Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   EQH37_RS04600 Genome accession   NZ_CP035243
Coordinates   909876..912116 (+) Length   746 a.a.
NCBI ID   WP_054366115.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_1901946     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 904876..917116
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH37_RS04570 (EQH37_04850) - 905026..905994 (+) 969 WP_000658191.1 PhoH family protein -
  EQH37_RS04575 (EQH37_04860) - 906187..906687 (+) 501 WP_000566988.1 GNAT family N-acetyltransferase -
  EQH37_RS04580 (EQH37_04865) - 906690..907064 (+) 375 Protein_919 TfoX/Sxy family protein -
  EQH37_RS04585 (EQH37_04870) ald 907318..908428 (-) 1111 Protein_920 alanine dehydrogenase -
  EQH37_RS04590 (EQH37_04875) - 908605..909174 (+) 570 WP_000443749.1 GNAT family N-acetyltransferase -
  EQH37_RS04595 (EQH37_04880) comEA/celA/cilE 909242..909892 (+) 651 WP_000387336.1 ComEA family DNA-binding protein Machinery gene
  EQH37_RS04600 (EQH37_04885) comEC/celB 909876..912116 (+) 2241 WP_054366115.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EQH37_RS04605 (EQH37_04890) - 912295..912483 (+) 189 WP_001809102.1 hypothetical protein -
  EQH37_RS04610 (EQH37_04895) - 912515..913102 (+) 588 WP_000933542.1 ATP-binding cassette domain-containing protein -
  EQH37_RS04615 (EQH37_04900) - 913106..914287 (+) 1182 WP_000655951.1 membrane protein -
  EQH37_RS04620 (EQH37_04905) infC 914594..915124 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  EQH37_RS04625 (EQH37_04910) rpmI 915157..915357 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  EQH37_RS04630 (EQH37_04915) rplT 915409..915768 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  EQH37_RS04635 (EQH37_04920) - 915826..916206 (+) 381 WP_054366114.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84552.05 Da        Isoelectric Point: 9.5761

>NTDB_id=337201 EQH37_RS04600 WP_054366115.1 909876..912116(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901946]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVFLLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVASRPLVFGQPNEWLLILLLISLPLVYDLRKNIKGLTVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=337201 EQH37_RS04600 WP_054366115.1 909876..912116(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901946]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACAGTGTTTCTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
CTCACAGGTGGCAAGTAGGCCTCTAGTCTTTGGACAACCCAATGAATGGCTTTTAATCCTATTGTTAATTTCTTTGCCTT
TGGTCTATGATTTGAGGAAAAACATTAAAGGATTAACAGTATTGAGTTTATTGATTACAGGTCTCTTTTTCCTTACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGGGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAATTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae TIGR4

98.794

100

0.988

  comEC/celB Streptococcus pneumoniae Rx1

97.855

100

0.979

  comEC/celB Streptococcus pneumoniae D39

97.855

100

0.979

  comEC/celB Streptococcus pneumoniae R6

97.855

100

0.979

  comEC/celB Streptococcus mitis SK321

91.823

100

0.918

  comEC/celB Streptococcus mitis NCTC 12261

91.678

99.866

0.916

  comEC Lactococcus lactis subsp. cremoris KW2

44.13

99.33

0.438


Multiple sequence alignment