Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   EQH20_RS04185 Genome accession   NZ_CP035260
Coordinates   845238..847478 (+) Length   746 a.a.
NCBI ID   WP_000942403.1    Uniprot ID   B1IBB4
Organism   Streptococcus pneumoniae strain TVO_1901925     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 840238..852478
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH20_RS04155 (EQH20_04380) - 840389..841357 (+) 969 WP_000658183.1 PhoH family protein -
  EQH20_RS04160 (EQH20_04390) - 841550..842050 (+) 501 WP_000566982.1 GNAT family N-acetyltransferase -
  EQH20_RS04165 (EQH20_04395) - 842053..842378 (+) 326 Protein_830 TfoX/Sxy family protein -
  EQH20_RS04170 ald 842679..843790 (-) 1112 Protein_831 alanine dehydrogenase -
  EQH20_RS04175 (EQH20_04415) - 843967..844536 (+) 570 WP_000443774.1 GNAT family N-acetyltransferase -
  EQH20_RS04180 (EQH20_04420) comEA/celA/cilE 844604..845254 (+) 651 WP_000387331.1 ComEA family DNA-binding protein Machinery gene
  EQH20_RS04185 (EQH20_04425) comEC/celB 845238..847478 (+) 2241 WP_000942403.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EQH20_RS04190 (EQH20_04430) - 847657..847845 (+) 189 WP_001809102.1 hypothetical protein -
  EQH20_RS04195 (EQH20_04435) - 847877..848464 (+) 588 WP_000933542.1 ATP-binding cassette domain-containing protein -
  EQH20_RS04200 (EQH20_04440) - 848468..849649 (+) 1182 WP_061647495.1 hypothetical protein -
  EQH20_RS04205 (EQH20_04445) infC 849956..850486 (+) 531 WP_061647496.1 translation initiation factor IF-3 -
  EQH20_RS04210 (EQH20_04450) rpmI 850519..850719 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  EQH20_RS04215 (EQH20_04455) rplT 850771..851130 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  EQH20_RS04220 (EQH20_04460) - 851188..851568 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84530.99 Da        Isoelectric Point: 9.5181

>NTDB_id=338300 EQH20_RS04185 WP_000942403.1 845238..847478(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901925]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVTSESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQTTQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=338300 EQH20_RS04185 WP_000942403.1 845238..847478(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901925]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTACTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGACGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGACACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB B1IBB4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae TIGR4

98.123

100

0.981

  comEC/celB Streptococcus pneumoniae D39

97.855

100

0.979

  comEC/celB Streptococcus pneumoniae R6

97.855

100

0.979

  comEC/celB Streptococcus pneumoniae Rx1

97.855

100

0.979

  comEC/celB Streptococcus mitis NCTC 12261

91.946

99.866

0.918

  comEC/celB Streptococcus mitis SK321

91.689

100

0.917

  comEC Lactococcus lactis subsp. cremoris KW2

44.13

99.33

0.438


Multiple sequence alignment