Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   EQH22_RS04405 Genome accession   NZ_CP035258
Coordinates   875640..877880 (+) Length   746 a.a.
NCBI ID   WP_000942406.1    Uniprot ID   A0A4J2I0U7
Organism   Streptococcus pneumoniae strain TVO_1901927     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 870640..882880
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH22_RS04375 (EQH22_04625) - 870790..871758 (+) 969 WP_000658183.1 PhoH family protein -
  EQH22_RS04380 (EQH22_04635) - 871951..872451 (+) 501 WP_000566988.1 GNAT family N-acetyltransferase -
  EQH22_RS04385 (EQH22_04640) - 872454..872780 (+) 327 Protein_883 TfoX/Sxy family protein -
  EQH22_RS04390 ald 873081..874192 (-) 1112 Protein_884 alanine dehydrogenase -
  EQH22_RS04395 (EQH22_04660) - 874369..874938 (+) 570 WP_000443899.1 GNAT family N-acetyltransferase -
  EQH22_RS04400 (EQH22_04665) comEA/celA/cilE 875006..875656 (+) 651 WP_000387330.1 ComEA family DNA-binding protein Machinery gene
  EQH22_RS04405 (EQH22_04670) comEC/celB 875640..877880 (+) 2241 WP_000942406.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EQH22_RS04410 (EQH22_04675) - 878059..878247 (+) 189 WP_001810955.1 hypothetical protein -
  EQH22_RS04415 (EQH22_04680) - 878279..878866 (+) 588 WP_000933550.1 ATP-binding cassette domain-containing protein -
  EQH22_RS04420 (EQH22_04685) - 878870..880051 (+) 1182 WP_000655933.1 hypothetical protein -
  EQH22_RS04425 (EQH22_04690) infC 880358..880888 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  EQH22_RS04430 (EQH22_04695) rpmI 880921..881121 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  EQH22_RS04435 (EQH22_04700) rplT 881173..881532 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  EQH22_RS04440 (EQH22_04705) - 881590..881970 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84545.06 Da        Isoelectric Point: 9.5148

>NTDB_id=338145 EQH22_RS04405 WP_000942406.1 875640..877880(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901927]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=338145 EQH22_RS04405 WP_000942406.1 875640..877880(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901927]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4J2I0U7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae Rx1

98.928

100

0.989

  comEC/celB Streptococcus pneumoniae D39

98.928

100

0.989

  comEC/celB Streptococcus pneumoniae R6

98.928

100

0.989

  comEC/celB Streptococcus pneumoniae TIGR4

97.319

100

0.973

  comEC/celB Streptococcus mitis NCTC 12261

92.349

99.866

0.922

  comEC/celB Streptococcus mitis SK321

92.225

100

0.922

  comEC Lactococcus lactis subsp. cremoris KW2

43.725

99.33

0.434


Multiple sequence alignment