Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   EQH21_RS04415 Genome accession   NZ_CP035259
Coordinates   871711..873951 (+) Length   746 a.a.
NCBI ID   WP_000942406.1    Uniprot ID   A0A4J2I0U7
Organism   Streptococcus pneumoniae strain TVO_1901926     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 866711..878951
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH21_RS04385 (EQH21_04610) - 866861..867829 (+) 969 WP_000658183.1 PhoH family protein -
  EQH21_RS04390 (EQH21_04620) - 868022..868522 (+) 501 WP_000566988.1 GNAT family N-acetyltransferase -
  EQH21_RS04395 (EQH21_04625) - 868525..868851 (+) 327 Protein_884 TfoX/Sxy family protein -
  EQH21_RS04400 ald 869152..870263 (-) 1112 Protein_885 alanine dehydrogenase -
  EQH21_RS04405 (EQH21_04645) - 870440..871009 (+) 570 WP_000443899.1 GNAT family N-acetyltransferase -
  EQH21_RS04410 (EQH21_04650) comEA/celA/cilE 871077..871727 (+) 651 WP_000387330.1 ComEA family DNA-binding protein Machinery gene
  EQH21_RS04415 (EQH21_04655) comEC/celB 871711..873951 (+) 2241 WP_000942406.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EQH21_RS04420 (EQH21_04660) - 874130..874318 (+) 189 WP_001810955.1 hypothetical protein -
  EQH21_RS04425 (EQH21_04665) - 874350..874937 (+) 588 WP_000933550.1 ATP-binding cassette domain-containing protein -
  EQH21_RS04430 (EQH21_04670) - 874941..876122 (+) 1182 WP_000655933.1 hypothetical protein -
  EQH21_RS04435 (EQH21_04675) infC 876429..876959 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  EQH21_RS04440 (EQH21_04680) rpmI 876992..877192 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  EQH21_RS04445 (EQH21_04685) rplT 877244..877603 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  EQH21_RS04450 (EQH21_04690) - 877661..878041 (+) 381 WP_000157154.1 VOC family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84545.06 Da        Isoelectric Point: 9.5148

>NTDB_id=338223 EQH21_RS04415 WP_000942406.1 871711..873951(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901926]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
ENVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=338223 EQH21_RS04415 WP_000942406.1 871711..873951(+) (comEC/celB) [Streptococcus pneumoniae strain TVO_1901926]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGAATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4J2I0U7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae Rx1

98.928

100

0.989

  comEC/celB Streptococcus pneumoniae D39

98.928

100

0.989

  comEC/celB Streptococcus pneumoniae R6

98.928

100

0.989

  comEC/celB Streptococcus pneumoniae TIGR4

97.319

100

0.973

  comEC/celB Streptococcus mitis NCTC 12261

92.349

99.866

0.922

  comEC/celB Streptococcus mitis SK321

92.225

100

0.922

  comEC Lactococcus lactis subsp. cremoris KW2

43.725

99.33

0.434


Multiple sequence alignment