Detailed information    

experimental Experimentally validated

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   KZH43_RS04200 Genome accession   NZ_CP079923
Coordinates   846509..848749 (+) Length   746 a.a.
NCBI ID   WP_000942407.1    Uniprot ID   A0A2U3RZI0
Organism   Streptococcus pneumoniae Rx1     
Function   ssDNA transport into the cell   
DNA binding and uptake

Function


ComEC is a transmembrane channel protein essential for the transport of extracellular DNA into the cell during natural transformation. It forms a pore in the cell membrane that allows single-stranded DNA (ssDNA) generated from internalized double-stranded DNA (dsDNA) to enter the cytoplasm. ComEC works in conjunction with ComEA to facilitate DNA uptake and processing for subsequent homologous recombination.


Genomic Context


Location: 841509..853749
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KZH43_RS04155 (KZH43_04155) - 841659..842627 (+) 969 WP_000658183.1 PhoH family protein -
  KZH43_RS04165 (KZH43_04165) - 842820..843320 (+) 501 WP_000566988.1 GNAT family N-acetyltransferase -
  KZH43_RS04170 (KZH43_04170) - 843323..843649 (+) 327 Protein_842 TfoX/Sxy family protein -
  KZH43_RS10510 ald 843950..845061 (-) 1112 Protein_843 alanine dehydrogenase -
  KZH43_RS04190 (KZH43_04190) - 845238..845807 (+) 570 WP_000443899.1 GNAT family N-acetyltransferase -
  KZH43_RS04195 (KZH43_04195) comEA/celA/cilE 845875..846525 (+) 651 WP_000387330.1 ComEA family DNA-binding protein Machinery gene
  KZH43_RS04200 (KZH43_04200) comEC/celB 846509..848749 (+) 2241 WP_000942407.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  KZH43_RS04205 (KZH43_04205) - 848928..849116 (+) 189 WP_010976492.1 hypothetical protein -
  KZH43_RS04210 (KZH43_04210) - 849148..849735 (+) 588 Protein_848 ATP-binding cassette domain-containing protein -
  KZH43_RS04215 (KZH43_04215) - 849739..850920 (+) 1182 WP_000655958.1 ABC transporter permease -
  KZH43_RS04220 (KZH43_04220) infC 851227..851757 (+) 531 WP_000848180.1 translation initiation factor IF-3 -
  KZH43_RS04225 (KZH43_04225) rpmI 851790..851990 (+) 201 WP_001125943.1 50S ribosomal protein L35 -
  KZH43_RS04230 (KZH43_04230) rplT 852042..852401 (+) 360 WP_000124836.1 50S ribosomal protein L20 -
  KZH43_RS04235 (KZH43_04235) - 852459..852839 (+) 381 WP_000157154.1 VOC family protein -

Regulatory network


Positive effect      
Negative effect
Regulator Target Regulation
  htrA comEC/celB negative effect
  htrA comEA/celA/cilE negative effect
  htrA comC/comC1 negative effect
  comC/comC1 comD/comD1 positive effect
  comB comC/comC1 positive effect
  ciaR comC/comC1 negative effect
  ciaH comC/comC1 negative effect
  comA comC/comC1 positive effect
  comE comC/comC1 positive effect
  ciaR htrA positive effect
  ciaH htrA positive effect
  comD/comD1 comE positive effect
  comE comA positive effect
  comE comB positive effect
  comE comE positive effect
  comE comX/comX1 positive effect
  comE comX/comX2 positive effect
  comE comD/comD1 positive effect
  comE comM positive effect
  comE comW positive effect
  stkP comE positive effect
  comX/comX1 late competence genes positive effect
  comW comX/comX1 positive effect
  clpE comX/comX1 negative effect
  clpP comX/comX1 negative effect
  comX/comX2 late competence genes positive effect
  comW comX/comX2 positive effect
  clpE comX/comX2 negative effect
  clpP comX/comX2 negative effect
  comM cbpD negative effect
  clpC comW negative effect
  clpP comW negative effect
  mecA comW negative effect
  comX/comX1 late competence genes positive effect
  comX/comX2 late competence genes positive effect
  cbpD lytA positive effect
  cbpD lytC positive effect

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84619.29 Da        Isoelectric Point: 9.6173

>NTDB_id=264 KZH43_RS04200 WP_000942407.1 846509..848749(+) (comEC/celB) [Streptococcus pneumoniae Rx1]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFFTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNTWLLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIKKWQEKMTTSNAQRSLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEMTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMIVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKHYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPYQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=264 KZH43_RS04200 WP_000942407.1 846509..848749(+) (comEC/celB) [Streptococcus pneumoniae Rx1]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTTACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATACATGGCTTTTAATCCTATTGTTAATTTCTTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCAAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAAGCTTGATACCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGATGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGATAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCACTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCTATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2U3RZI0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus pneumoniae D39

100

100

1

  comEC/celB Streptococcus pneumoniae R6

100

100

1

  comEC/celB Streptococcus pneumoniae TIGR4

97.587

100

0.976

  comEC/celB Streptococcus mitis SK321

91.823

100

0.918

  comEC/celB Streptococcus mitis NCTC 12261

91.544

99.866

0.914

  comEC Lactococcus lactis subsp. cremoris KW2

43.725

100

0.44


Multiple sequence alignment    



References


[1] Yanni Liu et al. (2019) HtrA-mediated selective degradation of DNA uptake apparatus accelerates termination of pneumococcal transformation. Molecular Microbiology 112(4):1308-1325. [PMID: 31396996]
[2] Scott N Peterson et al. (2004) Identification of competence pheromone responsive genes in Streptococcus pneumoniae by use of DNA microarrays. Molecular Microbiology 51(4):1051-70. [PMID: 14763980]
[3] E V Pestova et al. (1998) Isolation and characterization of three Streptococcus pneumoniae transformation-specific loci by use of a lacZ reporter insertion vector. Journal of Bacteriology 180(10):2701-10. [PMID: 9573156]