Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SM12261_RS04305 | Genome accession | NZ_CP028414 |
| Coordinates | 795536..797776 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942409.1 | Uniprot ID | - |
| Organism | Streptococcus mitis NCTC 12261 | ||
| Function | ssDNA transport into the cell DNA binding and uptake |
||
Genomic Context
Location: 790536..802776
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SM12261_RS04275 (SM12261_0830) | cvfB | 790767..791621 (+) | 855 | WP_001095458.1 | RNA-binding virulence regulatory protein CvfB | - |
| SM12261_RS04280 (SM12261_0831) | - | 791630..791845 (+) | 216 | WP_001232083.1 | YozE family protein | - |
| SM12261_RS04285 (SM12261_0832) | - | 791930..792925 (+) | 996 | WP_000658173.1 | PhoH family protein | - |
| SM12261_RS04290 (SM12261_0833) | ald | 792982..794094 (-) | 1113 | WP_000904719.1 | alanine dehydrogenase | - |
| SM12261_RS04295 (SM12261_0834) | - | 794266..794835 (+) | 570 | WP_000443732.1 | GNAT family N-acetyltransferase | - |
| SM12261_RS04300 (SM12261_0835) | comEA/celA/cilE | 794902..795552 (+) | 651 | WP_000387351.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| SM12261_RS04305 (SM12261_0836) | comEC/celB | 795536..797776 (+) | 2241 | WP_000942409.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SM12261_RS04310 (SM12261_0837) | infC | 797996..798526 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| SM12261_RS04315 (SM12261_0838) | rpmI | 798559..798759 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| SM12261_RS04320 (SM12261_0839) | rplT | 798811..799170 (+) | 360 | WP_000124834.1 | 50S ribosomal protein L20 | - |
| SM12261_RS04325 (SM12261_0840) | - | 799346..801010 (+) | 1665 | WP_000635031.1 | molecular chaperone HscC | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84460.75 Da Isoelectric Point: 9.5655
>NTDB_id=494 SM12261_RS04305 WP_000942409.1 795536..797776(+) (comEC/celB) [Streptococcus mitis NCTC 12261]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSSSYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKSDGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPDGQRNFGGFDYQAY
LKTQGIYQTLTIKRIQSLQKVSSWDIGENLSSLRRKTVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNGLYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFALSFLHPVIQLNFIFEWLEGMIRFVSQVASRPLVFGQPNAWILILLLISLALVYDLRKNIKRLAVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKIILIDVGGKAESSKKIEKWQEKTTTSNAQRTLIPYLKSRGVSKIDQLILTNTDK
EYVGDLLEVTKAFHVGEILVSKGSLKQKQFVAELQATQTKVRSITAGENLSIFGSQLEVLSPRKIGDGGYEDSLVLYGKL
LDKYFLFTGNLEEKGEKELLKHYPDLKVDVLKAGQHGSKKSSSSDFLEKLKPEFTLISVGKNNRAKLPHQETLTRLETIN
SKVYRTDQHGAIRFKGWNSWKVETVG
MLQWIKNFSIPLIYLSFLLLWLYYAIFSSSYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKSDGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPDGQRNFGGFDYQAY
LKTQGIYQTLTIKRIQSLQKVSSWDIGENLSSLRRKTVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNGLYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFALSFLHPVIQLNFIFEWLEGMIRFVSQVASRPLVFGQPNAWILILLLISLALVYDLRKNIKRLAVLSLLITGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKIILIDVGGKAESSKKIEKWQEKTTTSNAQRTLIPYLKSRGVSKIDQLILTNTDK
EYVGDLLEVTKAFHVGEILVSKGSLKQKQFVAELQATQTKVRSITAGENLSIFGSQLEVLSPRKIGDGGYEDSLVLYGKL
LDKYFLFTGNLEEKGEKELLKHYPDLKVDVLKAGQHGSKKSSSSDFLEKLKPEFTLISVGKNNRAKLPHQETLTRLETIN
SKVYRTDQHGAIRFKGWNSWKVETVG
Nucleotide
Download Length: 2241 bp
>NTDB_id=494 SM12261_RS04305 WP_000942409.1 795536..797776(+) (comEC/celB) [Streptococcus mitis NCTC 12261]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTTAGTTTTCTGTTGCTTTGGCTTTACTACGCCATTTT
CTCATCATCTTATCTCGCACTACTAGGTTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGTGGAATCTTTGGTTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
TTGGCGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTGTTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGTAA
GTCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TTCATGAGATAGGACTAGAAGGAAAACTTTCAGAACCAGACGGGCAGAGAAATTTTGGTGGTTTTGACTACCAAGCCTAT
CTGAAAACTCAAGGGATTTACCAAACATTAACTATCAAAAGAATCCAGTCACTTCAAAAGGTTAGCAGTTGGGATATAGG
AGAAAATCTGTCCAGTTTACGTCGAAAGACTGTGGTTTGGATTAAGACGCATTTTCCAGACCCTATGCGCAATTACATGA
CAGGGCTCTTGTTAGGACATCTGGACACCGACTTTGAGGAGATGAACGGGCTTTATTCTAGTTTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTAGGTTTTTTCATGGACGGATTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCAGGGCTGACAGGATTTTCAGCATCAGTTATTCGCAGTC
TTTTGCAAAAGTTACTGGCACAACATGGTGTTAAGGGCTTGGATAATTTTGCCTTGACGGTCCTTGTCCTCTTTATCGTC
ATGCCCAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAAGGCCTCAAGGCTGTTGCTAGAGAAAGTCTAGTCATTTCCTTGGGAATATTACCCATCCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGCCCTTTCCTTTCTTCATCCAGTCATTCAGCTGAATTTTATTTTTGAATGGTTGGAGGGAATGATTCGCTTTGT
ATCACAGGTGGCAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGATTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAGGCTAGCAGTGTTGAGCTTATTGATTACAGGTCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTTGGGCAAGGGGAAAGTATTTTCCTACGGGATGTAACTGGTAAAAT
CATTCTCATAGATGTGGGTGGCAAGGCAGAGTCTAGTAAGAAAATAGAAAAATGGCAAGAAAAGACGACGACCAGTAATG
CCCAGCGAACCTTGATACCCTATCTCAAAAGTCGCGGAGTATCCAAGATTGACCAGCTAATTTTGACCAATACGGACAAG
GAATATGTTGGAGATTTGTTGGAGGTGACCAAGGCTTTTCATGTAGGCGAAATTTTAGTGTCAAAAGGCAGTTTGAAACA
GAAGCAATTTGTGGCAGAACTACAAGCGACCCAAACAAAAGTTAGAAGTATAACAGCAGGAGAGAACTTGTCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGAAAATTGGAGATGGAGGTTATGAAGATTCCCTGGTTCTGTATGGAAAACTT
TTGGATAAGTACTTTCTCTTCACAGGAAATTTGGAGGAAAAAGGAGAGAAGGAATTGCTGAAGCACTATCCAGACTTGAA
AGTGGATGTTTTAAAAGCTGGCCAACATGGCTCTAAAAAATCATCAAGTTCAGACTTTCTAGAAAAACTCAAACCAGAGT
TTACTCTTATCTCAGTTGGAAAGAACAATCGAGCGAAACTCCCCCATCAGGAAACTTTGACACGACTGGAAACTATCAAT
AGTAAAGTTTACCGAACTGACCAGCATGGAGCTATACGCTTTAAAGGGTGGAATAGTTGGAAAGTCGAAACGGTTGGTTA
A
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTTAGTTTTCTGTTGCTTTGGCTTTACTACGCCATTTT
CTCATCATCTTATCTCGCACTACTAGGTTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGTGGAATCTTTGGTTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
TTGGCGGATTCTGTTGAAAGGGTACGGATTTTACCAGACACTGTTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGTAA
GTCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TTCATGAGATAGGACTAGAAGGAAAACTTTCAGAACCAGACGGGCAGAGAAATTTTGGTGGTTTTGACTACCAAGCCTAT
CTGAAAACTCAAGGGATTTACCAAACATTAACTATCAAAAGAATCCAGTCACTTCAAAAGGTTAGCAGTTGGGATATAGG
AGAAAATCTGTCCAGTTTACGTCGAAAGACTGTGGTTTGGATTAAGACGCATTTTCCAGACCCTATGCGCAATTACATGA
CAGGGCTCTTGTTAGGACATCTGGACACCGACTTTGAGGAGATGAACGGGCTTTATTCTAGTTTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTAGGTTTTTTCATGGACGGATTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCAGGGCTGACAGGATTTTCAGCATCAGTTATTCGCAGTC
TTTTGCAAAAGTTACTGGCACAACATGGTGTTAAGGGCTTGGATAATTTTGCCTTGACGGTCCTTGTCCTCTTTATCGTC
ATGCCCAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAAGGCCTCAAGGCTGTTGCTAGAGAAAGTCTAGTCATTTCCTTGGGAATATTACCCATCCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTCTTGTCTATC
TTATTTGCCCTTTCCTTTCTTCATCCAGTCATTCAGCTGAATTTTATTTTTGAATGGTTGGAGGGAATGATTCGCTTTGT
ATCACAGGTGGCAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGATTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAGGCTAGCAGTGTTGAGCTTATTGATTACAGGTCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTTGGGCAAGGGGAAAGTATTTTCCTACGGGATGTAACTGGTAAAAT
CATTCTCATAGATGTGGGTGGCAAGGCAGAGTCTAGTAAGAAAATAGAAAAATGGCAAGAAAAGACGACGACCAGTAATG
CCCAGCGAACCTTGATACCCTATCTCAAAAGTCGCGGAGTATCCAAGATTGACCAGCTAATTTTGACCAATACGGACAAG
GAATATGTTGGAGATTTGTTGGAGGTGACCAAGGCTTTTCATGTAGGCGAAATTTTAGTGTCAAAAGGCAGTTTGAAACA
GAAGCAATTTGTGGCAGAACTACAAGCGACCCAAACAAAAGTTAGAAGTATAACAGCAGGAGAGAACTTGTCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGAAAATTGGAGATGGAGGTTATGAAGATTCCCTGGTTCTGTATGGAAAACTT
TTGGATAAGTACTTTCTCTTCACAGGAAATTTGGAGGAAAAAGGAGAGAAGGAATTGCTGAAGCACTATCCAGACTTGAA
AGTGGATGTTTTAAAAGCTGGCCAACATGGCTCTAAAAAATCATCAAGTTCAGACTTTCTAGAAAAACTCAAACCAGAGT
TTACTCTTATCTCAGTTGGAAAGAACAATCGAGCGAAACTCCCCCATCAGGAAACTTTGACACGACTGGAAACTATCAAT
AGTAAAGTTTACCGAACTGACCAGCATGGAGCTATACGCTTTAAAGGGTGGAATAGTTGGAAAGTCGAAACGGTTGGTTA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
91.946 |
99.866 |
0.918 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
91.812 |
99.866 |
0.917 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
91.544 |
99.866 |
0.914 |
| comEC/celB | Streptococcus pneumoniae R6 |
91.544 |
99.866 |
0.914 |
| comEC/celB | Streptococcus pneumoniae D39 |
91.544 |
99.866 |
0.914 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.459 |
100 |
0.447 |
Multiple sequence alignment
References
| [1] | G Salvadori et al. (2018) High-resolution profiles of the Streptococcus mitis CSP signaling pathway reveal core and strain-specific regulated genes. BMC Genomics 19(1):453. [PMID: 29898666] |