Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SMSK321_RS04965 | Genome accession | NZ_AEDT01000012 |
| Coordinates | 27303..29543 (+) | Length | 746 a.a. |
| NCBI ID | WP_000942453.1 | Uniprot ID | - |
| Organism | Streptococcus mitis SK321 | ||
| Function | ssDNA transport into the cell DNA binding and uptake |
||
Genomic Context
Location: 22303..34543
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SMSK321_RS04935 (SMSK321_0416) | - | 22799..23356 (+) | 558 | WP_004234343.1 | GrpB family protein | - |
| SMSK321_RS04940 (SMSK321_0417) | - | 23402..23617 (+) | 216 | WP_001232084.1 | YozE family protein | - |
| SMSK321_RS04945 (SMSK321_0418) | - | 23698..24693 (+) | 996 | WP_000658203.1 | PhoH family protein | - |
| SMSK321_RS04950 (SMSK321_0419) | ald | 24749..25861 (-) | 1113 | WP_000904691.1 | alanine dehydrogenase | - |
| SMSK321_RS04955 (SMSK321_0420) | - | 26033..26602 (+) | 570 | WP_000443733.1 | GNAT family N-acetyltransferase | - |
| SMSK321_RS04960 (SMSK321_0421) | comEA/celA/cilE | 26669..27319 (+) | 651 | WP_000443804.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| SMSK321_RS04965 (SMSK321_0422) | comEC/celB | 27303..29543 (+) | 2241 | WP_000942453.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SMSK321_RS04975 (SMSK321_0423) | - | 29943..30530 (+) | 588 | WP_000939891.1 | ABC transporter ATP-binding protein | - |
| SMSK321_RS04980 (SMSK321_0424) | - | 30534..31718 (+) | 1185 | WP_000655927.1 | hypothetical protein | - |
| SMSK321_RS04985 (SMSK321_0425) | infC | 32027..32557 (+) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| SMSK321_RS04990 (SMSK321_0426) | rpmI | 32590..32790 (+) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| SMSK321_RS04995 (SMSK321_0427) | rplT | 32842..33201 (+) | 360 | WP_000124832.1 | 50S ribosomal protein L20 | - |
| SMSK321_RS05000 (SMSK321_0428) | - | 33260..33640 (+) | 381 | WP_000157156.1 | lactoylglutathione lyase | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84423.84 Da Isoelectric Point: 9.0434
>NTDB_id=522 SMSK321_RS04965 WP_000942453.1 27303..29543(+) (comEC/celB) [Streptococcus mitis SK321]
MLQWTKNFPIPLIYLSFLLLWLYYAIFSVSYLALLGFVFLLVCLFIQFLWKSAGKVLVICGIFGFWFVFQNWQQSQASQN
LSDSVEIVRILPDTIKVNGDSLSFRGKADGRIFQVYYKLQSEEEKETFQILTALHDLELEGKLSEPEGKRNFGGFDYQAY
LKTQGIYQTLNIKKIQSLQKVGSWDLGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSV
LFALSFLYPVIQLNFIFEWLEGMIRLVSQVASRPLVFGQPNAWLLILLLISLALVYDLRKNIKRLAVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVAGKTILIDVGGKAESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELQATQIKVHSVAEGENVPIFGSQLEVLSPRKIGDGGHDDSLVLYGKL
LDKHFLFTGNLEEKGEKDLLKQYPDLEVDVLKASQHGSKKSSSSAFLEQLKPEMTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWNSWKIESVR
MLQWTKNFPIPLIYLSFLLLWLYYAIFSVSYLALLGFVFLLVCLFIQFLWKSAGKVLVICGIFGFWFVFQNWQQSQASQN
LSDSVEIVRILPDTIKVNGDSLSFRGKADGRIFQVYYKLQSEEEKETFQILTALHDLELEGKLSEPEGKRNFGGFDYQAY
LKTQGIYQTLNIKKIQSLQKVGSWDLGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSV
LFALSFLYPVIQLNFIFEWLEGMIRLVSQVASRPLVFGQPNAWLLILLLISLALVYDLRKNIKRLAVLSLLITGLFFLTK
YPLENEITMLDVGQGESIFLRDVAGKTILIDVGGKAESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELQATQIKVHSVAEGENVPIFGSQLEVLSPRKIGDGGHDDSLVLYGKL
LDKHFLFTGNLEEKGEKDLLKQYPDLEVDVLKASQHGSKKSSSSAFLEQLKPEMTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWNSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=522 SMSK321_RS04965 WP_000942453.1 27303..29543(+) (comEC/celB) [Streptococcus mitis SK321]
ATGTTACAGTGGACTAAGAATTTCCCCATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTACTACGCTATTTT
CTCAGTATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCTTTGGAAATCTGCTG
GAAAAGTTCTAGTAATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGTCAGATTCTGTTGAAATTGTACGGATTTTACCTGACACTATTAAGGTTAATGGTGATAGTTTGTCCTTTCGTGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAATTTTAACAGCCC
TTCATGATTTGGAACTAGAAGGGAAACTTTCGGAGCCAGAAGGGAAGAGAAATTTTGGTGGTTTTGACTATCAAGCCTAT
CTGAAAACTCAGGGAATTTACCAGACTCTAAATATCAAAAAAATCCAGTCACTTCAAAAGGTTGGCAGTTGGGATCTAGG
TGAAAACCTGTCCAGTTTACGCCGTAAGGCTGTAGTTTGGATTAAGACGCATTTTCCAGATCCTATGCGCAATTACATGA
CGGGGCTTCTATTAGGACATCTGGATACAGACTTTGAGGAGATGAATGAGCTTTATTCTAGTCTAGGAATTATCCACCTA
TTTGCCTTGTCAGGCATGCAGGTAGGATTTTTCATGGATGGATTTAAGAAACTACTCTTGCGATTAGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCAGGGCTGACAGGATTTTCAGCATCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCTCAACATGGTGTTAAGGGTTTGGATAATTTTGCCTTGACGGTTCTTGTCCTCTTTATCATC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAGGGCCTCAAGGCTGTGGCTAGAGAAAGTCTAGTCATTTCCTTGGGAATATTGCCCATTCTGTCCTTCTATTTTGCGG
AATTTCAACCTTGGTCCATCCTCTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTTTTGTCTGTC
TTATTTGCCCTTTCCTTTCTCTACCCAGTCATTCAGCTTAATTTTATTTTTGAATGGTTGGAGGGGATGATTCGCTTGGT
ATCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCTAATGCGTGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAGGCTAGCAGTATTGAGCTTATTGATTACAGGGCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGGGAAAGTATTTTCCTACGGGATGTAGCTGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTGATAAGAAAATTGAAAAATGGCAAGAAAAGGCAACGACCAGCAATG
CCCAGCGAACCTTGATTCCTTACCTCAAAAGTCGAGGAGTAGCCAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTTGGAGATTTGTTGGAGGTGACCAAGGCTTTCCATGTAGGCGAAATTTTAGTGTCCAAAGGCAGTTTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAATTAAGGTGCATAGTGTGGCAGAAGGGGAAAATGTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCGAGAAAAATTGGAGATGGAGGTCATGATGATTCTCTGGTTCTGTATGGAAAACTC
TTGGATAAGCACTTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGATTTGCTGAAGCAATATCCTGACCTGGA
GGTGGATGTTTTGAAAGCTAGCCAACATGGCTCTAAAAAATCATCAAGTTCAGCCTTTCTAGAACAGTTAAAACCAGAGA
TGACTCTCATCTCAGTTGGAAAGAGCAATAGAATGAAACTCCCCCATCAGGAAACCTTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGCTTTAAAGGTTGGAATAGTTGGAAGATCGAAAGTGTTCGATA
G
ATGTTACAGTGGACTAAGAATTTCCCCATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTACTACGCTATTTT
CTCAGTATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCTTTGGAAATCTGCTG
GAAAAGTTCTAGTAATTTGTGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGTCAGATTCTGTTGAAATTGTACGGATTTTACCTGACACTATTAAGGTTAATGGTGATAGTTTGTCCTTTCGTGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAATTTTAACAGCCC
TTCATGATTTGGAACTAGAAGGGAAACTTTCGGAGCCAGAAGGGAAGAGAAATTTTGGTGGTTTTGACTATCAAGCCTAT
CTGAAAACTCAGGGAATTTACCAGACTCTAAATATCAAAAAAATCCAGTCACTTCAAAAGGTTGGCAGTTGGGATCTAGG
TGAAAACCTGTCCAGTTTACGCCGTAAGGCTGTAGTTTGGATTAAGACGCATTTTCCAGATCCTATGCGCAATTACATGA
CGGGGCTTCTATTAGGACATCTGGATACAGACTTTGAGGAGATGAATGAGCTTTATTCTAGTCTAGGAATTATCCACCTA
TTTGCCTTGTCAGGCATGCAGGTAGGATTTTTCATGGATGGATTTAAGAAACTACTCTTGCGATTAGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCAGGGCTGACAGGATTTTCAGCATCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCTCAACATGGTGTTAAGGGTTTGGATAATTTTGCCTTGACGGTTCTTGTCCTCTTTATCATC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGTGCTTATGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAGGGCCTCAAGGCTGTGGCTAGAGAAAGTCTAGTCATTTCCTTGGGAATATTGCCCATTCTGTCCTTCTATTTTGCGG
AATTTCAACCTTGGTCCATCCTCTTGACCTTTGTCTTTTCCTTTCTTTTTGACTTGGTCTTCTTACCGCTTTTGTCTGTC
TTATTTGCCCTTTCCTTTCTCTACCCAGTCATTCAGCTTAATTTTATTTTTGAATGGTTGGAGGGGATGATTCGCTTGGT
ATCGCAGGTGGCAAGTAGACCGCTTGTCTTTGGACAACCTAATGCGTGGCTTTTAATCTTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGGAAAAACATTAAAAGGCTAGCAGTATTGAGCTTATTGATTACAGGGCTCTTTTTCCTTACCAAG
TATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGGGAAAGTATTTTCCTACGGGATGTAGCTGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTGATAAGAAAATTGAAAAATGGCAAGAAAAGGCAACGACCAGCAATG
CCCAGCGAACCTTGATTCCTTACCTCAAAAGTCGAGGAGTAGCCAAGATTGACCAGCTGATTTTGACAAATACGGACAAG
GAACATGTTGGAGATTTGTTGGAGGTGACCAAGGCTTTCCATGTAGGCGAAATTTTAGTGTCCAAAGGCAGTTTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAATTAAGGTGCATAGTGTGGCAGAAGGGGAAAATGTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCGAGAAAAATTGGAGATGGAGGTCATGATGATTCTCTGGTTCTGTATGGAAAACTC
TTGGATAAGCACTTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGATTTGCTGAAGCAATATCCTGACCTGGA
GGTGGATGTTTTGAAAGCTAGCCAACATGGCTCTAAAAAATCATCAAGTTCAGCCTTTCTAGAACAGTTAAAACCAGAGA
TGACTCTCATCTCAGTTGGAAAGAGCAATAGAATGAAACTCCCCCATCAGGAAACCTTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGCTTTAAAGGTTGGAATAGTTGGAAGATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae TIGR4 |
92.225 |
100 |
0.922 |
| comEC/celB | Streptococcus pneumoniae D39 |
91.823 |
100 |
0.918 |
| comEC/celB | Streptococcus pneumoniae R6 |
91.823 |
100 |
0.918 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
91.823 |
100 |
0.918 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.946 |
99.866 |
0.918 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
45.479 |
100 |
0.458 |
Multiple sequence alignment
References
| [1] | G Salvadori et al. (2018) High-resolution profiles of the Streptococcus mitis CSP signaling pathway reveal core and strain-specific regulated genes. BMC Genomics 19(1):453. [PMID: 29898666] |