Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | R4699_RS04730 | Genome accession | NZ_CP137105 |
| Coordinates | 934439..936679 (-) | Length | 746 a.a. |
| NCBI ID | WP_317786088.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 16H2092 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 929439..941679
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R4699_RS04695 | - | 930349..930729 (-) | 381 | WP_000157154.1 | VOC family protein | - |
| R4699_RS04700 | rplT | 930787..931146 (-) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| R4699_RS04705 | rpmI | 931198..931398 (-) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| R4699_RS04710 | infC | 931431..931961 (-) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| R4699_RS04715 | - | 932268..933449 (-) | 1182 | WP_000655951.1 | membrane protein | - |
| R4699_RS04720 | - | 933453..934040 (-) | 588 | WP_000933542.1 | ATP-binding cassette domain-containing protein | - |
| R4699_RS04725 | - | 934072..934260 (-) | 189 | WP_001809102.1 | hypothetical protein | - |
| R4699_RS04730 | comEC/celB | 934439..936679 (-) | 2241 | WP_317786088.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| R4699_RS04735 | comEA/celA/cilE | 936663..937313 (-) | 651 | WP_000387336.1 | ComEA family DNA-binding protein | Machinery gene |
| R4699_RS04740 | - | 937381..937950 (-) | 570 | WP_000443744.1 | GNAT family N-acetyltransferase | - |
| R4699_RS04745 | ald | 938127..939238 (+) | 1112 | Protein_923 | alanine dehydrogenase | - |
| R4699_RS04750 | - | 939539..939865 (-) | 327 | Protein_924 | TfoX/Sxy family protein | - |
| R4699_RS04755 | - | 939868..940368 (-) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| R4699_RS04760 | - | 940561..941529 (-) | 969 | WP_000658183.1 | PhoH family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84600.12 Da Isoelectric Point: 9.4167
>NTDB_id=895479 R4699_RS04730 WP_317786088.1 934439..936679(-) (comEC/celB) [Streptococcus pneumoniae strain 16H2092]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQVTQTKVRSMTVGEYLPIFGSQLEVLSPREIGYGGHEDSLVLYGKL
LNNHFLFIGNLEEKGEKDLLKYYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSAGKVLIICGIFGFWFVFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSNGRAFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKTIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQVTQTKVRSMTVGEYLPIFGSQLEVLSPREIGYGGHEDSLVLYGKL
LNNHFLFIGNLEEKGEKDLLKYYPDLKVNVLKASQHGNKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=895479 R4699_RS04730 WP_317786088.1 934439..936679(-) (comEC/celB) [Streptococcus pneumoniae strain 16H2092]
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACTAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGTGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGTACTTGCCCATTTTCG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGGAAATTGGATATGGAGGTCATGAAGATTCCCTGGTTCTTTATGGGAAACTC
TTGAATAATCACTTTCTCTTCATAGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGTATTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTATTACTTTGGCTTTATTACGCTATTTT
CTCAGCATCTTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTATCCAATTTCCGTGGAAATCTGCTG
GTAAAGTTCTAATAATTTGCGGAATCTTTGGATTTTGGTTTGTTTTTCAAAATTGGCAACAGAGTCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGATACTATTAAGGTTAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTAACGGTCGTGCTTTCCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAACAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTCTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGTATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTAGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGGAAAAC
CATTCTCATAGATGTAGGTGGTAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACTAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGTGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGTACTTGCCCATTTTCG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGGAAATTGGATATGGAGGTCATGAAGATTCCCTGGTTCTTTATGGGAAACTC
TTGAATAATCACTTTCTCTTCATAGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGTATTATCCAGACTTGAA
AGTAAATGTTTTGAAAGCTAGCCAACATGGCAATAAAAAATCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTATCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae R6 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
96.515 |
100 |
0.965 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.678 |
99.866 |
0.916 |
| comEC/celB | Streptococcus mitis SK321 |
91.555 |
100 |
0.916 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.13 |
99.33 |
0.438 |