Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | U0449_RS00810 | Genome accession | NZ_CP139862 |
| Coordinates | 162983..165223 (-) | Length | 746 a.a. |
| NCBI ID | WP_050210311.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain 05H0020-2 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 157983..170223
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| U0449_RS00775 | - | 158893..159273 (-) | 381 | WP_050219036.1 | VOC family protein | - |
| U0449_RS00780 | rplT | 159331..159690 (-) | 360 | WP_000124836.1 | 50S ribosomal protein L20 | - |
| U0449_RS00785 | rpmI | 159742..159942 (-) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| U0449_RS00790 | infC | 159975..160505 (-) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| U0449_RS00795 | - | 160812..161993 (-) | 1182 | WP_000655935.1 | membrane protein | - |
| U0449_RS00800 | - | 161997..162584 (-) | 588 | WP_050210313.1 | ATP-binding cassette domain-containing protein | - |
| U0449_RS00805 | - | 162616..162804 (-) | 189 | WP_050210312.1 | hypothetical protein | - |
| U0449_RS00810 | comEC/celB | 162983..165223 (-) | 2241 | WP_050210311.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| U0449_RS00815 | comEA/celA/cilE | 165207..165857 (-) | 651 | WP_050210310.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| U0449_RS00820 | - | 165925..166350 (-) | 426 | Protein_164 | GNAT family N-acetyltransferase | - |
| U0449_RS00825 | ald | 166500..167611 (+) | 1112 | Protein_165 | alanine dehydrogenase | - |
| U0449_RS00830 | - | 167912..168238 (-) | 327 | Protein_166 | TfoX/Sxy family protein | - |
| U0449_RS00835 | - | 168241..168741 (-) | 501 | WP_000566988.1 | GNAT family N-acetyltransferase | - |
| U0449_RS00840 | - | 168921..169889 (-) | 969 | WP_138702972.1 | PhoH family protein | - |
| U0449_RS00845 | - | 169974..170189 (-) | 216 | WP_004250177.1 | YozE family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84606.14 Da Isoelectric Point: 9.2732
>NTDB_id=911456 U0449_RS00810 WP_050210311.1 162983..165223(-) (comEC/celB) [Streptococcus pneumoniae strain 05H0020-2]
MLQWIKNIPIPLIYLSFLLLWFYYAIFSASYLALLGFVFLLVCLFFQFPWKSAGKVLVICGVFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLWDITGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKQYPYLEVDVLKAAQHGSKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
MLQWIKNIPIPLIYLSFLLLWFYYAIFSASYLALLGFVFLLVCLFFQFPWKSAGKVLVICGVFGFWFLFQNWQQSQASQN
LADSVERVRILPDTVKVNGDSLSFRGKADGRIFQVYYKLQSEEEKEAFQALTDLHEIGLEGKLSEPEGQRNFGGFNYQAY
LKTQGIYQTLNIKKIQSLQKIGSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMNGFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFIV
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVASESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLTFLPLLSI
LFVLSFLYPVIQLNFIFEWLEGIIRLVSQVTSRPLVFGQPNAWFLILLLISLALVYDLRKNIKKLTVLCLLITGLFLLTK
HPLENEITMLDVGQGESIFLWDITGKTILIDVGGKAESYKKIEKWQEKMTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLSEVTKAFHVGEILVSKDSLKQKEFVAELQATQTKVRSMTVGENLPIFGSQLEVLSPRKMGDGGHDDTLVLYGKF
LDKQFLFTGNLEEKGEKDLLKQYPYLEVDVLKAAQHGSKKSSSPAFLEKLKPELTLISVGKSNRMKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGLDSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=911456 U0449_RS00810 WP_050210311.1 162983..165223(-) (comEC/celB) [Streptococcus pneumoniae strain 05H0020-2]
ATGTTACAGTGGATTAAGAACATCCCCATTCCCCTAATTTACCTGAGTTTTTTGTTACTTTGGTTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCATGGAAATCAGCAG
GCAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGCCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTTTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTAACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTATGGGATATAACTGGGAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCAATATCCTTACTTAGA
GGTGGATGTATTGAAAGCTGCCCAACATGGCTCTAAAAAGTCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAACATCCCCATTCCCCTAATTTACCTGAGTTTTTTGTTACTTTGGTTTTATTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGCTTTGTTTTTCTGCTAGTCTGTCTCTTTTTCCAATTTCCATGGAAATCAGCAG
GCAAAGTTCTAGTGATTTGTGGAGTCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGCCAAGCGAGTCAAAAT
CTGGCGGATTCTGTTGAAAGGGTACGGATTCTGCCTGACACTGTTAAGGTCAATGGTGATAGTCTGTCCTTTCGCGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAGCCTTTCAAGCTTTAACCGACC
TGCATGAGATAGGACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGCAGAGAAATTTTGGTGGCTTTAATTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGATTGGCAGTTGGGATATAGG
AGAAAACTTGTCCAGTTTACGTCGAAAGGCTGTGGTTTGGATTAAGACGCACTTTCCAGACCCTATGCGCAATTACATGA
CAGGACTTTTGCTGGGACATCTGGACACCGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCCTATCTGGCATGCAGGTAGGTTTTTTCATGAATGGATTTAAGAAACTTCTCTTGCGATTGGGCTTGACCCAAGA
AAAGTTGAAATGGCTAACTTATCCCTTTTCCCTTATCTATGCGGGACTAACTGGATTTTCAGCATCGGTTATTCGCAGTC
TCTTGCAAAAGCTACTGGCTCAACATGGGGTTAAGGGCTTGGATAATTTTGCCTTGACGGTGCTTGTCCTCTTTATTGTC
ATGCCAAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCCTGACCATGACCAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCTAGTGAAAGTCTAGTCATCTCCTTGGGCATATTGCCCATTCTATCCTTCTATTTTGCGG
AATTTCAACCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTGACCTTCTTACCGCTCTTGTCTATT
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAATGGTTGGAGGGCATTATTCGCTTGGT
GTCACAGGTGACAAGTAGACCTCTGGTCTTTGGACAACCCAATGCATGGTTTTTAATCCTATTGTTAATTTCCTTGGCTT
TGGTCTATGATTTGAGAAAAAACATTAAAAAGCTAACGGTATTGTGCTTATTGATTACAGGGCTCTTTCTCCTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGAGAAAGTATTTTCCTATGGGATATAACTGGGAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTTATAAGAAAATCGAAAAATGGCAAGAAAAGATGACGACCAGCAATG
CCCAGCGAACCTTGATTCCCTATCTCAAAAGTCGAGGAGTAGCTAAGATTGACCAGCTAATTTTGACTAACACGGACAAG
GAGCATGTTGGAGATTTGTCAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTCTAGTATCAAAAGACAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACAAAGGTGCGTAGTATGACAGTAGGGGAGAACTTGCCCATTTTTG
GAAGTCAGTTAGAAGTTCTATCTCCAAGGAAAATGGGAGATGGAGGACACGATGATACCCTAGTTCTGTATGGGAAATTC
TTGGATAAGCAATTTCTCTTCACGGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCAATATCCTTACTTAGA
GGTGGATGTATTGAAAGCTGCCCAACATGGCTCTAAAAAGTCATCAAGTCCAGCCTTTCTAGAAAAACTCAAACCAGAGC
TTACTCTTATCTCAGTTGGAAAGAGCAATCGAATGAAACTCCCCCATCAGGAAACATTGACACGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGTTTTAAGGGGTTGGATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus pneumoniae Rx1 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus pneumoniae D39 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus pneumoniae R6 |
96.917 |
100 |
0.969 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
95.576 |
100 |
0.956 |
| comEC/celB | Streptococcus mitis SK321 |
92.091 |
100 |
0.921 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
91.409 |
99.866 |
0.913 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.355 |
99.732 |
0.442 |