Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | UKS_RS04375 | Genome accession | NZ_AP021887 |
| Coordinates | 847576..849816 (+) | Length | 746 a.a. |
| NCBI ID | WP_156011949.1 | Uniprot ID | - |
| Organism | Streptococcus sp. 116-D4 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 842576..854816
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| UKS_RS04345 (UKS_08110) | - | 843079..843636 (+) | 558 | WP_049495610.1 | GrpB family protein | - |
| UKS_RS04350 (UKS_08120) | - | 843682..843897 (+) | 216 | WP_001232084.1 | YozE family protein | - |
| UKS_RS04355 (UKS_08130) | - | 843982..844956 (+) | 975 | WP_156011945.1 | PhoH family protein | - |
| UKS_RS04360 (UKS_08140) | ald | 845020..846132 (-) | 1113 | WP_156011946.1 | alanine dehydrogenase | - |
| UKS_RS04365 (UKS_08150) | - | 846306..846875 (+) | 570 | WP_156011947.1 | GNAT family N-acetyltransferase | - |
| UKS_RS04370 (UKS_08160) | comEA/celA/cilE | 846942..847592 (+) | 651 | WP_156011948.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| UKS_RS04375 (UKS_08170) | comEC/celB | 847576..849816 (+) | 2241 | WP_156011949.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| UKS_RS04380 (UKS_08180) | - | 849947..850195 (+) | 249 | WP_332066893.1 | hypothetical protein | - |
| UKS_RS04385 (UKS_08190) | - | 850228..850815 (+) | 588 | WP_156011951.1 | ABC transporter ATP-binding protein | - |
| UKS_RS04390 (UKS_08200) | - | 850819..852003 (+) | 1185 | WP_156011952.1 | hypothetical protein | - |
| UKS_RS04395 (UKS_08210) | - | 852103..853359 (-) | 1257 | WP_173020495.1 | ISL3 family transposase | - |
| UKS_RS04400 (UKS_08220) | infC | 853732..854262 (+) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| UKS_RS04405 (UKS_08230) | rpmI | 854295..854495 (+) | 201 | WP_049496086.1 | 50S ribosomal protein L35 | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84507.81 Da Isoelectric Point: 9.3783
>NTDB_id=75354 UKS_RS04375 WP_156011949.1 847576..849816(+) (comEC/celB) [Streptococcus sp. 116-D4]
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSTSKVLAICGIFGFWFLFQTWQQTQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSDGRIFQVYYKLQSEEEKETFQALTALHDLELEGKLSEPEGRRNFGGFDYQSY
LKTQGIYQTLNIKRIQSLQKAGSWDIGENLSSLRRKAVVWIKTKFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWMTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGFKGLDNFALTVLVLFIA
MPNFFLTAGGILSCAYAFILTMTSKEGEGLKAVTRESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNVIFEWLEGIIRLVSQVASRPLVFGQPTTWLLILLLVSLALLYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKVESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELEASQTKVRSVTAGENLPIFGSQLEVLSPGKIGEVGSNDSLVLYGKL
LDKHFLFTENLEEKGEKDLLKQYPDLEVDVLKAGQHGSKKSSSSAFLEQLKPEITLISVGKSNRTKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWKSWKIESIR
MLQWIKNFSIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLVCLFIQFPWKSTSKVLAICGIFGFWFLFQTWQQTQASQN
LADSVERVRILPDTIKVNGDSLSFRGKSDGRIFQVYYKLQSEEEKETFQALTALHDLELEGKLSEPEGRRNFGGFDYQSY
LKTQGIYQTLNIKRIQSLQKAGSWDIGENLSSLRRKAVVWIKTKFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLLLRLGLTQEKLKWMTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGFKGLDNFALTVLVLFIA
MPNFFLTAGGILSCAYAFILTMTSKEGEGLKAVTRESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFVLSFLYPVIQLNVIFEWLEGIIRLVSQVASRPLVFGQPTTWLLILLLVSLALLYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKVESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELEASQTKVRSVTAGENLPIFGSQLEVLSPGKIGEVGSNDSLVLYGKL
LDKHFLFTENLEEKGEKDLLKQYPDLEVDVLKAGQHGSKKSSSSAFLEQLKPEITLISVGKSNRTKLPHQETLTRLEGIN
SKVYRTDQQGAIRFKGWKSWKIESIR
Nucleotide
Download Length: 2241 bp
>NTDB_id=75354 UKS_RS04375 WP_156011949.1 847576..849816(+) (comEC/celB) [Streptococcus sp. 116-D4]
ATGTTGCAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAGCATCCTATCTTGCTTTATTGGGCTTTGTTTTTCTGCTAGTTTGTCTCTTTATTCAATTTCCTTGGAAATCTACTA
GCAAAGTTCTAGCAATTTGTGGAATCTTTGGATTTTGGTTTCTATTTCAAACTTGGCAGCAGACACAAGCTAGTCAGAAC
CTAGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAGCCTTAACAGCTC
TTCATGATTTGGAACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGAGGAGAAATTTTGGTGGCTTTGACTACCAATCCTAT
CTGAAAACTCAGGGAATTTACCAGACTCTCAATATCAAAAGAATCCAGTCGCTTCAAAAGGCTGGCAGTTGGGATATAGG
TGAAAACCTATCCAGTTTACGTCGAAAGGCTGTAGTTTGGATTAAGACAAAGTTTCCAGATCCTATGCGCAATTACATGA
CGGGGCTTCTATTAGGACATCTCGACACCGACTTCGAGGAAATGAATGAACTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTAGGCTTTTTCATGGATGGCTTTAAGAAACTTCTTTTGCGACTGGGGTTGACTCAAGA
AAAGTTGAAATGGATGACTTATCCCTTTTCTCTTATTTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCTCAACATGGTTTTAAGGGCTTGGATAATTTTGCCTTGACAGTCCTTGTCCTCTTTATCGCC
ATGCCCAACTTTTTCCTGACGGCGGGAGGTATTTTGTCTTGTGCCTACGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAGGGGCTTAAGGCTGTGACCAGAGAAAGTCTGGTTATTTCTTTGGGCATATTACCCATCCTATCCTTCTATTTTGCAG
AATTTCAACCTTGGTCCATCCTCTTGACCTTTGTCTTTTCCTTTCTATTTGACTTAGTCTTCTTACCGCTCTTGTCCATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAACTGAATGTTATCTTTGAATGGTTGGAAGGCATTATTCGCTTGGT
ATCACAGGTGGCAAGTAGACCCCTGGTCTTTGGTCAACCCACCACATGGCTTTTGATTCTTCTCTTAGTTTCATTAGCCT
TGCTCTATGATATGAGAAAAAATATCAAAAGACTAGCAGGATTTAGTCTCTTTATCGTGGGGCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGCGAAAGTATTTTCCTAAGGGATGTAACTGGTAAGAC
CATTCTCATAGATGTCGGTGGCAAGGTAGAATCTGATAAGAAAATCGAAAAATGGCAAGAAAAAGCGACAACCAGTAATG
CGCAGAGAACCTTGATTCCCTATCTTAAAAGTCGCGGAGTAGCCAAGATTGACCAGCTAATTTTGACCAACACAGACAAG
GAACATGTTGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTGGTATCAAAAGGAAGTTTGAAACA
GAAGGAATTTGTGGCGGAACTAGAAGCAAGCCAAACCAAGGTACGCAGTGTGACAGCAGGGGAGAATTTACCGATTTTTG
GCAGTCAGTTAGAAGTCCTATCTCCAGGGAAGATTGGAGAAGTTGGTTCCAATGATTCCTTGGTTCTTTATGGGAAACTC
TTGGATAAGCACTTTCTCTTCACGGAAAATTTGGAGGAGAAAGGAGAGAAGGATCTTCTAAAGCAATATCCTGACCTAGA
GGTGGATGTTTTGAAAGCTGGCCAACATGGCTCTAAAAAATCATCAAGTTCGGCCTTTTTAGAACAGCTTAAACCGGAGA
TCACTCTCATCTCAGTTGGAAAGAGCAATCGAACGAAACTCCCCCATCAGGAAACCCTGACCCGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGATTTAAAGGTTGGAAGAGTTGGAAGATCGAAAGTATTCGATA
G
ATGTTGCAGTGGATTAAGAATTTCTCTATTCCCCTAATTTACCTGAGTTTTCTGTTACTTTGGCTTTACTACGCCATTTT
TTCAGCATCCTATCTTGCTTTATTGGGCTTTGTTTTTCTGCTAGTTTGTCTCTTTATTCAATTTCCTTGGAAATCTACTA
GCAAAGTTCTAGCAATTTGTGGAATCTTTGGATTTTGGTTTCTATTTCAAACTTGGCAGCAGACACAAGCTAGTCAGAAC
CTAGCGGATTCTGTTGAAAGGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGTGATAGTCTATCCTTTCGTGGCAA
GTCTGATGGACGCATTTTTCAAGTCTATTATAAACTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAGCCTTAACAGCTC
TTCATGATTTGGAACTAGAAGGGAAGCTTTCGGAGCCAGAAGGGAGGAGAAATTTTGGTGGCTTTGACTACCAATCCTAT
CTGAAAACTCAGGGAATTTACCAGACTCTCAATATCAAAAGAATCCAGTCGCTTCAAAAGGCTGGCAGTTGGGATATAGG
TGAAAACCTATCCAGTTTACGTCGAAAGGCTGTAGTTTGGATTAAGACAAAGTTTCCAGATCCTATGCGCAATTACATGA
CGGGGCTTCTATTAGGACATCTCGACACCGACTTCGAGGAAATGAATGAACTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTAGGCTTTTTCATGGATGGCTTTAAGAAACTTCTTTTGCGACTGGGGTTGACTCAAGA
AAAGTTGAAATGGATGACTTATCCCTTTTCTCTTATTTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTGGCTCAACATGGTTTTAAGGGCTTGGATAATTTTGCCTTGACAGTCCTTGTCCTCTTTATCGCC
ATGCCCAACTTTTTCCTGACGGCGGGAGGTATTTTGTCTTGTGCCTACGCTTTTATCTTGACCATGACCAGCAAAGAAGG
AGAGGGGCTTAAGGCTGTGACCAGAGAAAGTCTGGTTATTTCTTTGGGCATATTACCCATCCTATCCTTCTATTTTGCAG
AATTTCAACCTTGGTCCATCCTCTTGACCTTTGTCTTTTCCTTTCTATTTGACTTAGTCTTCTTACCGCTCTTGTCCATC
TTATTTGTCCTTTCCTTTCTCTATCCAGTCATTCAACTGAATGTTATCTTTGAATGGTTGGAAGGCATTATTCGCTTGGT
ATCACAGGTGGCAAGTAGACCCCTGGTCTTTGGTCAACCCACCACATGGCTTTTGATTCTTCTCTTAGTTTCATTAGCCT
TGCTCTATGATATGAGAAAAAATATCAAAAGACTAGCAGGATTTAGTCTCTTTATCGTGGGGCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGATGTGGGGCAAGGCGAAAGTATTTTCCTAAGGGATGTAACTGGTAAGAC
CATTCTCATAGATGTCGGTGGCAAGGTAGAATCTGATAAGAAAATCGAAAAATGGCAAGAAAAAGCGACAACCAGTAATG
CGCAGAGAACCTTGATTCCCTATCTTAAAAGTCGCGGAGTAGCCAAGATTGACCAGCTAATTTTGACCAACACAGACAAG
GAACATGTTGGAGATTTGTTAGAGGTGACCAAGGCTTTCCATGTAGGGGAGATTTTGGTATCAAAAGGAAGTTTGAAACA
GAAGGAATTTGTGGCGGAACTAGAAGCAAGCCAAACCAAGGTACGCAGTGTGACAGCAGGGGAGAATTTACCGATTTTTG
GCAGTCAGTTAGAAGTCCTATCTCCAGGGAAGATTGGAGAAGTTGGTTCCAATGATTCCTTGGTTCTTTATGGGAAACTC
TTGGATAAGCACTTTCTCTTCACGGAAAATTTGGAGGAGAAAGGAGAGAAGGATCTTCTAAAGCAATATCCTGACCTAGA
GGTGGATGTTTTGAAAGCTGGCCAACATGGCTCTAAAAAATCATCAAGTTCGGCCTTTTTAGAACAGCTTAAACCGGAGA
TCACTCTCATCTCAGTTGGAAAGAGCAATCGAACGAAACTCCCCCATCAGGAAACCCTGACCCGACTGGAAGGTATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGAGCTATACGATTTAAAGGTTGGAAGAGTTGGAAGATCGAAAGTATTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
92.091 |
100 |
0.921 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
90.201 |
99.866 |
0.901 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
90.08 |
100 |
0.901 |
| comEC/celB | Streptococcus pneumoniae D39 |
89.544 |
100 |
0.895 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
89.544 |
100 |
0.895 |
| comEC/celB | Streptococcus pneumoniae R6 |
89.544 |
100 |
0.895 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.669 |
99.33 |
0.444 |