Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | DG474_RS06015 | Genome accession | NZ_CP029257 |
| Coordinates | 1222228..1224468 (-) | Length | 746 a.a. |
| NCBI ID | WP_200371573.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain CCUG 53468 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1217228..1229468
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DG474_RS05995 (DG474_06105) | - | 1218996..1220660 (-) | 1665 | WP_255777667.1 | molecular chaperone HscC | - |
| DG474_RS06000 (DG474_06110) | rplT | 1220834..1221193 (-) | 360 | WP_000124830.1 | 50S ribosomal protein L20 | - |
| DG474_RS06005 (DG474_06115) | rpmI | 1221245..1221445 (-) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| DG474_RS06010 (DG474_06120) | infC | 1221478..1222008 (-) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| DG474_RS06015 (DG474_06125) | comEC/celB | 1222228..1224468 (-) | 2241 | WP_200371573.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| DG474_RS06020 (DG474_06130) | comEA/celA/cilE | 1224452..1225102 (-) | 651 | WP_125397719.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| DG474_RS06025 (DG474_06135) | - | 1225169..1225738 (-) | 570 | WP_255777670.1 | GNAT family N-acetyltransferase | - |
| DG474_RS06030 (DG474_06145) | ald | 1225914..1227026 (+) | 1113 | WP_255777671.1 | alanine dehydrogenase | - |
| DG474_RS06035 (DG474_06150) | - | 1227077..1228063 (-) | 987 | WP_084921953.1 | PhoH family protein | - |
| DG474_RS06040 (DG474_06155) | - | 1228144..1228359 (-) | 216 | WP_001232084.1 | YozE family protein | - |
| DG474_RS06045 (DG474_06160) | - | 1228356..1228961 (-) | 606 | WP_255777672.1 | GrpB family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84789.07 Da Isoelectric Point: 9.2887
>NTDB_id=291423 DG474_RS06015 WP_200371573.1 1222228..1224468(-) (comEC/celB) [Streptococcus oralis strain CCUG 53468]
MSQRIKNFPIPLIYLSFLLLWLYFVILGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGIWFLFQNWQQTQASQN
LVDSVERVRILPDTIKVNGDSLSFRGKAEGRTFQVYYKLQSEEEKELFQALTDLHEIELEGKLSEPEGQRNFGGFDYRAY
LKTQGIYQTLTIKSIQSLKQVSSWDIGENLSALRRKAVVWIKAHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMEAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTILVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGDGIKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFIYSVTQFNFVFEWLENIIRLVSQLASRPLVLGQPTTWLLILLLVSLALLYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIEAWQEKSTTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLTQKEFVAELEASQNKVRSVTAGEYFPIFGSYLEVLSPRQIRDGNRDDSLVLYGKL
LDKHFFFTGNLKEKGEKDLMKHYPDLEVDVLKAGQHGAKTSSNPVFLEQLKPAITLISVGKSNRTKLPHQETLTRMESIK
SKIYRTDQQGAIRFKGWNSWRIETVR
MSQRIKNFPIPLIYLSFLLLWLYFVILGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGIWFLFQNWQQTQASQN
LVDSVERVRILPDTIKVNGDSLSFRGKAEGRTFQVYYKLQSEEEKELFQALTDLHEIELEGKLSEPEGQRNFGGFDYRAY
LKTQGIYQTLTIKSIQSLKQVSSWDIGENLSALRRKAVVWIKAHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMEAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTILVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGDGIKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFIYSVTQFNFVFEWLENIIRLVSQLASRPLVLGQPTTWLLILLLVSLALLYDMRKNIKRLAGFSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIEAWQEKSTTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLTQKEFVAELEASQNKVRSVTAGEYFPIFGSYLEVLSPRQIRDGNRDDSLVLYGKL
LDKHFFFTGNLKEKGEKDLMKHYPDLEVDVLKAGQHGAKTSSNPVFLEQLKPAITLISVGKSNRTKLPHQETLTRMESIK
SKIYRTDQQGAIRFKGWNSWRIETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=291423 DG474_RS06015 WP_200371573.1 1222228..1224468(-) (comEC/celB) [Streptococcus oralis strain CCUG 53468]
ATGTCACAGCGGATTAAGAATTTCCCTATCCCCTTAATCTATCTGAGTTTTCTGTTACTCTGGCTTTACTTTGTCATTTT
AGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTGGTCTGCCTCTTTTTCCAATTTCCTTGGAAATCGGCTG
GTAAAGTTCTAGCGATTTGTGGAGTTTTTGGAATTTGGTTTTTGTTTCAAAACTGGCAACAGACACAAGCTAGTCAGAAC
CTAGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGATACCATCAAAGTCAATGGAGACAGTCTGTCCTTTCGGGGCAA
GGCTGAGGGCCGCACCTTCCAAGTTTATTATAAACTCCAGTCCGAGGAAGAGAAAGAGCTCTTTCAGGCCTTAACAGACC
TTCACGAGATAGAGCTAGAAGGAAAACTTTCTGAGCCTGAAGGGCAGAGAAATTTTGGTGGATTTGACTACCGAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACACTGACTATCAAGAGCATCCAGTCACTTAAACAGGTTAGCAGTTGGGATATAGG
TGAAAATTTGTCGGCTTTACGTCGAAAGGCTGTAGTTTGGATCAAGGCGCACTTTCCAGACCCTATGCGCAACTATATGA
CGGGGCTCTTGCTGGGACATTTGGACACGGACTTTGAGGAGATGAATGAACTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTGGGCTTCTTTATGGAGGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
GAAGTTGAAGTGGCTAACTTATCCATTTTCTCTTATCTATGCAGGTCTGACAGGATTTTCAGCATCTGTCATTCGCAGTC
TCTTACAAAAGTTACTGGCCCAACATGGTGTTAAAGGTTTGGATAATTTTGCCTTGACGATCCTTGTCCTCTTTATCATC
ATGCCCAACTTTTTCTTGACAGCTGGAGGGGTCTTGTCCTGCGCTTATGCTTTTATCTTGACTATGACTAGCAAAGAAGG
CGATGGGATCAAGGCTGTTGCCAGAGAAAGTCTGGTCATTTCTTTGGGAATATTGCCTATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTGTTTGATGTGGTCTTCTTGCCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTATTTACTCAGTCACTCAGTTTAACTTTGTTTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGTAGGCCCCTGGTCCTTGGTCAACCCACCACATGGCTTTTGATTCTCCTCTTAGTTTCATTAGCCT
TGCTATATGATATGAGGAAAAACATTAAAAGACTAGCAGGATTCAGTCTCTTTATTGTGGGGCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGTGAAAGTATTTTCCTACGGGATGTGACTGGGAAAAC
TATTCTCATAGATGTAGGTGGTAAGGCAGAATCTGATAAGAAAATCGAGGCTTGGCAAGAAAAGTCGACGACCAGCAATG
CCCAGCGTACCTTGATTCCCTATCTAAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACCAATACAGACAAG
GAGCATGTTGGCGATTTGCTGGAGGTGACCAAGGCTTTCCATGTGGGGGAGATTTTAGTATCAAAAGGAAGTCTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTCAAAACAAGGTACGTAGTGTGACAGCTGGGGAGTATTTCCCGATTTTTG
GCAGTTACTTAGAAGTTCTATCTCCAAGGCAGATTCGAGATGGGAATCGTGATGATTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTTTCTTTACTGGAAATTTGAAAGAGAAGGGAGAGAAGGACTTGATGAAGCATTATCCTGACCTAGA
GGTGGATGTCTTGAAAGCAGGCCAACATGGTGCTAAAACCTCATCAAATCCAGTTTTCCTAGAACAGCTCAAACCAGCGA
TTACTCTCATTTCAGTCGGAAAGAGCAATCGTACGAAACTCCCCCATCAGGAAACCTTGACACGAATGGAAAGTATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGGGCTATCCGCTTTAAAGGATGGAATAGTTGGAGAATTGAAACGGTTCGATA
A
ATGTCACAGCGGATTAAGAATTTCCCTATCCCCTTAATCTATCTGAGTTTTCTGTTACTCTGGCTTTACTTTGTCATTTT
AGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTGGTCTGCCTCTTTTTCCAATTTCCTTGGAAATCGGCTG
GTAAAGTTCTAGCGATTTGTGGAGTTTTTGGAATTTGGTTTTTGTTTCAAAACTGGCAACAGACACAAGCTAGTCAGAAC
CTAGTGGATTCTGTTGAAAGGGTACGGATTTTACCAGATACCATCAAAGTCAATGGAGACAGTCTGTCCTTTCGGGGCAA
GGCTGAGGGCCGCACCTTCCAAGTTTATTATAAACTCCAGTCCGAGGAAGAGAAAGAGCTCTTTCAGGCCTTAACAGACC
TTCACGAGATAGAGCTAGAAGGAAAACTTTCTGAGCCTGAAGGGCAGAGAAATTTTGGTGGATTTGACTACCGAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACACTGACTATCAAGAGCATCCAGTCACTTAAACAGGTTAGCAGTTGGGATATAGG
TGAAAATTTGTCGGCTTTACGTCGAAAGGCTGTAGTTTGGATCAAGGCGCACTTTCCAGACCCTATGCGCAACTATATGA
CGGGGCTCTTGCTGGGACATTTGGACACGGACTTTGAGGAGATGAATGAACTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTGGGCTTCTTTATGGAGGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
GAAGTTGAAGTGGCTAACTTATCCATTTTCTCTTATCTATGCAGGTCTGACAGGATTTTCAGCATCTGTCATTCGCAGTC
TCTTACAAAAGTTACTGGCCCAACATGGTGTTAAAGGTTTGGATAATTTTGCCTTGACGATCCTTGTCCTCTTTATCATC
ATGCCCAACTTTTTCTTGACAGCTGGAGGGGTCTTGTCCTGCGCTTATGCTTTTATCTTGACTATGACTAGCAAAGAAGG
CGATGGGATCAAGGCTGTTGCCAGAGAAAGTCTGGTCATTTCTTTGGGAATATTGCCTATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTGTTTGATGTGGTCTTCTTGCCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTATTTACTCAGTCACTCAGTTTAACTTTGTTTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGTAGGCCCCTGGTCCTTGGTCAACCCACCACATGGCTTTTGATTCTCCTCTTAGTTTCATTAGCCT
TGCTATATGATATGAGGAAAAACATTAAAAGACTAGCAGGATTCAGTCTCTTTATTGTGGGGCTCTTTTTCTTGACCAAG
CATCCACTGGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGTGAAAGTATTTTCCTACGGGATGTGACTGGGAAAAC
TATTCTCATAGATGTAGGTGGTAAGGCAGAATCTGATAAGAAAATCGAGGCTTGGCAAGAAAAGTCGACGACCAGCAATG
CCCAGCGTACCTTGATTCCCTATCTAAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACCAATACAGACAAG
GAGCATGTTGGCGATTTGCTGGAGGTGACCAAGGCTTTCCATGTGGGGGAGATTTTAGTATCAAAAGGAAGTCTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTCAAAACAAGGTACGTAGTGTGACAGCTGGGGAGTATTTCCCGATTTTTG
GCAGTTACTTAGAAGTTCTATCTCCAAGGCAGATTCGAGATGGGAATCGTGATGATTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTTTCTTTACTGGAAATTTGAAAGAGAAGGGAGAGAAGGACTTGATGAAGCATTATCCTGACCTAGA
GGTGGATGTCTTGAAAGCAGGCCAACATGGTGCTAAAACCTCATCAAATCCAGTTTTCCTAGAACAGCTCAAACCAGCGA
TTACTCTCATTTCAGTCGGAAAGAGCAATCGTACGAAACTCCCCCATCAGGAAACCTTGACACGAATGGAAAGTATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGGGCTATCCGCTTTAAAGGATGGAATAGTTGGAGAATTGAAACGGTTCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
86.863 |
100 |
0.869 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
86.309 |
99.866 |
0.862 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
85.389 |
100 |
0.854 |
| comEC/celB | Streptococcus pneumoniae D39 |
85.121 |
100 |
0.851 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
85.121 |
100 |
0.851 |
| comEC/celB | Streptococcus pneumoniae R6 |
85.121 |
100 |
0.851 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.669 |
99.33 |
0.444 |