Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | GOM47_RS06105 | Genome accession | NZ_CP046524 |
| Coordinates | 1197297..1199537 (-) | Length | 746 a.a. |
| NCBI ID | WP_235080191.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain SOT | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1192297..1204537
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GOM47_RS06075 (GOM47_06080) | - | 1193466..1194452 (-) | 987 | WP_235080189.1 | dihydroorotate dehydrogenase | - |
| GOM47_RS06080 (GOM47_06085) | - | 1194415..1195215 (-) | 801 | WP_042902385.1 | dihydroorotate dehydrogenase electron transfer subunit | - |
| GOM47_RS06085 (GOM47_06090) | - | 1195464..1195844 (-) | 381 | WP_235080190.1 | VOC family protein | - |
| GOM47_RS06090 (GOM47_06095) | rplT | 1195903..1196262 (-) | 360 | WP_000124830.1 | 50S ribosomal protein L20 | - |
| GOM47_RS06095 (GOM47_06100) | rpmI | 1196314..1196514 (-) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| GOM47_RS06100 (GOM47_06105) | infC | 1196547..1197077 (-) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| GOM47_RS06105 (GOM47_06110) | comEC/celB | 1197297..1199537 (-) | 2241 | WP_235080191.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| GOM47_RS06110 (GOM47_06115) | comEA/celA/cilE | 1199521..1200171 (-) | 651 | WP_235080192.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| GOM47_RS06115 (GOM47_06120) | - | 1200238..1200807 (-) | 570 | WP_235080193.1 | GNAT family N-acetyltransferase | - |
| GOM47_RS06120 (GOM47_06125) | ald | 1200986..1202098 (+) | 1113 | WP_235080194.1 | alanine dehydrogenase | - |
| GOM47_RS06125 (GOM47_06130) | - | 1202150..1203136 (-) | 987 | WP_000658168.1 | PhoH family protein | - |
| GOM47_RS06130 (GOM47_06135) | - | 1203217..1203432 (-) | 216 | WP_001232084.1 | YozE family protein | - |
| GOM47_RS06135 (GOM47_06140) | - | 1203456..1204034 (-) | 579 | WP_235080195.1 | GrpB family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84662.86 Da Isoelectric Point: 9.0412
>NTDB_id=404997 GOM47_RS06105 WP_235080191.1 1197297..1199537(-) (comEC/celB) [Streptococcus oralis strain SOT]
MSQWIKNSPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGRVLAICGVFGFWFLFQTWQQTQASQN
LVDSVEKVRILPDTIKVNGDSLSFRGKAEGRTLQVYYKLQSEEEKELFQALTDLHEIELEGKPSEPEGQRNFGGFNYQAY
LKTQGIYQTLTIKSIQSMKQVSSWDIRENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMEAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQAWSILLTFVFSFLFDLVFLPLLSI
LFILSFAYPVTQFNLVFEWLENIIRLVSQLASRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGVSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKVEASKKIEAWQEKVTTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHIGEILVSKGSLTQKEFVAELEASKNKVRSVTAGEYFPIFGSYLELLSPRQIGDGDRDDSLVLYGKL
LDKHFLFTGNLKEKGEKDLLKQYPDLEVDVLKAGQHGAKTSSNSAFLEQLKPEFTLISVGKSNRAKLPHQETLTQLENIK
SKIYRTDQQGAIRFKGWNSWRIETVR
MSQWIKNSPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGRVLAICGVFGFWFLFQTWQQTQASQN
LVDSVEKVRILPDTIKVNGDSLSFRGKAEGRTLQVYYKLQSEEEKELFQALTDLHEIELEGKPSEPEGQRNFGGFNYQAY
LKTQGIYQTLTIKSIQSMKQVSSWDIRENLSSLRRKAVVWIKMHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMEAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQAWSILLTFVFSFLFDLVFLPLLSI
LFILSFAYPVTQFNLVFEWLENIIRLVSQLASRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGVSLFIVGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKVEASKKIEAWQEKVTTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHIGEILVSKGSLTQKEFVAELEASKNKVRSVTAGEYFPIFGSYLELLSPRQIGDGDRDDSLVLYGKL
LDKHFLFTGNLKEKGEKDLLKQYPDLEVDVLKAGQHGAKTSSNSAFLEQLKPEFTLISVGKSNRAKLPHQETLTQLENIK
SKIYRTDQQGAIRFKGWNSWRIETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=404997 GOM47_RS06105 WP_235080191.1 1197297..1199537(-) (comEC/celB) [Streptococcus oralis strain SOT]
ATGTCACAGTGGATTAAGAATTCCCCAATTCCCCTAATCTATCTGAGTTTTCTATTGCTCTGGCTTTACTATGCAATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTTGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCGGCCG
GTAGAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTTTGTTTCAAACTTGGCAACAGACACAAGCTAGTCAAAAC
CTAGTGGATTCTGTTGAGAAGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGAGATAGTCTGTCCTTTCGGGGCAA
GGCTGAGGGCCGCACCCTCCAAGTTTACTATAAACTCCAGTCTGAGGAGGAGAAAGAGCTCTTTCAGGCCTTAACAGACC
TTCACGAGATAGAGCTAGAAGGAAAACCTTCAGAGCCTGAAGGTCAGAGGAATTTTGGTGGCTTTAACTATCAAGCATAT
CTAAAGACTCAAGGAATTTACCAAACTTTGACTATCAAGAGTATCCAGTCAATGAAACAGGTTAGCAGTTGGGATATAAG
AGAAAATCTGTCTAGTTTACGTCGAAAGGCTGTAGTTTGGATCAAGATGCACTTTCCAGATCCTATGCGCAACTATATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCTGGTATGCAGGTGGGCTTCTTTATGGAGGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAGTGGCTAACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGTAGTC
TCTTACAAAAGTTACTAGCCCAACATGGTGTTAAGGGCTTGGACAATTTTGCCTTAACAGTCCTTGTCCTCTTTATCATC
ATGCCCAACTTTTTCCTAACAGCAGGAGGAGTCTTGTCTTGCGCTTATGCTTTTATCTTGACCATGACAAGCAAAGAAGG
GGAGGGACTCAAGGCTGTTGCCAGAGAAAGTTTGGTCATTTCTTTGGGAATATTGCCCATTCTATCCTTTTATTTTGCAG
AATTTCAGGCTTGGTCTATCCTTTTGACCTTTGTATTTTCCTTTCTGTTTGATCTGGTCTTCTTGCCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTGCTTACCCAGTCACTCAGTTTAACCTTGTCTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAACGCATGGCTGCTGATTCTACTGTTAGTTTCATTAGCCT
TAGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGAGTCAGTCTTTTTATCGTGGGGCTCTTTTTCTTGACCAAG
CACCCACTTGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGTGAAAGTATTTTCCTACGGGATGTAACTGGTAAAAC
TATTCTCATAGATGTTGGTGGTAAGGTAGAAGCTAGTAAGAAAATCGAGGCTTGGCAAGAAAAGGTGACAACCAGCAATG
CCCAGAGAACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACCAACACAGACAAG
GAACATGTTGGTGATTTACTGGAGGTGACCAAAGCTTTCCATATCGGGGAGATTTTAGTGTCAAAAGGAAGTTTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTAAAAACAAGGTACGTAGTGTGACAGCTGGGGAGTATTTCCCGATTTTTG
GCAGTTACTTAGAACTTCTATCTCCAAGGCAGATTGGAGATGGGGATCGTGATGATTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGGAGAGAAGGATCTTCTAAAGCAATACCCTGACTTAGA
GGTGGATGTCCTGAAAGCAGGCCAACATGGTGCTAAAACATCATCCAATTCAGCTTTCCTAGAACAGCTCAAACCTGAGT
TCACTCTCATTTCAGTTGGAAAGAGCAATCGAGCAAAACTCCCTCATCAGGAAACCTTGACACAACTGGAAAATATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGGGCTATCCGCTTTAAAGGATGGAATAGTTGGAGAATTGAAACGGTTCGATA
A
ATGTCACAGTGGATTAAGAATTCCCCAATTCCCCTAATCTATCTGAGTTTTCTATTGCTCTGGCTTTACTATGCAATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTTGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCGGCCG
GTAGAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTTTGTTTCAAACTTGGCAACAGACACAAGCTAGTCAAAAC
CTAGTGGATTCTGTTGAGAAGGTACGGATTTTGCCTGACACTATTAAGGTCAATGGAGATAGTCTGTCCTTTCGGGGCAA
GGCTGAGGGCCGCACCCTCCAAGTTTACTATAAACTCCAGTCTGAGGAGGAGAAAGAGCTCTTTCAGGCCTTAACAGACC
TTCACGAGATAGAGCTAGAAGGAAAACCTTCAGAGCCTGAAGGTCAGAGGAATTTTGGTGGCTTTAACTATCAAGCATAT
CTAAAGACTCAAGGAATTTACCAAACTTTGACTATCAAGAGTATCCAGTCAATGAAACAGGTTAGCAGTTGGGATATAAG
AGAAAATCTGTCTAGTTTACGTCGAAAGGCTGTAGTTTGGATCAAGATGCACTTTCCAGATCCTATGCGCAACTATATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTTTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCTGGTATGCAGGTGGGCTTCTTTATGGAGGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAGTGGCTAACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGTAGTC
TCTTACAAAAGTTACTAGCCCAACATGGTGTTAAGGGCTTGGACAATTTTGCCTTAACAGTCCTTGTCCTCTTTATCATC
ATGCCCAACTTTTTCCTAACAGCAGGAGGAGTCTTGTCTTGCGCTTATGCTTTTATCTTGACCATGACAAGCAAAGAAGG
GGAGGGACTCAAGGCTGTTGCCAGAGAAAGTTTGGTCATTTCTTTGGGAATATTGCCCATTCTATCCTTTTATTTTGCAG
AATTTCAGGCTTGGTCTATCCTTTTGACCTTTGTATTTTCCTTTCTGTTTGATCTGGTCTTCTTGCCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTGCTTACCCAGTCACTCAGTTTAACCTTGTCTTTGAGTGGTTGGAGAACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAACGCATGGCTGCTGATTCTACTGTTAGTTTCATTAGCCT
TAGTCTATGACATGAGGAAAAACATCAAAAGACTAGCAGGAGTCAGTCTTTTTATCGTGGGGCTCTTTTTCTTGACCAAG
CACCCACTTGAAAATGAAATCACCATGCTGGATGTGGGGCAAGGTGAAAGTATTTTCCTACGGGATGTAACTGGTAAAAC
TATTCTCATAGATGTTGGTGGTAAGGTAGAAGCTAGTAAGAAAATCGAGGCTTGGCAAGAAAAGGTGACAACCAGCAATG
CCCAGAGAACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACCAACACAGACAAG
GAACATGTTGGTGATTTACTGGAGGTGACCAAAGCTTTCCATATCGGGGAGATTTTAGTGTCAAAAGGAAGTTTGACACA
GAAGGAATTTGTAGCGGAACTAGAAGCAAGTAAAAACAAGGTACGTAGTGTGACAGCTGGGGAGTATTTCCCGATTTTTG
GCAGTTACTTAGAACTTCTATCTCCAAGGCAGATTGGAGATGGGGATCGTGATGATTCTCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGGAGAGAAGGATCTTCTAAAGCAATACCCTGACTTAGA
GGTGGATGTCCTGAAAGCAGGCCAACATGGTGCTAAAACATCATCCAATTCAGCTTTCCTAGAACAGCTCAAACCTGAGT
TCACTCTCATTTCAGTTGGAAAGAGCAATCGAGCAAAACTCCCTCATCAGGAAACCTTGACACAACTGGAAAATATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGGGCTATCCGCTTTAAAGGATGGAATAGTTGGAGAATTGAAACGGTTCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
87.802 |
100 |
0.878 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
87.248 |
99.866 |
0.871 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
86.059 |
100 |
0.861 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
85.523 |
100 |
0.855 |
| comEC/celB | Streptococcus pneumoniae D39 |
85.523 |
100 |
0.855 |
| comEC/celB | Streptococcus pneumoniae R6 |
85.523 |
100 |
0.855 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.818 |
99.598 |
0.446 |