Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | I6G43_RS04425 | Genome accession | NZ_CP065706 |
| Coordinates | 933003..935243 (+) | Length | 746 a.a. |
| NCBI ID | WP_038805080.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain FDAARGOS_886 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 928003..940243
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6G43_RS04395 (I6G43_04395) | - | 928498..929076 (+) | 579 | WP_009014028.1 | GrpB family protein | - |
| I6G43_RS04400 (I6G43_04400) | - | 929100..929315 (+) | 216 | WP_001232084.1 | YozE family protein | - |
| I6G43_RS04405 (I6G43_04405) | - | 929396..930382 (+) | 987 | WP_000658176.1 | PhoH family protein | - |
| I6G43_RS04410 (I6G43_04410) | ald | 930444..931556 (-) | 1113 | WP_038806202.1 | alanine dehydrogenase | - |
| I6G43_RS04415 (I6G43_04415) | - | 931733..932302 (+) | 570 | WP_038805078.1 | GNAT family N-acetyltransferase | - |
| I6G43_RS04420 (I6G43_04420) | comEA/celA/cilE | 932369..933019 (+) | 651 | WP_038805079.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| I6G43_RS04425 (I6G43_04425) | comEC/celB | 933003..935243 (+) | 2241 | WP_038805080.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| I6G43_RS04430 (I6G43_04430) | infC | 935463..935993 (+) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| I6G43_RS04435 (I6G43_04435) | rpmI | 936026..936226 (+) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| I6G43_RS04440 (I6G43_04440) | rplT | 936278..936637 (+) | 360 | WP_000124830.1 | 50S ribosomal protein L20 | - |
| I6G43_RS04445 (I6G43_04445) | - | 936696..937076 (+) | 381 | WP_038806201.1 | VOC family protein | - |
| I6G43_RS04450 (I6G43_04450) | - | 937323..938123 (+) | 801 | WP_038806200.1 | dihydroorotate dehydrogenase electron transfer subunit | - |
| I6G43_RS04455 (I6G43_04455) | - | 938086..939072 (+) | 987 | WP_161603075.1 | dihydroorotate dehydrogenase | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84494.79 Da Isoelectric Point: 9.2903
>NTDB_id=513218 I6G43_RS04425 WP_038805080.1 933003..935243(+) (comEC/celB) [Streptococcus oralis strain FDAARGOS_886]
MSQWIKNFPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGFWFLFQTWQQTQASQN
LVDSVEKVKILPDTIKVNGDSLSFRGKADGHTFQVYYKLQSEEEKEQFQTLTDLHEIELEGKVSEPEGQRNFGGFNYQAY
LKTQGIYQTLTIKRIQSVKQISSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGIKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFVYPITQLNLVFEWLEDIIRLVSQLASRPLVFGQPNAWLLILLLISLAILYDFRKNIKRVAGVSLLIIGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKAILIDVGGKAESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGILTQKEFVAELEASQTKVRSVTAGENLPIFGSQLEVLSPRQIGDGDRDGSLVLYGKL
LDKHFLFTGNLKEKGEKDILKQYPNLEVDVLKVGQHGSKTSSNPAFLEKLKPEISLISVGKNNRAKLPHQETLTRLETIK
SKIYRTDQQGAIRFKGWNSWRIETVR
MSQWIKNFPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGFWFLFQTWQQTQASQN
LVDSVEKVKILPDTIKVNGDSLSFRGKADGHTFQVYYKLQSEEEKEQFQTLTDLHEIELEGKVSEPEGQRNFGGFNYQAY
LKTQGIYQTLTIKRIQSVKQISSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGIKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFVYPITQLNLVFEWLEDIIRLVSQLASRPLVFGQPNAWLLILLLISLAILYDFRKNIKRVAGVSLLIIGLFFLTK
HPLENEITMLDVGQGESIFLRDVTGKAILIDVGGKAESDKKIEKWQEKATTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGILTQKEFVAELEASQTKVRSVTAGENLPIFGSQLEVLSPRQIGDGDRDGSLVLYGKL
LDKHFLFTGNLKEKGEKDILKQYPNLEVDVLKVGQHGSKTSSNPAFLEKLKPEISLISVGKNNRAKLPHQETLTRLETIK
SKIYRTDQQGAIRFKGWNSWRIETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=513218 I6G43_RS04425 WP_038805080.1 933003..935243(+) (comEC/celB) [Streptococcus oralis strain FDAARGOS_886]
ATGTCACAGTGGATTAAGAATTTCCCTATTCCTCTAATCTATCTGAGCTTTCTGTTGCTCTGGCTTTACTATGCCATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTCGTCTGTCTCTTTTTTCAATTTCCTTGGAAATCTGCTG
GAAAAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTCTGTTTCAAACTTGGCAACAGACACAAGCTAGTCAAAAC
CTAGTGGATTCTGTTGAAAAGGTGAAGATCTTGCCTGATACCATCAAAGTCAATGGAGACAGTCTGTCCTTTCGGGGCAA
GGCTGACGGCCATACCTTCCAAGTTTACTATAAACTCCAGTCTGAGGAGGAGAAAGAGCAATTTCAAACCTTAACAGATC
TTCATGAGATTGAACTAGAAGGGAAAGTTTCAGAGCCTGAAGGTCAGAGGAATTTTGGTGGATTTAACTACCAAGCCTAC
CTGAAGACTCAAGGGATTTACCAGACACTGACTATCAAGAGAATCCAGTCAGTTAAACAGATTAGCAGTTGGGATATAGG
TGAAAATTTATCAAGTTTACGTCGAAAGGCTGTAGTTTGGATCAAGACGCACTTTCCAGACCCTATGCGCAACTATATGA
CAGGTCTTCTGTTAGGACATCTGGATACGGACTTTGAGGAGATGAATGAACTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTGGGATTCTTTATGGATGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAGTGGCTGACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCAGTCATTCGCAGTC
TCTTGCAAAAGTTACTAGCACAACATGGTATTAAGGGCTTGGATAATTTTGCCTTGACAGTCCTTGTCCTCTTTATCATC
ATGCCCAACTTTTTCTTGACAGCTGGAGGGGTCTTGTCCTGCGCTTATGCTTTTATCTTGACTATGACAAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCCAGAGAAAGTTTGGTCATTTCCTTGGGAATATTGCCTATTTTGTCCTTTTATTTTGCAG
AATTTCAGCCTTGGTCAATTCTTTTAACCTTTGTCTTTTCCTTTCTGTTTGATGTTGTCTTCTTGCCGCTTTTGTCCATC
TTATTTATTCTGTCTTTTGTTTACCCAATCACCCAGCTTAACCTTGTCTTTGAGTGGTTGGAGGACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCCCTGGTCTTTGGTCAACCCAACGCATGGCTGTTGATTCTACTATTAATTTCCTTGGCAA
TACTGTATGATTTTAGGAAAAACATCAAAAGAGTAGCTGGAGTCAGCTTATTGATTATAGGTCTCTTTTTCCTAACCAAA
CATCCGCTGGAAAATGAAATCACCATGCTGGATGTCGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGTAAAGC
CATTCTCATAGATGTGGGAGGCAAGGCAGAGTCTGATAAGAAAATCGAAAAATGGCAAGAAAAGGCGACGACCAGCAATG
CCCAGAGAACCTTGATTCCCTATCTTAAAAGTCGTGGGGTAGCCAAGATTGACCAGCTGATTTTGACCAACACAGATAAA
GAACATGTTGGAGATTTGCTGGAGGTGACCAAGGCTTTCCATGTTGGGGAGATTTTAGTATCAAAAGGAATTCTGACACA
GAAGGAATTTGTAGCAGAACTAGAAGCAAGCCAAACCAAGGTGCGCAGTGTGACAGCAGGGGAGAACTTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGCAGATTGGAGATGGAGATCGTGATGGTTCCCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTTACAGGAAACTTGAAAGAGAAGGGAGAGAAGGATATTCTAAAGCAATACCCTAACTTAGA
GGTGGATGTCTTGAAAGTCGGTCAACATGGTTCTAAAACATCATCAAATCCAGCTTTTCTAGAAAAACTTAAACCAGAGA
TTTCTCTCATCTCAGTTGGAAAGAACAATCGTGCGAAACTCCCCCATCAGGAAACCTTGACTCGACTAGAGACCATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGAGCTATTCGCTTTAAAGGATGGAATAGTTGGCGAATTGAAACGGTTCGATA
A
ATGTCACAGTGGATTAAGAATTTCCCTATTCCTCTAATCTATCTGAGCTTTCTGTTGCTCTGGCTTTACTATGCCATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTCGTCTGTCTCTTTTTTCAATTTCCTTGGAAATCTGCTG
GAAAAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTCTGTTTCAAACTTGGCAACAGACACAAGCTAGTCAAAAC
CTAGTGGATTCTGTTGAAAAGGTGAAGATCTTGCCTGATACCATCAAAGTCAATGGAGACAGTCTGTCCTTTCGGGGCAA
GGCTGACGGCCATACCTTCCAAGTTTACTATAAACTCCAGTCTGAGGAGGAGAAAGAGCAATTTCAAACCTTAACAGATC
TTCATGAGATTGAACTAGAAGGGAAAGTTTCAGAGCCTGAAGGTCAGAGGAATTTTGGTGGATTTAACTACCAAGCCTAC
CTGAAGACTCAAGGGATTTACCAGACACTGACTATCAAGAGAATCCAGTCAGTTAAACAGATTAGCAGTTGGGATATAGG
TGAAAATTTATCAAGTTTACGTCGAAAGGCTGTAGTTTGGATCAAGACGCACTTTCCAGACCCTATGCGCAACTATATGA
CAGGTCTTCTGTTAGGACATCTGGATACGGACTTTGAGGAGATGAATGAACTTTATTCCAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGTATGCAGGTGGGATTCTTTATGGATGCCTTTAAGAAACTCCTCTTGCGATTGGGCTTGACTCAAGA
AAAGTTGAAGTGGCTGACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCAGTCATTCGCAGTC
TCTTGCAAAAGTTACTAGCACAACATGGTATTAAGGGCTTGGATAATTTTGCCTTGACAGTCCTTGTCCTCTTTATCATC
ATGCCCAACTTTTTCTTGACAGCTGGAGGGGTCTTGTCCTGCGCTTATGCTTTTATCTTGACTATGACAAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCCAGAGAAAGTTTGGTCATTTCCTTGGGAATATTGCCTATTTTGTCCTTTTATTTTGCAG
AATTTCAGCCTTGGTCAATTCTTTTAACCTTTGTCTTTTCCTTTCTGTTTGATGTTGTCTTCTTGCCGCTTTTGTCCATC
TTATTTATTCTGTCTTTTGTTTACCCAATCACCCAGCTTAACCTTGTCTTTGAGTGGTTGGAGGACATCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCCCTGGTCTTTGGTCAACCCAACGCATGGCTGTTGATTCTACTATTAATTTCCTTGGCAA
TACTGTATGATTTTAGGAAAAACATCAAAAGAGTAGCTGGAGTCAGCTTATTGATTATAGGTCTCTTTTTCCTAACCAAA
CATCCGCTGGAAAATGAAATCACCATGCTGGATGTCGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTGGTAAAGC
CATTCTCATAGATGTGGGAGGCAAGGCAGAGTCTGATAAGAAAATCGAAAAATGGCAAGAAAAGGCGACGACCAGCAATG
CCCAGAGAACCTTGATTCCCTATCTTAAAAGTCGTGGGGTAGCCAAGATTGACCAGCTGATTTTGACCAACACAGATAAA
GAACATGTTGGAGATTTGCTGGAGGTGACCAAGGCTTTCCATGTTGGGGAGATTTTAGTATCAAAAGGAATTCTGACACA
GAAGGAATTTGTAGCAGAACTAGAAGCAAGCCAAACCAAGGTGCGCAGTGTGACAGCAGGGGAGAACTTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGCAGATTGGAGATGGAGATCGTGATGGTTCCCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTTACAGGAAACTTGAAAGAGAAGGGAGAGAAGGATATTCTAAAGCAATACCCTAACTTAGA
GGTGGATGTCTTGAAAGTCGGTCAACATGGTTCTAAAACATCATCAAATCCAGCTTTTCTAGAAAAACTTAAACCAGAGA
TTTCTCTCATCTCAGTTGGAAAGAACAATCGTGCGAAACTCCCCCATCAGGAAACCTTGACTCGACTAGAGACCATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGAGCTATTCGCTTTAAAGGATGGAATAGTTGGCGAATTGAAACGGTTCGATA
A
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
88.472 |
100 |
0.885 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
88.322 |
99.866 |
0.882 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
86.997 |
100 |
0.87 |
| comEC/celB | Streptococcus pneumoniae D39 |
86.595 |
100 |
0.866 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
86.595 |
100 |
0.866 |
| comEC/celB | Streptococcus pneumoniae R6 |
86.595 |
100 |
0.866 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
45.074 |
99.33 |
0.448 |