Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | I6H78_RS01790 | Genome accession | NZ_CP066059 |
| Coordinates | 375510..377750 (-) | Length | 746 a.a. |
| NCBI ID | WP_198459754.1 | Uniprot ID | - |
| Organism | Streptococcus oralis strain FDAARGOS_1021 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 370510..382750
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6H78_RS01760 (I6H78_01760) | rplT | 371844..372203 (-) | 360 | WP_000124830.1 | 50S ribosomal protein L20 | - |
| I6H78_RS01765 (I6H78_01765) | rpmI | 372255..372455 (-) | 201 | WP_001125942.1 | 50S ribosomal protein L35 | - |
| I6H78_RS01770 (I6H78_01770) | infC | 372488..373018 (-) | 531 | WP_000848184.1 | translation initiation factor IF-3 | - |
| I6H78_RS01775 (I6H78_01775) | - | 373325..374509 (-) | 1185 | WP_198459751.1 | hypothetical protein | - |
| I6H78_RS01780 (I6H78_01780) | - | 374513..375100 (-) | 588 | WP_198459752.1 | ATP-binding cassette domain-containing protein | - |
| I6H78_RS01785 (I6H78_01785) | - | 375133..375321 (-) | 189 | WP_061419442.1 | hypothetical protein | - |
| I6H78_RS01790 (I6H78_01790) | comEC/celB | 375510..377750 (-) | 2241 | WP_198459754.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| I6H78_RS01795 (I6H78_01795) | comEA/celA/cilE | 377734..378384 (-) | 651 | WP_198459755.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| I6H78_RS01800 (I6H78_01800) | - | 378451..379020 (-) | 570 | WP_198459756.1 | GNAT family N-acetyltransferase | - |
| I6H78_RS01805 (I6H78_01805) | ald | 379197..380309 (+) | 1113 | WP_198459757.1 | alanine dehydrogenase | - |
| I6H78_RS01810 (I6H78_01810) | tnpA | 380469..380932 (-) | 464 | Protein_360 | IS200/IS605 family transposase | - |
| I6H78_RS01815 (I6H78_01815) | - | 381122..382108 (-) | 987 | WP_125425819.1 | PhoH family protein | - |
| I6H78_RS01820 (I6H78_01820) | - | 382189..382404 (-) | 216 | WP_198459758.1 | YozE family protein | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84552.79 Da Isoelectric Point: 8.7852
>NTDB_id=516933 I6H78_RS01790 WP_198459754.1 375510..377750(-) (comEC/celB) [Streptococcus oralis strain FDAARGOS_1021]
MSQWIKNSPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGFWFLFQNWQQTQVSQK
LVDSVEKVRILPDTIKVNGDSLSFRGKADGRTFQVYYKLQSEEEKEQFQALTDLHEIELEGKLSEPEGQRNFGGFDYQAY
LKTQGIYQTLTIKNIQSLKQVSSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTILVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFVYPVTQFNFVFEWLEGVIRLVSQLASRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGFSLFIVCLFFLTK
HPLENEITMLDVGQGESIFLRDVTSKTILIDVGGKVEASKKIGVWQEKVTTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLTQKEFVAELEASQTKVRSVTVGENLPIFGSQLEVLSPRQIGDDDRDGSLVLYGKL
LDKHFLFTGNLKEKGEKDLLKQYPDLEVEVLKAGQHGAKTSSNPAFLENLKPEITLISVGKNNLAKLPHQETLTRLETIK
SKIYRTDQQGAIRFKGWNSWRMESVR
MSQWIKNSPIPLIYLSFLLLWLYYAIFGASYLALLGFVFLLVCLFFQFPWKSAGKVLAICGVFGFWFLFQNWQQTQVSQK
LVDSVEKVRILPDTIKVNGDSLSFRGKADGRTFQVYYKLQSEEEKEQFQALTDLHEIELEGKLSEPEGQRNFGGFDYQAY
LKTQGIYQTLTIKNIQSLKQVSSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDAFKKLLLRLGLTQEKLKWLTYPFSLIYAGLTGFSASVIRSLLQKLLAQHGVKGLDNFALTILVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEGEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDVVFLPLLSI
LFILSFVYPVTQFNFVFEWLEGVIRLVSQLASRPLVFGQPNAWLLILLLVSLALVYDMRKNIKRLAGFSLFIVCLFFLTK
HPLENEITMLDVGQGESIFLRDVTSKTILIDVGGKVEASKKIGVWQEKVTTSNAQRTLIPYLKSRGVDKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLTQKEFVAELEASQTKVRSVTVGENLPIFGSQLEVLSPRQIGDDDRDGSLVLYGKL
LDKHFLFTGNLKEKGEKDLLKQYPDLEVEVLKAGQHGAKTSSNPAFLENLKPEITLISVGKNNLAKLPHQETLTRLETIK
SKIYRTDQQGAIRFKGWNSWRMESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=516933 I6H78_RS01790 WP_198459754.1 375510..377750(-) (comEC/celB) [Streptococcus oralis strain FDAARGOS_1021]
ATGTCACAGTGGATTAAGAATTCCCCTATCCCCCTAATCTATCTGAGTTTTCTGTTACTCTGGCTTTACTATGCCATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTTGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCGGCTG
GAAAAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTCTGTTTCAAAATTGGCAACAGACACAAGTTAGTCAAAAG
TTAGTGGATTCTGTTGAGAAGGTACGGATTTTACCAGATACTATCAAAGTCAATGGAGATAGTCTGTCCTTTCGGGGCAA
GGCTGACGGCCGCACCTTCCAAGTTTACTATAAACTCCAGTCCGAGGAGGAGAAAGAGCAATTTCAAGCCTTGACAGACC
TTCATGAGATAGAACTAGAAGGAAAACTTTCAGAGCCTGAAGGTCAGAGGAATTTTGGTGGATTTGACTATCAAGCCTAT
CTAAAGACTCAAGGGATTTACCAGACATTGACTATCAAGAACATCCAGTCACTTAAACAGGTTAGCAGTTGGGATATAGG
AGAAAATCTGTCCAGTTTACGTCGAAAGGCTGTAGTTTGGATAAAGACGCACTTTCCAGATCCTATGCGCAACTATATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTGGGCTTCTTTATGGATGCCTTTAAGAAACTCCTCTTACGATTGGGCTTGACCCAAGA
AAAGTTGAAGTGGCTGACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTAGCCCAACATGGTGTTAAAGGTTTGGATAATTTTGCCTTGACGATCCTTGTCCTTTTTATCATT
ATGCCCAACTTTTTCCTAACTGCAGGAGGAGTCTTGTCCTGCGCCTATGCTTTTATCTTGACCATGACAAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCCAGAGAAAGTTTGGTCATTTCTTTGGGAATATTGCCCATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATTCTTTTGACCTTTGTTTTTTCCTTTCTGTTTGATGTGGTATTCTTACCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTGTTTACCCAGTCACTCAGTTTAACTTTGTCTTTGAGTGGTTAGAGGGCGTCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAACGCATGGCTGTTGATTCTACTGTTAGTTTCATTAGCCT
TAGTCTATGACATGAGGAAAAACATTAAAAGACTAGCAGGATTTAGTCTCTTTATTGTGTGTCTCTTTTTCCTGACCAAG
CATCCACTTGAAAATGAAATTACCATGCTGGATGTTGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTAGTAAAAC
TATTCTCATAGATGTTGGTGGTAAGGTAGAAGCTAGTAAGAAAATCGGGGTTTGGCAAGAAAAGGTGACGACCAGCAATG
CTCAGAGAACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACCAATACAGACAAG
GAGCATGTTGGAGATTTACTAGAGGTGACCAAGGCTTTCCATGTTGGGGAGATTTTAGTATCAAAAGGAAGTCTGACACA
GAAGGAATTTGTAGCAGAATTAGAAGCAAGCCAAACCAAGGTGCGCAGTGTAACAGTAGGGGAGAACTTGCCGATTTTTG
GGAGTCAGTTAGAAGTCCTATCTCCAAGGCAGATTGGAGATGATGATCGTGATGGTTCCCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGGAGAGAAAGACTTGCTGAAGCAATACCCTGACCTAGA
GGTGGAGGTCTTGAAAGCAGGCCAGCATGGTGCTAAAACCTCATCCAATCCAGCTTTCCTAGAAAACCTCAAACCAGAAA
TTACTCTCATCTCAGTTGGAAAAAACAATCTTGCGAAACTCCCCCATCAGGAAACCTTGACTCGACTAGAAACCATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGGGCTATCCGCTTTAAAGGGTGGAATAGTTGGCGAATGGAAAGTGTTCGTTA
G
ATGTCACAGTGGATTAAGAATTCCCCTATCCCCCTAATCTATCTGAGTTTTCTGTTACTCTGGCTTTACTATGCCATTTT
TGGAGCGTCCTATCTCGCACTGCTAGGTTTTGTTTTTTTGCTTGTCTGTCTCTTTTTCCAATTTCCTTGGAAATCGGCTG
GAAAAGTTCTAGCGATTTGTGGAGTTTTTGGATTTTGGTTTCTGTTTCAAAATTGGCAACAGACACAAGTTAGTCAAAAG
TTAGTGGATTCTGTTGAGAAGGTACGGATTTTACCAGATACTATCAAAGTCAATGGAGATAGTCTGTCCTTTCGGGGCAA
GGCTGACGGCCGCACCTTCCAAGTTTACTATAAACTCCAGTCCGAGGAGGAGAAAGAGCAATTTCAAGCCTTGACAGACC
TTCATGAGATAGAACTAGAAGGAAAACTTTCAGAGCCTGAAGGTCAGAGGAATTTTGGTGGATTTGACTATCAAGCCTAT
CTAAAGACTCAAGGGATTTACCAGACATTGACTATCAAGAACATCCAGTCACTTAAACAGGTTAGCAGTTGGGATATAGG
AGAAAATCTGTCCAGTTTACGTCGAAAGGCTGTAGTTTGGATAAAGACGCACTTTCCAGATCCTATGCGCAACTATATGA
CAGGTCTTCTTTTAGGACATTTGGACACGGACTTTGAGGAGATGAATGAGCTCTATTCCAGTCTTGGAATTATCCATCTC
TTTGCCTTGTCGGGTATGCAGGTGGGCTTCTTTATGGATGCCTTTAAGAAACTCCTCTTACGATTGGGCTTGACCCAAGA
AAAGTTGAAGTGGCTGACCTATCCCTTTTCCCTTATCTATGCAGGTCTGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAGTTACTAGCCCAACATGGTGTTAAAGGTTTGGATAATTTTGCCTTGACGATCCTTGTCCTTTTTATCATT
ATGCCCAACTTTTTCCTAACTGCAGGAGGAGTCTTGTCCTGCGCCTATGCTTTTATCTTGACCATGACAAGCAAAGAAGG
GGAGGGGCTCAAGGCTGTTGCCAGAGAAAGTTTGGTCATTTCTTTGGGAATATTGCCCATTCTATCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATTCTTTTGACCTTTGTTTTTTCCTTTCTGTTTGATGTGGTATTCTTACCGCTTTTGTCCATC
TTATTCATTCTGTCTTTTGTTTACCCAGTCACTCAGTTTAACTTTGTCTTTGAGTGGTTAGAGGGCGTCATTCGCTTGGT
ATCGCAGCTGGCAAGCAGGCCTCTGGTCTTTGGTCAACCCAACGCATGGCTGTTGATTCTACTGTTAGTTTCATTAGCCT
TAGTCTATGACATGAGGAAAAACATTAAAAGACTAGCAGGATTTAGTCTCTTTATTGTGTGTCTCTTTTTCCTGACCAAG
CATCCACTTGAAAATGAAATTACCATGCTGGATGTTGGGCAAGGAGAAAGTATTTTCCTACGGGATGTAACTAGTAAAAC
TATTCTCATAGATGTTGGTGGTAAGGTAGAAGCTAGTAAGAAAATCGGGGTTTGGCAAGAAAAGGTGACGACCAGCAATG
CTCAGAGAACCTTGATTCCCTATCTTAAAAGTCGAGGAGTAGACAAGATTGACCAGCTGATTTTGACCAATACAGACAAG
GAGCATGTTGGAGATTTACTAGAGGTGACCAAGGCTTTCCATGTTGGGGAGATTTTAGTATCAAAAGGAAGTCTGACACA
GAAGGAATTTGTAGCAGAATTAGAAGCAAGCCAAACCAAGGTGCGCAGTGTAACAGTAGGGGAGAACTTGCCGATTTTTG
GGAGTCAGTTAGAAGTCCTATCTCCAAGGCAGATTGGAGATGATGATCGTGATGGTTCCCTGGTTCTTTATGGAAAACTT
TTGGATAAGCACTTTCTCTTCACAGGAAATTTGAAAGAGAAGGGAGAGAAAGACTTGCTGAAGCAATACCCTGACCTAGA
GGTGGAGGTCTTGAAAGCAGGCCAGCATGGTGCTAAAACCTCATCCAATCCAGCTTTCCTAGAAAACCTCAAACCAGAAA
TTACTCTCATCTCAGTTGGAAAAAACAATCTTGCGAAACTCCCCCATCAGGAAACCTTGACTCGACTAGAAACCATCAAG
AGTAAGATTTACCGAACTGACCAGCAAGGGGCTATCCGCTTTAAAGGGTGGAATAGTTGGCGAATGGAAAGTGTTCGTTA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
88.472 |
100 |
0.885 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
88.322 |
99.866 |
0.882 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
87.131 |
100 |
0.871 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
86.729 |
100 |
0.867 |
| comEC/celB | Streptococcus pneumoniae D39 |
86.729 |
100 |
0.867 |
| comEC/celB | Streptococcus pneumoniae R6 |
86.729 |
100 |
0.867 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
44.804 |
99.33 |
0.445 |