Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | I6H72_RS02330 | Genome accession | NZ_CP066055 |
| Coordinates | 465405..467633 (-) | Length | 742 a.a. |
| NCBI ID | WP_020997762.1 | Uniprot ID | - |
| Organism | Streptococcus constellatus strain FDAARGOS_1015 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 460405..472633
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6H72_RS02305 (I6H72_02305) | - | 460704..461981 (-) | 1278 | WP_198458219.1 | dihydroorotase | - |
| I6H72_RS02310 (I6H72_02310) | - | 461994..462464 (-) | 471 | WP_195674093.1 | NUDIX hydrolase | - |
| I6H72_RS02315 (I6H72_02315) | - | 462501..463154 (-) | 654 | WP_198458220.1 | uracil-DNA glycosylase | - |
| I6H72_RS02320 (I6H72_02320) | - | 463176..464162 (-) | 987 | WP_198458221.1 | Gfo/Idh/MocA family protein | - |
| I6H72_RS10190 | - | 464607..464747 (-) | 141 | WP_022525502.1 | hypothetical protein | - |
| I6H72_RS10195 | - | 464838..465242 (-) | 405 | WP_006267075.1 | DUF805 domain-containing protein | - |
| I6H72_RS02330 (I6H72_02330) | comEC/celB | 465405..467633 (-) | 2229 | WP_020997762.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| I6H72_RS02335 (I6H72_02335) | comEA/celA/cilE | 467617..468321 (-) | 705 | WP_150225670.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| I6H72_RS02340 (I6H72_02340) | smpB | 468489..468956 (-) | 468 | WP_006267184.1 | SsrA-binding protein SmpB | - |
| I6H72_RS02345 (I6H72_02345) | rnr | 469122..471258 (-) | 2137 | Protein_464 | ribonuclease R | - |
| I6H72_RS02350 (I6H72_02350) | secG | 471351..471584 (-) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| I6H72_RS02355 (I6H72_02355) | rpmG | 471624..471773 (-) | 150 | WP_003024746.1 | 50S ribosomal protein L33 | - |
Sequence
Protein
Download Length: 742 a.a. Molecular weight: 85327.29 Da Isoelectric Point: 9.9939
>NTDB_id=516834 I6H72_RS02330 WP_020997762.1 465405..467633(-) (comEC/celB) [Streptococcus constellatus strain FDAARGOS_1015]
MSQWIKIFPIKPIYIAFLLVWLYFAIYQSSWLAWFCLIFLIVRLFVLYSPKKCFTTLMFLACFAAFFFVRREMAEWQTKA
EPSSVRQVAVLPDTIKVNGDSLSFRGKANGQTYQIYYKLKSKEEQLAFQNLSSLVTLTVEGEFESPEKQRNFSGFDYQVY
LKTQGIYRNLKVDQILSSQDRVSFQPFEWLSSWRRKALVFIKRNFPNPMNHYMTGLLFGALETDFDEMSDLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLKLGLTKEMVHKCQYPFSFFYAGMTGFSVSVVRSLIQKLLSQHGITKLDNFALTIMVLSLL
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKVVVESSIISLGVLPILIFYFGEFQPWSILLTFVFSLIFDIVMLPGLT
LIFLISPFIKLIQVNFLFEGLENSIRWIANVFGRPIVFGQPSSLLLVVMLLVLAILYDVRKNKKWVILLSLCLAILFFIT
KFPLQNEITMVDVGQGDSLFLRDWKGRNVLIDVGGREEIRTKESWQKRTTKSNAEKTLIPYLKSRGIDTIDTLILTNPNA
DYAGDVLKVVKKFAVKKIFISRSSLNDADFLNKLKETKTFVHVVKQGDKLPIFDHHLQVLSGTNKNDQSLVLYGQFFRTR
FLFMSNLTEEDEVKLMQLYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKSPNQEMIERLNRLNAKIY
RTDERGAIKFSGWTKWQLETVQ
MSQWIKIFPIKPIYIAFLLVWLYFAIYQSSWLAWFCLIFLIVRLFVLYSPKKCFTTLMFLACFAAFFFVRREMAEWQTKA
EPSSVRQVAVLPDTIKVNGDSLSFRGKANGQTYQIYYKLKSKEEQLAFQNLSSLVTLTVEGEFESPEKQRNFSGFDYQVY
LKTQGIYRNLKVDQILSSQDRVSFQPFEWLSSWRRKALVFIKRNFPNPMNHYMTGLLFGALETDFDEMSDLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLKLGLTKEMVHKCQYPFSFFYAGMTGFSVSVVRSLIQKLLSQHGITKLDNFALTIMVLSLL
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKVVVESSIISLGVLPILIFYFGEFQPWSILLTFVFSLIFDIVMLPGLT
LIFLISPFIKLIQVNFLFEGLENSIRWIANVFGRPIVFGQPSSLLLVVMLLVLAILYDVRKNKKWVILLSLCLAILFFIT
KFPLQNEITMVDVGQGDSLFLRDWKGRNVLIDVGGREEIRTKESWQKRTTKSNAEKTLIPYLKSRGIDTIDTLILTNPNA
DYAGDVLKVVKKFAVKKIFISRSSLNDADFLNKLKETKTFVHVVKQGDKLPIFDHHLQVLSGTNKNDQSLVLYGQFFRTR
FLFMSNLTEEDEVKLMQLYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKSPNQEMIERLNRLNAKIY
RTDERGAIKFSGWTKWQLETVQ
Nucleotide
Download Length: 2229 bp
>NTDB_id=516834 I6H72_RS02330 WP_020997762.1 465405..467633(-) (comEC/celB) [Streptococcus constellatus strain FDAARGOS_1015]
ATGTCACAGTGGATTAAGATATTTCCGATTAAACCAATTTACATTGCTTTCTTACTTGTCTGGCTATATTTTGCAATCTA
TCAAAGTAGCTGGTTGGCTTGGTTTTGTCTTATTTTTCTAATAGTCCGTCTTTTTGTCCTTTATTCACCGAAAAAATGTT
TCACAACTTTAATGTTTCTCGCTTGTTTTGCTGCTTTTTTCTTTGTTCGTAGAGAAATGGCAGAGTGGCAAACAAAAGCA
GAACCTTCTTCTGTAAGACAAGTAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCGCTTTCTTTTCGCGGTAA
AGCAAATGGGCAGACTTATCAAATTTACTACAAATTGAAATCAAAAGAAGAACAGTTGGCTTTTCAAAATCTATCCAGTC
TTGTCACATTAACTGTTGAGGGGGAATTTGAATCTCCTGAAAAGCAGCGCAATTTTTCTGGTTTTGATTATCAAGTCTAT
CTAAAAACACAAGGGATTTATCGAAATTTAAAGGTGGATCAAATTTTGTCTAGTCAAGATAGGGTTAGCTTCCAACCGTT
TGAGTGGTTGTCTAGCTGGCGAAGAAAGGCATTGGTTTTCATTAAGAGAAATTTTCCAAATCCGATGAATCATTACATGA
CAGGGCTCTTATTTGGCGCATTAGAGACAGATTTTGATGAAATGAGTGATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGTTATCTGGCATGCAAGTTGGCTTTTTTATGGAAGGATTTCGTAAGTTGCTTTTGAAGCTGGGACTTACGAAGGA
AATGGTTCATAAATGCCAATATCCGTTTTCTTTCTTTTATGCAGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAATTCAGAAATTATTGTCGCAACATGGTATCACTAAATTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGCTT
ATGCCATCTTTTCTTTTAACGGCAGGAGGAGTTCTCTCCTGTGCCTATGCTTTTGTTATTAGTGTTATAGATTTTGAAAG
TCTGACTTCTTGGAGAAAAGTTGTGGTAGAGAGTAGTATCATTTCACTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GAGAATTTCAACCTTGGTCTATTTTATTGACATTTGTTTTTTCACTAATTTTTGATATAGTGATGCTACCGGGTTTAACA
CTGATTTTTCTCATTTCACCTTTCATAAAGCTCATTCAAGTAAATTTTCTTTTTGAAGGCTTAGAAAATAGTATTCGTTG
GATAGCAAATGTCTTTGGCAGACCAATCGTTTTTGGGCAACCTAGCTCGCTTTTATTAGTTGTTATGCTGCTTGTACTGG
CTATTTTGTATGATGTTAGAAAAAATAAAAAATGGGTGATATTGCTCAGCTTGTGTCTTGCTATACTATTTTTTATAACT
AAATTTCCTTTGCAAAATGAGATCACAATGGTTGATGTTGGACAGGGAGATAGTCTTTTTTTGAGAGACTGGAAAGGTAG
AAATGTATTGATTGACGTTGGAGGACGTGAAGAAATTAGAACAAAAGAATCTTGGCAAAAACGAACAACTAAATCAAATG
CGGAGAAAACTTTGATTCCTTACCTAAAAAGTCGTGGTATAGATACGATTGATACTCTAATCTTAACAAATCCTAATGCA
GATTATGCGGGAGATGTATTAAAAGTGGTTAAAAAGTTTGCGGTAAAGAAAATTTTTATTTCCAGAAGTAGTTTGAATGA
TGCAGACTTTTTAAATAAATTAAAGGAGACAAAGACGTTTGTTCATGTCGTAAAACAAGGAGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCTGGTACGAATAAGAATGATCAATCGCTAGTTTTATATGGTCAATTTTTTCGTACAAGG
TTTTTATTTATGAGCAATTTAACAGAAGAGGATGAAGTAAAGCTAATGCAACTTTATCCAAAGCTAAAAACAGATGTCTT
GAAGGTTGGGCAACATGGAGCCAAAAATTCATCACATTCAAAATTTTTACAGCAAATAGAACCGGCAGTTGCACTCATTT
CTGTTGGGAAAAATAACCAATCCAAATCTCCGAATCAAGAAATGATAGAGCGATTGAATCGCTTAAATGCGAAGATTTAT
CGAACAGATGAACGAGGGGCGATCAAGTTTTCAGGATGGACAAAATGGCAATTAGAAACTGTTCAATAG
ATGTCACAGTGGATTAAGATATTTCCGATTAAACCAATTTACATTGCTTTCTTACTTGTCTGGCTATATTTTGCAATCTA
TCAAAGTAGCTGGTTGGCTTGGTTTTGTCTTATTTTTCTAATAGTCCGTCTTTTTGTCCTTTATTCACCGAAAAAATGTT
TCACAACTTTAATGTTTCTCGCTTGTTTTGCTGCTTTTTTCTTTGTTCGTAGAGAAATGGCAGAGTGGCAAACAAAAGCA
GAACCTTCTTCTGTAAGACAAGTAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCGCTTTCTTTTCGCGGTAA
AGCAAATGGGCAGACTTATCAAATTTACTACAAATTGAAATCAAAAGAAGAACAGTTGGCTTTTCAAAATCTATCCAGTC
TTGTCACATTAACTGTTGAGGGGGAATTTGAATCTCCTGAAAAGCAGCGCAATTTTTCTGGTTTTGATTATCAAGTCTAT
CTAAAAACACAAGGGATTTATCGAAATTTAAAGGTGGATCAAATTTTGTCTAGTCAAGATAGGGTTAGCTTCCAACCGTT
TGAGTGGTTGTCTAGCTGGCGAAGAAAGGCATTGGTTTTCATTAAGAGAAATTTTCCAAATCCGATGAATCATTACATGA
CAGGGCTCTTATTTGGCGCATTAGAGACAGATTTTGATGAAATGAGTGATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGTTATCTGGCATGCAAGTTGGCTTTTTTATGGAAGGATTTCGTAAGTTGCTTTTGAAGCTGGGACTTACGAAGGA
AATGGTTCATAAATGCCAATATCCGTTTTCTTTCTTTTATGCAGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAATTCAGAAATTATTGTCGCAACATGGTATCACTAAATTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGCTT
ATGCCATCTTTTCTTTTAACGGCAGGAGGAGTTCTCTCCTGTGCCTATGCTTTTGTTATTAGTGTTATAGATTTTGAAAG
TCTGACTTCTTGGAGAAAAGTTGTGGTAGAGAGTAGTATCATTTCACTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GAGAATTTCAACCTTGGTCTATTTTATTGACATTTGTTTTTTCACTAATTTTTGATATAGTGATGCTACCGGGTTTAACA
CTGATTTTTCTCATTTCACCTTTCATAAAGCTCATTCAAGTAAATTTTCTTTTTGAAGGCTTAGAAAATAGTATTCGTTG
GATAGCAAATGTCTTTGGCAGACCAATCGTTTTTGGGCAACCTAGCTCGCTTTTATTAGTTGTTATGCTGCTTGTACTGG
CTATTTTGTATGATGTTAGAAAAAATAAAAAATGGGTGATATTGCTCAGCTTGTGTCTTGCTATACTATTTTTTATAACT
AAATTTCCTTTGCAAAATGAGATCACAATGGTTGATGTTGGACAGGGAGATAGTCTTTTTTTGAGAGACTGGAAAGGTAG
AAATGTATTGATTGACGTTGGAGGACGTGAAGAAATTAGAACAAAAGAATCTTGGCAAAAACGAACAACTAAATCAAATG
CGGAGAAAACTTTGATTCCTTACCTAAAAAGTCGTGGTATAGATACGATTGATACTCTAATCTTAACAAATCCTAATGCA
GATTATGCGGGAGATGTATTAAAAGTGGTTAAAAAGTTTGCGGTAAAGAAAATTTTTATTTCCAGAAGTAGTTTGAATGA
TGCAGACTTTTTAAATAAATTAAAGGAGACAAAGACGTTTGTTCATGTCGTAAAACAAGGAGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCTGGTACGAATAAGAATGATCAATCGCTAGTTTTATATGGTCAATTTTTTCGTACAAGG
TTTTTATTTATGAGCAATTTAACAGAAGAGGATGAAGTAAAGCTAATGCAACTTTATCCAAAGCTAAAAACAGATGTCTT
GAAGGTTGGGCAACATGGAGCCAAAAATTCATCACATTCAAAATTTTTACAGCAAATAGAACCGGCAGTTGCACTCATTT
CTGTTGGGAAAAATAACCAATCCAAATCTCCGAATCAAGAAATGATAGAGCGATTGAATCGCTTAAATGCGAAGATTTAT
CGAACAGATGAACGAGGGGCGATCAAGTTTTCAGGATGGACAAAATGGCAATTAGAAACTGTTCAATAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis NCTC 12261 |
54.826 |
100 |
0.551 |
| comEC/celB | Streptococcus mitis SK321 |
54.752 |
100 |
0.551 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
53.949 |
100 |
0.543 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
53.146 |
100 |
0.535 |
| comEC/celB | Streptococcus pneumoniae D39 |
53.146 |
100 |
0.535 |
| comEC/celB | Streptococcus pneumoniae R6 |
53.146 |
100 |
0.535 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
46.433 |
100 |
0.465 |