Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | I6K87_RS03120 | Genome accession | NZ_CP069892 |
| Coordinates | 632130..634358 (-) | Length | 742 a.a. |
| NCBI ID | WP_204982753.1 | Uniprot ID | - |
| Organism | Streptococcus anginosus strain FDAARGOS_1357 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 627130..639358
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6K87_RS03090 (I6K87_03090) | - | 627347..628624 (-) | 1278 | WP_070497022.1 | dihydroorotase | - |
| I6K87_RS03095 (I6K87_03095) | - | 628637..629107 (-) | 471 | WP_204982751.1 | NUDIX hydrolase | - |
| I6K87_RS03100 (I6K87_03100) | - | 629144..629797 (-) | 654 | WP_049517463.1 | uracil-DNA glycosylase | - |
| I6K87_RS03105 (I6K87_03105) | - | 629821..630807 (-) | 987 | WP_204982752.1 | Gfo/Idh/MocA family protein | - |
| I6K87_RS03110 (I6K87_03110) | - | 631050..631208 (+) | 159 | WP_003042740.1 | hypothetical protein | - |
| I6K87_RS03115 (I6K87_03115) | - | 631351..632049 (-) | 699 | WP_003029857.1 | DUF805 domain-containing protein | - |
| I6K87_RS03120 (I6K87_03120) | comEC/celB | 632130..634358 (-) | 2229 | WP_204982753.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| I6K87_RS03125 (I6K87_03125) | comEA/celA/cilE | 634342..635046 (-) | 705 | WP_204982754.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| I6K87_RS03130 (I6K87_03130) | smpB | 635228..635695 (-) | 468 | WP_022526922.1 | SsrA-binding protein SmpB | - |
| I6K87_RS03135 (I6K87_03135) | rnr | 635658..637991 (-) | 2334 | WP_204982755.1 | ribonuclease R | - |
| I6K87_RS03140 (I6K87_03140) | secG | 638083..638316 (-) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| I6K87_RS03145 (I6K87_03145) | rpmG | 638356..638505 (-) | 150 | WP_003024746.1 | 50S ribosomal protein L33 | - |
Sequence
Protein
Download Length: 742 a.a. Molecular weight: 85037.92 Da Isoelectric Point: 10.0271
>NTDB_id=537234 I6K87_RS03120 WP_204982753.1 632130..634358(-) (comEC/celB) [Streptococcus anginosus strain FDAARGOS_1357]
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVICLFRFYSPKECFMTFMILICFAGFFFIRREIAEQKTKV
EPSPIRQVAVLPDTIKVNGDSLSFRGKANGQTYQVYYKLKSKEEQLAFQNLSSLVTLTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKNHFPNPMSNYMTGLLFGALDTSFDEMSNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRNIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDAMMLPGLT
LIFLFSPLIKLTQVNFLFEGLENSIRWIASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATGSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLEVAKKFAIKKIYIARSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDHHLKVLSSASKNDHSIVLYGRFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVICLFRFYSPKECFMTFMILICFAGFFFIRREIAEQKTKV
EPSPIRQVAVLPDTIKVNGDSLSFRGKANGQTYQVYYKLKSKEEQLAFQNLSSLVTLTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKNHFPNPMSNYMTGLLFGALDTSFDEMSNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRNIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDAMMLPGLT
LIFLFSPLIKLTQVNFLFEGLENSIRWIASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATGSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLEVAKKFAIKKIYIARSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDHHLKVLSSASKNDHSIVLYGRFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
Nucleotide
Download Length: 2229 bp
>NTDB_id=537234 I6K87_RS03120 WP_204982753.1 632130..634358(-) (comEC/celB) [Streptococcus anginosus strain FDAARGOS_1357]
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCGATTTACATTGCTTTTTTGCTCGTCTGGTTGTATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTTGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATTCTAATTTGTTTTGCTGGTTTTTTCTTTATTCGTAGGGAAATAGCAGAGCAGAAGACGAAAGTA
GAACCTTCTCCCATAAGACAAGTGGCAGTTCTACCTGACACGATTAAGGTAAATGGTGATTCGCTTTCTTTTCGTGGTAA
AGCTAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTACATTGACTGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAACCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACATCCTTTGACGAAATGAGCAATCTTTACTCTAGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGCTGGGTCTTACACAGGA
AATGGTTCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCGGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAACATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTCGATGCAATGATGTTGCCAGGATTGACG
TTGATTTTTCTTTTTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCCTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAGAAACGAGCAACTGGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTGCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGAAAGTTCTTTCGAGTGCTAGCAAGAACGACCATTCGATTGTTTTATATGGTCGGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAAAATTCATCACATTCAAAGTTTTTACAGCAAATAGAGCCCGCAGTTGCGCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCGATTTACATTGCTTTTTTGCTCGTCTGGTTGTATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTTGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATTCTAATTTGTTTTGCTGGTTTTTTCTTTATTCGTAGGGAAATAGCAGAGCAGAAGACGAAAGTA
GAACCTTCTCCCATAAGACAAGTGGCAGTTCTACCTGACACGATTAAGGTAAATGGTGATTCGCTTTCTTTTCGTGGTAA
AGCTAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTACATTGACTGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAACCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACATCCTTTGACGAAATGAGCAATCTTTACTCTAGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGCTGGGTCTTACACAGGA
AATGGTTCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCGGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAACATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTCGATGCAATGATGTTGCCAGGATTGACG
TTGATTTTTCTTTTTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCCTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAGAAACGAGCAACTGGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTGCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGAAAGTTCTTTCGAGTGCTAGCAAGAACGACCATTCGATTGTTTTATATGGTCGGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAAAATTCATCACATTCAAAGTTTTTACAGCAAATAGAGCCCGCAGTTGCGCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
55.154 |
100 |
0.555 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
54.558 |
100 |
0.549 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
54.083 |
100 |
0.544 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
53.28 |
100 |
0.536 |
| comEC/celB | Streptococcus pneumoniae D39 |
53.28 |
100 |
0.536 |
| comEC/celB | Streptococcus pneumoniae R6 |
53.28 |
100 |
0.536 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.043 |
100 |
0.472 |