Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | ABKA15_RS04675 | Genome accession | NZ_CP157941 |
| Coordinates | 944627..946855 (-) | Length | 742 a.a. |
| NCBI ID | WP_049512791.1 | Uniprot ID | - |
| Organism | Streptococcus sp. KHUD_010 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 939627..951855
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ABKA15_RS04645 (ABKA15_04640) | - | 940094..940564 (-) | 471 | WP_003043000.1 | 8-oxo-dGTP diphosphatase | - |
| ABKA15_RS04650 (ABKA15_04645) | - | 940601..941254 (-) | 654 | WP_049513377.1 | uracil-DNA glycosylase | - |
| ABKA15_RS04655 (ABKA15_04650) | - | 941276..942262 (-) | 987 | WP_049513380.1 | Gfo/Idh/MocA family oxidoreductase | - |
| ABKA15_RS04660 (ABKA15_04655) | - | 942543..943480 (+) | 938 | Protein_907 | IS30 family transposase | - |
| ABKA15_RS04665 (ABKA15_04660) | - | 943558..943716 (+) | 159 | WP_185952229.1 | hypothetical protein | - |
| ABKA15_RS04670 (ABKA15_04665) | - | 943860..944546 (-) | 687 | WP_225620253.1 | DUF805 domain-containing protein | - |
| ABKA15_RS04675 (ABKA15_04670) | comEC/celB | 944627..946855 (-) | 2229 | WP_049512791.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| ABKA15_RS04680 (ABKA15_04675) | comEA/celA/cilE | 946839..947543 (-) | 705 | WP_049512789.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| ABKA15_RS04685 (ABKA15_04680) | smpB | 947725..948192 (-) | 468 | WP_049512786.1 | SsrA-binding protein SmpB | - |
| ABKA15_RS04690 (ABKA15_04685) | rnr | 948155..950488 (-) | 2334 | WP_049512784.1 | ribonuclease R | - |
| ABKA15_RS04695 (ABKA15_04690) | secG | 950580..950813 (-) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| ABKA15_RS04700 (ABKA15_04695) | rpmG | 950853..951002 (-) | 150 | WP_003024746.1 | 50S ribosomal protein L33 | - |
Sequence
Protein
Download Length: 742 a.a. Molecular weight: 85261.19 Da Isoelectric Point: 10.0513
>NTDB_id=1009118 ABKA15_RS04675 WP_049512791.1 944627..946855(-) (comEC/celB) [Streptococcus sp. KHUD_010]
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLACLGFIFLVIRLFRFYSPKECFMTFMILSCFAGFFFVRREMVEQKTKV
EPSPVRQVAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLVILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKVLVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTIMVLSLL
MPSFLLTAGGVLSCAYAFVISVIDFESLTFWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFISSPLIKLTQVNFLFEGLENSIRWIASVFSRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLFLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDHHLKVLSSASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMIK
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLACLGFIFLVIRLFRFYSPKECFMTFMILSCFAGFFFVRREMVEQKTKV
EPSPVRQVAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLVILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKVLVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTIMVLSLL
MPSFLLTAGGVLSCAYAFVISVIDFESLTFWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFISSPLIKLTQVNFLFEGLENSIRWIASVFSRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLFLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDHHLKVLSSASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKNSSHSKFLQQIEPAVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMIK
Nucleotide
Download Length: 2229 bp
>NTDB_id=1009118 ABKA15_RS04675 WP_049512791.1 944627..946855(-) (comEC/celB) [Streptococcus sp. KHUD_010]
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCAATTTACATAGCTTTTTTGCTTGTCTGGTTGTATTTTGCAATCTA
CCAAAGTAGTTGGTTAGCTTGTCTTGGTTTTATCTTTCTGGTGATTCGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTGGTTTTTTCTTTGTTCGTAGGGAAATGGTAGAGCAGAAGACGAAAGTA
GAACCTTCTCCCGTAAGACAAGTAGCAGTTCTACCCGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGTACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTAGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGCTGGGTCTTACACAGGA
AATGGTCCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGCTT
ATGCCATCTTTTCTTTTAACGGCAGGAGGAGTTCTCTCCTGTGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTTTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTACTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTATTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTAGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTTTGTTTCTTTCATTGCTCTTTTTCATAAAT
AAATTTCCTTTGCAAAATGAGATCACAATGGTGGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGAAAGTTCTTTCGAGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAAAATTCATCACATTCAAAGTTTTTACAGCAAATAGAGCCCGCAGTTGCGCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAGGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGATCAAGTAA
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCAATTTACATAGCTTTTTTGCTTGTCTGGTTGTATTTTGCAATCTA
CCAAAGTAGTTGGTTAGCTTGTCTTGGTTTTATCTTTCTGGTGATTCGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTGGTTTTTTCTTTGTTCGTAGGGAAATGGTAGAGCAGAAGACGAAAGTA
GAACCTTCTCCCGTAAGACAAGTAGCAGTTCTACCCGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGTACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTAGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGCTGGGTCTTACACAGGA
AATGGTCCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACAATAATGGTGTTGTCCTTGCTT
ATGCCATCTTTTCTTTTAACGGCAGGAGGAGTTCTCTCCTGTGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTTTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTACTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTATTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTAGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTTTGTTTCTTTCATTGCTCTTTTTCATAAAT
AAATTTCCTTTGCAAAATGAGATCACAATGGTGGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGAAAGTTCTTTCGAGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAAAATTCATCACATTCAAAGTTTTTACAGCAAATAGAGCCCGCAGTTGCGCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAGGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGATCAAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
55.288 |
100 |
0.557 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
55.094 |
100 |
0.554 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
54.351 |
100 |
0.547 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
53.548 |
100 |
0.539 |
| comEC/celB | Streptococcus pneumoniae D39 |
53.548 |
100 |
0.539 |
| comEC/celB | Streptococcus pneumoniae R6 |
53.548 |
100 |
0.539 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
46.98 |
100 |
0.472 |