Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | PUW49_RS05655 | Genome accession | NZ_CP118054 |
| Coordinates | 1124602..1126830 (-) | Length | 742 a.a. |
| NCBI ID | WP_264341579.1 | Uniprot ID | - |
| Organism | Streptococcus anginosus strain VSI37 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1119602..1131830
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| PUW49_RS05630 (PUW49_05630) | - | 1119870..1121147 (-) | 1278 | WP_070815179.1 | dihydroorotase | - |
| PUW49_RS05635 (PUW49_05635) | - | 1121160..1121630 (-) | 471 | WP_024051595.1 | 8-oxo-dGTP diphosphatase | - |
| PUW49_RS05640 (PUW49_05640) | - | 1121667..1122320 (-) | 654 | WP_024051596.1 | uracil-DNA glycosylase | - |
| PUW49_RS05645 (PUW49_05645) | - | 1122344..1123330 (-) | 987 | WP_024051597.1 | Gfo/Idh/MocA family oxidoreductase | - |
| PUW49_RS05650 (PUW49_05650) | - | 1123960..1124520 (-) | 561 | WP_224783657.1 | DUF805 domain-containing protein | - |
| PUW49_RS05655 (PUW49_05655) | comEC/celB | 1124602..1126830 (-) | 2229 | WP_264341579.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| PUW49_RS05660 (PUW49_05660) | comEA/celA/cilE | 1126814..1127518 (-) | 705 | WP_024051601.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| PUW49_RS05665 (PUW49_05665) | smpB | 1127701..1128168 (-) | 468 | WP_024051602.1 | SsrA-binding protein SmpB | - |
| PUW49_RS05670 (PUW49_05670) | rnr | 1128131..1130470 (-) | 2340 | WP_101750372.1 | ribonuclease R | - |
| PUW49_RS05675 (PUW49_05675) | secG | 1130562..1130795 (-) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| PUW49_RS05680 (PUW49_05680) | rpmG | 1130835..1130984 (-) | 150 | WP_003024746.1 | 50S ribosomal protein L33 | - |
Sequence
Protein
Download Length: 742 a.a. Molecular weight: 85069.90 Da Isoelectric Point: 9.9550
>NTDB_id=789677 PUW49_RS05655 WP_264341579.1 1124602..1126830(-) (comEC/celB) [Streptococcus anginosus strain VSI37]
MSQWIKKFPIKPIYIAFLLVWLYFAIYPSSWLGWLGFIFLVIRLFRFYSPKECFMTFMILSCFASFFFIRREIAEQQTKA
EPSPIKQVAVLPDTIKVNGDSLSFRGKANRQTYQVYHKLKSKEEQLAFQNLSSLVALTVEGDFEIPEKKRDFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPLEWLSSWRRKALVFIKNHFPNPMSNYMTGLLFGALDTDFDEMSNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGFTQEMVRKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLSAGGVLSCVYAFVISVIDFEGLASWRKVVVESSVISLGILPILIFYFGEFQPWSILLTFVFSLIFDAMMLPGLT
LIFLFSPLIKLTQVNFLFEGLENSIRWIASVFGKPIVFGQPSPLLLIVMLLVLAILYDVRKNKKWVIFLSLFLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGKNVLIDVGGREEIRTKEAWQKRAISSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLGVAKKFAIKKIYISRSSLSNADFLEKLRKLNTFIHVVKQGDKLPIFDHHLQVLSGVSKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHEAKDSSSSKFLQQIEPTVALISVEKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
MSQWIKKFPIKPIYIAFLLVWLYFAIYPSSWLGWLGFIFLVIRLFRFYSPKECFMTFMILSCFASFFFIRREIAEQQTKA
EPSPIKQVAVLPDTIKVNGDSLSFRGKANRQTYQVYHKLKSKEEQLAFQNLSSLVALTVEGDFEIPEKKRDFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPLEWLSSWRRKALVFIKNHFPNPMSNYMTGLLFGALDTDFDEMSNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGFTQEMVRKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLSAGGVLSCVYAFVISVIDFEGLASWRKVVVESSVISLGILPILIFYFGEFQPWSILLTFVFSLIFDAMMLPGLT
LIFLFSPLIKLTQVNFLFEGLENSIRWIASVFGKPIVFGQPSPLLLIVMLLVLAILYDVRKNKKWVIFLSLFLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGKNVLIDVGGREEIRTKEAWQKRAISSNAEKTLIPYLKSRGVDTIDTLVLTNPNP
DYAGDVLGVAKKFAIKKIYISRSSLSNADFLEKLRKLNTFIHVVKQGDKLPIFDHHLQVLSGVSKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHEAKDSSSSKFLQQIEPTVALISVEKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
Nucleotide
Download Length: 2229 bp
>NTDB_id=789677 PUW49_RS05655 WP_264341579.1 1124602..1126830(-) (comEC/celB) [Streptococcus anginosus strain VSI37]
ATGTCACAGTGGATTAAAAAATTTCCGATTAAGCCGATTTACATTGCTTTCTTACTTGTCTGGTTATATTTTGCAATCTA
TCCAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTAATTCGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTAGTTTTTTCTTTATTCGTAGGGAAATAGCAGAGCAGCAAACAAAAGCA
GAGCCTTCTCCCATAAAACAAGTGGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCGCTTTCTTTTCGTGGTAA
AGCTAATAGGCAGACTTATCAAGTTTACCACAAATTGAAATCAAAGGAAGAGCAGTTGGCTTTTCAAAATCTCTCTAGTT
TGGTTGCGTTGACCGTTGAAGGAGATTTTGAAATTCCTGAGAAGAAGCGTGATTTTGCTGGTTTTGATTACCAATCTTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTAGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
GGAGTGGCTCTCTAGCTGGCGAAGAAAGGCACTGGTATTTATCAAGAACCATTTTCCAAATCCGATGAGTAATTATATGA
CGGGGCTCTTATTTGGCGCCTTGGATACAGACTTTGACGAAATGAGCAATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGATTTCGCAAGTTACTGTTAAGGCTGGGGTTCACACAAGA
AATGGTTCGTAAATGCCAATATCCATTTTCTTTCTTTTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTATCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCGTCCTTTCTTTTGTCGGCAGGAGGAGTTCTCTCCTGTGTCTATGCTTTTGTCATTAGTGTGATAGATTTTGAAGG
TCTGGCTTCTTGGCGAAAAGTTGTTGTAGAGAGTAGCGTTATTTCACTTGGTATTTTACCGATTCTAATCTTTTACTTTG
GCGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTTGATGCAATGATGTTGCCAGGCTTGACG
TTAATTTTTCTTTTTTCGCCTTTGATAAAGTTGACTCAAGTCAATTTTCTATTTGAAGGTTTAGAAAATAGTATTCGTTG
GATAGCAAGTGTCTTTGGTAAACCAATCGTTTTTGGGCAGCCCAGTCCGCTTTTGCTGATTGTTATGCTGCTTGTACTGG
CTATTTTGTATGATGTTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGTTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCTTTGCAAAACGAGATCACAATGGTGGATGTCGGGCAAGGAGATAGTATTTTTCTGAGAGACTGGAAAGGAAA
AAATGTATTGATTGATGTTGGCGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCAATTAGCTCAAATG
CAGAAAAAACTTTGATTCCTTATCTAAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGGAGTTGCTAAAAAGTTTGCAATAAAAAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAACTAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCGGGTGTTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCAAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGAAGCCAAGGACTCATCAAGTTCAAAATTTTTGCAACAAATAGAACCAACGGTTGCGCTCATTT
CTGTTGAGAAAAATAATCAATCTAAGCAACCTAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
ATGTCACAGTGGATTAAAAAATTTCCGATTAAGCCGATTTACATTGCTTTCTTACTTGTCTGGTTATATTTTGCAATCTA
TCCAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTAATTCGTCTTTTTCGCTTTTATTCACCGAAAGAATGTT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTAGTTTTTTCTTTATTCGTAGGGAAATAGCAGAGCAGCAAACAAAAGCA
GAGCCTTCTCCCATAAAACAAGTGGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCGCTTTCTTTTCGTGGTAA
AGCTAATAGGCAGACTTATCAAGTTTACCACAAATTGAAATCAAAGGAAGAGCAGTTGGCTTTTCAAAATCTCTCTAGTT
TGGTTGCGTTGACCGTTGAAGGAGATTTTGAAATTCCTGAGAAGAAGCGTGATTTTGCTGGTTTTGATTACCAATCTTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTAGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACCCTTT
GGAGTGGCTCTCTAGCTGGCGAAGAAAGGCACTGGTATTTATCAAGAACCATTTTCCAAATCCGATGAGTAATTATATGA
CGGGGCTCTTATTTGGCGCCTTGGATACAGACTTTGACGAAATGAGCAATCTTTATTCTAGCTTGGGAATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGATTTCGCAAGTTACTGTTAAGGCTGGGGTTCACACAAGA
AATGGTTCGTAAATGCCAATATCCATTTTCTTTCTTTTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTATTATCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCGTCCTTTCTTTTGTCGGCAGGAGGAGTTCTCTCCTGTGTCTATGCTTTTGTCATTAGTGTGATAGATTTTGAAGG
TCTGGCTTCTTGGCGAAAAGTTGTTGTAGAGAGTAGCGTTATTTCACTTGGTATTTTACCGATTCTAATCTTTTACTTTG
GCGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTTGATGCAATGATGTTGCCAGGCTTGACG
TTAATTTTTCTTTTTTCGCCTTTGATAAAGTTGACTCAAGTCAATTTTCTATTTGAAGGTTTAGAAAATAGTATTCGTTG
GATAGCAAGTGTCTTTGGTAAACCAATCGTTTTTGGGCAGCCCAGTCCGCTTTTGCTGATTGTTATGCTGCTTGTACTGG
CTATTTTGTATGATGTTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGTTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCTTTGCAAAACGAGATCACAATGGTGGATGTCGGGCAAGGAGATAGTATTTTTCTGAGAGACTGGAAAGGAAA
AAATGTATTGATTGATGTTGGCGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCAATTAGCTCAAATG
CAGAAAAAACTTTGATTCCTTATCTAAAAAGTCGTGGTGTAGATACGATTGATACTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGGAGTTGCTAAAAAGTTTGCAATAAAAAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAACTAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCGGGTGTTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCAAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGAAGCCAAGGACTCATCAAGTTCAAAATTTTTGCAACAAATAGAACCAACGGTTGCGCTCATTT
CTGTTGAGAAAAATAATCAATCTAAGCAACCTAGTCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
54.485 |
100 |
0.549 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
53.753 |
100 |
0.54 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
53.414 |
100 |
0.538 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
52.61 |
100 |
0.53 |
| comEC/celB | Streptococcus pneumoniae D39 |
52.61 |
100 |
0.53 |
| comEC/celB | Streptococcus pneumoniae R6 |
52.61 |
100 |
0.53 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.383 |
100 |
0.476 |