Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SANR_RS04795 | Genome accession | NC_022239 |
| Coordinates | 967321..969549 (+) | Length | 742 a.a. |
| NCBI ID | WP_020999585.1 | Uniprot ID | - |
| Organism | Streptococcus anginosus C238 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 962321..974549
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SANR_RS11130 (SANR_0936) | rpmG | 963183..963332 (+) | 150 | WP_003034401.1 | 50S ribosomal protein L33 | - |
| SANR_RS04775 (SANR_0937) | secG | 963372..963605 (+) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| SANR_RS04780 (SANR_0938) | rnr | 963696..966035 (+) | 2340 | WP_003034491.1 | ribonuclease R | - |
| SANR_RS04785 (SANR_0939) | smpB | 965998..966465 (+) | 468 | WP_003034458.1 | SsrA-binding protein SmpB | - |
| SANR_RS04790 (SANR_0940) | comEA/celA/cilE | 966633..967337 (+) | 705 | WP_020999584.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| SANR_RS04795 (SANR_0941) | comEC/celB | 967321..969549 (+) | 2229 | WP_020999585.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SANR_RS04800 (SANR_0942) | - | 969630..970322 (+) | 693 | WP_020999586.1 | DUF805 domain-containing protein | - |
| SANR_RS04805 (SANR_0943) | - | 970827..971813 (+) | 987 | WP_003034384.1 | Gfo/Idh/MocA family protein | - |
| SANR_RS04810 (SANR_0944) | - | 971835..972488 (+) | 654 | WP_003034489.1 | uracil-DNA glycosylase | - |
| SANR_RS04815 (SANR_0945) | - | 972525..972995 (+) | 471 | WP_003031692.1 | NUDIX hydrolase | - |
| SANR_RS04820 (SANR_0946) | - | 973008..974285 (+) | 1278 | WP_003034545.1 | dihydroorotase | - |
Sequence
Protein
Download Length: 742 a.a. Molecular weight: 85066.91 Da Isoelectric Point: 9.9699
>NTDB_id=61675 SANR_RS04795 WP_020999585.1 967321..969549(+) (comEC/celB) [Streptococcus anginosus C238]
MSQWIKKFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVIRLFRFYSPKECFMTFMILSCFVGFFFIRREMAERQANA
EPSPVKQIAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLVILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSGLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFLSSPLIKLTQVNFLFEGLENSIRWLASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGIDTIDALVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDYHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKDSSSSKFLQQIEPTVALISVGKNNQSKQPSQDIIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
MSQWIKKFPIKPIYIAFLLVWLYFAIYQSSWLGWLGFIFLVIRLFRFYSPKECFMTFMILSCFVGFFFIRREMAERQANA
EPSPVKQIAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLVILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHPFEWLSSWRRKALVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSGLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFLSSPLIKLTQVNFLFEGLENSIRWLASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGIDTIDALVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDYHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKDSSSSKFLQQIEPTVALISVGKNNQSKQPSQDIIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
Nucleotide
Download Length: 2229 bp
>NTDB_id=61675 SANR_RS04795 WP_020999585.1 967321..969549(+) (comEC/celB) [Streptococcus anginosus C238]
ATGTCACAGTGGATTAAAAAATTTCCGATTAAGCCGATTTACATTGCTTTCTTACTCGTCTGGTTATATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTCGTCTTTTTCGCTTTTATTCACCAAAAGAATGTT
TCATGACCTTCATGATTCTCTCTTGTTTTGTTGGTTTTTTCTTTATTCGTAGGGAAATGGCAGAGCGGCAAGCAAATGCA
GAACCTTCTCCCGTAAAACAAATAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGACAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTGGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGTTGGGTCTTACACAGGA
AATGGTCCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGAAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCAGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTAATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTCTTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GCTAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCCTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGGGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGCATAGATACGATTGATGCTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATTATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAGGACTCATCAAGTTCAAAATTTTTACAACAAATAGAACCAACGGTCGCTCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATATAATAGAGCGATTTGCACAATTGCCTGCTAAGGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
ATGTCACAGTGGATTAAAAAATTTCCGATTAAGCCGATTTACATTGCTTTCTTACTCGTCTGGTTATATTTTGCAATCTA
TCAAAGTAGCTGGTTAGGCTGGTTGGGTTTTATCTTTCTGGTGATTCGTCTTTTTCGCTTTTATTCACCAAAAGAATGTT
TCATGACCTTCATGATTCTCTCTTGTTTTGTTGGTTTTTTCTTTATTCGTAGGGAAATGGCAGAGCGGCAAGCAAATGCA
GAACCTTCTCCCGTAAAACAAATAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGTTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGACAGAATCAGCTTGCACCCTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAAAGCACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTGGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGCTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGTTGGGTCTTACACAGGA
AATGGTCCATAAATGCCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGAAGCT
TAGTTCAGAAATTATTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCAGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTAATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTCTTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GCTAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCCTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGGGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGCATAGATACGATTGATGCTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCCAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATTATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAGGACTCATCAAGTTCAAAATTTTTACAACAAATAGAACCAACGGTCGCTCTCATTT
CTGTTGGGAAAAATAATCAATCTAAGCAACCAAGTCAAGATATAATAGAGCGATTTGCACAATTGCCTGCTAAGGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
55.615 |
100 |
0.561 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
54.886 |
100 |
0.553 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
54.545 |
100 |
0.55 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
53.61 |
100 |
0.54 |
| comEC/celB | Streptococcus pneumoniae D39 |
53.61 |
100 |
0.54 |
| comEC/celB | Streptococcus pneumoniae R6 |
53.61 |
100 |
0.54 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.383 |
100 |
0.476 |