Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SanJ4206_RS04350 | Genome accession | NZ_CP012719 |
| Coordinates | 890104..892332 (+) | Length | 742 a.a. |
| NCBI ID | WP_003029858.1 | Uniprot ID | - |
| Organism | Streptococcus anginosus strain J4206 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 885104..897332
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SanJ4206_RS04325 | rpmG | 885957..886106 (+) | 150 | WP_003024746.1 | 50S ribosomal protein L33 | - |
| SanJ4206_RS04330 (SanJ4206_0819) | secG | 886146..886379 (+) | 234 | WP_003024744.1 | preprotein translocase subunit SecG | - |
| SanJ4206_RS04335 (SanJ4206_0820) | rnr | 886471..888804 (+) | 2334 | WP_003029862.1 | ribonuclease R | - |
| SanJ4206_RS04340 (SanJ4206_0821) | smpB | 888767..889234 (+) | 468 | WP_003029860.1 | SsrA-binding protein SmpB | - |
| SanJ4206_RS04345 (SanJ4206_0822) | comEA/celA/cilE | 889416..890120 (+) | 705 | WP_003029859.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| SanJ4206_RS04350 (SanJ4206_0823) | comEC/celB | 890104..892332 (+) | 2229 | WP_003029858.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SanJ4206_RS04355 (SanJ4206_0824) | - | 892413..893111 (+) | 699 | WP_003029857.1 | DUF805 domain-containing protein | - |
| SanJ4206_RS09840 (SanJ4206_0825c) | - | 893254..893412 (-) | 159 | WP_003042740.1 | hypothetical protein | - |
| SanJ4206_RS04360 (SanJ4206_0826) | - | 893655..894641 (+) | 987 | WP_022525428.1 | Gfo/Idh/MocA family protein | - |
| SanJ4206_RS04365 (SanJ4206_0827) | - | 894663..895316 (+) | 654 | WP_003029851.1 | uracil-DNA glycosylase | - |
| SanJ4206_RS04370 (SanJ4206_0828) | - | 895353..895823 (+) | 471 | WP_003029850.1 | NUDIX hydrolase | - |
| SanJ4206_RS04375 (SanJ4206_0829) | - | 895836..897113 (+) | 1278 | WP_003029848.1 | dihydroorotase | - |
Sequence
Protein
Download Length: 742 a.a. Molecular weight: 85067.86 Da Isoelectric Point: 10.0445
>NTDB_id=156782 SanJ4206_RS04350 WP_003029858.1 890104..892332(+) (comEC/celB) [Streptococcus anginosus strain J4206]
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLACLGFIFLVIRLFRFYSPKEYFMTFMILSCFAGFFFVRREMVEQKTKV
APSPVRQVAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLAILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHTFEWLSSWRRRALVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFLSSPLIKLTQVNFLFEGLENSIRWIASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGIDTIDALVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDHHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKDSSSSKFLQQIEPTVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
MSQWIKRFPIKPIYIAFLLVWLYFAIYQSSWLACLGFIFLVIRLFRFYSPKEYFMTFMILSCFAGFFFVRREMVEQKTKV
APSPVRQVAVLPDTIKVNGDSLSFRGKTNGQTYQVYYKLKSKEEQLAFQNLSSLAILTVEGEFEIPEKKRNFAGFDYQSY
LKTQGIYRILKVDTILSSQDRISLHTFEWLSSWRRRALVFIKKHFPNPMSNYMTGLLFGALDTDFDEMNNLYSSLGIIHL
FALSGMQVGFFMEGFRKLLLRLGLTQEMVHKCQYPFSFFYAGMTGFSVSVVRSLVQKLLSQHGITKLDNFALTMMILSLI
MPSFLLTAGGVLSCAYAFVISVIDFESLTSWRKIVVESSVISLGVLPILIFYFGEFQPWSILLTFVFSLIFDTMMLPGLT
LIFLSSPLIKLTQVNFLFEGLENSIRWIASVFGRPIVFGQPSPLLLIVMLLVLAILYDIRKNKKWVIFLSLLLSLLFFIN
KFPLQNEITMVDVGQGDSIFLRDWKGRNVLIDVGGREEIRTKEAWQKRATSSNAEKTLIPYLKSRGIDTIDALVLTNPNP
DYAGDVLEVAKKFAIKKIYISRSSLSNADFLEKLRKINTFIHVVKQGDKLPIFDHHLQVLSGASKNDHSIVLYGQFFRTR
FLFASDLKEEGEAKLMQHYPKLKTDVLKVGQHGAKDSSSSKFLQQIEPTVALISVGKNNQSKQPSQDTIERFAQLPAKVY
RTDEQGAVKFSGWTNWRLEMVK
Nucleotide
Download Length: 2229 bp
>NTDB_id=156782 SanJ4206_RS04350 WP_003029858.1 890104..892332(+) (comEC/celB) [Streptococcus anginosus strain J4206]
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCAATTTACATAGCTTTTTTGCTTGTCTGGTTATATTTTGCAATCTA
CCAAAGTAGTTGGTTAGCTTGTCTTGGTTTTATCTTTCTAGTGATTCGTCTTTTTCGCTTTTATTCACCGAAAGAATATT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTGGTTTTTTCTTTGTTCGTAGGGAAATGGTAGAGCAGAAGACGAAAGTA
GCACCCTCTCCCGTAAGACAAGTAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGCTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACACTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAGAGCACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTAGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGTTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGCTGGGTCTTACACAGGA
AATGGTCCATAAATGTCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTACTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCAGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTCTTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCTTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGCATAGATACGATTGATGCTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCTAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAGGACTCATCAAGTTCAAAATTTTTGCAACAAATAGAACCAACGGTTGCGCTCATTT
CTGTCGGGAAAAATAATCAATCTAAGCAACCAAGCCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
ATGTCACAGTGGATTAAAAGATTTCCGATTAAGCCAATTTACATAGCTTTTTTGCTTGTCTGGTTATATTTTGCAATCTA
CCAAAGTAGTTGGTTAGCTTGTCTTGGTTTTATCTTTCTAGTGATTCGTCTTTTTCGCTTTTATTCACCGAAAGAATATT
TCATGACCTTCATGATCCTCTCTTGTTTTGCTGGTTTTTTCTTTGTTCGTAGGGAAATGGTAGAGCAGAAGACGAAAGTA
GCACCCTCTCCCGTAAGACAAGTAGCAGTTCTACCTGACACGATTAAGGTAAATGGAGATTCACTTTCTTTCCGTGGTAA
AACAAATGGGCAGACTTATCAAGTTTACTACAAATTGAAATCAAAGGAAGAACAGTTGGCTTTTCAAAATCTCTCTAGTC
TGGCTATATTGACCGTTGAAGGGGAATTTGAAATCCCTGAGAAGAAGCGTAATTTTGCTGGTTTTGATTACCAATCCTAT
TTAAAAACGCAAGGGATTTATCGAATTTTAAAAGTGGATACCATTTTATCGAGCCAAGATAGAATCAGCTTGCACACTTT
TGAGTGGCTTTCTAGCTGGCGAAGAAGAGCACTGGTATTTATCAAGAAGCATTTTCCAAATCCGATGAGTAATTACATGA
CAGGACTCTTATTTGGTGCCTTGGATACAGACTTCGATGAAATGAACAATCTTTACTCTAGCTTGGGGATTATTCATTTA
TTTGCGCTGTCTGGCATGCAAGTTGGTTTTTTTATGGAGGGTTTTCGTAAGTTGCTGTTAAGGCTGGGTCTTACACAGGA
AATGGTCCATAAATGTCAATATCCATTTTCTTTCTTCTATGCGGGAATGACTGGATTTTCAGTATCCGTTGTACGGAGCT
TAGTTCAGAAATTACTGTCGCAACATGGTATCACTAAGTTAGATAATTTTGCTTTAACGATGATGATATTGTCCTTGATT
ATGCCTTCCTTTCTTTTAACGGCAGGAGGAGTTCTCTCTTGCGCCTATGCTTTTGTTATTAGTGTGATAGATTTTGAAAG
TCTGACTTCTTGGCGAAAGATTGTTGTAGAGAGCAGTGTCATTTCGCTTGGTGTTTTACCGATTCTAATCTTTTATTTTG
GTGAATTTCAACCTTGGTCTATTTTGTTGACATTTGTTTTTTCACTGATTTTCGATACAATGATGCTGCCAGGCTTGACG
TTGATTTTTCTTTCTTCGCCTTTGATAAAGCTAACTCAGGTCAATTTTCTATTTGAAGGGTTAGAAAATAGCATTCGTTG
GATAGCAAGTGTCTTTGGCAGACCAATCGTTTTTGGGCAACCCAGTCCGCTTTTGCTGATTGTTATGCTACTTGTACTGG
CTATTTTGTATGATATTAGAAAAAATAAAAAATGGGTAATATTTCTTAGTCTGCTTCTTTCATTACTCTTTTTCATAAAT
AAATTTCCTTTGCAAAATGAGATCACAATGGTTGATGTTGGGCAGGGAGATAGTATTTTTCTGAGAGACTGGAAAGGACG
AAATGTATTGATTGATGTTGGTGGACGTGAGGAAATTAGAACAAAAGAAGCTTGGCAAAAACGAGCGACTAGCTCAAATG
CAGAGAAAACTTTGATTCCTTACTTAAAAAGTCGTGGCATAGATACGATTGATGCTTTAGTCTTAACAAATCCTAATCCA
GATTATGCAGGAGATGTATTAGAAGTTGCTAAAAAATTTGCAATAAAGAAAATTTATATTTCTAGAAGTAGTCTAAGCAA
TGCAGATTTTCTAGAGAAATTAAGGAAAATAAACACATTCATTCATGTTGTAAAACAAGGCGACAAACTTCCTATTTTTG
ATCATCATTTGCAAGTTCTTTCGGGTGCTAGCAAGAACGACCATTCGATCGTTTTATATGGTCAGTTTTTCCGTACAAGA
TTTTTATTTGCGAGCGACTTGAAAGAAGAGGGCGAAGCAAAGTTAATGCAGCATTATCCAAAGTTGAAAACAGATGTTTT
GAAAGTCGGACAGCATGGAGCCAAGGACTCATCAAGTTCAAAATTTTTGCAACAAATAGAACCAACGGTTGCGCTCATTT
CTGTCGGGAAAAATAATCAATCTAAGCAACCAAGCCAAGATACAATAGAGCGATTTGCACAATTGCCTGCTAAAGTCTAT
CGGACAGATGAACAAGGGGCAGTTAAATTTTCAGGGTGGACAAACTGGCGATTAGAGATGGTCAAGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
55.556 |
100 |
0.559 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
54.96 |
100 |
0.553 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
54.618 |
100 |
0.55 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
53.681 |
100 |
0.54 |
| comEC/celB | Streptococcus pneumoniae D39 |
53.681 |
100 |
0.54 |
| comEC/celB | Streptococcus pneumoniae R6 |
53.681 |
100 |
0.54 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
47.39 |
100 |
0.477 |