Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SON02_RS09120 | Genome accession | NZ_CP139211 |
| Coordinates | 2090803..2093040 (+) | Length | 745 a.a. |
| NCBI ID | WP_135027225.1 | Uniprot ID | - |
| Organism | Streptococcus sp. LysM4 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2085803..2098040
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SON02_RS09090 (SON02_09070) | - | 2086778..2087293 (+) | 516 | WP_135027232.1 | isoprenylcysteine carboxyl methyltransferase family protein | - |
| SON02_RS09095 (SON02_09075) | - | 2087739..2088146 (+) | 408 | WP_135027377.1 | GNAT family N-acetyltransferase | - |
| SON02_RS09100 (SON02_09080) | - | 2088260..2088517 (-) | 258 | WP_135027375.1 | GIY-YIG nuclease family protein | - |
| SON02_RS09105 (SON02_09085) | - | 2088525..2089274 (-) | 750 | WP_135027230.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| SON02_RS09110 (SON02_09090) | - | 2089370..2090116 (+) | 747 | WP_135027229.1 | lysophospholipid acyltransferase family protein | - |
| SON02_RS09115 (SON02_09095) | comEA/celA/cilE | 2090181..2090819 (+) | 639 | WP_135027227.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| SON02_RS09120 (SON02_09100) | comEC/celB | 2090803..2093040 (+) | 2238 | WP_135027225.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SON02_RS09125 (SON02_09105) | - | 2093745..2095268 (+) | 1524 | WP_135027223.1 | AMP-binding protein | - |
| SON02_RS09130 (SON02_09110) | holA | 2095448..2096479 (+) | 1032 | WP_135027221.1 | DNA polymerase III subunit delta | - |
| SON02_RS09135 | - | 2096573..2097045 (-) | 473 | Protein_1757 | IS630 family transposase | - |
| SON02_RS09140 (SON02_09120) | - | 2097083..2097409 (-) | 327 | WP_306299843.1 | IS630 transposase-related protein | - |
| SON02_RS09145 (SON02_09125) | - | 2097638..2097838 (+) | 201 | WP_135029514.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 85438.58 Da Isoelectric Point: 9.3994
>NTDB_id=908085 SON02_RS09120 WP_135027225.1 2090803..2093040(+) (comEC/celB) [Streptococcus sp. LysM4]
MSQLIKGLPLTPIHLAVLLLALYFVMHHLSFLSMVILVAMLLLLYFQQGKMIVYKVLPILACFFLLFGLQRMKVAMDAAS
TLTEISYLDVKPDTIHINGDSLSFRARSSGYRYMVFYQLKSQREQVYFKSLSHLVRLEVEATVSVPESQRNFNGFDYQDY
LGTQEIYRTVKISQIKKIAQQTSWNPLDWLSLLRRKLLVYIKEHFPNPMRHYMTGLLLGDLDKEFEQMSDLYSSLGIIHL
FALSGMQVGFFVDRFRSFFLRFGIRKEIVDWLQLPFAFIYASLTGFSVSVNRSLLQRILSNMGMSKLDNIACTIILSFLI
MPHFLLTVGGVLSFAYAFLLAVFDFEDLAAYKRVAVESLAISLAMFPLLIYYFYSFQPLSILLTFLFSFLFDLVLLPGLS
LVFLLSPLMKLTQVNALFVWLEACIRWIVDLDLKPLIFGKPTVVLLLILLLILLSLYDLYRNWKWCWGLISLLALLVFIV
KYPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVNVATKEVWQQGTNQANAKRTLLPYLRSRGVSRIDYLILTHAHT
DHMGDLLEVVREMAIGRIYISEGSASSQKLAEILQSVKVRPHLVKVGDTIPIGDGFLHVLYPYQKGDGGNDDSVVLYGEF
LQTRFLFTGDLEDSELELMKQYPQLSVDVLKVGHHGSKGSSHPEFLAAISPRIALISVGKNNRYKHPHQETLTRFQERQI
QVFRTDEQGAIRFRGWRKWKIETVR
MSQLIKGLPLTPIHLAVLLLALYFVMHHLSFLSMVILVAMLLLLYFQQGKMIVYKVLPILACFFLLFGLQRMKVAMDAAS
TLTEISYLDVKPDTIHINGDSLSFRARSSGYRYMVFYQLKSQREQVYFKSLSHLVRLEVEATVSVPESQRNFNGFDYQDY
LGTQEIYRTVKISQIKKIAQQTSWNPLDWLSLLRRKLLVYIKEHFPNPMRHYMTGLLLGDLDKEFEQMSDLYSSLGIIHL
FALSGMQVGFFVDRFRSFFLRFGIRKEIVDWLQLPFAFIYASLTGFSVSVNRSLLQRILSNMGMSKLDNIACTIILSFLI
MPHFLLTVGGVLSFAYAFLLAVFDFEDLAAYKRVAVESLAISLAMFPLLIYYFYSFQPLSILLTFLFSFLFDLVLLPGLS
LVFLLSPLMKLTQVNALFVWLEACIRWIVDLDLKPLIFGKPTVVLLLILLLILLSLYDLYRNWKWCWGLISLLALLVFIV
KYPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVNVATKEVWQQGTNQANAKRTLLPYLRSRGVSRIDYLILTHAHT
DHMGDLLEVVREMAIGRIYISEGSASSQKLAEILQSVKVRPHLVKVGDTIPIGDGFLHVLYPYQKGDGGNDDSVVLYGEF
LQTRFLFTGDLEDSELELMKQYPQLSVDVLKVGHHGSKGSSHPEFLAAISPRIALISVGKNNRYKHPHQETLTRFQERQI
QVFRTDEQGAIRFRGWRKWKIETVR
Nucleotide
Download Length: 2238 bp
>NTDB_id=908085 SON02_RS09120 WP_135027225.1 2090803..2093040(+) (comEC/celB) [Streptococcus sp. LysM4]
ATGTCACAGTTGATTAAAGGTCTGCCCCTTACTCCCATCCACTTGGCTGTCTTGTTACTAGCCCTTTACTTTGTTATGCA
TCATTTGTCCTTCTTATCAATGGTGATTTTGGTGGCAATGTTACTTCTTCTCTATTTTCAGCAAGGGAAGATGATAGTTT
ATAAGGTGCTTCCTATCTTAGCCTGTTTTTTTCTCCTATTTGGCCTGCAACGTATGAAAGTGGCAATGGATGCTGCATCT
ACTCTGACAGAGATTAGCTACTTAGATGTCAAACCAGATACCATTCATATCAATGGTGACAGCCTTTCGTTTCGTGCAAG
ATCGTCAGGATATCGGTATATGGTTTTTTACCAATTAAAAAGTCAAAGGGAGCAGGTTTACTTCAAAAGTCTGTCGCATC
TGGTGAGGCTAGAGGTGGAAGCGACTGTATCTGTCCCAGAAAGTCAGCGGAATTTTAATGGCTTTGATTACCAAGATTAC
TTGGGGACACAAGAGATTTATCGAACGGTCAAGATTAGTCAAATTAAGAAGATTGCCCAGCAGACTTCATGGAATCCTTT
GGACTGGCTCTCCTTACTTCGACGAAAGCTTTTAGTCTATATTAAGGAGCATTTTCCAAATCCAATGCGGCACTATATGA
CAGGGTTGTTGCTTGGAGATTTGGACAAGGAATTTGAGCAGATGAGCGATTTGTATTCTAGCCTAGGGATAATCCACTTA
TTTGCGCTTTCTGGTATGCAGGTTGGCTTCTTTGTAGATAGGTTCCGTTCCTTTTTCTTACGTTTCGGAATAAGAAAAGA
AATAGTTGATTGGCTACAGCTTCCGTTTGCCTTTATTTATGCAAGCTTGACAGGATTTTCAGTTTCAGTCAATCGTTCCC
TTTTGCAAAGGATTTTGAGCAATATGGGAATGAGTAAGTTAGATAATATAGCTTGTACGATTATCCTTTCTTTTCTAATC
ATGCCTCATTTTTTACTAACAGTAGGCGGTGTTCTGAGTTTTGCCTATGCTTTTTTATTGGCTGTTTTTGATTTTGAAGA
TTTGGCAGCTTATAAGCGAGTAGCTGTAGAAAGTTTAGCGATTTCTCTAGCTATGTTTCCCTTATTGATTTATTATTTTT
ACAGCTTTCAACCCTTATCTATTCTATTAACATTTCTCTTTTCCTTTCTTTTTGATCTAGTCTTGCTACCAGGCTTGAGC
CTTGTCTTTCTTCTTTCGCCGCTTATGAAGCTCACGCAAGTGAATGCTCTTTTTGTATGGTTAGAAGCCTGTATCCGGTG
GATAGTAGACTTGGACTTAAAACCATTGATTTTTGGGAAGCCGACAGTTGTTTTACTTCTAATCTTGCTGCTTATTTTGC
TCTCTTTGTATGATCTGTATCGGAATTGGAAATGGTGTTGGGGACTTATCAGCTTGTTAGCTCTGCTCGTTTTCATAGTC
AAATATCCCTTAGAAAATGAGGTGACAGTAGTGGATGTTGGGCAAGGGGATAGTATTTTTTTGAGAGATATACGAGGGCG
GACGGTCTTGATTGATGTAGGTGGTAGAGTGAATGTTGCTACAAAAGAAGTTTGGCAACAAGGGACAAATCAAGCGAATG
CGAAGCGAACGTTGCTCCCCTATCTTCGTAGTCGCGGTGTGAGTAGGATTGATTATCTAATCTTAACTCATGCCCATACG
GATCATATGGGAGATTTACTGGAAGTGGTGAGAGAAATGGCTATTGGTAGGATATATATTTCTGAGGGAAGTGCTAGCAG
TCAGAAATTAGCAGAGATCCTGCAGAGCGTAAAGGTGCGACCTCATCTTGTGAAAGTGGGAGATACTATTCCGATAGGTG
ACGGCTTTTTACATGTACTCTATCCTTATCAAAAAGGAGACGGTGGTAACGATGATTCGGTTGTGTTATATGGGGAGTTT
TTGCAGACTCGTTTTCTATTCACAGGGGATTTGGAAGATAGTGAGTTAGAGTTGATGAAACAGTATCCTCAGTTGTCTGT
TGATGTTTTAAAGGTAGGACATCATGGCTCGAAAGGTTCCTCTCATCCAGAATTTTTGGCTGCTATTTCTCCAAGGATTG
CCTTGATTTCAGTTGGGAAAAACAATCGTTACAAGCATCCGCATCAAGAAACTCTGACGCGTTTTCAAGAACGGCAGATC
CAGGTGTTTCGAACGGATGAACAAGGTGCTATTCGTTTCAGAGGTTGGAGGAAGTGGAAGATTGAAACGGTGAGATAG
ATGTCACAGTTGATTAAAGGTCTGCCCCTTACTCCCATCCACTTGGCTGTCTTGTTACTAGCCCTTTACTTTGTTATGCA
TCATTTGTCCTTCTTATCAATGGTGATTTTGGTGGCAATGTTACTTCTTCTCTATTTTCAGCAAGGGAAGATGATAGTTT
ATAAGGTGCTTCCTATCTTAGCCTGTTTTTTTCTCCTATTTGGCCTGCAACGTATGAAAGTGGCAATGGATGCTGCATCT
ACTCTGACAGAGATTAGCTACTTAGATGTCAAACCAGATACCATTCATATCAATGGTGACAGCCTTTCGTTTCGTGCAAG
ATCGTCAGGATATCGGTATATGGTTTTTTACCAATTAAAAAGTCAAAGGGAGCAGGTTTACTTCAAAAGTCTGTCGCATC
TGGTGAGGCTAGAGGTGGAAGCGACTGTATCTGTCCCAGAAAGTCAGCGGAATTTTAATGGCTTTGATTACCAAGATTAC
TTGGGGACACAAGAGATTTATCGAACGGTCAAGATTAGTCAAATTAAGAAGATTGCCCAGCAGACTTCATGGAATCCTTT
GGACTGGCTCTCCTTACTTCGACGAAAGCTTTTAGTCTATATTAAGGAGCATTTTCCAAATCCAATGCGGCACTATATGA
CAGGGTTGTTGCTTGGAGATTTGGACAAGGAATTTGAGCAGATGAGCGATTTGTATTCTAGCCTAGGGATAATCCACTTA
TTTGCGCTTTCTGGTATGCAGGTTGGCTTCTTTGTAGATAGGTTCCGTTCCTTTTTCTTACGTTTCGGAATAAGAAAAGA
AATAGTTGATTGGCTACAGCTTCCGTTTGCCTTTATTTATGCAAGCTTGACAGGATTTTCAGTTTCAGTCAATCGTTCCC
TTTTGCAAAGGATTTTGAGCAATATGGGAATGAGTAAGTTAGATAATATAGCTTGTACGATTATCCTTTCTTTTCTAATC
ATGCCTCATTTTTTACTAACAGTAGGCGGTGTTCTGAGTTTTGCCTATGCTTTTTTATTGGCTGTTTTTGATTTTGAAGA
TTTGGCAGCTTATAAGCGAGTAGCTGTAGAAAGTTTAGCGATTTCTCTAGCTATGTTTCCCTTATTGATTTATTATTTTT
ACAGCTTTCAACCCTTATCTATTCTATTAACATTTCTCTTTTCCTTTCTTTTTGATCTAGTCTTGCTACCAGGCTTGAGC
CTTGTCTTTCTTCTTTCGCCGCTTATGAAGCTCACGCAAGTGAATGCTCTTTTTGTATGGTTAGAAGCCTGTATCCGGTG
GATAGTAGACTTGGACTTAAAACCATTGATTTTTGGGAAGCCGACAGTTGTTTTACTTCTAATCTTGCTGCTTATTTTGC
TCTCTTTGTATGATCTGTATCGGAATTGGAAATGGTGTTGGGGACTTATCAGCTTGTTAGCTCTGCTCGTTTTCATAGTC
AAATATCCCTTAGAAAATGAGGTGACAGTAGTGGATGTTGGGCAAGGGGATAGTATTTTTTTGAGAGATATACGAGGGCG
GACGGTCTTGATTGATGTAGGTGGTAGAGTGAATGTTGCTACAAAAGAAGTTTGGCAACAAGGGACAAATCAAGCGAATG
CGAAGCGAACGTTGCTCCCCTATCTTCGTAGTCGCGGTGTGAGTAGGATTGATTATCTAATCTTAACTCATGCCCATACG
GATCATATGGGAGATTTACTGGAAGTGGTGAGAGAAATGGCTATTGGTAGGATATATATTTCTGAGGGAAGTGCTAGCAG
TCAGAAATTAGCAGAGATCCTGCAGAGCGTAAAGGTGCGACCTCATCTTGTGAAAGTGGGAGATACTATTCCGATAGGTG
ACGGCTTTTTACATGTACTCTATCCTTATCAAAAAGGAGACGGTGGTAACGATGATTCGGTTGTGTTATATGGGGAGTTT
TTGCAGACTCGTTTTCTATTCACAGGGGATTTGGAAGATAGTGAGTTAGAGTTGATGAAACAGTATCCTCAGTTGTCTGT
TGATGTTTTAAAGGTAGGACATCATGGCTCGAAAGGTTCCTCTCATCCAGAATTTTTGGCTGCTATTTCTCCAAGGATTG
CCTTGATTTCAGTTGGGAAAAACAATCGTTACAAGCATCCGCATCAAGAAACTCTGACGCGTTTTCAAGAACGGCAGATC
CAGGTGTTTCGAACGGATGAACAAGGTGCTATTCGTTTCAGAGGTTGGAGGAAGTGGAAGATTGAAACGGTGAGATAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
51.004 |
100 |
0.511 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
49.866 |
100 |
0.499 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
49.264 |
100 |
0.494 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
48.461 |
100 |
0.486 |
| comEC/celB | Streptococcus pneumoniae D39 |
48.461 |
100 |
0.486 |
| comEC/celB | Streptococcus pneumoniae R6 |
48.461 |
100 |
0.486 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
45.332 |
99.195 |
0.45 |