Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | SOL79_RS09180 | Genome accession | NZ_CP139210 |
| Coordinates | 2091238..2093475 (+) | Length | 745 a.a. |
| NCBI ID | WP_135027225.1 | Uniprot ID | - |
| Organism | Streptococcus sp. VEG1o | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2086238..2098475
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| SOL79_RS09150 (SOL79_09135) | - | 2087213..2087728 (+) | 516 | WP_135027232.1 | isoprenylcysteine carboxyl methyltransferase family protein | - |
| SOL79_RS09155 (SOL79_09140) | - | 2088174..2088581 (+) | 408 | WP_135027377.1 | GNAT family N-acetyltransferase | - |
| SOL79_RS09160 (SOL79_09145) | - | 2088695..2088952 (-) | 258 | WP_135027375.1 | GIY-YIG nuclease family protein | - |
| SOL79_RS09165 (SOL79_09150) | - | 2088960..2089709 (-) | 750 | WP_135027230.1 | tRNA1(Val) (adenine(37)-N6)-methyltransferase | - |
| SOL79_RS09170 (SOL79_09155) | - | 2089805..2090551 (+) | 747 | WP_135027229.1 | lysophospholipid acyltransferase family protein | - |
| SOL79_RS09175 (SOL79_09160) | comEA/celA/cilE | 2090616..2091254 (+) | 639 | WP_135027227.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| SOL79_RS09180 (SOL79_09165) | comEC/celB | 2091238..2093475 (+) | 2238 | WP_135027225.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| SOL79_RS09185 (SOL79_09170) | - | 2094180..2095703 (+) | 1524 | WP_135027223.1 | AMP-binding protein | - |
| SOL79_RS09190 (SOL79_09175) | holA | 2095883..2096914 (+) | 1032 | WP_135027221.1 | DNA polymerase III subunit delta | - |
| SOL79_RS09195 | - | 2097008..2097480 (-) | 473 | Protein_1769 | IS630 family transposase | - |
| SOL79_RS09200 (SOL79_09185) | - | 2097518..2097844 (-) | 327 | WP_306299843.1 | IS630 transposase-related protein | - |
| SOL79_RS09205 (SOL79_09190) | - | 2098073..2098273 (+) | 201 | WP_135029514.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 745 a.a. Molecular weight: 85438.58 Da Isoelectric Point: 9.3994
>NTDB_id=908034 SOL79_RS09180 WP_135027225.1 2091238..2093475(+) (comEC/celB) [Streptococcus sp. VEG1o]
MSQLIKGLPLTPIHLAVLLLALYFVMHHLSFLSMVILVAMLLLLYFQQGKMIVYKVLPILACFFLLFGLQRMKVAMDAAS
TLTEISYLDVKPDTIHINGDSLSFRARSSGYRYMVFYQLKSQREQVYFKSLSHLVRLEVEATVSVPESQRNFNGFDYQDY
LGTQEIYRTVKISQIKKIAQQTSWNPLDWLSLLRRKLLVYIKEHFPNPMRHYMTGLLLGDLDKEFEQMSDLYSSLGIIHL
FALSGMQVGFFVDRFRSFFLRFGIRKEIVDWLQLPFAFIYASLTGFSVSVNRSLLQRILSNMGMSKLDNIACTIILSFLI
MPHFLLTVGGVLSFAYAFLLAVFDFEDLAAYKRVAVESLAISLAMFPLLIYYFYSFQPLSILLTFLFSFLFDLVLLPGLS
LVFLLSPLMKLTQVNALFVWLEACIRWIVDLDLKPLIFGKPTVVLLLILLLILLSLYDLYRNWKWCWGLISLLALLVFIV
KYPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVNVATKEVWQQGTNQANAKRTLLPYLRSRGVSRIDYLILTHAHT
DHMGDLLEVVREMAIGRIYISEGSASSQKLAEILQSVKVRPHLVKVGDTIPIGDGFLHVLYPYQKGDGGNDDSVVLYGEF
LQTRFLFTGDLEDSELELMKQYPQLSVDVLKVGHHGSKGSSHPEFLAAISPRIALISVGKNNRYKHPHQETLTRFQERQI
QVFRTDEQGAIRFRGWRKWKIETVR
MSQLIKGLPLTPIHLAVLLLALYFVMHHLSFLSMVILVAMLLLLYFQQGKMIVYKVLPILACFFLLFGLQRMKVAMDAAS
TLTEISYLDVKPDTIHINGDSLSFRARSSGYRYMVFYQLKSQREQVYFKSLSHLVRLEVEATVSVPESQRNFNGFDYQDY
LGTQEIYRTVKISQIKKIAQQTSWNPLDWLSLLRRKLLVYIKEHFPNPMRHYMTGLLLGDLDKEFEQMSDLYSSLGIIHL
FALSGMQVGFFVDRFRSFFLRFGIRKEIVDWLQLPFAFIYASLTGFSVSVNRSLLQRILSNMGMSKLDNIACTIILSFLI
MPHFLLTVGGVLSFAYAFLLAVFDFEDLAAYKRVAVESLAISLAMFPLLIYYFYSFQPLSILLTFLFSFLFDLVLLPGLS
LVFLLSPLMKLTQVNALFVWLEACIRWIVDLDLKPLIFGKPTVVLLLILLLILLSLYDLYRNWKWCWGLISLLALLVFIV
KYPLENEVTVVDVGQGDSIFLRDIRGRTVLIDVGGRVNVATKEVWQQGTNQANAKRTLLPYLRSRGVSRIDYLILTHAHT
DHMGDLLEVVREMAIGRIYISEGSASSQKLAEILQSVKVRPHLVKVGDTIPIGDGFLHVLYPYQKGDGGNDDSVVLYGEF
LQTRFLFTGDLEDSELELMKQYPQLSVDVLKVGHHGSKGSSHPEFLAAISPRIALISVGKNNRYKHPHQETLTRFQERQI
QVFRTDEQGAIRFRGWRKWKIETVR
Nucleotide
Download Length: 2238 bp
>NTDB_id=908034 SOL79_RS09180 WP_135027225.1 2091238..2093475(+) (comEC/celB) [Streptococcus sp. VEG1o]
ATGTCACAGTTGATTAAAGGTCTGCCCCTTACTCCCATCCACTTGGCTGTCTTGTTACTAGCCCTTTACTTTGTTATGCA
TCATTTGTCCTTCTTATCAATGGTGATTTTGGTGGCAATGTTACTTCTTCTCTATTTTCAGCAAGGGAAGATGATAGTTT
ATAAGGTGCTTCCTATCTTAGCCTGTTTTTTTCTCCTATTTGGCCTGCAACGTATGAAAGTGGCAATGGATGCTGCATCT
ACTCTGACAGAGATTAGCTACTTAGATGTCAAACCAGATACCATTCATATCAATGGTGACAGCCTTTCGTTTCGTGCAAG
ATCGTCAGGATATCGGTATATGGTTTTTTACCAATTAAAAAGTCAAAGGGAGCAGGTTTACTTCAAAAGTCTGTCGCATC
TGGTGAGGCTAGAGGTGGAAGCGACTGTATCTGTCCCAGAAAGTCAGCGGAATTTTAATGGCTTTGATTACCAAGATTAC
TTGGGGACACAAGAGATTTATCGAACGGTCAAGATTAGTCAAATTAAGAAGATTGCCCAGCAGACTTCATGGAATCCTTT
GGACTGGCTCTCCTTACTTCGACGAAAGCTTTTAGTCTATATTAAGGAGCATTTTCCAAATCCAATGCGGCACTATATGA
CAGGGTTGTTGCTTGGAGATTTGGACAAGGAATTTGAGCAGATGAGCGATTTGTATTCTAGCCTAGGGATAATCCACTTA
TTTGCGCTTTCTGGTATGCAGGTTGGCTTCTTTGTAGATAGGTTCCGTTCCTTTTTCTTACGTTTCGGAATAAGAAAAGA
AATAGTTGATTGGCTACAGCTTCCGTTTGCCTTTATTTATGCAAGCTTGACAGGATTTTCAGTTTCAGTCAATCGTTCCC
TTTTGCAAAGGATTTTGAGCAATATGGGAATGAGTAAGTTAGATAATATAGCTTGTACGATTATCCTTTCTTTTCTAATC
ATGCCTCATTTTTTACTAACAGTAGGCGGTGTTCTGAGTTTTGCCTATGCTTTTTTATTGGCTGTTTTTGATTTTGAAGA
TTTGGCAGCTTATAAGCGAGTAGCTGTAGAAAGTTTAGCGATTTCTCTAGCTATGTTTCCCTTATTGATTTATTATTTTT
ACAGCTTTCAACCCTTATCTATTCTATTAACATTTCTCTTTTCCTTTCTTTTTGATCTAGTCTTGCTACCAGGCTTGAGC
CTTGTCTTTCTTCTTTCGCCGCTTATGAAGCTCACGCAAGTGAATGCTCTTTTTGTATGGTTAGAAGCCTGTATCCGGTG
GATAGTAGACTTGGACTTAAAACCATTGATTTTTGGGAAGCCGACAGTTGTTTTACTTCTAATCTTGCTGCTTATTTTGC
TCTCTTTGTATGATCTGTATCGGAATTGGAAATGGTGTTGGGGACTTATCAGCTTGTTAGCTCTGCTCGTTTTCATAGTC
AAATATCCCTTAGAAAATGAGGTGACAGTAGTGGATGTTGGGCAAGGGGATAGTATTTTTTTGAGAGATATACGAGGGCG
GACGGTCTTGATTGATGTAGGTGGTAGAGTGAATGTTGCTACAAAAGAAGTTTGGCAACAAGGGACAAATCAAGCGAATG
CGAAGCGAACGTTGCTCCCCTATCTTCGTAGTCGCGGTGTGAGTAGGATTGATTATCTAATCTTAACTCATGCCCATACG
GATCATATGGGAGATTTACTGGAAGTGGTGAGAGAAATGGCTATTGGTAGGATATATATTTCTGAGGGAAGTGCTAGCAG
TCAGAAATTAGCAGAGATCCTGCAGAGCGTAAAGGTGCGACCTCATCTTGTGAAAGTGGGAGATACTATTCCGATAGGTG
ACGGCTTTTTACATGTACTCTATCCTTATCAAAAAGGAGACGGTGGTAACGATGATTCGGTTGTGTTATATGGGGAGTTT
TTGCAGACTCGTTTTCTATTCACAGGGGATTTGGAAGATAGTGAGTTAGAGTTGATGAAACAGTATCCTCAGTTGTCTGT
TGATGTTTTAAAGGTAGGACATCATGGCTCGAAAGGTTCCTCTCATCCAGAATTTTTGGCTGCTATTTCTCCAAGGATTG
CCTTGATTTCAGTTGGGAAAAACAATCGTTACAAGCATCCGCATCAAGAAACTCTGACGCGTTTTCAAGAACGGCAGATC
CAGGTGTTTCGAACGGATGAACAAGGTGCTATTCGTTTCAGAGGTTGGAGGAAGTGGAAGATTGAAACGGTGAGATAG
ATGTCACAGTTGATTAAAGGTCTGCCCCTTACTCCCATCCACTTGGCTGTCTTGTTACTAGCCCTTTACTTTGTTATGCA
TCATTTGTCCTTCTTATCAATGGTGATTTTGGTGGCAATGTTACTTCTTCTCTATTTTCAGCAAGGGAAGATGATAGTTT
ATAAGGTGCTTCCTATCTTAGCCTGTTTTTTTCTCCTATTTGGCCTGCAACGTATGAAAGTGGCAATGGATGCTGCATCT
ACTCTGACAGAGATTAGCTACTTAGATGTCAAACCAGATACCATTCATATCAATGGTGACAGCCTTTCGTTTCGTGCAAG
ATCGTCAGGATATCGGTATATGGTTTTTTACCAATTAAAAAGTCAAAGGGAGCAGGTTTACTTCAAAAGTCTGTCGCATC
TGGTGAGGCTAGAGGTGGAAGCGACTGTATCTGTCCCAGAAAGTCAGCGGAATTTTAATGGCTTTGATTACCAAGATTAC
TTGGGGACACAAGAGATTTATCGAACGGTCAAGATTAGTCAAATTAAGAAGATTGCCCAGCAGACTTCATGGAATCCTTT
GGACTGGCTCTCCTTACTTCGACGAAAGCTTTTAGTCTATATTAAGGAGCATTTTCCAAATCCAATGCGGCACTATATGA
CAGGGTTGTTGCTTGGAGATTTGGACAAGGAATTTGAGCAGATGAGCGATTTGTATTCTAGCCTAGGGATAATCCACTTA
TTTGCGCTTTCTGGTATGCAGGTTGGCTTCTTTGTAGATAGGTTCCGTTCCTTTTTCTTACGTTTCGGAATAAGAAAAGA
AATAGTTGATTGGCTACAGCTTCCGTTTGCCTTTATTTATGCAAGCTTGACAGGATTTTCAGTTTCAGTCAATCGTTCCC
TTTTGCAAAGGATTTTGAGCAATATGGGAATGAGTAAGTTAGATAATATAGCTTGTACGATTATCCTTTCTTTTCTAATC
ATGCCTCATTTTTTACTAACAGTAGGCGGTGTTCTGAGTTTTGCCTATGCTTTTTTATTGGCTGTTTTTGATTTTGAAGA
TTTGGCAGCTTATAAGCGAGTAGCTGTAGAAAGTTTAGCGATTTCTCTAGCTATGTTTCCCTTATTGATTTATTATTTTT
ACAGCTTTCAACCCTTATCTATTCTATTAACATTTCTCTTTTCCTTTCTTTTTGATCTAGTCTTGCTACCAGGCTTGAGC
CTTGTCTTTCTTCTTTCGCCGCTTATGAAGCTCACGCAAGTGAATGCTCTTTTTGTATGGTTAGAAGCCTGTATCCGGTG
GATAGTAGACTTGGACTTAAAACCATTGATTTTTGGGAAGCCGACAGTTGTTTTACTTCTAATCTTGCTGCTTATTTTGC
TCTCTTTGTATGATCTGTATCGGAATTGGAAATGGTGTTGGGGACTTATCAGCTTGTTAGCTCTGCTCGTTTTCATAGTC
AAATATCCCTTAGAAAATGAGGTGACAGTAGTGGATGTTGGGCAAGGGGATAGTATTTTTTTGAGAGATATACGAGGGCG
GACGGTCTTGATTGATGTAGGTGGTAGAGTGAATGTTGCTACAAAAGAAGTTTGGCAACAAGGGACAAATCAAGCGAATG
CGAAGCGAACGTTGCTCCCCTATCTTCGTAGTCGCGGTGTGAGTAGGATTGATTATCTAATCTTAACTCATGCCCATACG
GATCATATGGGAGATTTACTGGAAGTGGTGAGAGAAATGGCTATTGGTAGGATATATATTTCTGAGGGAAGTGCTAGCAG
TCAGAAATTAGCAGAGATCCTGCAGAGCGTAAAGGTGCGACCTCATCTTGTGAAAGTGGGAGATACTATTCCGATAGGTG
ACGGCTTTTTACATGTACTCTATCCTTATCAAAAAGGAGACGGTGGTAACGATGATTCGGTTGTGTTATATGGGGAGTTT
TTGCAGACTCGTTTTCTATTCACAGGGGATTTGGAAGATAGTGAGTTAGAGTTGATGAAACAGTATCCTCAGTTGTCTGT
TGATGTTTTAAAGGTAGGACATCATGGCTCGAAAGGTTCCTCTCATCCAGAATTTTTGGCTGCTATTTCTCCAAGGATTG
CCTTGATTTCAGTTGGGAAAAACAATCGTTACAAGCATCCGCATCAAGAAACTCTGACGCGTTTTCAAGAACGGCAGATC
CAGGTGTTTCGAACGGATGAACAAGGTGCTATTCGTTTCAGAGGTTGGAGGAAGTGGAAGATTGAAACGGTGAGATAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
51.004 |
100 |
0.511 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
49.866 |
100 |
0.499 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
49.264 |
100 |
0.494 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
48.461 |
100 |
0.486 |
| comEC/celB | Streptococcus pneumoniae D39 |
48.461 |
100 |
0.486 |
| comEC/celB | Streptococcus pneumoniae R6 |
48.461 |
100 |
0.486 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
45.332 |
99.195 |
0.45 |