Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | ACN076_RS07670 | Genome accession | NZ_CP184836 |
| Coordinates | 1561227..1563467 (-) | Length | 746 a.a. |
| NCBI ID | WP_420789768.1 | Uniprot ID | - |
| Organism | Streptococcus sp. K0074 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1556227..1568467
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ACN076_RS07640 (ACN076_07640) | - | 1557131..1557511 (-) | 381 | WP_070842964.1 | VOC family protein | - |
| ACN076_RS07645 (ACN076_07645) | rplT | 1557570..1557929 (-) | 360 | WP_000124834.1 | 50S ribosomal protein L20 | - |
| ACN076_RS07650 (ACN076_07650) | rpmI | 1557981..1558181 (-) | 201 | WP_001125943.1 | 50S ribosomal protein L35 | - |
| ACN076_RS07655 (ACN076_07655) | infC | 1558214..1558744 (-) | 531 | WP_000848180.1 | translation initiation factor IF-3 | - |
| ACN076_RS07660 (ACN076_07660) | - | 1559052..1560236 (-) | 1185 | WP_420789766.1 | hypothetical protein | - |
| ACN076_RS07665 (ACN076_07665) | - | 1560240..1560827 (-) | 588 | WP_420789767.1 | ATP-binding cassette domain-containing protein | - |
| ACN076_RS07670 (ACN076_07670) | comEC/celB | 1561227..1563467 (-) | 2241 | WP_420789768.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| ACN076_RS07675 (ACN076_07675) | comEA/celA/cilE | 1563451..1564101 (-) | 651 | WP_420789769.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| ACN076_RS07680 (ACN076_07680) | - | 1564169..1564738 (-) | 570 | WP_420789770.1 | GNAT family N-acetyltransferase | - |
| ACN076_RS07685 (ACN076_07685) | ald | 1564909..1566021 (+) | 1113 | WP_420789771.1 | alanine dehydrogenase | - |
| ACN076_RS07690 (ACN076_07690) | - | 1566078..1567073 (-) | 996 | WP_420789772.1 | PhoH family protein | - |
| ACN076_RS07695 (ACN076_07695) | - | 1567158..1567373 (-) | 216 | WP_001232089.1 | YozE family protein | - |
| ACN076_RS07700 (ACN076_07700) | cvfB | 1567382..1568236 (-) | 855 | WP_020900277.1 | RNA-binding virulence regulatory protein CvfB | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 84651.84 Da Isoelectric Point: 9.0542
>NTDB_id=1111465 ACN076_RS07670 WP_420789768.1 1561227..1563467(-) (comEC/celB) [Streptococcus sp. K0074]
MLQWIKNLPIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLICLFFQFAWKSASKVLIICGIFGFWFLFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKADGRIFQVYYKFQSEEEKETFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQAY
LKTQGIYQTLNIKKIQSLQKVSSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLFLRSGLTQEKLKWLTYPFSLLYAGMTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEEEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFAFSFLYPVIQLNFIFEWLEGTIRLVSQLASRPLVFGQPNAWLLILLLISLALIYDLRKNVKKLAVLSLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIEKWQEKVTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELQATQTKVRSVSVGENLPIFGSQLEVLSPRKIGDGDHEDSLVLYGKL
LDKNFLFTGNLEEKGEKDLLKQYPDLEVDVLKASQHGSKKSSSSAFLEQLKPEITLISVGKNNRTKLPHQETLTRLETIN
SKVYRTDQQGAIRFKGWNSWKIESVR
MLQWIKNLPIPLIYLSFLLLWLYYAIFSASYLALLGFVFLLICLFFQFAWKSASKVLIICGIFGFWFLFQNWQQSQASQN
LADSVERVRILPDTIKVNGDSLSFRGKADGRIFQVYYKFQSEEEKETFQALTDLHEIGLEGKLSEPEGQRNFGGFDYQAY
LKTQGIYQTLNIKKIQSLQKVSSWDIGENLSSLRRKAVVWIKTHFPDPMRNYMTGLLLGHLDTDFEEMNELYSSLGIIHL
FALSGMQVGFFMDGFKKLFLRSGLTQEKLKWLTYPFSLLYAGMTGFSASVIRSLLQKLLAQHGVKGLDNFALTVLVLFII
MPNFFLTAGGVLSCAYAFILTMTSKEEEGLKAVARESLVISLGILPILSFYFAEFQPWSILLTFVFSFLFDLVFLPLLSI
LFAFSFLYPVIQLNFIFEWLEGTIRLVSQLASRPLVFGQPNAWLLILLLISLALIYDLRKNVKKLAVLSLLITGLFLLTK
HPLENEITMLDVGQGESIFLRDVTGKTILIDVGGKAESDKKIEKWQEKVTTSNAQRTLIPYLKSRGVAKIDQLILTNTDK
EHVGDLLEVTKAFHVGEILVSKGSLKQKEFVAELQATQTKVRSVSVGENLPIFGSQLEVLSPRKIGDGDHEDSLVLYGKL
LDKNFLFTGNLEEKGEKDLLKQYPDLEVDVLKASQHGSKKSSSSAFLEQLKPEITLISVGKNNRTKLPHQETLTRLETIN
SKVYRTDQQGAIRFKGWNSWKIESVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=1111465 ACN076_RS07670 WP_420789768.1 1561227..1563467(-) (comEC/celB) [Streptococcus sp. K0074]
ATGTTACAGTGGATTAAGAACCTCCCTATTCCCCTAATTTACCTGAGTTTTCTATTGCTCTGGCTTTACTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGTTTCGTTTTTCTGCTAATCTGTCTTTTTTTCCAATTTGCTTGGAAATCTGCTA
GTAAAGTTCTAATAATTTGCGGAATCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGTCAAGCTAGTCAAAAC
CTAGCGGATTCTGTTGAAAGGGTACGGATTTTGCCCGACACTATTAAGGTCAATGGTGACAGTCTATCCTTTCGTGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAATTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAGCTTTAACCGACC
TTCATGAGATAGGACTAGAAGGAAAACTTTCAGAACCAGAGGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGGTTAGCAGTTGGGATATAGG
TGAAAACCTGTCCAGCTTACGTCGAAAGGCTGTAGTTTGGATTAAGACACATTTTCCAGACCCTATGCGCAATTACATGA
CAGGGCTTTTGCTAGGGCATCTGGACACTGACTTTGAGGAGATGAATGAACTTTATTCTAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGCATGCAGGTAGGTTTTTTCATGGATGGATTTAAGAAACTATTCTTGCGATCAGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACCTATCCCTTTTCCCTTCTCTATGCTGGTATGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAATTACTGGCTCAACATGGTGTTAAGGGCTTGGATAATTTTGCCTTGACGGTTCTTGTCCTCTTTATTATC
ATGCCTAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCTTGACCATGACCAGCAAAGAAGA
GGAGGGGCTCAAGGCTGTTGCTAGAGAAAGTCTAGTCATATCCTTGGGCATATTGCCCATTCTGTCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTAGTCTTCTTACCGCTCCTGTCTATT
TTATTTGCCTTTTCATTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAGTGGTTGGAGGGGACTATTCGCTTGGT
ATCGCAGCTGGCAAGCAGACCGCTTGTCTTTGGACAACCTAATGCATGGCTTTTAATTTTATTGTTAATTTCCTTGGCTT
TGATCTATGATTTGAGGAAAAATGTTAAAAAGCTAGCAGTGTTGAGCTTATTGATTACAGGTCTCTTTTTATTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGACGTGGGACAAGGCGAAAGTATTTTCCTAAGGGATGTAACCGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTGATAAGAAAATTGAAAAATGGCAAGAAAAGGTAACGACTAGCAATG
CCCAGCGAACCTTGATTCCCTACCTCAAAAGTCGTGGAGTGGCCAAGATTGACCAGCTAATTTTGACCAATACGGACAAG
GAACATGTTGGAGATTTGTTGGAGGTGACCAAGGCTTTCCATGTAGGGGAAATTTTAGTGTCAAAAGGCAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACCAAGGTGCGTAGTGTGTCAGTGGGAGAGAACTTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGAAAATAGGAGATGGAGATCATGAAGATTCCCTGGTTTTGTATGGAAAACTC
TTGGATAAAAATTTTCTTTTCACAGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCAATATCCTGACTTAGA
GGTGGATGTTTTGAAAGCTAGCCAACATGGCTCTAAAAAATCATCAAGTTCAGCTTTTCTAGAACAGCTCAAACCAGAGA
TCACTCTCATCTCAGTTGGAAAGAACAATCGAACGAAACTCCCCCATCAGGAAACCCTGACCCGACTGGAAACGATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGGGCTATACGCTTTAAGGGGTGGAATAGTTGGAAAATCGAAAGTGTTCGATA
G
ATGTTACAGTGGATTAAGAACCTCCCTATTCCCCTAATTTACCTGAGTTTTCTATTGCTCTGGCTTTACTACGCTATTTT
CTCAGCATCCTATCTTGCTTTGTTGGGTTTCGTTTTTCTGCTAATCTGTCTTTTTTTCCAATTTGCTTGGAAATCTGCTA
GTAAAGTTCTAATAATTTGCGGAATCTTTGGCTTCTGGTTTCTGTTTCAAAATTGGCAACAGAGTCAAGCTAGTCAAAAC
CTAGCGGATTCTGTTGAAAGGGTACGGATTTTGCCCGACACTATTAAGGTCAATGGTGACAGTCTATCCTTTCGTGGCAA
GGCTGATGGACGCATTTTTCAAGTCTATTATAAATTCCAGTCCGAGGAGGAGAAAGAAACCTTTCAAGCTTTAACCGACC
TTCATGAGATAGGACTAGAAGGAAAACTTTCAGAACCAGAGGGGCAGAGAAATTTTGGTGGCTTTGACTACCAAGCCTAT
CTGAAGACTCAGGGAATTTACCAGACTCTCAATATCAAAAAAATCCAGTCACTTCAAAAGGTTAGCAGTTGGGATATAGG
TGAAAACCTGTCCAGCTTACGTCGAAAGGCTGTAGTTTGGATTAAGACACATTTTCCAGACCCTATGCGCAATTACATGA
CAGGGCTTTTGCTAGGGCATCTGGACACTGACTTTGAGGAGATGAATGAACTTTATTCTAGTCTAGGAATTATCCACCTC
TTTGCCTTGTCGGGCATGCAGGTAGGTTTTTTCATGGATGGATTTAAGAAACTATTCTTGCGATCAGGCTTGACCCAAGA
AAAGTTGAAATGGCTGACCTATCCCTTTTCCCTTCTCTATGCTGGTATGACAGGATTTTCAGCTTCGGTCATTCGCAGTC
TCTTGCAAAAATTACTGGCTCAACATGGTGTTAAGGGCTTGGATAATTTTGCCTTGACGGTTCTTGTCCTCTTTATTATC
ATGCCTAACTTTTTCTTGACAGCAGGAGGAGTCTTGTCCTGCGCTTATGCTTTTATCTTGACCATGACCAGCAAAGAAGA
GGAGGGGCTCAAGGCTGTTGCTAGAGAAAGTCTAGTCATATCCTTGGGCATATTGCCCATTCTGTCCTTCTATTTTGCAG
AATTTCAGCCTTGGTCTATCCTTTTGACCTTTGTCTTTTCCTTTCTATTTGACTTAGTCTTCTTACCGCTCCTGTCTATT
TTATTTGCCTTTTCATTTCTCTATCCAGTCATTCAGCTGAACTTTATCTTTGAGTGGTTGGAGGGGACTATTCGCTTGGT
ATCGCAGCTGGCAAGCAGACCGCTTGTCTTTGGACAACCTAATGCATGGCTTTTAATTTTATTGTTAATTTCCTTGGCTT
TGATCTATGATTTGAGGAAAAATGTTAAAAAGCTAGCAGTGTTGAGCTTATTGATTACAGGTCTCTTTTTATTGACCAAG
CATCCACTGGAAAATGAAATTACCATGCTGGACGTGGGACAAGGCGAAAGTATTTTCCTAAGGGATGTAACCGGTAAAAC
CATTCTCATAGATGTCGGTGGCAAGGCAGAATCTGATAAGAAAATTGAAAAATGGCAAGAAAAGGTAACGACTAGCAATG
CCCAGCGAACCTTGATTCCCTACCTCAAAAGTCGTGGAGTGGCCAAGATTGACCAGCTAATTTTGACCAATACGGACAAG
GAACATGTTGGAGATTTGTTGGAGGTGACCAAGGCTTTCCATGTAGGGGAAATTTTAGTGTCAAAAGGCAGTCTGAAACA
GAAGGAATTTGTGGCAGAACTACAGGCGACTCAAACCAAGGTGCGTAGTGTGTCAGTGGGAGAGAACTTGCCAATTTTTG
GAAGTCAGTTAGAAGTCCTATCTCCAAGGAAAATAGGAGATGGAGATCATGAAGATTCCCTGGTTTTGTATGGAAAACTC
TTGGATAAAAATTTTCTTTTCACAGGAAATTTGGAGGAGAAAGGAGAGAAGGACTTGCTGAAGCAATATCCTGACTTAGA
GGTGGATGTTTTGAAAGCTAGCCAACATGGCTCTAAAAAATCATCAAGTTCAGCTTTTCTAGAACAGCTCAAACCAGAGA
TCACTCTCATCTCAGTTGGAAAGAACAATCGAACGAAACTCCCCCATCAGGAAACCCTGACCCGACTGGAAACGATCAAT
AGCAAAGTTTACCGAACTGACCAGCAAGGGGCTATACGCTTTAAGGGGTGGAATAGTTGGAAAATCGAAAGTGTTCGATA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
93.566 |
100 |
0.936 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
92.483 |
99.866 |
0.924 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
91.823 |
100 |
0.918 |
| comEC/celB | Streptococcus pneumoniae D39 |
91.823 |
100 |
0.918 |
| comEC/celB | Streptococcus pneumoniae R6 |
91.823 |
100 |
0.918 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
91.555 |
100 |
0.916 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
45.209 |
99.33 |
0.449 |