Detailed information
Overview
| Name | comEC/celB | Type | Machinery gene |
| Locus tag | DQM54_RS07790 | Genome accession | NZ_LS483375 |
| Coordinates | 1575060..1577300 (-) | Length | 746 a.a. |
| NCBI ID | WP_111723981.1 | Uniprot ID | - |
| Organism | Streptococcus gordonii strain NCTC3165 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1570060..1582300
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DQM54_RS07760 (NCTC3165_01546) | - | 1570329..1570826 (-) | 498 | WP_111724306.1 | DUF1648 domain-containing protein | - |
| DQM54_RS07765 (NCTC3165_01547) | - | 1570813..1571097 (-) | 285 | WP_111723977.1 | autorepressor SdpR family transcription factor | - |
| DQM54_RS07770 (NCTC3165_01548) | - | 1571075..1571863 (-) | 789 | WP_111723978.1 | YhfC family intramembrane metalloprotease | - |
| DQM54_RS07775 (NCTC3165_01549) | - | 1572069..1573142 (-) | 1074 | WP_111723979.1 | serine hydrolase | - |
| DQM54_RS07780 (NCTC3165_01550) | - | 1573254..1573859 (-) | 606 | WP_045773124.1 | superoxide dismutase | - |
| DQM54_RS07785 (NCTC3165_01551) | holA | 1573932..1574969 (-) | 1038 | WP_111723980.1 | DNA polymerase III subunit delta | - |
| DQM54_RS07790 (NCTC3165_01552) | comEC/celB | 1575060..1577300 (-) | 2241 | WP_111723981.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| DQM54_RS07795 (NCTC3165_01553) | comEA/celA/cilE | 1577284..1577949 (-) | 666 | WP_111723982.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| DQM54_RS07800 (NCTC3165_01554) | - | 1578046..1578573 (-) | 528 | WP_111723983.1 | HXXEE domain-containing protein | - |
| DQM54_RS07805 (NCTC3165_01555) | - | 1578612..1579352 (-) | 741 | WP_041789898.1 | lysophospholipid acyltransferase family protein | - |
| DQM54_RS07810 (NCTC3165_01556) | - | 1579482..1581851 (+) | 2370 | WP_111723984.1 | cation-translocating P-type ATPase | - |
Sequence
Protein
Download Length: 746 a.a. Molecular weight: 85516.43 Da Isoelectric Point: 9.8787
>NTDB_id=1139125 DQM54_RS07790 WP_111723981.1 1575060..1577300(-) (comEC/celB) [Streptococcus gordonii strain NCTC3165]
MSQWIKKLPLSPIYLCFLLVWLYFAIYSGEKLAYLGVFLLIARLIWHYPRKKWLPTVAILIAFSIFFYARRELAERTFQS
QPAPARQVLVLPDTVKVNGDSLFFRGRIDGRLYQLYYKLASPREKKSFQKLADLVTLEIEGEFNLAEGRRNFSGFDYQAY
LKSQGIYRTVKISRIMSSHSSQSINPFDWLSVWRRKALVFIKSTFPSPMSHYMTGLLFGDLDIEFAEMNDLYSSLGIIHL
FALSGMQVGFFMDAFRKILLRIGLRMETVDWLQFPLSFIYAGLTGFSVSVVRSLVQKLLSQFGVRRLDNFAMAMMALMFL
MPSFLLTTGGVLSCAYAFIITMLDFEDFSGFRKAVLESLSISLGILPILIHYFAEFQPWSILLTFLFSIVFDTFMLPLLS
LIFLISPLFAFTQVNVFFQWLEMVIRWVASLSTRPIILGQPNLPLLIILLLVLALLYDFRKQKKIRAGLSLLAILLFFLS
KYPLQNEITVVDIGQGDSIFLRDVRGRTIVIDTGGRVEIGKKEAWQERVRKSNAETTLIPYLKSRGVDRLDKLVLTHTDT
DHMGDMLELAKHFSIREIYVSKGSMTQVDFVDKLKKMKAKVHVVEVGDRLPIFDSALEVLYPLGQGDGGNDDSIVLYGEF
FRTKFLFTGDLEAPGEGQMVTAYPDLRVDVLKAGHHGSKGSSSPEFLEHIKPKLALISAGKNNRYQHPHKETLDRFEKIQ
TKIFRTDEQGAIRFKGWNSWQIETVR
MSQWIKKLPLSPIYLCFLLVWLYFAIYSGEKLAYLGVFLLIARLIWHYPRKKWLPTVAILIAFSIFFYARRELAERTFQS
QPAPARQVLVLPDTVKVNGDSLFFRGRIDGRLYQLYYKLASPREKKSFQKLADLVTLEIEGEFNLAEGRRNFSGFDYQAY
LKSQGIYRTVKISRIMSSHSSQSINPFDWLSVWRRKALVFIKSTFPSPMSHYMTGLLFGDLDIEFAEMNDLYSSLGIIHL
FALSGMQVGFFMDAFRKILLRIGLRMETVDWLQFPLSFIYAGLTGFSVSVVRSLVQKLLSQFGVRRLDNFAMAMMALMFL
MPSFLLTTGGVLSCAYAFIITMLDFEDFSGFRKAVLESLSISLGILPILIHYFAEFQPWSILLTFLFSIVFDTFMLPLLS
LIFLISPLFAFTQVNVFFQWLEMVIRWVASLSTRPIILGQPNLPLLIILLLVLALLYDFRKQKKIRAGLSLLAILLFFLS
KYPLQNEITVVDIGQGDSIFLRDVRGRTIVIDTGGRVEIGKKEAWQERVRKSNAETTLIPYLKSRGVDRLDKLVLTHTDT
DHMGDMLELAKHFSIREIYVSKGSMTQVDFVDKLKKMKAKVHVVEVGDRLPIFDSALEVLYPLGQGDGGNDDSIVLYGEF
FRTKFLFTGDLEAPGEGQMVTAYPDLRVDVLKAGHHGSKGSSSPEFLEHIKPKLALISAGKNNRYQHPHKETLDRFEKIQ
TKIFRTDEQGAIRFKGWNSWQIETVR
Nucleotide
Download Length: 2241 bp
>NTDB_id=1139125 DQM54_RS07790 WP_111723981.1 1575060..1577300(-) (comEC/celB) [Streptococcus gordonii strain NCTC3165]
ATGTCACAGTGGATTAAAAAACTTCCCCTTTCACCAATCTATTTGTGCTTTCTTTTGGTTTGGCTCTATTTCGCTATTTA
TAGTGGGGAAAAGCTCGCTTATCTGGGAGTTTTTCTGCTTATAGCTCGACTAATCTGGCATTATCCAAGAAAGAAATGGT
TGCCAACTGTAGCTATCCTAATTGCTTTTTCTATCTTTTTTTATGCTAGACGAGAGTTGGCGGAGCGAACCTTTCAGTCT
CAACCGGCTCCAGCAAGACAAGTTCTTGTTTTACCAGATACGGTTAAAGTGAATGGAGATTCCCTGTTTTTCCGTGGTAG
AATAGACGGAAGACTTTATCAGCTCTACTACAAATTAGCAAGTCCAAGAGAGAAAAAATCATTTCAAAAACTAGCAGACT
TGGTCACTTTAGAAATAGAAGGAGAGTTTAATCTAGCAGAAGGTCGGCGAAATTTTTCTGGTTTTGACTATCAGGCTTAT
TTAAAAAGCCAGGGAATTTATCGGACAGTTAAGATCAGTAGGATTATGTCCAGTCACTCTAGTCAATCTATCAATCCATT
TGATTGGCTGTCTGTTTGGCGTAGGAAGGCTTTGGTTTTCATTAAGTCTACTTTTCCAAGCCCGATGAGTCACTACATGA
CAGGACTTTTGTTTGGAGATTTAGATATTGAATTTGCAGAGATGAATGACTTGTACTCAAGTTTAGGAATTATCCATCTT
TTTGCTTTATCAGGAATGCAGGTTGGCTTTTTTATGGATGCCTTTCGGAAAATTCTTCTGCGCATAGGCTTAAGGATGGA
AACGGTAGATTGGTTGCAATTCCCGTTGTCCTTTATTTATGCAGGTTTGACTGGATTTTCTGTGTCAGTAGTGAGAAGTT
TAGTGCAAAAATTATTGTCTCAATTTGGAGTGAGGCGCTTGGATAATTTTGCTATGGCCATGATGGCCTTGATGTTTCTC
ATGCCGAGTTTTCTCCTGACAACAGGCGGAGTCCTATCTTGCGCTTATGCCTTTATCATCACTATGTTGGATTTTGAGGA
TTTTAGTGGCTTTCGTAAAGCCGTACTAGAGAGTTTAAGTATCTCGTTAGGCATTTTACCGATTCTTATCCACTATTTTG
CAGAATTCCAACCTTGGTCTATCCTCTTGACCTTCCTTTTTTCAATCGTCTTTGATACATTTATGTTACCTCTCTTGAGT
CTAATTTTTCTCATTTCGCCTTTGTTTGCCTTCACTCAAGTTAATGTCTTCTTCCAATGGCTGGAAATGGTGATTCGTTG
GGTAGCTAGTTTGTCAACAAGGCCGATAATTTTAGGACAGCCAAATCTGCCTTTGCTTATTATTCTCCTGCTGGTCTTAG
CCTTGCTCTATGACTTTAGAAAACAAAAAAAGATTAGGGCTGGTCTGAGTCTCTTAGCAATCTTACTATTTTTCCTAAGT
AAATATCCTTTACAAAATGAAATCACAGTGGTAGATATTGGGCAGGGAGACAGTATATTTCTGAGGGATGTCAGAGGTAG
GACTATCGTGATTGATACAGGAGGACGGGTGGAGATTGGAAAAAAAGAAGCTTGGCAAGAGCGCGTGAGAAAAAGTAATG
CGGAAACGACCTTGATCCCTTATCTAAAAAGCCGAGGAGTGGATAGATTGGATAAATTGGTCTTGACTCACACCGATACA
GACCATATGGGAGATATGTTGGAGTTAGCTAAGCACTTTTCTATCCGAGAAATTTATGTTTCTAAAGGAAGTATGACTCA
AGTTGATTTTGTAGACAAGTTAAAGAAGATGAAAGCTAAAGTTCATGTGGTCGAGGTGGGAGATCGGCTTCCTATCTTTG
ATTCGGCCCTTGAAGTGCTCTATCCACTAGGTCAAGGAGATGGTGGTAATGATGACTCGATTGTCTTGTATGGTGAATTT
TTCCGGACCAAGTTTCTTTTCACAGGAGATTTAGAAGCTCCAGGTGAAGGTCAGATGGTGACAGCCTATCCAGATTTAAG
AGTAGATGTGCTCAAAGCTGGGCATCATGGTTCTAAAGGATCTTCTAGTCCAGAATTTCTAGAGCATATTAAGCCTAAGT
TGGCCTTGATTTCAGCTGGTAAAAACAACCGTTATCAACATCCCCATAAGGAAACTTTAGATAGATTCGAAAAAATCCAG
ACTAAGATTTTTCGGACAGATGAGCAGGGAGCTATTCGATTTAAAGGCTGGAATTCTTGGCAGATAGAAACGGTTCGCTA
G
ATGTCACAGTGGATTAAAAAACTTCCCCTTTCACCAATCTATTTGTGCTTTCTTTTGGTTTGGCTCTATTTCGCTATTTA
TAGTGGGGAAAAGCTCGCTTATCTGGGAGTTTTTCTGCTTATAGCTCGACTAATCTGGCATTATCCAAGAAAGAAATGGT
TGCCAACTGTAGCTATCCTAATTGCTTTTTCTATCTTTTTTTATGCTAGACGAGAGTTGGCGGAGCGAACCTTTCAGTCT
CAACCGGCTCCAGCAAGACAAGTTCTTGTTTTACCAGATACGGTTAAAGTGAATGGAGATTCCCTGTTTTTCCGTGGTAG
AATAGACGGAAGACTTTATCAGCTCTACTACAAATTAGCAAGTCCAAGAGAGAAAAAATCATTTCAAAAACTAGCAGACT
TGGTCACTTTAGAAATAGAAGGAGAGTTTAATCTAGCAGAAGGTCGGCGAAATTTTTCTGGTTTTGACTATCAGGCTTAT
TTAAAAAGCCAGGGAATTTATCGGACAGTTAAGATCAGTAGGATTATGTCCAGTCACTCTAGTCAATCTATCAATCCATT
TGATTGGCTGTCTGTTTGGCGTAGGAAGGCTTTGGTTTTCATTAAGTCTACTTTTCCAAGCCCGATGAGTCACTACATGA
CAGGACTTTTGTTTGGAGATTTAGATATTGAATTTGCAGAGATGAATGACTTGTACTCAAGTTTAGGAATTATCCATCTT
TTTGCTTTATCAGGAATGCAGGTTGGCTTTTTTATGGATGCCTTTCGGAAAATTCTTCTGCGCATAGGCTTAAGGATGGA
AACGGTAGATTGGTTGCAATTCCCGTTGTCCTTTATTTATGCAGGTTTGACTGGATTTTCTGTGTCAGTAGTGAGAAGTT
TAGTGCAAAAATTATTGTCTCAATTTGGAGTGAGGCGCTTGGATAATTTTGCTATGGCCATGATGGCCTTGATGTTTCTC
ATGCCGAGTTTTCTCCTGACAACAGGCGGAGTCCTATCTTGCGCTTATGCCTTTATCATCACTATGTTGGATTTTGAGGA
TTTTAGTGGCTTTCGTAAAGCCGTACTAGAGAGTTTAAGTATCTCGTTAGGCATTTTACCGATTCTTATCCACTATTTTG
CAGAATTCCAACCTTGGTCTATCCTCTTGACCTTCCTTTTTTCAATCGTCTTTGATACATTTATGTTACCTCTCTTGAGT
CTAATTTTTCTCATTTCGCCTTTGTTTGCCTTCACTCAAGTTAATGTCTTCTTCCAATGGCTGGAAATGGTGATTCGTTG
GGTAGCTAGTTTGTCAACAAGGCCGATAATTTTAGGACAGCCAAATCTGCCTTTGCTTATTATTCTCCTGCTGGTCTTAG
CCTTGCTCTATGACTTTAGAAAACAAAAAAAGATTAGGGCTGGTCTGAGTCTCTTAGCAATCTTACTATTTTTCCTAAGT
AAATATCCTTTACAAAATGAAATCACAGTGGTAGATATTGGGCAGGGAGACAGTATATTTCTGAGGGATGTCAGAGGTAG
GACTATCGTGATTGATACAGGAGGACGGGTGGAGATTGGAAAAAAAGAAGCTTGGCAAGAGCGCGTGAGAAAAAGTAATG
CGGAAACGACCTTGATCCCTTATCTAAAAAGCCGAGGAGTGGATAGATTGGATAAATTGGTCTTGACTCACACCGATACA
GACCATATGGGAGATATGTTGGAGTTAGCTAAGCACTTTTCTATCCGAGAAATTTATGTTTCTAAAGGAAGTATGACTCA
AGTTGATTTTGTAGACAAGTTAAAGAAGATGAAAGCTAAAGTTCATGTGGTCGAGGTGGGAGATCGGCTTCCTATCTTTG
ATTCGGCCCTTGAAGTGCTCTATCCACTAGGTCAAGGAGATGGTGGTAATGATGACTCGATTGTCTTGTATGGTGAATTT
TTCCGGACCAAGTTTCTTTTCACAGGAGATTTAGAAGCTCCAGGTGAAGGTCAGATGGTGACAGCCTATCCAGATTTAAG
AGTAGATGTGCTCAAAGCTGGGCATCATGGTTCTAAAGGATCTTCTAGTCCAGAATTTCTAGAGCATATTAAGCCTAAGT
TGGCCTTGATTTCAGCTGGTAAAAACAACCGTTATCAACATCCCCATAAGGAAACTTTAGATAGATTCGAAAAAATCCAG
ACTAAGATTTTTCGGACAGATGAGCAGGGAGCTATTCGATTTAAAGGCTGGAATTCTTGGCAGATAGAAACGGTTCGCTA
G
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC/celB | Streptococcus mitis SK321 |
57.564 |
100 |
0.576 |
| comEC/celB | Streptococcus mitis NCTC 12261 |
56.702 |
100 |
0.567 |
| comEC/celB | Streptococcus pneumoniae TIGR4 |
56.359 |
100 |
0.564 |
| comEC/celB | Streptococcus pneumoniae Rx1 |
55.823 |
100 |
0.559 |
| comEC/celB | Streptococcus pneumoniae D39 |
55.823 |
100 |
0.559 |
| comEC/celB | Streptococcus pneumoniae R6 |
55.823 |
100 |
0.559 |
| comEC | Lactococcus lactis subsp. cremoris KW2 |
49.597 |
99.732 |
0.495 |