Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | CGZ47_RS04300 | Genome accession | NZ_CP022475 |
| Coordinates | 828959..831073 (+) | Length | 704 a.a. |
| NCBI ID | WP_223315707.1 | Uniprot ID | - |
| Organism | Latilactobacillus curvatus strain KG6 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 823959..836073
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CGZ47_RS04265 (CGZ47_04265) | - | 824011..825099 (+) | 1089 | WP_065825727.1 | CAP-associated domain-containing protein | - |
| CGZ47_RS04270 (CGZ47_04270) | - | 825119..825409 (+) | 291 | WP_004270748.1 | YlbG family protein | - |
| CGZ47_RS04275 (CGZ47_04275) | rsmD | 825415..825969 (+) | 555 | WP_004270750.1 | 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD | - |
| CGZ47_RS04280 (CGZ47_04280) | coaD | 825981..826475 (+) | 495 | WP_065825725.1 | pantetheine-phosphate adenylyltransferase | - |
| CGZ47_RS04285 (CGZ47_04285) | - | 826477..827523 (+) | 1047 | WP_089542073.1 | SepM family pheromone-processing serine protease | - |
| CGZ47_RS04290 (CGZ47_04290) | comEA | 827600..828271 (+) | 672 | WP_004270751.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| CGZ47_RS04295 (CGZ47_04295) | - | 828330..828818 (+) | 489 | WP_004270740.1 | ComE operon protein 2 | - |
| CGZ47_RS04300 (CGZ47_04300) | comEC | 828959..831073 (+) | 2115 | WP_223315707.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| CGZ47_RS04305 (CGZ47_04305) | holA | 831074..832102 (+) | 1029 | WP_089542075.1 | DNA polymerase III subunit delta | - |
| CGZ47_RS04310 (CGZ47_04310) | rpsT | 832158..832412 (-) | 255 | WP_004271117.1 | 30S ribosomal protein S20 | - |
| CGZ47_RS04315 (CGZ47_04315) | rpsO | 832689..832958 (+) | 270 | WP_004271112.1 | 30S ribosomal protein S15 | - |
| CGZ47_RS04320 (CGZ47_04320) | - | 833124..834044 (+) | 921 | WP_011373852.1 | IS30-like element ISLsa1 family transposase | - |
| CGZ47_RS04325 (CGZ47_04325) | - | 834198..835916 (+) | 1719 | WP_056966011.1 | ribonuclease J | - |
Sequence
Protein
Download Length: 704 a.a. Molecular weight: 79837.22 Da Isoelectric Point: 9.4030
>NTDB_id=239831 CGZ47_RS04300 WP_223315707.1 828959..831073(+) (comEC) [Latilactobacillus curvatus strain KG6]
MVSGLVLCLGGCRFWLARRTYEQPVRPNNQVYKIQPDTIKVAGDQIQLMAQGQNDDQRVVCYYRCQSSAEKRRWQTTNQP
LLLAGDIELERIQGATNRNEFDYARFMGQQKHCFYQANIIDGTVFRRIPPQGWLDWFHQRRQQGLIYLRQLPPALSFHAE
ALLCGVRESDGLAYQNVLGQLGIIHLLSLSGLHVFYFVTIIRRLATLCRVPREWVNVSLFCLLPLYALFVGGGTSITRAI
GLILLRLICEVSHFQQSRLDSWSWVLLVNLLWQPYLLISMGGLLSYLMAFGLIYQRRHGTFTTAFWLGLLSLPVCLRFNY
RWHILTILINGIVAPIFLPVMLGLIMVAICLWPVSHMIVWLEEGLLTTLYKGLAWIATFPYATITFGKIQLIPLFLIVIG
TLWLMASTGRRAKWGWGAVISLYVISFIGIHFNPVGRVIMFDIGQGDSLLIQQPFNRHNLLIDTGGRLALPQEKWQRRQI
VSRAEKVTVNYLYSCGIDHLEAIALSHQDADHIGDLNEIMQHIRVKRLICAAGLPQNRQFQRQVRPHLTKVTIEPYLADQ
HFMIGHQQINVLAPVVAGPGGNADSLVLQTQIGGASWLFTGDLEKAGEVAIVDRYPQLRVDYLKVGHHGSQTASDPQSIA
TWQVKGALISAGRHNRYGHPHAATLQTLKRANVPFWNTADCGMLEWRYGFGQVPMIKTTLKDCD
MVSGLVLCLGGCRFWLARRTYEQPVRPNNQVYKIQPDTIKVAGDQIQLMAQGQNDDQRVVCYYRCQSSAEKRRWQTTNQP
LLLAGDIELERIQGATNRNEFDYARFMGQQKHCFYQANIIDGTVFRRIPPQGWLDWFHQRRQQGLIYLRQLPPALSFHAE
ALLCGVRESDGLAYQNVLGQLGIIHLLSLSGLHVFYFVTIIRRLATLCRVPREWVNVSLFCLLPLYALFVGGGTSITRAI
GLILLRLICEVSHFQQSRLDSWSWVLLVNLLWQPYLLISMGGLLSYLMAFGLIYQRRHGTFTTAFWLGLLSLPVCLRFNY
RWHILTILINGIVAPIFLPVMLGLIMVAICLWPVSHMIVWLEEGLLTTLYKGLAWIATFPYATITFGKIQLIPLFLIVIG
TLWLMASTGRRAKWGWGAVISLYVISFIGIHFNPVGRVIMFDIGQGDSLLIQQPFNRHNLLIDTGGRLALPQEKWQRRQI
VSRAEKVTVNYLYSCGIDHLEAIALSHQDADHIGDLNEIMQHIRVKRLICAAGLPQNRQFQRQVRPHLTKVTIEPYLADQ
HFMIGHQQINVLAPVVAGPGGNADSLVLQTQIGGASWLFTGDLEKAGEVAIVDRYPQLRVDYLKVGHHGSQTASDPQSIA
TWQVKGALISAGRHNRYGHPHAATLQTLKRANVPFWNTADCGMLEWRYGFGQVPMIKTTLKDCD
Nucleotide
Download Length: 2115 bp
>NTDB_id=239831 CGZ47_RS04300 WP_223315707.1 828959..831073(+) (comEC) [Latilactobacillus curvatus strain KG6]
ATGGTCAGTGGACTGGTACTGTGTTTAGGCGGTTGCCGTTTTTGGCTGGCACGGCGCACCTACGAACAGCCGGTTAGACC
AAATAATCAAGTATATAAGATTCAACCCGATACGATCAAAGTGGCTGGCGATCAGATTCAATTAATGGCGCAGGGGCAGA
ATGATGACCAACGTGTTGTTTGCTACTATCGCTGCCAGAGTTCAGCGGAAAAACGCCGTTGGCAAACGACGAATCAGCCG
TTATTATTAGCGGGTGATATTGAATTAGAGCGTATCCAAGGTGCTACTAACCGGAATGAGTTCGACTATGCGCGCTTTAT
GGGGCAGCAAAAGCATTGTTTTTATCAAGCAAATATAATTGACGGAACGGTCTTTAGACGCATTCCACCACAAGGTTGGC
TAGATTGGTTCCACCAACGGCGCCAACAGGGGCTGATTTATCTCCGACAATTGCCACCAGCGTTGAGTTTTCATGCAGAG
GCGTTGTTATGTGGTGTTCGAGAGTCAGATGGCCTGGCCTACCAGAATGTGTTAGGTCAATTGGGGATTATTCATTTATT
GAGTTTATCCGGATTGCATGTATTCTACTTTGTGACAATCATTCGTCGATTGGCAACGCTGTGTCGAGTCCCGCGTGAAT
GGGTGAATGTTAGCTTGTTTTGTTTGCTACCCCTTTATGCGTTGTTTGTGGGGGGAGGCACGAGTATTACACGCGCTATC
GGGCTGATCTTACTACGATTAATCTGCGAAGTAAGTCATTTTCAGCAATCGCGTTTAGATAGTTGGAGTTGGGTATTGCT
GGTCAATCTTTTATGGCAGCCGTATTTATTAATCAGTATGGGGGGATTGCTGAGCTATCTGATGGCTTTTGGATTGATTT
ATCAGCGGCGCCATGGTACTTTTACAACGGCATTTTGGCTAGGCTTGTTAAGTTTACCGGTGTGCTTGCGGTTTAATTAT
CGTTGGCATATCTTGACGATTTTGATTAATGGGATTGTCGCGCCGATTTTCTTACCAGTCATGTTGGGTTTGATTATGGT
AGCGATTTGCTTATGGCCTGTTAGTCATATGATAGTTTGGTTAGAAGAAGGCCTGCTAACGACGCTGTATAAAGGATTAG
CTTGGATTGCAACGTTCCCATATGCGACAATTACTTTCGGTAAAATCCAGCTAATACCACTCTTTTTAATTGTGATAGGC
ACACTATGGTTGATGGCAAGTACTGGTCGGCGAGCAAAGTGGGGCTGGGGGGCTGTTATCAGCCTGTATGTCATTAGTTT
TATTGGTATCCACTTTAATCCGGTTGGCCGCGTGATTATGTTTGATATCGGGCAAGGCGATAGCCTGTTGATTCAACAAC
CCTTCAATCGACATAACCTGTTAATTGACACCGGCGGCCGACTAGCTTTACCGCAAGAAAAGTGGCAACGGCGGCAAATT
GTGAGTCGGGCGGAAAAAGTAACAGTTAATTACCTCTACAGCTGTGGCATTGATCATTTAGAGGCAATCGCATTATCGCA
TCAGGATGCTGATCATATAGGGGACTTAAATGAAATTATGCAACATATCCGAGTTAAACGGTTGATTTGTGCTGCGGGCT
TACCGCAAAACCGGCAGTTTCAACGGCAAGTGCGGCCGCATCTTACAAAAGTTACAATTGAGCCTTATTTGGCGGACCAA
CATTTTATGATTGGTCACCAACAAATCAATGTGTTAGCGCCAGTGGTGGCCGGACCAGGCGGGAATGCTGACTCACTCGT
TTTGCAAACGCAAATTGGTGGTGCGAGTTGGTTATTTACTGGTGATTTAGAAAAAGCGGGTGAAGTGGCAATCGTTGATC
GCTACCCACAATTACGCGTTGATTACCTAAAGGTGGGTCACCATGGCAGTCAAACGGCGAGTGATCCCCAGTCAATTGCG
ACGTGGCAGGTAAAAGGGGCGTTGATTTCTGCGGGACGCCATAATCGTTATGGTCATCCACATGCCGCAACGTTGCAGAC
TTTAAAACGTGCGAACGTCCCATTTTGGAATACGGCTGACTGTGGGATGCTGGAATGGCGCTATGGTTTTGGCCAAGTGC
CAATGATTAAAACAACACTAAAGGATTGTGACTAA
ATGGTCAGTGGACTGGTACTGTGTTTAGGCGGTTGCCGTTTTTGGCTGGCACGGCGCACCTACGAACAGCCGGTTAGACC
AAATAATCAAGTATATAAGATTCAACCCGATACGATCAAAGTGGCTGGCGATCAGATTCAATTAATGGCGCAGGGGCAGA
ATGATGACCAACGTGTTGTTTGCTACTATCGCTGCCAGAGTTCAGCGGAAAAACGCCGTTGGCAAACGACGAATCAGCCG
TTATTATTAGCGGGTGATATTGAATTAGAGCGTATCCAAGGTGCTACTAACCGGAATGAGTTCGACTATGCGCGCTTTAT
GGGGCAGCAAAAGCATTGTTTTTATCAAGCAAATATAATTGACGGAACGGTCTTTAGACGCATTCCACCACAAGGTTGGC
TAGATTGGTTCCACCAACGGCGCCAACAGGGGCTGATTTATCTCCGACAATTGCCACCAGCGTTGAGTTTTCATGCAGAG
GCGTTGTTATGTGGTGTTCGAGAGTCAGATGGCCTGGCCTACCAGAATGTGTTAGGTCAATTGGGGATTATTCATTTATT
GAGTTTATCCGGATTGCATGTATTCTACTTTGTGACAATCATTCGTCGATTGGCAACGCTGTGTCGAGTCCCGCGTGAAT
GGGTGAATGTTAGCTTGTTTTGTTTGCTACCCCTTTATGCGTTGTTTGTGGGGGGAGGCACGAGTATTACACGCGCTATC
GGGCTGATCTTACTACGATTAATCTGCGAAGTAAGTCATTTTCAGCAATCGCGTTTAGATAGTTGGAGTTGGGTATTGCT
GGTCAATCTTTTATGGCAGCCGTATTTATTAATCAGTATGGGGGGATTGCTGAGCTATCTGATGGCTTTTGGATTGATTT
ATCAGCGGCGCCATGGTACTTTTACAACGGCATTTTGGCTAGGCTTGTTAAGTTTACCGGTGTGCTTGCGGTTTAATTAT
CGTTGGCATATCTTGACGATTTTGATTAATGGGATTGTCGCGCCGATTTTCTTACCAGTCATGTTGGGTTTGATTATGGT
AGCGATTTGCTTATGGCCTGTTAGTCATATGATAGTTTGGTTAGAAGAAGGCCTGCTAACGACGCTGTATAAAGGATTAG
CTTGGATTGCAACGTTCCCATATGCGACAATTACTTTCGGTAAAATCCAGCTAATACCACTCTTTTTAATTGTGATAGGC
ACACTATGGTTGATGGCAAGTACTGGTCGGCGAGCAAAGTGGGGCTGGGGGGCTGTTATCAGCCTGTATGTCATTAGTTT
TATTGGTATCCACTTTAATCCGGTTGGCCGCGTGATTATGTTTGATATCGGGCAAGGCGATAGCCTGTTGATTCAACAAC
CCTTCAATCGACATAACCTGTTAATTGACACCGGCGGCCGACTAGCTTTACCGCAAGAAAAGTGGCAACGGCGGCAAATT
GTGAGTCGGGCGGAAAAAGTAACAGTTAATTACCTCTACAGCTGTGGCATTGATCATTTAGAGGCAATCGCATTATCGCA
TCAGGATGCTGATCATATAGGGGACTTAAATGAAATTATGCAACATATCCGAGTTAAACGGTTGATTTGTGCTGCGGGCT
TACCGCAAAACCGGCAGTTTCAACGGCAAGTGCGGCCGCATCTTACAAAAGTTACAATTGAGCCTTATTTGGCGGACCAA
CATTTTATGATTGGTCACCAACAAATCAATGTGTTAGCGCCAGTGGTGGCCGGACCAGGCGGGAATGCTGACTCACTCGT
TTTGCAAACGCAAATTGGTGGTGCGAGTTGGTTATTTACTGGTGATTTAGAAAAAGCGGGTGAAGTGGCAATCGTTGATC
GCTACCCACAATTACGCGTTGATTACCTAAAGGTGGGTCACCATGGCAGTCAAACGGCGAGTGATCCCCAGTCAATTGCG
ACGTGGCAGGTAAAAGGGGCGTTGATTTCTGCGGGACGCCATAATCGTTATGGTCATCCACATGCCGCAACGTTGCAGAC
TTTAAAACGTGCGAACGTCCCATTTTGGAATACGGCTGACTGTGGGATGCTGGAATGGCGCTATGGTTTTGGCCAAGTGC
CAATGATTAAAACAACACTAAAGGATTGTGACTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Latilactobacillus sakei subsp. sakei 23K |
62.942 |
100 |
0.632 |