Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | CG798_RS18120 | Genome accession | NZ_CP022531 |
| Coordinates | 3628921..3631272 (-) | Length | 783 a.a. |
| NCBI ID | WP_094031847.1 | Uniprot ID | - |
| Organism | Bacillus velezensis strain TB1501 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 3623921..3636272
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| CG798_RS18090 (CG798_18090) | - | 3624287..3624625 (-) | 339 | WP_094031846.1 | YqxA family protein | - |
| CG798_RS18095 (CG798_18095) | - | 3624642..3625835 (-) | 1194 | WP_007612673.1 | stage II sporulation protein P | - |
| CG798_RS18100 (CG798_18100) | gpr | 3625903..3627009 (-) | 1107 | WP_007408268.1 | GPR endopeptidase | - |
| CG798_RS18105 (CG798_18105) | rpsT | 3627212..3627478 (+) | 267 | WP_003152876.1 | 30S ribosomal protein S20 | - |
| CG798_RS18110 (CG798_18110) | holA | 3627495..3628536 (-) | 1042 | Protein_3473 | DNA polymerase III subunit delta | - |
| CG798_RS19960 | - | 3628576..3628727 (-) | 152 | Protein_3474 | hypothetical protein | - |
| CG798_RS18115 (CG798_18115) | - | 3628768..3628902 (+) | 135 | WP_003152870.1 | YqzM family protein | - |
| CG798_RS18120 (CG798_18120) | comEC | 3628921..3631272 (-) | 2352 | WP_094031847.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
| CG798_RS18125 (CG798_18125) | - | 3631273..3631842 (-) | 570 | WP_094031848.1 | ComE operon protein 2 | - |
| CG798_RS18130 (CG798_18130) | comEA | 3631909..3632523 (-) | 615 | WP_043021637.1 | helix-hairpin-helix domain-containing protein | Machinery gene |
| CG798_RS18135 (CG798_18135) | comER | 3632582..3633403 (+) | 822 | WP_012118027.1 | late competence protein ComER | - |
| CG798_RS18140 (CG798_18140) | - | 3633472..3634209 (-) | 738 | WP_076424998.1 | class I SAM-dependent methyltransferase | - |
| CG798_RS18145 (CG798_18145) | rsfS | 3634206..3634562 (-) | 357 | WP_007408260.1 | ribosome silencing factor | - |
| CG798_RS18150 (CG798_18150) | yqeK | 3634580..3635140 (-) | 561 | WP_014418422.1 | bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK | - |
| CG798_RS18155 (CG798_18155) | - | 3635130..3635699 (-) | 570 | WP_007408258.1 | nicotinate-nucleotide adenylyltransferase | - |
| CG798_RS18160 (CG798_18160) | yhbY | 3635710..3636000 (-) | 291 | WP_007408257.1 | ribosome assembly RNA-binding protein YhbY | - |
Sequence
Protein
Download Length: 783 a.a. Molecular weight: 86502.42 Da Isoelectric Point: 8.9041
>NTDB_id=240440 CG798_RS18120 WP_094031847.1 3628921..3631272(-) (comEC) [Bacillus velezensis strain TB1501]
MKYKYLLLPLAAVSATAGIAAAHVFLVLLLFLLYLLFIIVKTKQHAPVIVCLVSFCLYFFLYTVCDVANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGESKLAGLAMASLIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFGMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD
MKYKYLLLPLAAVSATAGIAAAHVFLVLLLFLLYLLFIIVKTKQHAPVIVCLVSFCLYFFLYTVCDVANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGESKLAGLAMASLIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFGMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD
Nucleotide
Download Length: 2352 bp
>NTDB_id=240440 CG798_RS18120 WP_094031847.1 3628921..3631272(-) (comEC) [Bacillus velezensis strain TB1501]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTTGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATCATTGTAAAAACAAAGCAGCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGTTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCAGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACACTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGGAAAGCAAACTTGCCGGGCTTGCGATGGCTTCATTGATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCTTTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGATGGGAGAATGTTTGTTTGGTATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCATTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCACTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTAGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAAGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAGCAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTAAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTTGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATCATTGTAAAAACAAAGCAGCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGTTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCAGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACACTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGGAAAGCAAACTTGCCGGGCTTGCGATGGCTTCATTGATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCTTTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGATGGGAGAATGTTTGTTTGGTATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCATTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCACTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTAGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAAGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAGCAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTAAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Bacillus subtilis subsp. subtilis str. 168 |
57.124 |
98.595 |
0.563 |