Detailed information
Overview
| Name | comA/nlmT | Type | Regulator |
| Locus tag | GO995_RS08545 | Genome accession | NZ_CP046624 |
| Coordinates | 1711545..1713692 (+) | Length | 715 a.a. |
| NCBI ID | WP_157629401.1 | Uniprot ID | - |
| Organism | Streptococcus ruminicola strain CNU_G3 | ||
| Function | transport of ComC (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1707460..1716303 | 1711545..1713692 | within | 0 |
Gene organization within MGE regions
Location: 1707460..1716303
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GO995_RS08515 (GO995_08515) | - | 1707765..1708838 (-) | 1074 | WP_238385871.1 | glycosyltransferase family 2 protein | - |
| GO995_RS08520 (GO995_08520) | - | 1709060..1709359 (-) | 300 | WP_157327346.1 | DUF3884 family protein | - |
| GO995_RS08525 (GO995_08525) | - | 1709385..1709567 (-) | 183 | WP_157327348.1 | hypothetical protein | - |
| GO995_RS08530 (GO995_08530) | - | 1709633..1709857 (-) | 225 | WP_039697856.1 | helix-turn-helix domain-containing protein | - |
| GO995_RS08535 (GO995_08535) | - | 1709847..1710845 (-) | 999 | WP_157629400.1 | thioredoxin family protein | - |
| GO995_RS08540 (GO995_08540) | - | 1711075..1711305 (-) | 231 | WP_074602958.1 | class IIb bacteriocin, lactobin A/cerein 7B family | - |
| GO995_RS08545 (GO995_08545) | comA/nlmT | 1711545..1713692 (+) | 2148 | WP_157629401.1 | peptide cleavage/export ABC transporter | Regulator |
| GO995_RS08550 (GO995_08550) | - | 1713702..1715084 (+) | 1383 | WP_157629402.1 | bacteriocin secretion accessory protein | - |
| GO995_RS08555 (GO995_08555) | - | 1715110..1715289 (-) | 180 | WP_157629403.1 | hypothetical protein | - |
| GO995_RS08560 (GO995_08560) | - | 1715629..1716303 (-) | 675 | WP_133018006.1 | CPBP family intramembrane glutamic endopeptidase | - |
Sequence
Protein
Download Length: 715 a.a. Molecular weight: 80114.24 Da Isoelectric Point: 8.6397
>NTDB_id=406192 GO995_RS08545 WP_157629401.1 1711545..1713692(+) (comA/nlmT) [Streptococcus ruminicola strain CNU_G3]
MRRYKYVSQIDMRDCGVAALASVAKHYGSHFSLAHLRELAKTNKEGTTALGIVEAAKKIGFETRAVKADIALFEMDDIPY
PCIVHVNKQGKLQHYYVIYKAKKDYLIIGDPDPSVGVTKMAKADFAKEWTGVTIFLAPAASYKPHKDKKNGLMSFLPIIF
KQKSLLTYIILASLLVTLINIVGSYYLQGILDDYIPNQLQSTLGIVSIGLVVTYIMQQIMSFSQQYLLVVLSQRLTIDVI
LSYIRHIFELPMSFFATRRTGEVISRFTDANSIIDALASTILSLFLDVSILFIVGSVLILQNAKLFLITLIALPVYAIII
FAFMKPFERMNHDVMQANSMVSSAIIEDINGIETIKSLTSEETRYRNIDSEFVEYLDKSFTLNKYEAMQASLKQGARLIL
NVAILWYGSRLVMTGNISVGQLITYNTLLSYFTRPMENIINLQTKLQSAKVANKRLNEVYLVSSEFDNKQVIDNPEFLKG
DLVFDDVSYKYGFGRDTLSHIKLRIKEGEKVSLVGVSGSGKTTLAKMMVNFYSPNQGQITLGSYDYKTIDKKVLRQHINY
LPQQSYVFSGSILDNLTLGASDAITQEDIIKACEIAEIRADIEAMPMAYHTELSDGAGLSGGQKQRLAIARALLTKSPVL
ILDEATSGLDVLTEKRVIHNLLALKDKTIIFVAHRLSIAKQTDHVIVLDKGQIIEQGHHEQLMQNPGFYAQLFQE
MRRYKYVSQIDMRDCGVAALASVAKHYGSHFSLAHLRELAKTNKEGTTALGIVEAAKKIGFETRAVKADIALFEMDDIPY
PCIVHVNKQGKLQHYYVIYKAKKDYLIIGDPDPSVGVTKMAKADFAKEWTGVTIFLAPAASYKPHKDKKNGLMSFLPIIF
KQKSLLTYIILASLLVTLINIVGSYYLQGILDDYIPNQLQSTLGIVSIGLVVTYIMQQIMSFSQQYLLVVLSQRLTIDVI
LSYIRHIFELPMSFFATRRTGEVISRFTDANSIIDALASTILSLFLDVSILFIVGSVLILQNAKLFLITLIALPVYAIII
FAFMKPFERMNHDVMQANSMVSSAIIEDINGIETIKSLTSEETRYRNIDSEFVEYLDKSFTLNKYEAMQASLKQGARLIL
NVAILWYGSRLVMTGNISVGQLITYNTLLSYFTRPMENIINLQTKLQSAKVANKRLNEVYLVSSEFDNKQVIDNPEFLKG
DLVFDDVSYKYGFGRDTLSHIKLRIKEGEKVSLVGVSGSGKTTLAKMMVNFYSPNQGQITLGSYDYKTIDKKVLRQHINY
LPQQSYVFSGSILDNLTLGASDAITQEDIIKACEIAEIRADIEAMPMAYHTELSDGAGLSGGQKQRLAIARALLTKSPVL
ILDEATSGLDVLTEKRVIHNLLALKDKTIIFVAHRLSIAKQTDHVIVLDKGQIIEQGHHEQLMQNPGFYAQLFQE
Nucleotide
Download Length: 2148 bp
>NTDB_id=406192 GO995_RS08545 WP_157629401.1 1711545..1713692(+) (comA/nlmT) [Streptococcus ruminicola strain CNU_G3]
ATGAGGAGATACAAATACGTTAGTCAGATTGATATGCGAGATTGTGGGGTAGCTGCTTTAGCTTCAGTCGCTAAACACTA
CGGCTCACATTTTTCACTTGCTCACTTGAGGGAACTGGCTAAGACGAATAAAGAAGGAACAACTGCTCTTGGAATCGTCG
AAGCAGCCAAAAAGATTGGCTTTGAAACACGTGCTGTAAAAGCTGATATAGCACTTTTTGAGATGGATGATATCCCGTAT
CCTTGTATTGTCCATGTGAATAAGCAAGGAAAACTGCAACACTATTATGTCATCTATAAGGCTAAAAAAGATTATCTAAT
CATCGGTGATCCTGACCCAAGTGTGGGAGTCACAAAAATGGCTAAAGCTGATTTTGCCAAAGAATGGACAGGGGTTACGA
TCTTTCTAGCTCCTGCAGCTAGCTATAAACCACATAAGGATAAGAAAAATGGCTTAATGAGTTTTTTGCCAATCATCTTT
AAACAAAAATCACTACTAACATATATAATCCTTGCCAGTCTACTTGTAACCTTGATTAATATTGTCGGCTCTTATTATCT
TCAAGGAATCTTGGACGACTATATTCCCAATCAACTCCAATCCACCCTTGGAATTGTCTCAATCGGCTTGGTCGTGACTT
ACATCATGCAACAGATTATGAGCTTTTCACAACAGTATTTGTTAGTTGTGCTCAGTCAACGTTTGACGATTGATGTCATT
TTATCTTATATTCGTCATATTTTTGAATTGCCCATGTCCTTTTTTGCGACAAGGCGAACTGGGGAAGTGATTTCACGCTT
TACCGACGCTAACTCTATCATCGATGCTCTTGCTTCAACAATCTTATCGCTTTTTCTCGATGTGAGTATCCTTTTCATCG
TTGGCAGTGTCTTGATTTTGCAAAATGCGAAACTCTTCTTAATAACCTTAATCGCTTTGCCTGTTTATGCCATCATTATC
TTTGCTTTTATGAAGCCTTTTGAGCGAATGAATCATGACGTCATGCAAGCCAATTCTATGGTCAGCTCTGCCATTATTGA
AGATATCAACGGGATTGAAACCATTAAATCACTAACTAGCGAAGAGACACGTTATCGCAATATCGATAGCGAATTCGTGG
AATACTTGGATAAAAGTTTCACACTTAATAAATACGAAGCAATGCAAGCCTCACTTAAGCAAGGAGCACGGCTGATTTTA
AATGTGGCTATTCTCTGGTACGGCTCACGTCTCGTTATGACAGGGAATATTTCTGTTGGGCAGTTGATTACCTACAACAC
TCTCTTATCTTACTTCACAAGACCAATGGAAAATATTATTAACTTGCAAACTAAGTTACAGTCAGCAAAAGTTGCTAACA
AGCGTCTTAACGAAGTTTATCTGGTATCTTCTGAATTTGATAACAAACAAGTAATTGATAATCCAGAGTTTCTCAAAGGT
GATCTTGTCTTTGATGATGTTTCTTATAAATACGGCTTTGGACGCGATACTCTTAGTCACATTAAACTTCGTATCAAGGA
AGGTGAAAAAGTTAGTCTCGTCGGCGTCAGTGGTTCTGGAAAAACGACTTTGGCTAAGATGATGGTAAACTTTTATAGTC
CAAATCAAGGTCAAATCACCTTGGGAAGCTACGACTATAAAACTATTGATAAAAAAGTTCTTCGTCAACACATCAACTAC
CTTCCACAACAATCTTACGTCTTTTCTGGAAGTATCCTTGATAATCTCACTTTAGGTGCTTCTGATGCGATTACTCAAGA
AGACATTATCAAGGCCTGTGAAATCGCTGAGATTCGTGCTGACATTGAAGCTATGCCAATGGCTTACCACACTGAATTAT
CAGATGGGGCTGGTTTATCTGGCGGTCAAAAACAACGATTAGCTATTGCCAGAGCACTTCTGACTAAGTCACCCGTCTTA
ATCCTAGATGAAGCAACAAGTGGTCTTGATGTGCTGACCGAAAAACGTGTTATCCACAATTTGTTAGCTTTAAAAGACAA
AACCATTATCTTTGTTGCTCACCGTTTAAGCATTGCCAAGCAAACTGACCATGTCATCGTCCTAGATAAAGGACAAATTA
TCGAACAAGGTCATCACGAGCAACTCATGCAAAATCCTGGCTTTTACGCTCAGCTATTTCAAGAATAA
ATGAGGAGATACAAATACGTTAGTCAGATTGATATGCGAGATTGTGGGGTAGCTGCTTTAGCTTCAGTCGCTAAACACTA
CGGCTCACATTTTTCACTTGCTCACTTGAGGGAACTGGCTAAGACGAATAAAGAAGGAACAACTGCTCTTGGAATCGTCG
AAGCAGCCAAAAAGATTGGCTTTGAAACACGTGCTGTAAAAGCTGATATAGCACTTTTTGAGATGGATGATATCCCGTAT
CCTTGTATTGTCCATGTGAATAAGCAAGGAAAACTGCAACACTATTATGTCATCTATAAGGCTAAAAAAGATTATCTAAT
CATCGGTGATCCTGACCCAAGTGTGGGAGTCACAAAAATGGCTAAAGCTGATTTTGCCAAAGAATGGACAGGGGTTACGA
TCTTTCTAGCTCCTGCAGCTAGCTATAAACCACATAAGGATAAGAAAAATGGCTTAATGAGTTTTTTGCCAATCATCTTT
AAACAAAAATCACTACTAACATATATAATCCTTGCCAGTCTACTTGTAACCTTGATTAATATTGTCGGCTCTTATTATCT
TCAAGGAATCTTGGACGACTATATTCCCAATCAACTCCAATCCACCCTTGGAATTGTCTCAATCGGCTTGGTCGTGACTT
ACATCATGCAACAGATTATGAGCTTTTCACAACAGTATTTGTTAGTTGTGCTCAGTCAACGTTTGACGATTGATGTCATT
TTATCTTATATTCGTCATATTTTTGAATTGCCCATGTCCTTTTTTGCGACAAGGCGAACTGGGGAAGTGATTTCACGCTT
TACCGACGCTAACTCTATCATCGATGCTCTTGCTTCAACAATCTTATCGCTTTTTCTCGATGTGAGTATCCTTTTCATCG
TTGGCAGTGTCTTGATTTTGCAAAATGCGAAACTCTTCTTAATAACCTTAATCGCTTTGCCTGTTTATGCCATCATTATC
TTTGCTTTTATGAAGCCTTTTGAGCGAATGAATCATGACGTCATGCAAGCCAATTCTATGGTCAGCTCTGCCATTATTGA
AGATATCAACGGGATTGAAACCATTAAATCACTAACTAGCGAAGAGACACGTTATCGCAATATCGATAGCGAATTCGTGG
AATACTTGGATAAAAGTTTCACACTTAATAAATACGAAGCAATGCAAGCCTCACTTAAGCAAGGAGCACGGCTGATTTTA
AATGTGGCTATTCTCTGGTACGGCTCACGTCTCGTTATGACAGGGAATATTTCTGTTGGGCAGTTGATTACCTACAACAC
TCTCTTATCTTACTTCACAAGACCAATGGAAAATATTATTAACTTGCAAACTAAGTTACAGTCAGCAAAAGTTGCTAACA
AGCGTCTTAACGAAGTTTATCTGGTATCTTCTGAATTTGATAACAAACAAGTAATTGATAATCCAGAGTTTCTCAAAGGT
GATCTTGTCTTTGATGATGTTTCTTATAAATACGGCTTTGGACGCGATACTCTTAGTCACATTAAACTTCGTATCAAGGA
AGGTGAAAAAGTTAGTCTCGTCGGCGTCAGTGGTTCTGGAAAAACGACTTTGGCTAAGATGATGGTAAACTTTTATAGTC
CAAATCAAGGTCAAATCACCTTGGGAAGCTACGACTATAAAACTATTGATAAAAAAGTTCTTCGTCAACACATCAACTAC
CTTCCACAACAATCTTACGTCTTTTCTGGAAGTATCCTTGATAATCTCACTTTAGGTGCTTCTGATGCGATTACTCAAGA
AGACATTATCAAGGCCTGTGAAATCGCTGAGATTCGTGCTGACATTGAAGCTATGCCAATGGCTTACCACACTGAATTAT
CAGATGGGGCTGGTTTATCTGGCGGTCAAAAACAACGATTAGCTATTGCCAGAGCACTTCTGACTAAGTCACCCGTCTTA
ATCCTAGATGAAGCAACAAGTGGTCTTGATGTGCTGACCGAAAAACGTGTTATCCACAATTTGTTAGCTTTAAAAGACAA
AACCATTATCTTTGTTGCTCACCGTTTAAGCATTGCCAAGCAAACTGACCATGTCATCGTCCTAGATAAAGGACAAATTA
TCGAACAAGGTCATCACGAGCAACTCATGCAAAATCCTGGCTTTTACGCTCAGCTATTTCAAGAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA/nlmT | Streptococcus mutans UA159 |
69.144 |
99.72 |
0.69 |
| comA | Streptococcus mitis NCTC 12261 |
65.96 |
99.021 |
0.653 |
| comA | Streptococcus mitis SK321 |
64.972 |
99.021 |
0.643 |
| comA | Streptococcus pneumoniae Rx1 |
64.972 |
99.021 |
0.643 |
| comA | Streptococcus pneumoniae D39 |
64.972 |
99.021 |
0.643 |
| comA | Streptococcus pneumoniae R6 |
64.972 |
99.021 |
0.643 |
| comA | Streptococcus pneumoniae TIGR4 |
64.972 |
99.021 |
0.643 |
| comA | Streptococcus gordonii str. Challis substr. CH1 |
63.764 |
99.58 |
0.635 |