Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | VOI35_RS09190 | Genome accession | NZ_CP142685 |
| Coordinates | 1971311..1972822 (-) | Length | 503 a.a. |
| NCBI ID | WP_407275749.1 | Uniprot ID | - |
| Organism | Halothiobacillus sp. DCM-1 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1966311..1977822
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| VOI35_RS09170 (VOI35_09155) | erpA | 1967477..1967830 (-) | 354 | WP_407275745.1 | iron-sulfur cluster insertion protein ErpA | - |
| VOI35_RS09175 (VOI35_09160) | argC | 1967851..1968885 (-) | 1035 | WP_407275746.1 | N-acetyl-gamma-glutamyl-phosphate reductase | - |
| VOI35_RS09180 (VOI35_09165) | - | 1969018..1970763 (+) | 1746 | WP_407275747.1 | chloride channel protein | - |
| VOI35_RS09185 (VOI35_09170) | - | 1970809..1971324 (+) | 516 | WP_407275748.1 | peroxiredoxin family protein | - |
| VOI35_RS09190 (VOI35_09175) | comM | 1971311..1972822 (-) | 1512 | WP_407275749.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| VOI35_RS09195 (VOI35_09180) | - | 1972868..1973143 (-) | 276 | WP_407275750.1 | accessory factor UbiK family protein | - |
| VOI35_RS09200 (VOI35_09185) | - | 1973400..1974197 (+) | 798 | WP_407275751.1 | TorF family putative porin | - |
| VOI35_RS09205 (VOI35_09190) | - | 1974222..1974560 (+) | 339 | WP_407275752.1 | P-II family nitrogen regulator | - |
| VOI35_RS09210 (VOI35_09195) | - | 1974572..1975879 (+) | 1308 | WP_407275753.1 | ammonium transporter | - |
| VOI35_RS09215 (VOI35_09200) | - | 1976106..1977554 (+) | 1449 | WP_407275754.1 | DegQ family serine endoprotease | - |
Sequence
Protein
Download Length: 503 a.a. Molecular weight: 52545.59 Da Isoelectric Point: 9.1623
>NTDB_id=926420 VOI35_RS09190 WP_407275749.1 1971311..1972822(-) (comM) [Halothiobacillus sp. DCM-1]
MTIAKVFSRAQAGIDAPLITVEAHVANGLPSITLVGLPETVVRESRDRVRAAILSSGFEFPPRRLTINLAPADLPKEGAR
FDLPIALAILLVTGQIPGDPAEQAHRLAAMECVGELSLDGGVCGVRGALSAALGAQSAGRALFVPPANAEEATVVRGVQA
AAVPHLRALVAHVAGQRALPWLTAVTPPAVDPAAGIDLADVRGQPRAKRALALVAAGAHNLLLIGPPGAGKSMLAQRLPG
ILPPMTEAEAMESARIASASSLGFTAADFGRRRFRHPHHSASAVALIGGGSTPRPGEISLAHHGVLFLDELPEFDRRVLE
ALREPLENGVITISRAAQQVEFPARFQLVAAMNPSPQGEEVHSLAAQRYRARISGPLLDRIDIQLELPRVAAADLQPGGA
AEPSSSEVRAQVAAARAVMLARQAKSNAALLPAELAAVAPLAPRDAALLHQAMDKLRLSARAHDRILKVARTIADFDGKA
AIDTAALTEALGYRALDRLFSPS
MTIAKVFSRAQAGIDAPLITVEAHVANGLPSITLVGLPETVVRESRDRVRAAILSSGFEFPPRRLTINLAPADLPKEGAR
FDLPIALAILLVTGQIPGDPAEQAHRLAAMECVGELSLDGGVCGVRGALSAALGAQSAGRALFVPPANAEEATVVRGVQA
AAVPHLRALVAHVAGQRALPWLTAVTPPAVDPAAGIDLADVRGQPRAKRALALVAAGAHNLLLIGPPGAGKSMLAQRLPG
ILPPMTEAEAMESARIASASSLGFTAADFGRRRFRHPHHSASAVALIGGGSTPRPGEISLAHHGVLFLDELPEFDRRVLE
ALREPLENGVITISRAAQQVEFPARFQLVAAMNPSPQGEEVHSLAAQRYRARISGPLLDRIDIQLELPRVAAADLQPGGA
AEPSSSEVRAQVAAARAVMLARQAKSNAALLPAELAAVAPLAPRDAALLHQAMDKLRLSARAHDRILKVARTIADFDGKA
AIDTAALTEALGYRALDRLFSPS
Nucleotide
Download Length: 1512 bp
>NTDB_id=926420 VOI35_RS09190 WP_407275749.1 1971311..1972822(-) (comM) [Halothiobacillus sp. DCM-1]
ATGACAATCGCCAAGGTATTTAGCCGCGCGCAAGCGGGGATCGATGCCCCGTTGATTACGGTCGAGGCGCATGTGGCTAA
CGGCTTGCCGAGCATCACCTTGGTGGGCTTGCCCGAAACCGTGGTGCGGGAAAGCCGCGATCGGGTGCGTGCGGCGATTT
TGTCCTCGGGGTTTGAGTTTCCGCCGCGTCGCCTGACCATCAATCTCGCACCGGCGGACTTGCCCAAAGAAGGTGCGCGG
TTTGATTTGCCGATTGCGCTCGCCATTTTATTGGTGACCGGGCAGATTCCCGGCGATCCCGCCGAGCAGGCCCATCGGTT
GGCCGCAATGGAGTGCGTGGGCGAGCTCTCGCTCGATGGGGGCGTTTGCGGCGTGCGCGGCGCGCTCTCGGCGGCATTGG
GTGCGCAGTCAGCGGGGCGAGCCTTGTTTGTGCCGCCCGCCAATGCCGAGGAGGCTACCGTGGTGCGCGGCGTGCAGGCT
GCAGCGGTGCCCCATTTGCGGGCATTGGTCGCACATGTCGCGGGGCAGCGCGCCTTACCTTGGCTCACGGCGGTAACGCC
GCCCGCCGTCGATCCGGCAGCCGGGATCGATCTGGCGGATGTGCGTGGACAGCCGCGCGCGAAACGAGCCTTGGCATTGG
TGGCGGCGGGCGCGCACAACCTGCTGCTGATCGGCCCGCCGGGGGCGGGCAAGAGTATGTTGGCGCAGCGTTTGCCGGGA
ATTTTGCCGCCAATGACCGAGGCGGAGGCGATGGAATCGGCACGGATTGCCTCGGCGTCGAGCCTGGGCTTTACGGCAGC
GGATTTCGGCCGCCGACGCTTCCGGCATCCGCATCACTCCGCCTCGGCGGTGGCGCTGATCGGCGGTGGTTCGACCCCGC
GGCCCGGCGAGATTTCGCTCGCGCATCACGGCGTGCTCTTTCTCGATGAACTCCCGGAATTCGACCGGCGGGTGCTCGAG
GCCTTGCGCGAGCCGCTGGAAAACGGCGTGATCACCATTTCCCGCGCGGCGCAGCAGGTGGAATTTCCGGCGCGGTTTCA
ACTGGTGGCGGCGATGAACCCGTCGCCGCAGGGCGAGGAGGTGCACTCGCTGGCCGCCCAGCGGTATCGGGCGCGAATTT
CCGGCCCGCTGCTCGACCGGATCGATATTCAATTGGAGCTGCCGCGCGTGGCGGCCGCCGATTTGCAACCTGGCGGTGCA
GCAGAGCCCTCGTCGTCGGAGGTGCGTGCGCAGGTGGCGGCGGCACGCGCCGTCATGCTGGCTCGCCAAGCTAAGTCCAA
TGCCGCGTTGTTGCCGGCGGAACTGGCCGCCGTGGCGCCGCTGGCGCCCCGCGATGCCGCGTTGCTGCATCAGGCGATGG
ATAAGCTGCGCCTCTCGGCGCGCGCGCATGACCGCATTCTGAAGGTGGCGCGCACGATTGCCGATTTCGACGGCAAGGCC
GCCATCGATACCGCCGCGCTCACTGAGGCGCTGGGGTATCGCGCTCTGGATCGGCTGTTCAGCCCGTCGTGA
ATGACAATCGCCAAGGTATTTAGCCGCGCGCAAGCGGGGATCGATGCCCCGTTGATTACGGTCGAGGCGCATGTGGCTAA
CGGCTTGCCGAGCATCACCTTGGTGGGCTTGCCCGAAACCGTGGTGCGGGAAAGCCGCGATCGGGTGCGTGCGGCGATTT
TGTCCTCGGGGTTTGAGTTTCCGCCGCGTCGCCTGACCATCAATCTCGCACCGGCGGACTTGCCCAAAGAAGGTGCGCGG
TTTGATTTGCCGATTGCGCTCGCCATTTTATTGGTGACCGGGCAGATTCCCGGCGATCCCGCCGAGCAGGCCCATCGGTT
GGCCGCAATGGAGTGCGTGGGCGAGCTCTCGCTCGATGGGGGCGTTTGCGGCGTGCGCGGCGCGCTCTCGGCGGCATTGG
GTGCGCAGTCAGCGGGGCGAGCCTTGTTTGTGCCGCCCGCCAATGCCGAGGAGGCTACCGTGGTGCGCGGCGTGCAGGCT
GCAGCGGTGCCCCATTTGCGGGCATTGGTCGCACATGTCGCGGGGCAGCGCGCCTTACCTTGGCTCACGGCGGTAACGCC
GCCCGCCGTCGATCCGGCAGCCGGGATCGATCTGGCGGATGTGCGTGGACAGCCGCGCGCGAAACGAGCCTTGGCATTGG
TGGCGGCGGGCGCGCACAACCTGCTGCTGATCGGCCCGCCGGGGGCGGGCAAGAGTATGTTGGCGCAGCGTTTGCCGGGA
ATTTTGCCGCCAATGACCGAGGCGGAGGCGATGGAATCGGCACGGATTGCCTCGGCGTCGAGCCTGGGCTTTACGGCAGC
GGATTTCGGCCGCCGACGCTTCCGGCATCCGCATCACTCCGCCTCGGCGGTGGCGCTGATCGGCGGTGGTTCGACCCCGC
GGCCCGGCGAGATTTCGCTCGCGCATCACGGCGTGCTCTTTCTCGATGAACTCCCGGAATTCGACCGGCGGGTGCTCGAG
GCCTTGCGCGAGCCGCTGGAAAACGGCGTGATCACCATTTCCCGCGCGGCGCAGCAGGTGGAATTTCCGGCGCGGTTTCA
ACTGGTGGCGGCGATGAACCCGTCGCCGCAGGGCGAGGAGGTGCACTCGCTGGCCGCCCAGCGGTATCGGGCGCGAATTT
CCGGCCCGCTGCTCGACCGGATCGATATTCAATTGGAGCTGCCGCGCGTGGCGGCCGCCGATTTGCAACCTGGCGGTGCA
GCAGAGCCCTCGTCGTCGGAGGTGCGTGCGCAGGTGGCGGCGGCACGCGCCGTCATGCTGGCTCGCCAAGCTAAGTCCAA
TGCCGCGTTGTTGCCGGCGGAACTGGCCGCCGTGGCGCCGCTGGCGCCCCGCGATGCCGCGTTGCTGCATCAGGCGATGG
ATAAGCTGCGCCTCTCGGCGCGCGCGCATGACCGCATTCTGAAGGTGGCGCGCACGATTGCCGATTTCGACGGCAAGGCC
GCCATCGATACCGCCGCGCTCACTGAGGCGCTGGGGTATCGCGCTCTGGATCGGCTGTTCAGCCCGTCGTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
51.181 |
100 |
0.517 |
| comM | Vibrio cholerae strain A1552 |
50.888 |
100 |
0.513 |
| comM | Glaesserella parasuis strain SC1401 |
49.116 |
100 |
0.497 |
| comM | Haemophilus influenzae Rd KW20 |
48.527 |
100 |
0.491 |
| comM | Legionella pneumophila str. Paris |
44.794 |
100 |
0.453 |
| comM | Legionella pneumophila strain ERS1305867 |
44.794 |
100 |
0.453 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
43.58 |
100 |
0.445 |