Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | ACGI6K_RS19960 | Genome accession | NZ_CP171449 |
| Coordinates | 4117763..4119259 (+) | Length | 498 a.a. |
| NCBI ID | WP_376944608.1 | Uniprot ID | - |
| Organism | Azorhizophilus paspali strain ATCC 23833 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4112763..4124259
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ACGI6K_RS19925 (ACGI6K_19925) | - | 4112803..4113504 (+) | 702 | WP_376944597.1 | HAD family hydrolase | - |
| ACGI6K_RS19930 (ACGI6K_19930) | - | 4113550..4114317 (+) | 768 | Protein_3905 | CPBP family intramembrane glutamic endopeptidase | - |
| ACGI6K_RS19935 (ACGI6K_19935) | sutA | 4114384..4114704 (-) | 321 | WP_376944599.1 | transcriptional regulator SutA | - |
| ACGI6K_RS19940 (ACGI6K_19940) | - | 4114829..4115254 (-) | 426 | WP_376944601.1 | secondary thiamine-phosphate synthase enzyme YjbQ | - |
| ACGI6K_RS19945 (ACGI6K_19945) | - | 4115404..4116720 (-) | 1317 | WP_376944603.1 | ammonium transporter | - |
| ACGI6K_RS19950 (ACGI6K_19950) | glnK | 4116753..4117091 (-) | 339 | WP_012703245.1 | P-II family nitrogen regulator | - |
| ACGI6K_RS19955 (ACGI6K_19955) | - | 4117456..4117731 (+) | 276 | WP_376944606.1 | accessory factor UbiK family protein | - |
| ACGI6K_RS19960 (ACGI6K_19960) | comM | 4117763..4119259 (+) | 1497 | WP_376944608.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| ACGI6K_RS19965 (ACGI6K_19965) | - | 4119333..4120511 (+) | 1179 | Protein_3912 | AAA family ATPase | - |
| ACGI6K_RS19970 (ACGI6K_19970) | - | 4120508..4121389 (-) | 882 | WP_376944610.1 | LysR family transcriptional regulator | - |
| ACGI6K_RS19975 (ACGI6K_19975) | - | 4121504..4121926 (+) | 423 | WP_376944612.1 | DoxX family protein | - |
| ACGI6K_RS19980 (ACGI6K_19980) | - | 4121976..4122704 (+) | 729 | WP_376944614.1 | pirin family protein | - |
Sequence
Protein
Download Length: 498 a.a. Molecular weight: 52422.36 Da Isoelectric Point: 8.3343
>NTDB_id=1061264 ACGI6K_RS19960 WP_376944608.1 4117763..4119259(+) (comM) [Azorhizophilus paspali strain ATCC 23833]
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAGSLDGLECLGELALSGGLRPVRGVLPAALAAHAAGRTLVVPRANAEEASLASGLNVLAIDHL
LELAAHLNGRTPLAPYRSSGLLRQTRPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPPLDE
REALEVAAIHSVAGSAPLSAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHRGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHVGVARETTALGAPRLD
GPDSAGAAAQVALARNLQLARQGCPNAFLDLPGLHQHCALSDEDRQWLERACERLGLSLRAAHRVLKVARTLADLEALPS
IARAHLAEALQYRPAAHA
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAGSLDGLECLGELALSGGLRPVRGVLPAALAAHAAGRTLVVPRANAEEASLASGLNVLAIDHL
LELAAHLNGRTPLAPYRSSGLLRQTRPYPDLAEVQGQAAAKRALLVAAAGAHNLLLSGPPGTGKTLLASRLPGLLPPLDE
REALEVAAIHSVAGSAPLSAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHRGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHVGVARETTALGAPRLD
GPDSAGAAAQVALARNLQLARQGCPNAFLDLPGLHQHCALSDEDRQWLERACERLGLSLRAAHRVLKVARTLADLEALPS
IARAHLAEALQYRPAAHA
Nucleotide
Download Length: 1497 bp
>NTDB_id=1061264 ACGI6K_RS19960 WP_376944608.1 4117763..4119259(+) (comM) [Azorhizophilus paspali strain ATCC 23833]
ATGTCCCTGGCCATCGTCCACAGCCGTGCCCAGGTAGGCGTCGAGGCGCCCGCCGTCACCGTGGAGGCGCACCTGGCCAA
CGGTTTGCCGGCGTTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGATCGCGTGCGCAGCGCCATCC
TCACTTCCGGCTTCGATTTCCCGGCACGGCGCATCACCCTCAACCTGGCGCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTCGGCATCCTCGCCGCCAGCGAGCAATTGCCCGCGGGGTCTCTGGACGGCCTGGAATGCCT
GGGCGAACTGGCCCTCTCCGGCGGCCTGCGCCCGGTTCGGGGCGTGCTGCCCGCCGCGCTGGCCGCGCACGCCGCCGGAC
GCACCCTGGTGGTGCCGCGGGCGAACGCCGAAGAGGCCAGCCTGGCGTCGGGTCTGAACGTGCTGGCGATCGACCACCTG
CTGGAACTGGCCGCCCACCTGAACGGCCGGACCCCGCTCGCGCCCTACCGGTCCAGCGGCCTGCTGCGACAGACGCGCCC
CTACCCCGACCTCGCCGAGGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCGGCGGCAGGAGCTCACAATC
TCCTGTTGAGCGGTCCGCCCGGAACCGGCAAGACCTTGCTGGCCAGCCGTCTGCCGGGCCTGCTGCCGCCTTTGGACGAA
CGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGTAGCGCGCCGCTCTCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCCCATCATAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGCCCCGGCGAAATCACCCTGGCGC
ACCGGGGCGTGCTGTTTCTCGACGAATTGCCCGAATTCGACCGCAAGGTGCTGGAGGTGCTGCGCGAGCCTCTGGAAAGC
GGCGAAATCGTTATCGCCCGGGCCAGCGATAAGGTGCGCTTTCCGGCGCGCTTCCAGTTGGTGGCGGCGATGAACCCCTG
TCCCTGCGGCTATCTGGGCGACCCCGCCGGCCGTTGTCGCTGCACCCCGGAACAGATCCAGCGTTACCGCGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACGTCGGCGTCGCCCGCGAGACCACCGCCCTGGGCGCGCCGCGCCTGGAC
GGCCCGGACAGCGCCGGCGCCGCCGCCCAGGTGGCGTTGGCGCGCAACCTGCAACTGGCCCGCCAGGGCTGCCCCAATGC
CTTCCTCGACCTGCCCGGGCTGCACCAGCACTGTGCACTGAGCGACGAAGACCGCCAGTGGCTGGAACGCGCCTGCGAAC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCGTCCTCAAGGTGGCACGCACCCTGGCCGATCTGGAGGCGCTGCCGAGC
ATCGCCCGCGCCCACCTGGCCGAAGCGCTGCAATACCGGCCGGCGGCGCATGCCTGA
ATGTCCCTGGCCATCGTCCACAGCCGTGCCCAGGTAGGCGTCGAGGCGCCCGCCGTCACCGTGGAGGCGCACCTGGCCAA
CGGTTTGCCGGCGTTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGATCGCGTGCGCAGCGCCATCC
TCACTTCCGGCTTCGATTTCCCGGCACGGCGCATCACCCTCAACCTGGCGCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTCGGCATCCTCGCCGCCAGCGAGCAATTGCCCGCGGGGTCTCTGGACGGCCTGGAATGCCT
GGGCGAACTGGCCCTCTCCGGCGGCCTGCGCCCGGTTCGGGGCGTGCTGCCCGCCGCGCTGGCCGCGCACGCCGCCGGAC
GCACCCTGGTGGTGCCGCGGGCGAACGCCGAAGAGGCCAGCCTGGCGTCGGGTCTGAACGTGCTGGCGATCGACCACCTG
CTGGAACTGGCCGCCCACCTGAACGGCCGGACCCCGCTCGCGCCCTACCGGTCCAGCGGCCTGCTGCGACAGACGCGCCC
CTACCCCGACCTCGCCGAGGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCGGCGGCAGGAGCTCACAATC
TCCTGTTGAGCGGTCCGCCCGGAACCGGCAAGACCTTGCTGGCCAGCCGTCTGCCGGGCCTGCTGCCGCCTTTGGACGAA
CGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGTAGCGCGCCGCTCTCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCCCATCATAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGCCCCGGCGAAATCACCCTGGCGC
ACCGGGGCGTGCTGTTTCTCGACGAATTGCCCGAATTCGACCGCAAGGTGCTGGAGGTGCTGCGCGAGCCTCTGGAAAGC
GGCGAAATCGTTATCGCCCGGGCCAGCGATAAGGTGCGCTTTCCGGCGCGCTTCCAGTTGGTGGCGGCGATGAACCCCTG
TCCCTGCGGCTATCTGGGCGACCCCGCCGGCCGTTGTCGCTGCACCCCGGAACAGATCCAGCGTTACCGCGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACGTCGGCGTCGCCCGCGAGACCACCGCCCTGGGCGCGCCGCGCCTGGAC
GGCCCGGACAGCGCCGGCGCCGCCGCCCAGGTGGCGTTGGCGCGCAACCTGCAACTGGCCCGCCAGGGCTGCCCCAATGC
CTTCCTCGACCTGCCCGGGCTGCACCAGCACTGTGCACTGAGCGACGAAGACCGCCAGTGGCTGGAACGCGCCTGCGAAC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCGTCCTCAAGGTGGCACGCACCCTGGCCGATCTGGAGGCGCTGCCGAGC
ATCGCCCGCGCCCACCTGGCCGAAGCGCTGCAATACCGGCCGGCGGCGCATGCCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
56.74 |
99.799 |
0.566 |
| comM | Vibrio cholerae strain A1552 |
56.338 |
99.799 |
0.562 |
| comM | Haemophilus influenzae Rd KW20 |
55 |
100 |
0.552 |
| comM | Glaesserella parasuis strain SC1401 |
54.2 |
100 |
0.544 |
| comM | Legionella pneumophila str. Paris |
50.198 |
100 |
0.508 |
| comM | Legionella pneumophila strain ERS1305867 |
50.198 |
100 |
0.508 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
45.219 |
100 |
0.456 |