Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | B1781_RS21380 | Genome accession | NZ_CP019936 |
| Coordinates | 4521493..4523010 (+) | Length | 505 a.a. |
| NCBI ID | WP_078121627.1 | Uniprot ID | - |
| Organism | Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 4516493..4528010
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| B1781_RS21355 | - | 4517326..4518375 (+) | 1050 | WP_078121622.1 | succinylglutamate desuccinylase/aspartoacylase family protein | - |
| B1781_RS21360 | - | 4518475..4519761 (-) | 1287 | WP_078121623.1 | ammonium transporter | - |
| B1781_RS21365 | glnK | 4519783..4520121 (-) | 339 | WP_078121624.1 | P-II family nitrogen regulator | - |
| B1781_RS21370 | - | 4520200..4520913 (-) | 714 | WP_078121625.1 | TorF family putative porin | - |
| B1781_RS21375 | - | 4521198..4521446 (+) | 249 | WP_078121626.1 | accessory factor UbiK family protein | - |
| B1781_RS21380 | comM | 4521493..4523010 (+) | 1518 | WP_078121627.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| B1781_RS21385 | - | 4523086..4524987 (+) | 1902 | WP_078121628.1 | ATP-binding cassette domain-containing protein | - |
| B1781_RS21390 | - | 4525006..4525599 (+) | 594 | WP_078121629.1 | hypothetical protein | - |
| B1781_RS21395 | - | 4525630..4526748 (+) | 1119 | WP_078121630.1 | PQQ-dependent sugar dehydrogenase | - |
| B1781_RS21400 | arfB | 4526745..4527161 (+) | 417 | WP_408646378.1 | alternative ribosome rescue aminoacyl-tRNA hydrolase ArfB | - |
| B1781_RS21405 | - | 4527174..4527923 (-) | 750 | WP_334223815.1 | sulfite exporter TauE/SafE family protein | - |
Sequence
Protein
Download Length: 505 a.a. Molecular weight: 53705.46 Da Isoelectric Point: 8.1416
>NTDB_id=218982 B1781_RS21380 WP_078121627.1 4521493..4523010(+) (comM) [Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a]
MAFAVTHCRAAIGVDAPPVAVETHLANGLPSFNIVGLPEKAVQESRDRVRSALVNSGFDFPARRITVNLAPADIPKHGSR
FDLAIAIGILLASGQLPDKSADGYEFVGELSLAGALRSISGVLPMALATAAADTKMILPAANADEAALVSSLASYPAEHL
LGVTAHLLGATLLKEHCKPQQQSSNSPSSDLADVWGQSQAKRALEIVAAGRHNLLMVGPPGSGKSMLASRLPGILPPMTE
REALESAAVRSIAGLPFSPGTWMQRPFRAPHHTASGVALVGGGGGSHPRPGEVSLAHFGTLFLDELPEFNGKVLDVLREP
LETGKILISRAARQAEFPADFQLIAAMNPCQCGYANDPERICAGCSPERVARYQRRISGPLRDRIDIQIEVSALPRQALL
GGLKARSEDSATVRQRVCSAWEKQLERQGTANARLGHEDLQRHCSLSPAGQALLANAIDKLGLSARAFHRILRVARTIAD
LAGKESIEDQHLTEAIGYRRLDRIG
MAFAVTHCRAAIGVDAPPVAVETHLANGLPSFNIVGLPEKAVQESRDRVRSALVNSGFDFPARRITVNLAPADIPKHGSR
FDLAIAIGILLASGQLPDKSADGYEFVGELSLAGALRSISGVLPMALATAAADTKMILPAANADEAALVSSLASYPAEHL
LGVTAHLLGATLLKEHCKPQQQSSNSPSSDLADVWGQSQAKRALEIVAAGRHNLLMVGPPGSGKSMLASRLPGILPPMTE
REALESAAVRSIAGLPFSPGTWMQRPFRAPHHTASGVALVGGGGGSHPRPGEVSLAHFGTLFLDELPEFNGKVLDVLREP
LETGKILISRAARQAEFPADFQLIAAMNPCQCGYANDPERICAGCSPERVARYQRRISGPLRDRIDIQIEVSALPRQALL
GGLKARSEDSATVRQRVCSAWEKQLERQGTANARLGHEDLQRHCSLSPAGQALLANAIDKLGLSARAFHRILRVARTIAD
LAGKESIEDQHLTEAIGYRRLDRIG
Nucleotide
Download Length: 1518 bp
>NTDB_id=218982 B1781_RS21380 WP_078121627.1 4521493..4523010(+) (comM) [Thiosocius teredinicola strain PMS-2146H.STBD.0c.01a]
ATGGCATTTGCAGTAACACACTGTCGCGCCGCTATTGGCGTCGACGCCCCACCGGTAGCCGTCGAGACTCATCTCGCCAA
CGGTCTGCCCAGCTTCAACATCGTCGGACTGCCCGAGAAGGCGGTACAGGAAAGCCGCGACCGCGTGCGCAGCGCGCTGG
TCAACAGCGGATTTGATTTCCCCGCCAGAAGGATCACGGTCAACCTTGCCCCTGCAGATATACCCAAGCACGGCAGCCGG
TTTGATCTGGCAATCGCGATCGGCATCCTCCTGGCGAGTGGGCAACTGCCGGATAAGTCTGCCGACGGCTATGAGTTTGT
CGGCGAGCTGAGTCTCGCCGGCGCATTGCGCAGCATCAGCGGTGTTCTGCCCATGGCGTTGGCTACTGCCGCGGCCGACA
CCAAGATGATCCTGCCCGCGGCCAATGCCGACGAAGCCGCGCTTGTCTCGTCGCTGGCCTCATACCCCGCTGAGCATCTC
CTGGGAGTTACCGCCCATCTGCTGGGCGCGACCCTTCTGAAAGAACATTGCAAACCCCAGCAACAATCCTCGAACTCACC
GTCATCGGATCTCGCCGACGTATGGGGCCAGTCGCAGGCCAAGCGTGCCCTCGAAATCGTGGCTGCCGGTCGCCACAACC
TGCTAATGGTGGGGCCACCAGGCAGCGGCAAATCAATGCTCGCCAGCCGGCTGCCCGGTATTCTGCCGCCCATGACAGAG
CGAGAAGCACTCGAGAGTGCGGCCGTCAGATCGATCGCCGGTCTGCCCTTCTCGCCGGGCACATGGATGCAACGGCCCTT
TAGGGCACCGCACCATACGGCGTCCGGAGTCGCATTGGTCGGAGGTGGCGGCGGGAGCCACCCGAGACCAGGAGAGGTCT
CGCTCGCCCACTTCGGTACGCTGTTCCTGGACGAACTGCCAGAGTTCAACGGCAAGGTGCTCGACGTACTGCGCGAGCCG
CTCGAAACCGGAAAGATCCTGATCTCGCGCGCCGCTCGCCAAGCCGAGTTTCCGGCCGATTTCCAATTGATCGCGGCGAT
GAATCCCTGCCAGTGCGGCTACGCCAACGACCCTGAACGCATTTGTGCCGGCTGCAGCCCGGAACGCGTGGCGCGCTATC
AGCGGCGCATCTCCGGTCCGTTGCGCGACCGCATCGATATCCAGATCGAAGTATCCGCCTTACCACGTCAAGCGCTGCTG
GGCGGGTTGAAAGCGCGCTCGGAAGACAGCGCAACGGTGCGCCAGCGGGTCTGCTCAGCGTGGGAAAAGCAGCTTGAGCG
CCAGGGAACCGCCAACGCTCGGCTCGGCCACGAAGACCTGCAACGGCACTGCTCATTGTCGCCCGCCGGGCAAGCCCTGC
TCGCCAACGCCATCGATAAACTCGGTCTCTCAGCGCGTGCCTTTCACCGCATCCTGCGCGTGGCGCGTACGATCGCCGAC
CTGGCGGGCAAGGAAAGTATCGAAGATCAACACCTGACCGAGGCCATCGGTTATCGCCGTTTGGATCGCATCGGCTGA
ATGGCATTTGCAGTAACACACTGTCGCGCCGCTATTGGCGTCGACGCCCCACCGGTAGCCGTCGAGACTCATCTCGCCAA
CGGTCTGCCCAGCTTCAACATCGTCGGACTGCCCGAGAAGGCGGTACAGGAAAGCCGCGACCGCGTGCGCAGCGCGCTGG
TCAACAGCGGATTTGATTTCCCCGCCAGAAGGATCACGGTCAACCTTGCCCCTGCAGATATACCCAAGCACGGCAGCCGG
TTTGATCTGGCAATCGCGATCGGCATCCTCCTGGCGAGTGGGCAACTGCCGGATAAGTCTGCCGACGGCTATGAGTTTGT
CGGCGAGCTGAGTCTCGCCGGCGCATTGCGCAGCATCAGCGGTGTTCTGCCCATGGCGTTGGCTACTGCCGCGGCCGACA
CCAAGATGATCCTGCCCGCGGCCAATGCCGACGAAGCCGCGCTTGTCTCGTCGCTGGCCTCATACCCCGCTGAGCATCTC
CTGGGAGTTACCGCCCATCTGCTGGGCGCGACCCTTCTGAAAGAACATTGCAAACCCCAGCAACAATCCTCGAACTCACC
GTCATCGGATCTCGCCGACGTATGGGGCCAGTCGCAGGCCAAGCGTGCCCTCGAAATCGTGGCTGCCGGTCGCCACAACC
TGCTAATGGTGGGGCCACCAGGCAGCGGCAAATCAATGCTCGCCAGCCGGCTGCCCGGTATTCTGCCGCCCATGACAGAG
CGAGAAGCACTCGAGAGTGCGGCCGTCAGATCGATCGCCGGTCTGCCCTTCTCGCCGGGCACATGGATGCAACGGCCCTT
TAGGGCACCGCACCATACGGCGTCCGGAGTCGCATTGGTCGGAGGTGGCGGCGGGAGCCACCCGAGACCAGGAGAGGTCT
CGCTCGCCCACTTCGGTACGCTGTTCCTGGACGAACTGCCAGAGTTCAACGGCAAGGTGCTCGACGTACTGCGCGAGCCG
CTCGAAACCGGAAAGATCCTGATCTCGCGCGCCGCTCGCCAAGCCGAGTTTCCGGCCGATTTCCAATTGATCGCGGCGAT
GAATCCCTGCCAGTGCGGCTACGCCAACGACCCTGAACGCATTTGTGCCGGCTGCAGCCCGGAACGCGTGGCGCGCTATC
AGCGGCGCATCTCCGGTCCGTTGCGCGACCGCATCGATATCCAGATCGAAGTATCCGCCTTACCACGTCAAGCGCTGCTG
GGCGGGTTGAAAGCGCGCTCGGAAGACAGCGCAACGGTGCGCCAGCGGGTCTGCTCAGCGTGGGAAAAGCAGCTTGAGCG
CCAGGGAACCGCCAACGCTCGGCTCGGCCACGAAGACCTGCAACGGCACTGCTCATTGTCGCCCGCCGGGCAAGCCCTGC
TCGCCAACGCCATCGATAAACTCGGTCTCTCAGCGCGTGCCTTTCACCGCATCCTGCGCGTGGCGCGTACGATCGCCGAC
CTGGCGGGCAAGGAAAGTATCGAAGATCAACACCTGACCGAGGCCATCGGTTATCGCCGTTTGGATCGCATCGGCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
52.063 |
100 |
0.525 |
| comM | Vibrio campbellii strain DS40M4 |
52.579 |
99.802 |
0.525 |
| comM | Vibrio cholerae strain A1552 |
52.579 |
99.802 |
0.525 |
| comM | Glaesserella parasuis strain SC1401 |
51.772 |
100 |
0.521 |
| comM | Legionella pneumophila str. Paris |
48 |
99.01 |
0.475 |
| comM | Legionella pneumophila strain ERS1305867 |
48 |
99.01 |
0.475 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
44.488 |
100 |
0.448 |