Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AAG689_RS21120 Genome accession   NZ_CP152342
Coordinates   4296951..4298486 (+) Length   511 a.a.
NCBI ID   WP_406623628.1    Uniprot ID   -
Organism   Acidovorax sp. SDU_ACID1     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 4293320..4304082 4296951..4298486 within 0


Gene organization within MGE regions


Location: 4293320..4304082
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAG689_RS21100 - 4293454..4294206 (-) 753 Protein_4160 SMP-30/gluconolactonase/LRE family protein -
  AAG689_RS21105 - 4294293..4295699 (-) 1407 WP_406623627.1 ammonium transporter -
  AAG689_RS21110 glnK 4295722..4296060 (-) 339 WP_020226721.1 P-II family nitrogen regulator -
  AAG689_RS21115 - 4296082..4296813 (-) 732 WP_020226722.1 TorF family putative porin -
  AAG689_RS21120 comM 4296951..4298486 (+) 1536 WP_406623628.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AAG689_RS21125 - 4298501..4299301 (-) 801 WP_406623629.1 IclR family transcriptional regulator C-terminal domain-containing protein -
  AAG689_RS21130 - 4299420..4300265 (+) 846 WP_406623630.1 TauD/TfdA dioxygenase family protein -
  AAG689_RS21135 - 4300277..4301158 (+) 882 WP_406623631.1 alpha/beta fold hydrolase -
  AAG689_RS21140 - 4301209..4302183 (+) 975 WP_406623632.1 tripartite tricarboxylate transporter substrate binding protein -
  AAG689_RS21145 - 4302265..4303365 (+) 1101 WP_406623633.1 glycerophosphodiester phosphodiesterase family protein -
  AAG689_RS21150 - 4303372..4303632 (-) 261 WP_406626384.1 helix-turn-helix domain-containing protein -
  AAG689_RS21155 - 4303570..4303722 (+) 153 WP_406626407.1 hypothetical protein -
  AAG689_RS21160 - 4303744..4303854 (+) 111 Protein_4172 hypothetical protein -
  AAG689_RS21165 - 4303870..4304082 (+) 213 WP_406623634.1 IPTL-CTERM sorting domain-containing protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 53831.66 Da        Isoelectric Point: 8.2037

>NTDB_id=988500 AAG689_RS21120 WP_406623628.1 4296951..4298486(+) (comM) [Acidovorax sp. SDU_ACID1]
MSLALVQSRALMGLQAPAVTVEVHLANGLPSFTLVGLAEIEVKEARERVRSALQNAGLEFPSNKKITVNLAPADLPKDSG
RFDLPIALGLLAASGQIDAARLAGWEFAGELSLSGQLRPVRGALATALALRAQEQPARMVLPLGSAEEAALVPGTEIYGA
RHLLDVVRQFLPVDPGPDTQADGWQRLQSTPPPMPASGPDLADVKGQTAPKRALEIAAAGGHGLLLVGPPGSGKSMLAQR
FAGLLPPMDVEQALESAAIASLAGRFTPAQWMQRATASPHHTSSAVALVGGGSPPRPGEISLAHEGVLYLDEFPEFARSA
LEALREPLETGRITIARAAQRAEFPARFQLIAAMNPCPCGFAGSTQRACRCTPDQVARYQGKLSGPLLDRIDLHVEVPAL
PPQELLHAPPGEASHAVRARVAAARERALARQGKPNHALQGQEIDTHLALQDAAAQFLQTAATRLGWSARSTHRALKVAR
TIADLAGSDMVGTAHVAEAVQYRRVLRGTSP

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=988500 AAG689_RS21120 WP_406623628.1 4296951..4298486(+) (comM) [Acidovorax sp. SDU_ACID1]
ATGAGTCTTGCTTTGGTGCAAAGCCGCGCCCTGATGGGCCTGCAGGCGCCCGCAGTCACCGTCGAGGTGCATCTGGCCAA
TGGCCTGCCCAGCTTCACGCTGGTGGGTCTGGCCGAGATCGAGGTGAAGGAGGCGCGCGAGCGCGTGCGCTCCGCGCTGC
AGAACGCGGGGCTGGAGTTCCCCTCGAACAAGAAAATCACCGTCAACCTCGCCCCCGCCGACCTGCCCAAGGACTCGGGC
CGGTTCGACCTGCCCATCGCCCTGGGGCTGCTCGCGGCCAGCGGCCAGATCGATGCCGCGCGGCTGGCGGGCTGGGAGTT
CGCGGGCGAGTTGTCGCTGTCCGGCCAGCTGCGGCCGGTGCGCGGCGCGCTGGCCACGGCGCTGGCGCTGCGCGCCCAGG
AACAGCCTGCGCGCATGGTGCTGCCGCTCGGCAGCGCCGAGGAGGCCGCGCTGGTGCCCGGCACCGAGATCTACGGCGCA
CGCCACCTGCTGGACGTGGTGCGCCAGTTCCTGCCCGTGGACCCCGGCCCCGATACCCAAGCCGATGGCTGGCAGCGCCT
GCAGTCCACGCCGCCGCCCATGCCCGCCAGCGGCCCCGACCTCGCCGACGTGAAAGGCCAGACCGCTCCCAAGCGCGCGC
TGGAGATCGCCGCCGCCGGCGGCCACGGCCTGCTGCTGGTGGGCCCGCCGGGCTCGGGCAAGTCCATGCTGGCCCAGCGC
TTCGCCGGTCTGCTGCCGCCCATGGACGTGGAGCAGGCGCTGGAAAGCGCCGCCATCGCCAGCCTGGCCGGGCGCTTCAC
GCCCGCGCAATGGATGCAGCGCGCCACCGCCAGCCCGCACCACACGAGCAGCGCCGTGGCCCTGGTGGGCGGCGGCTCGC
CGCCGAGGCCCGGCGAGATCTCCCTGGCCCACGAAGGCGTTCTGTATCTGGACGAATTCCCCGAGTTCGCCCGCAGCGCG
CTGGAGGCGCTGCGCGAGCCGCTGGAGACCGGCCGCATCACCATCGCCCGCGCGGCGCAGCGCGCCGAGTTCCCCGCGCG
CTTCCAGCTCATCGCCGCCATGAACCCCTGCCCGTGCGGCTTCGCGGGCTCCACCCAGCGCGCCTGCCGCTGCACGCCTG
ACCAGGTCGCACGCTACCAGGGCAAGCTGAGCGGCCCGCTGCTCGACCGCATCGACCTGCACGTGGAAGTGCCCGCCCTG
CCCCCGCAGGAGCTGCTGCACGCCCCGCCGGGCGAGGCCAGCCATGCCGTGCGCGCACGCGTGGCTGCCGCGCGCGAACG
GGCCCTGGCGCGCCAGGGCAAGCCCAACCACGCGCTGCAGGGGCAGGAGATCGACACACACCTGGCACTGCAGGACGCCG
CCGCGCAGTTCCTGCAGACCGCCGCCACGCGCCTGGGCTGGTCGGCGCGCAGCACGCACCGGGCGCTGAAGGTGGCCCGC
ACCATCGCCGATCTGGCGGGCAGCGACATGGTGGGCACGGCCCACGTGGCCGAGGCCGTGCAGTACCGGCGCGTGCTGCG
GGGCACGTCGCCATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

52.549

99.804

0.524

  comM Haemophilus influenzae Rd KW20

51.064

100

0.517

  comM Vibrio campbellii strain DS40M4

50.877

100

0.511

  comM Glaesserella parasuis strain SC1401

51.186

99.022

0.507

  comM Legionella pneumophila str. Paris

48.638

100

0.489

  comM Legionella pneumophila strain ERS1305867

48.638

100

0.489

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.023

100

0.434


Multiple sequence alignment