Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   KY496_RS22265 Genome accession   NZ_CP080379
Coordinates   5048387..5049919 (-) Length   510 a.a.
NCBI ID   WP_219862486.1    Uniprot ID   -
Organism   Massilia sp. NP310     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IS/Tn 5050205..5051188 5048387..5049919 flank 286


Gene organization within MGE regions


Location: 5048387..5051188
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KY496_RS22265 (KY496_22265) comM 5048387..5049919 (-) 1533 WP_219862486.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  KY496_RS22270 (KY496_22270) - 5050205..5051188 (+) 984 WP_219862084.1 IS5 family transposase -

Sequence


Protein


Download         Length: 510 a.a.        Molecular weight: 54721.56 Da        Isoelectric Point: 8.3005

>NTDB_id=592433 KY496_RS22265 WP_219862486.1 5048387..5049919(-) (comM) [Massilia sp. NP310]
MSLAVLRSRALSGMEAPAVNVEVHLANGLPGIAIVGLPDTEVREAKDRVRAALQNSGFDIPARRITINLAPADLPKESGR
FDLPIALGILAASRQIRDDALHRYEFAGELSLSGELRPVRGALAMAFAMARSGKDEMRDDEMRAFILPLANADEAALVEA
AAIYPARTLLEVCAHFANAPDAPKLARHHGPGLTRLPSYPDFAEVKGQQHAKRALEVAAAGTHSVLLVGPPGAGKSMLAA
RLPGLLPQMSEAEALESAAVQSLAGGFAPERWRQRPFRSPHHTTSGVALVGGGNLPRPGEVSLAHHGVLFLDELPEFERR
VLEVLREPLESGSITISRAAHQADFPARFQLIAAMNPCPCGWLGHASGKCRCTPDAVLRYQGRISGPLLDRIDLQLPVAA
MAPDSMGAQADGEPSASIAQRVARAHARQLARQGRPNSQLGPGAIDRHCAPDEAGRRLLHDAARHLHWSARAYHRVLKVA
RTIADLADVDDVRSKHVAEAIGYRRALRDD

Nucleotide


Download         Length: 1533 bp        

>NTDB_id=592433 KY496_RS22265 WP_219862486.1 5048387..5049919(-) (comM) [Massilia sp. NP310]
ATGAGTCTCGCCGTCCTCAGAAGCCGCGCCCTCTCTGGCATGGAAGCGCCCGCCGTCAACGTCGAGGTCCACCTGGCCAA
CGGTCTACCCGGCATAGCCATCGTCGGCCTGCCCGATACCGAAGTGCGCGAAGCGAAAGACCGCGTGCGCGCGGCGCTGC
AGAATTCCGGCTTCGATATCCCGGCGCGCCGCATCACGATCAACCTGGCGCCCGCCGACCTGCCGAAAGAATCGGGCCGC
TTCGACCTGCCGATCGCGCTGGGCATCCTGGCCGCCTCGCGCCAGATCCGCGACGATGCGCTGCACCGCTACGAATTTGC
GGGAGAACTGTCGCTGTCGGGTGAGCTGCGCCCGGTGCGCGGCGCGCTGGCGATGGCGTTCGCGATGGCGCGCTCAGGTA
AGGACGAGATGCGCGACGACGAAATGCGCGCCTTCATCCTGCCGCTGGCCAATGCCGACGAAGCGGCCCTGGTCGAGGCA
GCCGCCATCTACCCGGCGCGCACCTTGCTCGAGGTCTGCGCCCACTTCGCCAATGCGCCCGACGCGCCGAAGCTGGCGCG
CCACCATGGCCCCGGCCTGACGCGGCTGCCTTCCTATCCCGACTTCGCCGAGGTCAAGGGCCAGCAGCATGCCAAGCGCG
CGCTCGAGGTGGCGGCAGCCGGGACTCATTCCGTGCTGCTGGTCGGCCCGCCGGGCGCGGGCAAGAGCATGCTGGCGGCG
CGCCTGCCCGGCCTGCTGCCGCAGATGAGCGAGGCCGAGGCGCTCGAGTCGGCCGCCGTGCAATCGCTGGCCGGCGGCTT
CGCGCCCGAACGTTGGCGCCAGCGGCCGTTTCGCAGTCCGCATCACACGACGTCCGGCGTGGCCCTGGTCGGCGGCGGCA
ACCTGCCGCGGCCGGGCGAGGTGTCGCTGGCGCATCACGGGGTACTGTTTCTCGACGAGCTGCCCGAGTTCGAGCGCCGC
GTGCTGGAAGTGCTGCGCGAGCCGCTGGAGTCGGGATCGATCACGATCTCGCGCGCGGCCCACCAGGCAGATTTTCCCGC
GCGCTTCCAGCTGATCGCGGCGATGAATCCCTGCCCCTGCGGCTGGCTGGGCCACGCCAGCGGCAAGTGCCGTTGCACGC
CGGACGCCGTGCTGCGCTACCAGGGACGCATCTCCGGGCCGCTGCTCGACCGCATCGACCTCCAGCTGCCGGTGGCGGCC
ATGGCGCCGGACAGCATGGGCGCGCAGGCCGATGGCGAACCGAGCGCGAGCATCGCGCAGCGGGTGGCGCGGGCCCATGC
GCGCCAGCTCGCGCGCCAGGGGCGGCCCAACAGCCAGCTGGGGCCGGGCGCGATCGATCGCCACTGCGCGCCCGACGAAG
CCGGGCGCCGGCTGCTGCACGATGCCGCCCGGCACCTGCATTGGTCGGCGCGGGCCTACCACCGCGTGCTCAAGGTGGCG
CGCACGATCGCCGACCTGGCGGACGTGGACGACGTGCGGTCGAAGCACGTGGCCGAGGCGATCGGCTACCGGCGCGCGCT
GCGCGACGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

51.677

99.412

0.514

  comM Glaesserella parasuis strain SC1401

50.973

100

0.514

  comM Haemophilus influenzae Rd KW20

49.515

100

0.5

  comM Vibrio cholerae strain A1552

50.395

99.216

0.5

  comM Legionella pneumophila str. Paris

48.555

100

0.494

  comM Legionella pneumophila strain ERS1305867

48.555

100

0.494

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.437

100

0.459