Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AACH09_RS23940 Genome accession   NZ_OY970406
Coordinates   5011836..5013359 (+) Length   507 a.a.
NCBI ID   WP_159318777.1    Uniprot ID   -
Organism   Serratia marcescens isolate SERMG     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5006836..5018359
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AACH09_RS23915 (SERMG_04716) - 5007036..5007962 (-) 927 WP_033636413.1 branched-chain amino acid transaminase -
  AACH09_RS23920 (SERMG_04717) ilvM 5007986..5008243 (-) 258 WP_038874702.1 acetolactate synthase 2 small subunit -
  AACH09_RS23925 (SERMG_04718) ilvG 5008240..5009886 (-) 1647 WP_213873526.1 acetolactate synthase 2 catalytic subunit -
  AACH09_RS23930 ilvL 5010029..5010127 (-) 99 WP_013814970.1 ilv operon leader peptide -
  AACH09_RS23935 (SERMG_04719) - 5010261..5011493 (-) 1233 WP_060438303.1 MFS transporter -
  AACH09_RS23940 (SERMG_04720) comM 5011836..5013359 (+) 1524 WP_159318777.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AACH09_RS23945 (SERMG_04721) - 5013388..5013726 (-) 339 WP_019455272.1 DUF413 domain-containing protein -
  AACH09_RS23950 (SERMG_04722) hdfR 5013846..5014673 (+) 828 WP_060425370.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54391.63 Da        Isoelectric Point: 7.8605

>NTDB_id=1162278 AACH09_RS23940 WP_159318777.1 5011836..5013359(+) (comM) [Serratia marcescens isolate SERMG]
MSLAVIYSRAIIGVQAPSVTVEVHISNGLPGLTLVGLPETTVKEARDRVRSALINNGFTFPARRITVNLAPADLPKEGGR
YDLPIALAILAASEQLPLAPLARYEFLGELALSGALRAVRGAIPAALAAADAGRQLILSTDNAAEVGLIAQSHSHTARHL
LEVCAFLLGQGELPVAATPPAADTACDNADLRDIIGQEQAKRALEIAAAGGHNLLLIGPPGTGKTMLASRLTGLLPPLTE
HEALESLAVASLQHHIPAALPWRQRPFRAPHHSASMAALVGGGSLPRPGEISMAHNGVLFLDELPEFERKVLDALREPLE
SGEIVISRANAKVCFPARVQLIAAMNPSPTGHYQGLHNRASPQQVLRYLARLSGPFLDRFDLSIEVPLLPPGTLSQRKTH
GESSEQVRKRVQQARARQLERAGKVNALLSNREVERDCVLQAADAEFLEATLNALGLSVRAWQRILKVARTLADLAGDAE
LNRSHLCEALGYRSMDRLLLQLHRSLE

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=1162278 AACH09_RS23940 WP_159318777.1 5011836..5013359(+) (comM) [Serratia marcescens isolate SERMG]
ATGTCATTGGCGGTAATCTATAGCCGCGCCATCATCGGCGTTCAGGCCCCTTCCGTGACGGTGGAGGTGCATATCAGCAA
TGGCCTGCCCGGCCTGACGCTGGTCGGTCTGCCGGAAACCACGGTGAAAGAGGCGCGCGATCGGGTGCGCAGTGCCCTGA
TCAACAACGGTTTCACCTTTCCTGCCCGGCGCATCACCGTCAATTTGGCGCCCGCCGATCTGCCGAAAGAAGGCGGACGT
TACGATCTGCCGATAGCGCTGGCGATCCTCGCCGCCTCCGAGCAATTGCCCCTCGCACCGTTGGCGCGCTACGAGTTTCT
TGGCGAGCTCGCACTGTCCGGCGCACTGCGCGCGGTCAGGGGCGCGATCCCGGCGGCGCTGGCGGCGGCTGACGCCGGGC
GGCAACTGATCCTCTCGACGGACAACGCCGCCGAAGTCGGCCTGATTGCGCAGTCGCACTCCCATACCGCCCGGCACCTG
TTGGAAGTCTGTGCCTTTCTGCTCGGCCAGGGCGAGCTGCCGGTGGCGGCCACGCCTCCCGCTGCGGACACCGCCTGCGA
CAACGCCGATCTGCGCGACATCATCGGCCAGGAACAGGCCAAGCGGGCTCTGGAGATCGCCGCCGCCGGCGGGCATAACC
TGCTGCTGATTGGGCCGCCGGGCACCGGTAAAACCATGTTGGCCAGCCGCCTCACGGGCTTGCTGCCGCCGCTCACGGAG
CATGAGGCGCTGGAAAGCCTGGCGGTCGCCAGCTTGCAGCATCATATCCCCGCCGCCCTGCCATGGCGCCAGAGGCCGTT
TCGCGCACCGCATCACAGTGCATCGATGGCGGCGTTGGTCGGCGGCGGCTCACTGCCGCGCCCGGGCGAGATCTCTATGG
CGCATAACGGCGTGCTGTTTCTGGATGAGCTACCGGAATTCGAGCGTAAGGTGCTGGATGCACTGCGTGAGCCGCTGGAG
TCCGGCGAGATCGTAATTTCACGCGCCAACGCCAAGGTCTGTTTCCCCGCCAGAGTGCAGTTGATTGCGGCAATGAACCC
TAGCCCGACCGGGCATTATCAGGGGTTGCACAATCGCGCCTCACCGCAGCAGGTATTACGCTATCTGGCCCGGCTGTCAG
GGCCTTTTCTCGACCGTTTCGATCTGTCTATCGAAGTGCCGCTGTTACCGCCGGGTACGCTCAGTCAGCGGAAAACGCAT
GGAGAGAGCAGTGAGCAGGTGCGGAAGCGGGTGCAGCAGGCACGCGCCCGGCAGCTCGAACGCGCCGGGAAAGTCAACGC
GCTGTTGAGCAACCGCGAAGTGGAACGTGATTGCGTTCTGCAGGCAGCAGACGCCGAGTTTTTGGAGGCAACATTAAACG
CGCTGGGGTTATCGGTCCGCGCCTGGCAGCGCATTTTGAAAGTTGCGCGCACGCTGGCGGATTTGGCGGGGGATGCGGAG
CTCAACAGGAGCCACCTCTGCGAAGCGCTGGGCTATCGCAGTATGGACCGTCTGCTGTTACAGCTGCATCGTAGTCTGGA
ATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

65.81

99.803

0.657

  comM Glaesserella parasuis strain SC1401

65.483

100

0.655

  comM Vibrio cholerae strain A1552

65.339

99.014

0.647

  comM Vibrio campbellii strain DS40M4

64.343

99.014

0.637

  comM Legionella pneumophila str. Paris

51.004

98.225

0.501

  comM Legionella pneumophila strain ERS1305867

51.004

98.225

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.882

100

0.45


Multiple sequence alignment