Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   BVG95_RS19995 Genome accession   NZ_CP018915
Coordinates   4238146..4239669 (+) Length   507 a.a.
NCBI ID   WP_049205986.1    Uniprot ID   -
Organism   Serratia marcescens strain UMH1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4233146..4244669
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BVG95_RS19970 (BVG95_20005) - 4233347..4234273 (-) 927 WP_016930475.1 branched-chain amino acid transaminase -
  BVG95_RS19975 (BVG95_20010) ilvM 4234297..4234554 (-) 258 WP_004929955.1 acetolactate synthase 2 small subunit -
  BVG95_RS19980 (BVG95_20015) ilvG 4234551..4236197 (-) 1647 WP_089196352.1 acetolactate synthase 2 catalytic subunit -
  BVG95_RS19985 (BVG95_20020) ilvL 4236340..4236438 (-) 99 WP_013814970.1 ilv operon leader peptide -
  BVG95_RS19990 (BVG95_20025) - 4236572..4237804 (-) 1233 WP_089196353.1 MFS transporter -
  BVG95_RS19995 (BVG95_20030) comM 4238146..4239669 (+) 1524 WP_049205986.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  BVG95_RS20000 (BVG95_20035) - 4239697..4240035 (-) 339 WP_004929946.1 DUF413 domain-containing protein -
  BVG95_RS20005 (BVG95_20040) hdfR 4240155..4240982 (+) 828 WP_025304646.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 54477.72 Da        Isoelectric Point: 7.9462

>NTDB_id=211831 BVG95_RS19995 WP_049205986.1 4238146..4239669(+) (comM) [Serratia marcescens strain UMH1]
MSLAVIYSRAIIGVQAPSVTVEVHISNGLPGLTLVGLPETTVKEARDRVRSALINNGFTFPARRITVNLAPADLPKEGGR
YDLPIALAILAASEQLPLAPLARYEFLGELALSGALRAVRGAIPAALAAADAGRQLILSTDNAAEVGLIAQSQSHTAQHL
LEVCAFLLGQGELPVAVTPPAAGNAYENADLRDIIGQEQAKRALEIAAAGGHNLLLIGPPGTGKTMLASRLTGLLPPLTE
PEALESLAIASLQHPVLSALPWRQRPFRAPHHSASMAALVGGGSLPRPGEISMAHNGVLFLDELPEFERKVLDALREPLE
SGEIVISRANAKVCFPARVQLIAAMNPSPTGHYQGLHNRASPQQVLRYLARLSGPFLDRFDLSIEVPLLPPGTLSQRKTH
GESSEQVRKRVQQARARQLERAGKVNALLSNREVERDCVLQAADAEFLETTLNALGLSVRAWQRILKVARTLADLAGDTE
LDRRHLSEALGYRSMDRLLLQLHRSLE

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=211831 BVG95_RS19995 WP_049205986.1 4238146..4239669(+) (comM) [Serratia marcescens strain UMH1]
ATGTCACTGGCGGTAATCTATAGCCGCGCCATCATCGGCGTTCAGGCCCCTTCCGTGACGGTGGAGGTGCATATCAGCAA
TGGCCTGCCCGGCCTGACGCTGGTCGGTCTGCCAGAAACCACGGTAAAAGAGGCGCGCGATCGGGTGCGCAGCGCCCTGA
TCAACAACGGTTTCACCTTTCCCGCCCGGCGCATCACCGTCAATTTGGCACCGGCCGATCTGCCGAAAGAAGGCGGGCGT
TACGATCTGCCGATAGCGCTGGCGATCCTCGCCGCCTCCGAGCAACTGCCCCTCGCACCGTTGGCACGCTACGAGTTTCT
TGGCGAGCTCGCGCTGTCCGGCGCACTGCGTGCGGTCAGAGGCGCCATCCCGGCGGCGCTGGCGGCGGCTGACGCCGGGC
GACAATTGATCCTGTCGACGGACAACGCCGCCGAGGTCGGCCTGATCGCACAGTCACAATCCCATACCGCCCAACACCTG
TTGGAGGTCTGTGCTTTTTTACTCGGCCAGGGCGAACTGCCGGTGGCCGTCACACCCCCCGCAGCCGGCAATGCGTACGA
AAACGCCGATCTGCGCGACATCATCGGCCAGGAGCAGGCCAAGCGGGCGCTGGAGATCGCCGCCGCCGGCGGGCATAACC
TGCTGCTGATTGGGCCGCCGGGCACAGGCAAAACCATGCTGGCCAGCCGACTGACGGGCTTACTGCCGCCGCTGACGGAG
CCTGAGGCGCTGGAAAGCCTGGCGATCGCCAGCTTGCAACACCCTGTTCTGAGCGCTCTGCCATGGCGCCAGAGGCCGTT
TCGCGCGCCGCATCACAGCGCATCGATGGCGGCATTGGTCGGCGGCGGCTCACTGCCGCGTCCGGGCGAGATCTCGATGG
CGCATAACGGCGTGCTGTTTCTGGATGAGCTACCGGAATTCGAGCGTAAGGTCCTGGATGCGCTGCGCGAGCCGTTGGAA
TCCGGCGAGATCGTGATTTCACGCGCCAACGCCAAGGTCTGTTTCCCTGCCAGAGTACAGTTGATCGCGGCAATGAACCC
CAGCCCGACAGGGCATTATCAGGGGTTGCACAACCGCGCCTCGCCGCAGCAGGTGTTGCGCTATCTGGCCCGGCTGTCAG
GGCCTTTTCTCGACCGTTTCGATCTGTCTATCGAAGTGCCGCTGTTACCGCCGGGTACGCTCAGCCAGCGGAAAACGCAT
GGGGAAAGCAGTGAGCAGGTGCGGAAACGGGTGCAGCAGGCACGCGCCCGGCAGCTCGAACGCGCCGGTAAAGTCAACGC
GCTGTTGAGCAACCGTGAAGTGGAACGTGATTGTGTTCTGCAGGCGGCGGACGCCGAGTTTTTGGAGACGACATTAAACG
CGCTGGGGTTATCGGTACGCGCCTGGCAGCGCATCCTGAAAGTGGCTCGCACGCTGGCGGATTTGGCGGGAGATACTGAA
CTCGACAGGCGCCACCTCAGCGAAGCGCTGGGCTATCGCAGTATGGATCGTCTGTTGTTACAGCTACATCGCAGTCTGGA
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

66.206

99.803

0.661

  comM Glaesserella parasuis strain SC1401

65.619

100

0.659

  comM Vibrio cholerae strain A1552

65.538

99.014

0.649

  comM Vibrio campbellii strain DS40M4

64.542

99.014

0.639

  comM Legionella pneumophila str. Paris

50.905

98.028

0.499

  comM Legionella pneumophila strain ERS1305867

50.905

98.028

0.499

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.652

99.803

0.456


Multiple sequence alignment