Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   E5CHR_RS01525 Genome accession   NZ_LR594675
Coordinates   330431..331966 (-) Length   511 a.a.
NCBI ID   WP_162578062.1    Uniprot ID   A0A6P2E8S5
Organism   Variovorax sp. PBS-H4     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 325431..336966
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E5CHR_RS01505 (H4CHR_00301) - 326242..327051 (+) 810 WP_162578059.1 helix-turn-helix transcriptional regulator -
  E5CHR_RS01510 (H4CHR_00302) - 327144..328163 (+) 1020 WP_162578060.1 aromatic ring-hydroxylating dioxygenase subunit alpha -
  E5CHR_RS01515 (H4CHR_00303) - 328208..329191 (+) 984 WP_232061918.1 tripartite tricarboxylate transporter substrate-binding protein -
  E5CHR_RS01520 (H4CHR_00304) - 329321..330430 (+) 1110 WP_162578061.1 ABC transporter substrate-binding protein -
  E5CHR_RS01525 (H4CHR_00305) comM 330431..331966 (-) 1536 WP_162578062.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  E5CHR_RS01530 (H4CHR_00306) - 332127..332948 (+) 822 WP_162583510.1 TorF family putative porin -
  E5CHR_RS01535 (H4CHR_00307) glnK 333013..333351 (+) 339 WP_162578063.1 P-II family nitrogen regulator -
  E5CHR_RS01540 (H4CHR_00308) amt 333378..334904 (+) 1527 WP_174255679.1 ammonium transporter -
  E5CHR_RS01545 (H4CHR_00309) - 335045..335968 (+) 924 WP_162578064.1 SMP-30/gluconolactonase/LRE family protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 53564.25 Da        Isoelectric Point: 8.9412

>NTDB_id=1128239 E5CHR_RS01525 WP_162578062.1 330431..331966(-) (comM) [Variovorax sp. PBS-H4]
MSLSLVQSRALLGLEAASVTVEVHLANGLPSFTLVGLADVEVKEARERVRSAIQNAGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGILAAAGQIDAARLAGHEFAGELSLSGHLRPVRGALAMALALHGRGVATKLVLPAESAREAALVPGADIYGA
AHLLDVVRQFLPGGPAASDASEDGWHRAQPAEPGAVAAGADMADVKGHASARRALEVAAAGQHSLLMVGPPGSGKSMLAQ
RFAGLLPPMSVDEALESAAVASLHGRFAVERWRVRPTCSPHHSASAVALVGGGSPPRPGEISLAHNGVLFLDEFPEFQRA
ALEALREPLETGTITIARAARRAEFPARFQLIAAMNPCPCGYLGSSLKSCRCTPDQVARYQGKLSGPLLDRIDLHIEVPA
LPATQLLETPAGEASADVRARVAEGRERAIRRQGKANQALQGAEIDRHAKLDAAARQFLQTAATRLGWSARGTHRTLKLA
RTIADLAGAGAVQAAHVAEAVQYRRALQRVE

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1128239 E5CHR_RS01525 WP_162578062.1 330431..331966(-) (comM) [Variovorax sp. PBS-H4]
ATGAGCTTATCTTTGGTGCAAAGCCGTGCTTTGCTGGGCCTCGAAGCGGCAAGCGTCACGGTCGAGGTGCATCTGGCCAA
CGGGTTGCCGAGCTTCACGCTGGTGGGACTGGCGGACGTGGAAGTGAAGGAGGCGCGCGAGCGGGTGCGTTCTGCCATTC
AGAATGCCGGCCTCGAATTCCCGAGCAACAAGCGCATCACGGTCAACCTGGCGCCGGCCGATCTGCCGAAGGATTCGGGG
CGCTTCGACCTGCCAATCGCACTGGGCATCCTGGCGGCAGCCGGGCAGATCGACGCGGCCCGGCTGGCGGGCCACGAATT
TGCGGGGGAGCTCTCGCTTTCAGGCCATCTGCGGCCCGTGCGCGGTGCGCTCGCAATGGCGCTGGCGCTGCATGGCCGCG
GCGTTGCGACGAAGCTGGTGCTGCCGGCCGAGAGCGCGCGGGAGGCCGCCCTGGTACCCGGCGCCGACATCTACGGTGCG
GCCCACCTGCTGGACGTGGTGCGGCAGTTCCTGCCGGGCGGCCCGGCCGCGAGCGATGCGTCGGAAGACGGCTGGCACCG
TGCGCAGCCGGCCGAACCCGGCGCGGTGGCGGCCGGGGCCGACATGGCGGACGTCAAGGGGCACGCGAGCGCAAGACGCG
CGCTCGAGGTCGCGGCGGCCGGCCAGCACAGCCTGCTGATGGTGGGCCCGCCGGGTTCCGGCAAGTCGATGCTGGCGCAG
CGCTTTGCGGGCCTGCTGCCGCCGATGAGCGTCGACGAGGCGCTGGAAAGCGCCGCCGTAGCCAGCCTGCACGGCCGCTT
CGCCGTCGAGCGTTGGCGTGTGCGGCCGACCTGCAGCCCGCACCACAGCGCCAGTGCCGTGGCGCTGGTGGGCGGCGGCT
CGCCGCCGCGGCCTGGCGAGATCTCGCTGGCGCACAACGGCGTGCTGTTCCTGGACGAATTTCCCGAGTTCCAGCGTGCC
GCGCTCGAGGCGCTGCGCGAGCCGCTGGAGACCGGAACCATCACGATCGCGCGGGCCGCGCGGCGTGCCGAGTTCCCGGC
CCGCTTCCAGTTGATCGCCGCCATGAATCCGTGCCCTTGCGGCTACCTGGGCTCATCCCTCAAGTCCTGCCGTTGCACGC
CGGACCAGGTGGCCCGCTACCAGGGCAAGCTCAGCGGGCCGCTGCTCGACCGGATCGACCTGCACATCGAAGTGCCCGCC
CTACCCGCCACCCAGTTGCTGGAGACGCCCGCCGGCGAAGCCAGCGCAGACGTCCGTGCGCGCGTGGCCGAAGGGCGCGA
GCGCGCAATCCGCCGACAGGGCAAGGCCAACCAGGCGCTGCAAGGCGCAGAGATCGACCGCCACGCGAAGCTCGATGCGG
CGGCCCGGCAGTTCCTGCAGACTGCAGCGACGCGACTGGGCTGGTCGGCGCGCGGCACGCACCGCACGCTCAAGCTGGCC
CGAACGATAGCGGACCTGGCCGGAGCCGGCGCGGTGCAGGCGGCGCATGTGGCGGAGGCGGTGCAGTACCGGCGTGCACT
GCAGAGGGTGGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

51.744

100

0.523

  comM Glaesserella parasuis strain SC1401

51.064

100

0.517

  comM Vibrio cholerae strain A1552

51.581

99.022

0.511

  comM Vibrio campbellii strain DS40M4

50

99.804

0.499

  comM Legionella pneumophila str. Paris

47.368

100

0.476

  comM Legionella pneumophila strain ERS1305867

47.368

100

0.476

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.307

99.413

0.431


Multiple sequence alignment