Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   G0034_RS22410 Genome accession   NZ_CP048736
Coordinates   4684148..4685668 (+) Length   506 a.a.
NCBI ID   WP_023331589.1    Uniprot ID   A0A2J0PD61
Organism   Enterobacter sp. T2     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4679148..4690668
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G0034_RS22390 (G0034_22390) ilvE 4680714..4681643 (-) 930 WP_014885677.1 branched-chain-amino-acid transaminase -
  G0034_RS22395 (G0034_22395) ilvM 4681662..4681925 (-) 264 WP_003860184.1 acetolactate synthase 2 small subunit -
  G0034_RS22400 (G0034_22400) ilvG 4681922..4683568 (-) 1647 WP_023331588.1 acetolactate synthase 2 catalytic subunit -
  G0034_RS23580 ilvX 4683571..4683621 (-) 51 WP_201405605.1 peptide IlvX -
  G0034_RS22405 (G0034_22405) ilvL 4683708..4683806 (-) 99 WP_094948593.1 ilv operon leader peptide -
  G0034_RS22410 (G0034_22410) comM 4684148..4685668 (+) 1521 WP_023331589.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  G0034_RS22415 (G0034_22415) - 4685698..4686036 (-) 339 WP_014885680.1 DUF413 domain-containing protein -
  G0034_RS22420 (G0034_22420) hdfR 4686155..4686976 (+) 822 WP_045269185.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55055.09 Da        Isoelectric Point: 7.3092

>NTDB_id=422606 G0034_RS22410 WP_023331589.1 4684148..4685668(+) (comM) [Enterobacter sp. T2]
MSLSVVYTRAALGVKAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPAKKITINLAPADLPKEGGR
YDLPIAIALLAASEQLNTTRLGSYEFVGELALTGALRGVPGAISGALEAIRAGRQIIVANENASEVSLIAEKGCLVAGHL
QEVCAWLEGRHELSEPEECDNVIADTPEDLSEIMGQEQGKRALEITAAGGHNLLLIGPPGTGKTMLASRLSGLLPPLNNH
EALESAAIYSLISSTSLQKQWRRRPFRSPHHSASLTAMVGGGSIPGPGEISLAHNGILFLDELPEFERRVLDALREPIES
GEIHISRTRAKISYPAQFQLVAAMNPSPTGHYQGNHNRCTPEQTLRYLGKLSGPFLDRFDLSLEIPLPPPGLLRQKGITG
ESSADVRERVIAAQTRQYARQNRLNARLDNAGIRQFCSLNMEDAVWLEETLTRFGLSIRAWQRLLKVARTIADVEGCSDI
QRKHLQEALSYRAIDRLLLHLQKLLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=422606 G0034_RS22410 WP_023331589.1 4684148..4685668(+) (comM) [Enterobacter sp. T2]
ATGTCACTGTCAGTTGTTTATACGCGTGCGGCTCTCGGGGTGAAGGCACCGCTTATTTCTGTAGAGGTTCATCTGAGTAA
TGGGCTACCCGGGCTCACGCTCGTCGGGCTACCTGAAACGACGGTTAAAGAAGCCCGGGATCGCGTTCGCAGCGCAATAA
TAAATAGCGGTTATACCTTCCCCGCGAAGAAGATCACCATCAACCTGGCTCCCGCCGATCTGCCCAAAGAGGGGGGACGA
TATGATTTACCTATCGCGATTGCGCTTCTCGCGGCCTCTGAGCAGCTTAATACGACCAGGCTAGGCTCGTACGAGTTTGT
GGGAGAACTCGCGCTCACAGGCGCGTTAAGGGGCGTTCCCGGTGCGATATCGGGCGCGCTGGAAGCTATACGCGCAGGGC
GGCAAATCATTGTCGCGAATGAAAATGCATCAGAAGTGAGCCTTATCGCCGAGAAGGGGTGTCTTGTCGCAGGGCATTTA
CAGGAAGTTTGCGCCTGGCTGGAAGGGCGCCATGAACTGTCCGAGCCGGAAGAGTGTGACAATGTTATAGCCGATACCCC
AGAGGATCTCAGCGAGATTATGGGACAGGAGCAAGGGAAACGGGCGCTGGAGATTACGGCCGCAGGTGGACACAATCTTC
TGTTGATTGGCCCGCCTGGTACGGGTAAAACGATGCTGGCGAGCAGGCTGAGTGGCTTGCTGCCACCGCTCAATAATCAT
GAAGCGCTGGAAAGTGCCGCCATTTATAGCCTTATCAGTTCTACCTCGTTGCAAAAACAGTGGCGCCGTCGCCCTTTTCG
TTCACCTCATCATAGTGCTTCACTGACGGCGATGGTCGGCGGTGGGTCTATTCCCGGGCCGGGTGAAATTTCACTGGCGC
ACAATGGCATTCTGTTTCTCGACGAGCTGCCTGAGTTTGAGCGCCGCGTCCTGGATGCCCTGAGAGAACCCATTGAATCT
GGAGAGATACACATCTCACGCACGCGAGCCAAAATAAGCTATCCCGCGCAGTTTCAGCTGGTCGCTGCGATGAATCCCAG
CCCTACGGGTCATTATCAGGGCAATCATAACCGCTGTACGCCAGAGCAGACGTTGCGCTATCTGGGAAAGTTATCCGGCC
CGTTCCTCGACCGTTTCGATTTATCCCTCGAAATCCCCCTTCCCCCGCCGGGCCTGCTCAGGCAAAAAGGCATCACGGGT
GAAAGCTCAGCAGATGTACGCGAGCGGGTCATTGCTGCCCAGACGCGACAATATGCCCGTCAGAACAGGCTGAATGCGCG
GCTGGATAATGCCGGGATCCGGCAGTTTTGTTCTCTTAACATGGAGGATGCGGTTTGGCTGGAGGAAACCTTGACGCGTT
TTGGTCTTTCTATACGCGCGTGGCAGCGTTTGCTGAAAGTGGCCAGAACCATTGCTGACGTGGAGGGTTGTAGTGACATT
CAGAGGAAACACTTGCAGGAGGCGCTGAGCTACCGCGCTATCGATCGTTTGCTGCTCCATCTGCAGAAGTTGCTGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2J0PD61

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

58.153

100

0.585

  comM Vibrio cholerae strain A1552

58.648

99.407

0.583

  comM Glaesserella parasuis strain SC1401

58.185

100

0.583

  comM Vibrio campbellii strain DS40M4

58.333

99.605

0.581

  comM Legionella pneumophila str. Paris

50.101

98.221

0.492

  comM Legionella pneumophila strain ERS1305867

50.101

98.221

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.181

100

0.443


Multiple sequence alignment