Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CWR52_RS13230 Genome accession   NZ_CP025034
Coordinates   2755240..2756760 (-) Length   506 a.a.
NCBI ID   WP_104949879.1    Uniprot ID   -
Organism   Enterobacter sp. SGAir0187     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 2750240..2761760
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CWR52_RS13220 (CWR52_13320) hdfR 2753930..2754751 (-) 822 WP_024907953.1 HTH-type transcriptional regulator HdfR -
  CWR52_RS13225 (CWR52_13325) - 2754870..2755208 (+) 339 WP_008501577.1 DUF413 domain-containing protein -
  CWR52_RS13230 (CWR52_13330) comM 2755240..2756760 (-) 1521 WP_104949879.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CWR52_RS13235 (CWR52_13335) ilvL 2757102..2757200 (+) 99 WP_001311244.1 ilv operon leader peptide -
  CWR52_RS23310 ilvX 2757287..2757337 (+) 51 WP_166792073.1 peptide IlvX -
  CWR52_RS13240 (CWR52_13340) ilvG 2757340..2758986 (+) 1647 WP_104949880.1 acetolactate synthase 2 catalytic subunit -
  CWR52_RS13245 (CWR52_13345) ilvM 2758983..2759246 (+) 264 WP_006179206.1 acetolactate synthase 2 small subunit -
  CWR52_RS13250 (CWR52_13350) ilvE 2759265..2760194 (+) 930 WP_023309654.1 branched-chain-amino-acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 54980.99 Da        Isoelectric Point: 7.1226

>NTDB_id=257519 CWR52_RS13230 WP_104949879.1 2755240..2756760(-) (comM) [Enterobacter sp. SGAir0187]
MSLSVVYTRAALGVKAPLVSVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPAKKITINLAPADLPKEGGR
YDLPIAIALLAASEQLNTTRLGSYEFVGELALTGALRGVPGAISGALEAIRSGRQIIVANENASEVSLIAEKGCLIAGHL
QEVCAWLEGRHELSEPEECDEVIADVPEDLSEIMGQEQGKRGLEITAAGGHNLLLIGPPGTGKTMLASRLSGLLPPLNNH
EALESAAIYSLISSASLQKQWRRRPFRSPHHSASLTAMVGGGSIPGPGEISLAHNGILFLDELPEFERRVLDALREPIES
GEIHISRTRAKISYPAQFQLVAAMNPSPTGHYQGNHNRCTPEQTLRYLGKLSGPFLDRFDLSLEIPLPPPGLLRQTGITG
ESSAKVRERVIAAQARQYTRQNRLNARLDNAGIRQFCALNSEDAVWLEETLTRFGLSIRAWQRLLKVARTIADVEGCTDI
ERKHLQEALSYRAIDRLLLHLQKLLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=257519 CWR52_RS13230 WP_104949879.1 2755240..2756760(-) (comM) [Enterobacter sp. SGAir0187]
ATGTCACTGTCAGTTGTTTATACGCGTGCGGCTCTCGGGGTAAAGGCACCGCTTGTTTCCGTTGAGGTTCATTTGAGTAA
TGGGCTACCCGGACTCACGCTTGTCGGTTTACCTGAAACGACGGTTAAAGAGGCCAGGGATCGCGTTCGCAGCGCAATAA
TAAATAGCGGTTATACCTTCCCTGCGAAGAAGATCACCATCAACCTTGCTCCCGCCGACCTGCCTAAAGAAGGGGGCCGA
TACGATTTACCTATCGCAATTGCCCTTCTCGCGGCTTCTGAGCAGCTTAATACGACAAGGCTAGGCTCGTATGAGTTCGT
GGGTGAACTCGCGCTCACAGGCGCGCTGAGAGGCGTTCCCGGCGCGATATCGGGAGCGCTGGAGGCCATACGATCAGGCC
GGCAAATCATTGTGGCGAATGAAAACGCATCAGAAGTGAGTCTTATCGCGGAAAAAGGATGCCTCATCGCGGGGCATTTA
CAGGAAGTTTGTGCCTGGCTGGAAGGACGACATGAATTGTCCGAACCGGAGGAGTGTGACGAGGTTATAGCCGACGTCCC
GGAGGATCTCAGCGAGATTATGGGACAGGAGCAAGGGAAGCGGGGACTGGAGATTACGGCCGCAGGTGGACACAATCTTC
TGTTGATTGGCCCGCCAGGTACGGGGAAAACGATGCTGGCGAGCAGGCTGAGTGGGTTACTTCCACCTCTCAATAACCAT
GAAGCGCTGGAAAGCGCTGCCATATATAGCCTCATCAGTTCTGCATCGTTGCAAAAACAGTGGCGCCGTCGTCCTTTTCG
TTCCCCACATCATAGCGCTTCACTGACGGCAATGGTCGGCGGCGGGTCTATCCCCGGGCCGGGTGAGATCTCACTGGCGC
ACAATGGCATTTTGTTTCTCGATGAGCTGCCCGAGTTTGAGCGCCGCGTGCTGGATGCGCTGAGAGAACCTATTGAATCT
GGCGAAATACATATCTCGCGCACGCGGGCCAAAATAAGCTATCCCGCGCAGTTTCAACTGGTCGCTGCGATGAATCCCAG
CCCGACGGGACACTACCAGGGCAATCATAACCGCTGTACGCCAGAGCAGACGCTGCGCTATCTGGGAAAGTTATCCGGTC
CGTTCCTCGACCGTTTCGATTTATCTCTCGAAATCCCGCTTCCTCCTCCGGGCCTGCTCAGGCAGACGGGCATCACGGGT
GAAAGCTCTGCGAAGGTACGCGAGCGGGTGATCGCCGCCCAGGCACGACAGTACACTCGCCAGAACAGGCTAAATGCGCG
GCTGGATAATGCCGGGATCCGGCAGTTTTGTGCCCTTAACAGTGAGGATGCGGTATGGCTGGAGGAGACGTTGACGCGCT
TTGGGCTATCTATACGTGCGTGGCAGCGTTTGCTAAAAGTGGCCAGAACCATTGCTGACGTGGAGGGATGTACTGACATT
GAGAGGAAACACTTGCAGGAGGCGCTGAGCTATCGCGCTATCGATCGTTTGCTGCTGCATCTGCAGAAGTTGCTGGCATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

58.546

100

0.589

  comM Vibrio campbellii strain DS40M4

58.648

99.407

0.583

  comM Glaesserella parasuis strain SC1401

58.185

100

0.583

  comM Vibrio cholerae strain A1552

58.367

99.209

0.579

  comM Legionella pneumophila str. Paris

50.101

98.221

0.492

  comM Legionella pneumophila strain ERS1305867

50.101

98.221

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.59

100

0.437


Multiple sequence alignment