Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ENTAS_RS21830 Genome accession   NC_015968
Coordinates   4617452..4618972 (+) Length   506 a.a.
NCBI ID   WP_014072312.1    Uniprot ID   -
Organism   Enterobacter soli     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4612452..4623972
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ENTAS_RS21815 (Entas_4300) ilvE 4614016..4614945 (-) 930 WP_014072309.1 branched-chain-amino-acid transaminase -
  ENTAS_RS21820 (Entas_4301) ilvM 4614964..4615227 (-) 264 WP_014072310.1 acetolactate synthase 2 small subunit -
  ENTAS_RS21825 (Entas_4302) ilvG 4615224..4616870 (-) 1647 WP_014072311.1 acetolactate synthase 2 catalytic subunit -
  ENTAS_RS25590 ilvX 4616873..4616923 (-) 51 WP_166792073.1 peptide IlvX -
  ENTAS_RS24570 ilvL 4617010..4617108 (-) 99 WP_001311244.1 ilv operon leader peptide -
  ENTAS_RS21830 (Entas_4303) comM 4617452..4618972 (+) 1521 WP_014072312.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ENTAS_RS21835 (Entas_4304) - 4619005..4619343 (-) 339 WP_014072313.1 DUF413 domain-containing protein -
  ENTAS_RS21840 (Entas_4305) hdfR 4619462..4620283 (+) 822 WP_014072314.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55187.29 Da        Isoelectric Point: 7.3089

>NTDB_id=42279 ENTAS_RS21830 WP_014072312.1 4617452..4618972(+) (comM) [Enterobacter soli]
MSLSVVYTRAAIGVKAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPARKITINLAPADLPKEGGR
YDLPIAIALLAASEQLNTTRLGSYEFIGELALTGSLRGVPGAISGALEAISAGRQIIVANENSPEVSLIAEKGCLIAGHL
QEVCAYLEGRHELAEPREAEDILPDNRDDLRDIIGQEQGKRALEITAAGGHNLLLIGPPGTGKTMLASRLNGLLPPLNNR
EALESAAIFSLVSSTSLQKQWRQRPFRSPHHSASLAAMVGGGSIPGPGEISLAHNGILFLDELPEFERRVLDALREPIES
GQIHISRTRAKISYPARFQLVAAMNPSPTGHYQGNHNRCTPEQTLRYLGKLSGPFLDRFDLSLEIPLPPPGLLSQNNGQG
ITSAEVRERVITAQANQYARQCKLNAYLDNAEIRRFCHLAPEDALWLEDTLVRFGLSIRAWQRLLKVARTIADVEGCIEI
QRQHLQEALSYRAIDRLLLHLQKMLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=42279 ENTAS_RS21830 WP_014072312.1 4617452..4618972(+) (comM) [Enterobacter soli]
ATGTCACTGTCGGTTGTTTATACACGAGCCGCTATTGGTGTGAAGGCACCACTGATTTCTGTTGAAGTACATCTGAGTAA
TGGCCTGCCAGGGCTCACTCTCGTTGGGCTACCAGAAACCACCGTAAAAGAGGCCAGGGATCGGGTGCGTAGCGCCATCA
TCAATAGCGGTTATACCTTCCCTGCCAGAAAAATTACCATCAACCTGGCACCCGCTGACTTACCCAAAGAGGGAGGACGA
TATGATTTACCTATCGCTATTGCGCTTCTCGCCGCTTCTGAGCAGCTGAATACCACCAGGCTAGGCTCGTATGAGTTCAT
CGGCGAACTTGCTCTTACAGGCTCATTAAGAGGCGTTCCTGGAGCGATCTCAGGTGCCCTGGAAGCGATAAGCGCAGGCC
GACAAATCATTGTGGCCAATGAAAATTCTCCTGAAGTGAGCCTGATTGCAGAAAAAGGGTGCCTGATAGCCGGCCATCTA
CAGGAGGTTTGTGCGTACCTGGAGGGGCGACATGAGCTTGCCGAACCACGGGAGGCCGAAGATATCCTGCCGGATAACCG
GGACGATCTCCGCGATATTATTGGTCAGGAACAGGGTAAGCGGGCCCTGGAGATTACCGCCGCAGGTGGGCATAACCTCT
TATTGATTGGCCCACCAGGAACAGGCAAAACAATGCTGGCAAGCCGCCTGAATGGCTTACTGCCGCCATTAAACAATCGC
GAGGCGCTTGAAAGCGCCGCCATATTTAGCCTCGTTAGCTCAACATCACTGCAAAAGCAGTGGCGGCAGCGCCCCTTTCG
CTCCCCGCACCACAGTGCTTCTTTGGCAGCAATGGTGGGTGGGGGCTCAATCCCCGGGCCTGGCGAAATATCGCTGGCGC
ACAATGGCATCCTGTTTCTTGATGAACTGCCTGAATTTGAACGACGTGTTCTGGACGCCCTGCGTGAGCCGATTGAGTCG
GGACAAATACACATCTCCCGGACACGAGCAAAAATCAGTTACCCTGCACGGTTCCAGCTGGTTGCTGCGATGAACCCCAG
TCCAACAGGACACTACCAGGGGAACCATAACCGGTGTACGCCCGAGCAGACGCTTCGCTATCTTGGAAAGCTATCAGGTC
CTTTCCTCGACCGGTTCGATTTATCGCTGGAGATCCCTCTTCCGCCCCCCGGGCTACTCAGCCAGAACAACGGACAGGGA
ATAACTTCAGCGGAAGTGCGCGAAAGAGTTATCACTGCCCAGGCGAATCAATATGCCCGTCAGTGCAAACTCAATGCTTA
TCTTGATAATGCGGAAATTCGGCGATTCTGCCATCTGGCCCCGGAGGATGCACTTTGGCTGGAGGATACCCTGGTACGGT
TTGGGCTTTCCATTCGTGCCTGGCAGCGTTTATTGAAAGTCGCCAGGACTATCGCTGATGTTGAAGGGTGTATTGAAATC
CAGAGGCAGCATCTGCAAGAAGCATTGAGCTATCGTGCAATCGACCGTTTGCTACTGCATCTGCAAAAAATGCTGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.136

100

0.595

  comM Glaesserella parasuis strain SC1401

59.369

100

0.595

  comM Vibrio campbellii strain DS40M4

59.127

99.605

0.589

  comM Vibrio cholerae strain A1552

58.367

99.209

0.579

  comM Legionella pneumophila str. Paris

50.704

98.221

0.498

  comM Legionella pneumophila strain ERS1305867

50.704

98.221

0.498

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.745

100

0.431


Multiple sequence alignment