Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   LRS40_RS07590 Genome accession   NZ_CP088265
Coordinates   1309037..1310557 (-) Length   506 a.a.
NCBI ID   WP_231313433.1    Uniprot ID   -
Organism   Leclercia sp. G3L     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1304037..1315557
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LRS40_RS07580 (LRS40_07580) hdfR 1307733..1308554 (-) 822 WP_142487544.1 HTH-type transcriptional regulator HdfR -
  LRS40_RS07585 (LRS40_07585) - 1308673..1309011 (+) 339 WP_142487543.1 DUF413 domain-containing protein -
  LRS40_RS07590 (LRS40_07590) comM 1309037..1310557 (-) 1521 WP_231313433.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  LRS40_RS07595 (LRS40_07595) ilvL 1310901..1310999 (+) 99 WP_023620831.1 ilv operon leader peptide -
  LRS40_RS07600 (LRS40_07600) ilvX 1311086..1311136 (+) 51 WP_197090613.1 peptide IlvX -
  LRS40_RS07605 (LRS40_07605) ilvG 1311139..1312785 (+) 1647 WP_231313434.1 acetolactate synthase 2 catalytic subunit -
  LRS40_RS07610 (LRS40_07610) ilvM 1312782..1313051 (+) 270 WP_032615208.1 acetolactate synthase 2 small subunit -
  LRS40_RS07615 (LRS40_07615) ilvE 1313069..1313998 (+) 930 WP_142487539.1 branched-chain-amino-acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55152.52 Da        Isoelectric Point: 7.8962

>NTDB_id=633574 LRS40_RS07590 WP_231313433.1 1309037..1310557(-) (comM) [Leclercia sp. G3L]
MSLSVVYTRAALGVKAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPAKKITINLAPADLPKEGGR
YDLPIAIALLAASEQLSSPRLSAYEFVGELALTGALRGVPGAISGALAAIHAGREIIVAKDNAAEVSLIEQKGCLVAEHL
QEVCAFLEGRHELAVPAQEPFAVENSDQDISDIIGQEQGKRALEITAAGAHNLLLIGPPGTGKTMLASRLNGLLPPLSNH
EALESAAIVSLVNATSMYKQWRRRPFRAPHHSASLVAMVGGGAIPAPGEISLAHNGILFLDELPEFERRVLDALREPIES
GQINISRTRAKISYPARFQLIAAMNPSPTGHYQGNHNRCSPEQTLRYLGRLSGPFLDRFDLSLEIPLPPPGLLSQAHRAG
ETSLTVRNRVIAAQERQLVRQNKLNAHLDNAEIRRFCPLMAEDALWLEETLTRFGLSVRAWQRLLKVARTIADLGESEKI
ERRHLQEALSYRAIDRLLMHLQKMLE

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=633574 LRS40_RS07590 WP_231313433.1 1309037..1310557(-) (comM) [Leclercia sp. G3L]
ATGTCACTGTCGGTTGTTTATACCCGCGCTGCGTTAGGTGTTAAAGCGCCGCTGATTTCCGTTGAGGTTCACCTCAGTAA
TGGCCTGCCGGGCTTAACGCTAGTTGGCTTGCCGGAAACCACCGTCAAGGAAGCCAGAGACCGGGTACGCAGCGCAATCA
TCAATAGCGGTTATACCTTTCCGGCCAAAAAGATCACGATTAACCTGGCGCCAGCCGATCTACCCAAGGAAGGCGGACGA
TATGATTTACCTATCGCCATTGCGCTTCTCGCCGCCTCTGAGCAACTCTCCTCTCCCAGACTAAGCGCATACGAGTTTGT
GGGTGAGCTGGCGCTCACAGGCGCATTAAGAGGGGTTCCTGGCGCAATATCTGGTGCCCTGGCGGCCATTCATGCTGGCA
GAGAGATTATCGTTGCCAAGGATAACGCTGCCGAAGTCAGCCTTATCGAACAAAAAGGCTGCCTGGTGGCAGAGCACCTG
CAGGAAGTGTGCGCCTTTCTTGAGGGCCGCCACGAGCTGGCTGTGCCCGCTCAGGAGCCGTTTGCAGTAGAAAACAGCGA
TCAGGATATCAGCGATATTATCGGCCAGGAGCAGGGTAAAAGGGCGCTGGAGATCACCGCAGCCGGCGCACACAACCTGC
TGCTGATTGGCCCACCAGGCACAGGCAAAACCATGCTCGCGAGCCGGCTCAATGGTCTGCTACCGCCACTCAGCAACCAT
GAGGCGCTGGAGAGTGCGGCAATCGTCAGTCTGGTCAACGCCACATCGATGTATAAGCAGTGGCGTCGACGTCCCTTTCG
CGCACCGCACCACAGCGCTTCGCTCGTGGCCATGGTAGGCGGGGGTGCAATCCCCGCCCCGGGAGAGATCTCACTGGCGC
ATAACGGCATTCTCTTTCTGGATGAGTTACCCGAGTTTGAACGGCGTGTGCTGGATGCTTTACGTGAGCCTATCGAATCG
GGTCAGATTAACATCTCCCGTACCCGTGCAAAGATCAGTTATCCGGCTCGTTTCCAGCTTATCGCCGCCATGAATCCCAG
CCCAACGGGTCATTATCAAGGTAACCATAACCGCTGTTCGCCCGAACAGACGCTTCGCTATCTGGGACGCTTATCGGGCC
CGTTTCTCGATCGCTTCGATTTATCGCTTGAGATCCCGCTACCGCCGCCAGGTTTGCTAAGCCAGGCGCACAGAGCAGGA
GAGACCAGTCTCACCGTACGAAACAGAGTGATCGCCGCGCAGGAGCGTCAGCTGGTGCGGCAGAATAAACTTAATGCCCA
TCTGGATAATGCGGAGATCCGCAGGTTTTGCCCCCTCATGGCAGAGGATGCCCTGTGGCTGGAAGAGACTCTCACCCGAT
TCGGCCTGTCGGTACGTGCCTGGCAACGGCTGCTAAAGGTGGCCCGTACGATTGCTGACCTGGGGGAAAGTGAAAAGATA
GAACGCAGACACCTGCAGGAAGCCCTGAGCTATCGCGCTATTGACCGTTTACTCATGCACCTGCAAAAGATGCTGGAATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

60.511

100

0.609

  comM Glaesserella parasuis strain SC1401

60.355

100

0.605

  comM Vibrio campbellii strain DS40M4

59.245

99.407

0.589

  comM Vibrio cholerae strain A1552

58.765

99.209

0.583

  comM Legionella pneumophila str. Paris

50.704

98.221

0.498

  comM Legionella pneumophila strain ERS1305867

50.704

98.221

0.498

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.64

100

0.441