Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   N1249_RS21830 Genome accession   NZ_CP104014
Coordinates   4620050..4621570 (+) Length   506 a.a.
NCBI ID   WP_259833383.1    Uniprot ID   -
Organism   Enterobacter sp. CP102     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4615050..4626570
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  N1249_RS21805 (N1249_21805) ilvE 4616614..4617543 (-) 930 WP_259833378.1 branched-chain-amino-acid transaminase -
  N1249_RS21810 (N1249_21810) ilvM 4617562..4617825 (-) 264 WP_014072310.1 acetolactate synthase 2 small subunit -
  N1249_RS21815 (N1249_21815) ilvG 4617822..4619468 (-) 1647 WP_259833380.1 acetolactate synthase 2 catalytic subunit -
  N1249_RS21820 (N1249_21820) ilvX 4619471..4619521 (-) 51 WP_166792073.1 peptide IlvX -
  N1249_RS21825 (N1249_21825) ilvL 4619608..4619706 (-) 99 WP_023620831.1 ilv operon leader peptide -
  N1249_RS21830 (N1249_21830) comM 4620050..4621570 (+) 1521 WP_259833383.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  N1249_RS21835 (N1249_21835) - 4621603..4621941 (-) 339 WP_259833386.1 DUF413 domain-containing protein -
  N1249_RS21840 (N1249_21840) hdfR 4622060..4622881 (+) 822 WP_259833388.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55261.50 Da        Isoelectric Point: 8.0218

>NTDB_id=727034 N1249_RS21830 WP_259833383.1 4620050..4621570(+) (comM) [Enterobacter sp. CP102]
MSLSVVYTRAAIGVKAPLISVEVHLSNGLPGLTLVGLPETTVKEARDRVRSAIINSGYTFPARKITINLAPADLPKEGGR
YDLPIAIALLAASEQLNTTRLGSYEFIGELALTGSLRGVPGAISGALEAISAGRQIIVANENSPEVSLIAEKGCLIASHL
QEVCAYLEGRHELAEPREAEDILPDNRDDLRDIIGQEQGKRALEITAAGGHNLLLIGPPGTGKTMLASRLNGLLPPLNNR
EALESAAIFSLVSSTSLQKQWRQRPFRSPHHSASLTAMVGGGSIPGPGEISLAHNGILFLDELPEFERRVLDALREPIES
GQIHISRTRAKISYPARFQLVAAMNPSPTGHYQGNHNRCTPEQTLRYLGKLSGPFLDRFDLSLEIPLPPPGLLSKNNGKG
ITSAEVRERVITAQAKQYARQCKLNAYLDNAEIRRFCHLAPEDALWLEDTLVRFGLSIRAWQRLLKVARTIADVEGCIEI
QRQHLQEALSYRAIDRLLLHLQKMLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=727034 N1249_RS21830 WP_259833383.1 4620050..4621570(+) (comM) [Enterobacter sp. CP102]
ATGTCACTGTCGGTTGTTTATACACGCGCCGCTATTGGTGTGAAGGCACCACTGATTTCTGTTGAAGTACATCTGAGTAA
TGGCCTGCCAGGGCTCACTCTCGTTGGGCTACCAGAAACCACCGTAAAAGAGGCCAGGGATCGTGTGCGTAGCGCCATCA
TCAATAGCGGTTATACCTTCCCTGCCAGGAAAATCACCATCAACCTGGCACCTGCTGACTTACCCAAAGAGGGGGGGCGA
TATGATTTACCTATCGCTATTGCGCTTCTCGCCGCTTCTGAGCAACTGAATACCACCAGGCTAGGCTCGTATGAGTTCAT
CGGCGAACTTGCTCTTACAGGCTCATTAAGAGGCGTTCCTGGAGCGATATCAGGAGCCCTGGAAGCGATAAGCGCAGGCC
GACAAATCATTGTGGCCAATGAAAATTCTCCTGAAGTGAGCCTGATTGCAGAAAAAGGGTGTCTGATAGCCAGCCATCTA
CAGGAGGTTTGTGCGTACCTGGAAGGGCGGCATGAACTTGCTGAACCACGGGAGGCCGAAGATATCCTGCCGGATAACCG
GGACGATCTCCGCGATATTATTGGGCAGGAACAGGGCAAGCGAGCCCTGGAGATTACCGCCGCAGGTGGCCATAACCTCT
TATTGATTGGCCCGCCAGGAACGGGCAAAACAATGCTGGCAAGCCGCCTGAATGGCTTACTGCCGCCATTAAACAATCGC
GAGGCGCTTGAAAGCGCCGCCATATTTAGCCTCGTTAGCTCAACATCACTGCAAAAGCAGTGGCGGCAGCGCCCCTTTCG
CTCACCGCACCACAGTGCTTCTTTGACAGCAATGGTGGGTGGGGGCTCTATCCCGGGGCCTGGCGAAATCTCGCTGGCGC
ACAATGGCATCCTGTTTCTTGATGAACTGCCTGAATTTGAACGACGGGTTCTGGACGCGCTACGTGAACCGATTGAGTCG
GGACAAATACACATCTCCCGGACACGAGCCAAAATCAGTTACCCTGCGCGGTTCCAGCTGGTTGCTGCGATGAACCCCAG
CCCAACAGGGCACTACCAGGGGAACCATAACCGGTGCACGCCTGAGCAGACGCTTCGCTATCTTGGAAAGCTATCCGGTC
CTTTCCTCGACCGGTTCGACTTATCGCTGGAGATCCCTCTTCCGCCTCCCGGGCTGCTCAGCAAGAACAACGGAAAGGGA
ATAACTTCAGCGGAAGTACGTGAAAGAGTCATCACTGCCCAGGCGAAGCAATATGCCCGTCAGTGCAAACTCAATGCCTA
TCTTGATAATGCGGAGATTCGGCGATTCTGCCATCTGGCGCCAGAAGATGCGCTTTGGCTGGAGGATACCCTGGTACGGT
TTGGGCTTTCCATTCGTGCCTGGCAGCGTTTATTGAAAGTCGCCAGGACTATCGCTGATGTTGAAGGGTGTATTGAAATC
CAGAGGCAGCATCTGCAAGAAGCATTGAGCTATCGTGCAATCGACCGTTTGCTACTGCATCTGCAAAAAATGCTGGCGTA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.136

100

0.595

  comM Glaesserella parasuis strain SC1401

59.369

100

0.595

  comM Vibrio campbellii strain DS40M4

58.929

99.605

0.587

  comM Vibrio cholerae strain A1552

58.167

99.209

0.577

  comM Legionella pneumophila str. Paris

50.704

98.221

0.498

  comM Legionella pneumophila strain ERS1305867

50.704

98.221

0.498

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.941

100

0.433