Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   MLG_RS00415 Genome accession   NC_008340
Coordinates   89788..91305 (+) Length   505 a.a.
NCBI ID   WP_011627835.1    Uniprot ID   Q0ACJ8
Organism   Alkalilimnicola ehrlichii MLHE-1     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 84788..96305
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MLG_RS00400 (Mlg_0079) - 87516..88766 (-) 1251 WP_011627832.1 ammonium transporter -
  MLG_RS00405 (Mlg_0080) glnK 88829..89167 (-) 339 WP_011627833.1 P-II family nitrogen regulator -
  MLG_RS00410 (Mlg_0081) ubiK 89410..89724 (+) 315 WP_011627834.1 ubiquinone biosynthesis accessory factor UbiK -
  MLG_RS00415 (Mlg_0082) comM 89788..91305 (+) 1518 WP_011627835.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  MLG_RS00420 (Mlg_0083) - 91329..91883 (-) 555 WP_011627836.1 cytochrome c5 family protein -
  MLG_RS00425 (Mlg_0084) argB 92052..92966 (-) 915 WP_011627837.1 acetylglutamate kinase -
  MLG_RS00430 (Mlg_0085) - 93252..93461 (-) 210 WP_011627838.1 DUF2905 domain-containing protein -
  MLG_RS00435 (Mlg_0086) rpiA 93478..94140 (-) 663 WP_041717846.1 ribose-5-phosphate isomerase RpiA -
  MLG_RS00440 (Mlg_0087) ilvA 94115..95740 (+) 1626 WP_269490060.1 threonine ammonia-lyase, biosynthetic -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 52762.69 Da        Isoelectric Point: 8.6750

>NTDB_id=26283 MLG_RS00415 WP_011627835.1 89788..91305(+) (comM) [Alkalilimnicola ehrlichii MLHE-1]
MSLAVVHTRASLGLAAPEVTVEVHIGPGLPALAIVGLPETAVREARERVRSALTMAGFEFPAGRITVNLAPADLPKGGGR
YDLPIALGILRASEQLNAPLGDWEFAGELALSGRLRAIPGLLPLAVQARKAGRGLVVPEACGGEVALVHRNALTAGHLLD
VCRHLGGEACLAPACARASEPAGAPVADLADVRGQAGARRALEIAAAGGHALLLCGPPGTGKSMLAARLPGILPAMTEAE
ALEAAAVASVSHAGFRPEQWARRPFRQPHHSASQAALIGGGRKPGPGEASLAHRGILFLDELPEFSRGALEALREPLETG
EVHISRASARVSYPARFQLIAAMNPCPCGHLGDPAGRCRCTPEQVQRYRGRLSGPLMDRLDMQVAVPRLSATELQVDGAT
GEDSEVVRARVTAARARQQARAGVENAHLQGRALEAHCRLAPTDARLLARAMEQLGLSARAYHRVLRLARTIADLEGADA
IHSPHLTEALALRRGLEPGRGAVGP

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=26283 MLG_RS00415 WP_011627835.1 89788..91305(+) (comM) [Alkalilimnicola ehrlichii MLHE-1]
ATGTCATTGGCTGTGGTTCATACCCGGGCCTCGCTGGGGCTGGCCGCCCCGGAGGTCACCGTGGAGGTACACATCGGGCC
CGGGCTGCCGGCACTGGCCATCGTCGGGCTGCCGGAGACCGCCGTGCGCGAGGCGCGCGAGCGGGTGCGCTCGGCGCTCA
CCATGGCGGGCTTTGAGTTCCCTGCCGGCCGCATCACGGTGAATCTCGCCCCCGCCGACCTGCCCAAGGGCGGCGGTCGT
TACGATCTGCCCATCGCCCTGGGCATCCTCCGCGCCTCTGAACAGCTAAACGCCCCTCTGGGCGACTGGGAGTTCGCCGG
CGAACTGGCCCTCAGCGGCCGCCTGCGGGCCATCCCCGGGCTGCTGCCGCTGGCGGTGCAGGCGCGCAAGGCCGGGCGGG
GGCTGGTGGTGCCCGAGGCCTGCGGGGGCGAGGTGGCGTTGGTCCACCGCAACGCCCTGACCGCGGGGCATCTCCTGGAT
GTCTGTCGGCACCTGGGCGGTGAGGCCTGTCTGGCGCCGGCGTGTGCCCGGGCGTCGGAGCCGGCCGGGGCGCCGGTGGC
GGACCTGGCCGACGTGCGCGGCCAGGCCGGCGCCCGACGGGCGCTGGAGATCGCCGCCGCCGGCGGCCACGCCCTCCTGC
TGTGCGGCCCGCCGGGTACCGGCAAGAGCATGCTGGCGGCGCGCCTGCCGGGGATTCTGCCCGCCATGACCGAGGCCGAG
GCCCTGGAGGCGGCGGCGGTGGCCTCGGTCAGCCATGCGGGGTTTCGTCCCGAGCAATGGGCCCGCCGGCCCTTCCGCCA
GCCGCACCACAGCGCCTCACAGGCGGCGTTGATCGGCGGCGGGCGCAAGCCCGGGCCCGGCGAGGCCTCGTTGGCTCACC
GCGGCATCCTGTTCCTGGACGAGCTGCCGGAGTTCAGTCGCGGCGCGTTGGAGGCCCTGCGTGAGCCGCTGGAGACCGGC
GAGGTGCATATCTCCCGGGCCTCGGCCCGGGTCAGTTACCCGGCCCGGTTCCAGCTCATAGCGGCCATGAACCCCTGCCC
CTGCGGCCACCTGGGGGACCCGGCCGGGCGTTGCCGCTGCACCCCGGAGCAGGTCCAGCGGTATCGCGGCCGGCTCTCCG
GCCCACTGATGGACCGGCTGGACATGCAGGTGGCGGTGCCCCGGTTGAGCGCGACCGAATTACAGGTCGACGGGGCGACG
GGGGAGGACAGTGAGGTGGTGCGGGCTCGGGTGACGGCGGCCCGGGCCCGGCAACAGGCCCGGGCGGGCGTGGAGAATGC
GCACCTCCAGGGGCGGGCCCTGGAGGCCCATTGCCGGCTGGCGCCGACGGACGCCCGGTTGCTGGCCCGGGCCATGGAGC
AGCTGGGGTTGTCCGCTCGGGCCTACCACCGGGTGTTGCGGCTGGCGCGCACCATTGCCGATCTGGAGGGTGCGGACGCG
ATCCACTCCCCGCACCTGACCGAGGCGCTGGCCCTGCGCCGGGGCCTGGAGCCCGGTCGCGGTGCGGTCGGGCCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q0ACJ8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

52.906

98.812

0.523

  comM Vibrio cholerae strain A1552

53.012

98.614

0.523

  comM Vibrio campbellii strain DS40M4

53.131

98.02

0.521

  comM Glaesserella parasuis strain SC1401

51.703

98.812

0.511

  comM Legionella pneumophila str. Paris

46.707

99.208

0.463

  comM Legionella pneumophila strain ERS1305867

46.707

99.208

0.463

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.936

99.604

0.438


Multiple sequence alignment