Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GYM74_RS10580 Genome accession   NZ_CP048265
Coordinates   2437586..2439109 (+) Length   507 a.a.
NCBI ID   WP_220218180.1    Uniprot ID   -
Organism   Gilliamella sp. ESL0405     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 2432586..2444109
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GYM74_RS10560 (GYM74_10535) mepM 2433433..2434794 (-) 1362 WP_220218176.1 murein DD-endopeptidase MepM -
  GYM74_RS10565 (GYM74_10540) znuA 2434823..2435782 (-) 960 WP_220218177.1 zinc ABC transporter substrate-binding protein ZnuA -
  GYM74_RS10570 (GYM74_10545) znuC 2435877..2436659 (+) 783 WP_220218178.1 zinc ABC transporter ATP-binding protein ZnuC -
  GYM74_RS10575 (GYM74_10550) znuB 2436652..2437455 (+) 804 WP_220218179.1 zinc ABC transporter permease subunit ZnuB -
  GYM74_RS10580 (GYM74_10555) comM 2437586..2439109 (+) 1524 WP_220218180.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GYM74_RS10585 (GYM74_10560) - 2439114..2439452 (-) 339 WP_220218181.1 DUF413 domain-containing protein -

Sequence


Protein


Download         Length: 507 a.a.        Molecular weight: 55590.79 Da        Isoelectric Point: 9.3527

>NTDB_id=420321 GYM74_RS10580 WP_220218180.1 2437586..2439109(+) (comM) [Gilliamella sp. ESL0405]
MSLAIVSTRASIGIQAPQINVEVHISNGLPGFVLVGLPEATVKEAKDRVRSAIINSGFTFPAKKITVNLSPADLPKEGSR
FDLPIAIAILAATEQIPVDNLAQYEFLGELALSGDIRAVKGAIPAAIASKKNHRVLIISAENQSELSLIHHNNTLITTNL
LQLCQYLYNEINLPMVEYRDHNDNGNQVEKLEDIVGQEHAKRALEIAAAGGHNLLLIGPPGTGKTMLATRLTSLLPPLSD
DEALQSAAITSLVSPNGSIKNWRQRPFRAPHHSASTAALVGGGSIPKPGEISLAHNGVLFLDELPEFNRKVLDALREPIE
SGEIVISRANAKIKFPAKFQLIAAMNPSPTGHYQGTHNRTTPQQTMRYLNRLSGPFLDRFDISIEVPLLPKGTLSQKNIA
VESTQSVKQRVFKAREIQLGRNNKLNSQLSVNEIKHYCLLSDENNEYLEQALIKLGLSARAWHRILKVSRTIADLDSSVN
IERQHISEALSYRSMDRLLIQLHKNIG

Nucleotide


Download         Length: 1524 bp        

>NTDB_id=420321 GYM74_RS10580 WP_220218180.1 2437586..2439109(+) (comM) [Gilliamella sp. ESL0405]
ATGTCTCTTGCGATTGTCTCAACCCGTGCTTCTATCGGAATACAAGCACCGCAAATCAATGTCGAAGTACACATTAGTAA
TGGATTGCCTGGTTTTGTACTAGTAGGGCTACCTGAAGCAACGGTGAAAGAAGCAAAAGATCGGGTTAGGAGCGCCATTA
TCAATAGCGGCTTCACATTTCCGGCAAAGAAAATCACCGTTAACCTCTCACCCGCTGATCTTCCTAAAGAAGGAAGTCGT
TTTGATTTACCTATAGCTATCGCTATTTTAGCTGCTACAGAGCAAATTCCGGTAGATAATCTGGCACAGTATGAGTTCTT
AGGTGAGTTAGCGCTTTCGGGCGATATAAGAGCGGTTAAAGGCGCAATTCCGGCGGCAATAGCATCAAAGAAAAATCATC
GTGTTTTAATTATTTCAGCTGAAAATCAATCTGAATTATCTTTGATTCACCATAACAATACATTAATCACCACTAATTTA
TTACAACTATGCCAATATTTATATAATGAAATCAATTTGCCCATGGTTGAATATCGTGATCATAACGATAATGGAAATCA
AGTAGAAAAATTAGAAGACATTGTCGGTCAGGAACATGCTAAAAGAGCGTTGGAAATTGCCGCTGCTGGTGGTCATAATC
TATTATTAATTGGTCCTCCCGGAACAGGTAAAACAATGTTAGCAACACGTTTAACTTCGCTTTTACCGCCGCTATCAGAC
GATGAAGCGTTACAAAGCGCTGCAATAACGAGCCTTGTAAGTCCTAATGGCTCGATAAAAAATTGGCGTCAAAGACCGTT
TAGAGCGCCTCATCATAGCGCATCAACCGCAGCTTTAGTCGGTGGTGGCTCAATACCTAAACCGGGGGAAATTTCATTAG
CACATAATGGCGTACTATTTTTAGATGAACTACCGGAATTTAATCGTAAAGTGCTTGATGCCTTAAGGGAACCGATTGAA
TCGGGCGAAATAGTGATTTCAAGAGCCAATGCGAAAATAAAATTTCCTGCCAAATTTCAACTTATTGCCGCTATGAATCC
AAGCCCAACCGGTCATTATCAAGGTACGCATAATAGAACAACGCCTCAACAAACTATGCGTTATTTAAATCGGCTATCGG
GGCCATTTTTAGATAGGTTTGATATCTCTATCGAAGTGCCATTATTACCTAAAGGGACACTTAGTCAGAAAAATATCGCA
GTTGAATCAACACAGAGCGTAAAACAGCGAGTATTTAAAGCAAGAGAAATACAATTAGGAAGGAATAACAAGCTTAATAG
TCAGCTAAGCGTAAATGAGATAAAACATTATTGTCTGTTATCGGATGAGAATAACGAGTATTTGGAACAAGCTTTGATTA
AATTAGGTCTTTCAGCACGAGCATGGCACCGAATTTTGAAAGTCTCACGTACTATTGCGGATTTAGATTCATCAGTCAAT
ATAGAACGACAACATATTTCAGAAGCATTAAGTTATCGTTCTATGGATAGGTTATTAATTCAATTACATAAAAACATTGG
GTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

63.241

99.803

0.631

  comM Glaesserella parasuis strain SC1401

62.13

100

0.621

  comM Vibrio cholerae strain A1552

60.558

99.014

0.6

  comM Vibrio campbellii strain DS40M4

59.96

99.014

0.594

  comM Legionella pneumophila str. Paris

52.823

97.83

0.517

  comM Legionella pneumophila strain ERS1305867

52.823

97.83

0.517

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.169

97.83

0.452


Multiple sequence alignment