Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   O1Q81_RS00470 Genome accession   NZ_CP170385
Coordinates   88177..89712 (+) Length   511 a.a.
NCBI ID   WP_386698589.1    Uniprot ID   -
Organism   Lonepinella sp. MS14436     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 83177..94712
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  O1Q81_RS00450 (O1Q81_00089) - 84300..84983 (-) 684 WP_386697684.1 metallophosphoesterase -
  O1Q81_RS00455 (O1Q81_00090) nadR 84983..86269 (-) 1287 WP_386697686.1 multifunctional transcriptional regulator/nicotinamide-nucleotide adenylyltransferase/ribosylnicotinamide kinase NadR -
  O1Q81_RS00460 (O1Q81_00091) ribB 86672..87316 (+) 645 WP_386697688.1 3,4-dihydroxy-2-butanone-4-phosphate synthase -
  O1Q81_RS00465 (O1Q81_00092) yihA 87454..88080 (+) 627 WP_386697690.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  O1Q81_RS00470 (O1Q81_00093) comM 88177..89712 (+) 1536 WP_386698589.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  O1Q81_RS00475 (O1Q81_00094) - 89810..90304 (-) 495 WP_386688367.1 YfbU family protein -
  O1Q81_RS00480 (O1Q81_00095) - 90315..91079 (-) 765 WP_386695075.1 VacJ family lipoprotein -
  O1Q81_RS00485 (O1Q81_00096) carA 91344..92471 (+) 1128 WP_386697692.1 glutamine-hydrolyzing carbamoyl-phosphate synthase small subunit -
  O1Q81_RS00490 (O1Q81_00097) - 92478..93134 (+) 657 WP_386695078.1 Fic family protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 56189.68 Da        Isoelectric Point: 9.6455

>NTDB_id=1056483 O1Q81_RS00470 WP_386698589.1 88177..89712(+) (comM) [Lonepinella sp. MS14436]
MSLAIVYSRASIGVQAPLVTIEVHISNGKPNFVLVGLPEKTVKEAQDRVRSALINAQFKYPPKRITVNLAPADLPKEGGR
FDLPIALGILAASGQINAEQLRQFEFLGELALTGNLRGVQGVIPAILAAQKAKRTMIIAPQNANEASLISEHDSYYANSL
LEVVQFLDKQLNLPTASQMADQIRSNAKQSAVQNSKDLTDIIGQHHAKRALTIAAAGQHNLLLLGPPGTGKTMLASRLTD
LLPEMSDQEAIETASVTSLIQNELNFANWKKRPFRCPHHSASLPALVGGGSIPKPGEISLAHNGILFLDELPEFERKVLD
ALRQPLESGEIVISRATAKIQFPAKFQLVAAMNPSPTGHYQGSHNRTSPQQIIRYLNRLSGPFLDRFDLSIEVPLLPQGA
LQSQEDRGESSEQVRLKIVKVRERQFARQGKVNAHLTSKEIERYCPLLEKDAIFLENALTKLGLSVRAYHRILKVSRTIA
DLAESEQITQAHLAEALGYRTMDRLLLKLHE

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1056483 O1Q81_RS00470 WP_386698589.1 88177..89712(+) (comM) [Lonepinella sp. MS14436]
ATGTCGTTAGCCATTGTTTATAGCCGTGCATCTATCGGGGTACAAGCCCCCCTTGTGACCATCGAAGTCCATATTAGCAA
TGGTAAACCAAATTTTGTGCTGGTAGGATTACCAGAAAAAACCGTGAAAGAAGCACAGGATCGCGTGAGAAGTGCCTTGA
TTAATGCACAATTTAAATATCCTCCAAAACGTATTACCGTTAATCTTGCCCCAGCAGATTTACCTAAAGAGGGGGGACGT
TTTGATTTGCCGATAGCGTTGGGGATTTTGGCAGCATCAGGGCAAATTAATGCGGAACAATTACGTCAATTTGAATTTCT
TGGGGAACTGGCATTAACGGGGAATTTACGTGGTGTGCAAGGGGTTATTCCTGCTATTTTAGCGGCTCAGAAAGCGAAAA
GAACCATGATTATTGCCCCGCAAAATGCCAATGAAGCATCTCTCATCTCAGAACACGACAGTTATTACGCAAATTCACTG
TTAGAAGTGGTGCAGTTTCTCGACAAGCAATTAAATTTGCCAACGGCTAGTCAAATGGCAGACCAAATTAGGTCAAATGC
CAAGCAAAGTGCGGTGCAAAATAGCAAAGATTTAACCGATATTATCGGGCAACATCATGCCAAACGAGCCTTAACCATTG
CCGCTGCTGGGCAACATAACTTACTTTTATTAGGTCCCCCAGGGACTGGCAAAACCATGCTTGCCAGTCGATTAACCGAT
CTCTTGCCCGAAATGAGTGATCAAGAAGCCATAGAAACCGCCTCTGTGACTAGCCTGATTCAAAATGAACTGAATTTTGC
GAATTGGAAGAAACGCCCATTCCGTTGCCCACATCATAGTGCATCTTTGCCTGCGTTAGTGGGAGGGGGATCGATCCCAA
AACCTGGCGAAATTTCATTAGCTCACAATGGCATTCTTTTTCTTGATGAGTTGCCCGAGTTTGAACGTAAAGTATTAGAT
GCTTTACGCCAACCCTTAGAAAGTGGTGAAATAGTTATCTCTCGTGCCACTGCTAAAATCCAATTTCCCGCTAAATTTCA
ATTAGTTGCTGCTATGAATCCAAGTCCAACAGGACATTATCAAGGTTCTCATAATCGCACATCACCACAACAAATTATCC
GTTATTTGAATCGGTTATCAGGACCTTTTTTAGATCGCTTTGATTTATCTATTGAAGTCCCATTGTTACCACAGGGTGCA
TTACAAAGCCAAGAAGATCGGGGTGAAAGCAGCGAACAAGTGCGGTTAAAAATTGTTAAGGTACGCGAAAGACAATTCGC
TCGACAGGGTAAAGTGAATGCTCATTTAACCAGTAAAGAAATTGAACGTTATTGCCCGTTATTAGAAAAAGATGCAATTT
TTTTGGAAAATGCCCTAACTAAACTAGGTTTATCAGTACGAGCTTATCATCGTATTCTCAAGGTTTCTCGTACCATTGCA
GATTTGGCAGAGTCGGAACAAATTACGCAAGCCCATTTAGCCGAAGCCTTGGGGTATCGTACTATGGATCGGTTATTATT
GAAATTACATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

79.568

99.609

0.793

  comM Glaesserella parasuis strain SC1401

75.734

100

0.757

  comM Vibrio cholerae strain A1552

66.012

99.609

0.658

  comM Vibrio campbellii strain DS40M4

64.188

100

0.642

  comM Legionella pneumophila str. Paris

51.491

98.434

0.507

  comM Legionella pneumophila strain ERS1305867

51.491

98.434

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.228

100

0.468


Multiple sequence alignment