Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   LQ983_RS04855 Genome accession   NZ_OV100759
Coordinates   1022222..1023757 (+) Length   511 a.a.
NCBI ID   WP_230621828.1    Uniprot ID   -
Organism   Aggregatibacter sp. Marseille-P9115     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1017222..1028757
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LQ983_RS04825 - 1017673..1018350 (+) 678 WP_230621823.1 methionine ABC transporter permease -
  LQ983_RS04830 - 1018385..1019209 (+) 825 WP_230621824.1 MetQ/NlpA family lipoprotein -
  LQ983_RS04835 - 1019314..1019811 (+) 498 WP_230621825.1 YbhB/YbcL family Raf kinase inhibitor-like protein -
  LQ983_RS04840 trmL 1019927..1020403 (+) 477 WP_005704903.1 tRNA (uridine(34)/cytosine(34)/5- carboxymethylaminomethyluridine(34)-2'-O)- methyltransferase TrmL -
  LQ983_RS04845 - 1020475..1021350 (-) 876 WP_230621826.1 DUF535 family protein -
  LQ983_RS04850 yihA 1021493..1022107 (+) 615 WP_230621827.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  LQ983_RS04855 comM 1022222..1023757 (+) 1536 WP_230621828.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  LQ983_RS04860 - 1023771..1024598 (-) 828 WP_230621829.1 divergent polysaccharide deacetylase family protein -
  LQ983_RS04865 envC 1024595..1025842 (-) 1248 WP_230622606.1 murein hydrolase activator EnvC -
  LQ983_RS04870 - 1026044..1027840 (+) 1797 WP_230621830.1 ShlB/FhaC/HecB family hemolysin secretion/activation protein -

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 56069.56 Da        Isoelectric Point: 9.6300

>NTDB_id=1151816 LQ983_RS04855 WP_230621828.1 1022222..1023757(+) (comM) [Aggregatibacter sp. Marseille-P9115]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALLNAEFRYPAKRITVNLAPADLPKEGGR
FDLPIAIGMLAASGYIDAEKLKQFEFIGELALTGQLRAVHGVIPAILAAKQAKRKCIIAQGNANEASLVSEQETYYANSL
LDVVQFLNQQGELPLAGDIKTQSTVDFFPENPKDLTDIIGQQHAKRALMIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMSDQEAIETASVASLVQNELNFHNWKQRPFRAPHHSASTPALVGGGSIPKPGEISLAHNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLIAAMNPSPTGHYQGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQH
SGDRGESSAMMREKVLKTRAIQLQRAGKINAHLTSKEIERDCKLEEKDALFLENALTKLGLSVRAYHRILKVSRTIADLE
GEKQIHQRHLAEALGYRAMDRLLQKLSKASA

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=1151816 LQ983_RS04855 WP_230621828.1 1022222..1023757(+) (comM) [Aggregatibacter sp. Marseille-P9115]
ATGTCTTTAGCCATCGTTTACAGCCGCGCCTCCATGGGCGTTCAAGCACCACTGGTCACCATTGAAGTACATTTAAGCAA
CGGCAAACCGGGCTTTACCCTTGTTGGCTTGCCGGAAAAAACGGTAAAAGAAGCTCAAGATCGCGTACGCAGTGCGTTGC
TCAATGCCGAATTTCGCTACCCGGCTAAACGCATTACCGTCAATCTTGCCCCGGCAGATTTACCCAAAGAAGGCGGACGT
TTTGATTTGCCTATTGCCATCGGCATGCTGGCTGCCTCCGGTTATATTGATGCAGAAAAGTTAAAACAATTTGAATTTAT
CGGCGAACTTGCCCTGACCGGTCAACTGCGCGCGGTGCATGGTGTCATCCCTGCGATTTTGGCGGCGAAGCAAGCCAAGC
GAAAATGCATTATTGCACAAGGCAATGCCAATGAGGCATCGTTGGTTTCCGAGCAAGAAACCTATTACGCCAATTCCTTA
TTAGATGTCGTGCAATTTCTCAATCAACAGGGCGAATTACCTCTTGCAGGCGACATCAAAACCCAAAGTACGGTGGATTT
TTTCCCTGAAAATCCTAAAGATCTGACGGATATCATCGGCCAACAACATGCCAAGCGTGCCTTAATGATTGCCGCGGCAG
GGCAGCATAATTTGTTATTTTTAGGCCCACCCGGCACAGGAAAGACCATGCTTGCCAGTCGCTTAACCGGATTACTGCCG
GAAATGAGCGACCAAGAAGCCATTGAAACCGCCTCTGTGGCCAGTTTGGTGCAAAATGAACTAAATTTTCACAACTGGAA
ACAACGCCCCTTCCGTGCGCCTCACCACAGTGCCTCCACGCCAGCATTGGTGGGTGGCGGTTCTATTCCAAAACCTGGGG
AAATTTCCCTCGCACACAATGGCGTGTTGTTTTTGGATGAATTGCCTGAATTCGAACGCAAGGTGTTGGATGCGCTCCGC
CAGCCTTTAGAAAGTGGTGAAATCATCATTTCTCGCGCCAATGCCAAGATTCAATTTCCGGCAAGATTTCAGCTGATTGC
CGCCATGAATCCCAGCCCCACAGGGCATTATCAAGGCACCCATAACCGTACGTCACCACAACAAATCATGCGCTATTTAA
ATCGCCTATCAGGCCCCTTTTTAGATCGTTTCGATTTATCCATTGAGGTGCCGTTATTGCCGCAAGGGAGTCTACAACAT
AGCGGTGACCGTGGCGAATCCAGTGCAATGATGAGAGAAAAAGTCTTAAAAACACGCGCTATTCAACTACAACGTGCCGG
CAAAATTAACGCCCATTTAACCAGCAAAGAAATTGAACGAGATTGCAAACTGGAAGAGAAAGACGCGTTATTTTTAGAAA
ATGCGCTGACCAAGCTAGGGCTGTCTGTGCGGGCATACCACCGCATACTGAAAGTTTCCCGCACCATTGCGGATTTAGAA
GGGGAAAAGCAAATTCATCAGCGACACTTAGCAGAAGCGTTAGGCTATCGGGCGATGGACAGGCTATTACAGAAACTTTC
TAAAGCATCCGCATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

86.391

99.217

0.857

  comM Glaesserella parasuis strain SC1401

78.474

100

0.785

  comM Vibrio cholerae strain A1552

65.437

100

0.659

  comM Vibrio campbellii strain DS40M4

66.075

99.217

0.656

  comM Legionella pneumophila str. Paris

50.197

99.413

0.499

  comM Legionella pneumophila strain ERS1305867

50.197

99.413

0.499

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.287

100

0.477


Multiple sequence alignment