Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   WGK59_RS08745 Genome accession   NZ_CP148002
Coordinates   1740792..1742321 (+) Length   509 a.a.
NCBI ID   WP_005647634.1    Uniprot ID   A0A0Y7HS02
Organism   Haemophilus influenzae strain AR2427     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1735792..1747321
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WGK59_RS08720 (WGK59_08715) oppC 1736138..1737073 (+) 936 WP_005651799.1 oligopeptide ABC transporter permease OppC -
  WGK59_RS08725 (WGK59_08720) - 1737083..1738054 (+) 972 WP_013526077.1 ABC transporter ATP-binding protein -
  WGK59_RS08730 (WGK59_08725) oppF 1738051..1739049 (+) 999 WP_005657282.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  WGK59_RS08735 (WGK59_08730) - 1739089..1739955 (-) 867 WP_005670794.1 VirK/YbjX family protein -
  WGK59_RS08740 (WGK59_08735) yihA 1740060..1740677 (+) 618 WP_015702069.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  WGK59_RS08745 (WGK59_08740) comM 1740792..1742321 (+) 1530 WP_005647634.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  WGK59_RS08750 (WGK59_08745) deoC 1742355..1743026 (-) 672 WP_015702068.1 deoxyribose-phosphate aldolase -
  WGK59_RS08755 (WGK59_08750) - 1743137..1743640 (+) 504 WP_005651783.1 protein disulfide oxidoreductase -
  WGK59_RS08760 (WGK59_08755) rfaD 1743693..1744619 (+) 927 WP_015702067.1 ADP-glyceromanno-heptose 6-epimerase -
  WGK59_RS08765 (WGK59_08760) xylB 1744670..1746151 (-) 1482 WP_040034663.1 xylulokinase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55918.45 Da        Isoelectric Point: 9.8668

>NTDB_id=949990 WGK59_RS08745 WP_005647634.1 1740792..1742321(+) (comM) [Haemophilus influenzae strain AR2427]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASRLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSVQVREKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=949990 WGK59_RS08745 WP_005647634.1 1740792..1742321(+) (comM) [Haemophilus influenzae strain AR2427]
ATGTCCCTTGCTATTGTTTACAGCCGTGCTTCTATGGGTGTGCAAGCACCGCTAGTCACCATTGAGGTGCATTTAAGCAA
CGGAAAACCAGGATTTACACTTGTTGGCTTGCCAGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCGTTGA
TGAATGCACAATTTAAATACCCAGCCAAGCGTATCACAGTGAATCTCGCCCCTGCGGATTTACCGAAAGAAGGCGGAAGA
TTTGATTTGCCTATTGCTATCGGGATTTTAGCTGCATCGGATCAGCTTGATGCGAGCCGCTTAAAGCAATTTGAATTTGT
GGCTGAACTTGCGCTGACGGGTCAATTGCGTGGCGTACATGGTGTGATTCCCGCTATTCTTGCAGCGCAAAAGTCAAAGC
GAGAGTTAATTATCGCAAAACAAAATGCTAATGAAGCCTCGCTCGTGTCTGATCAAAATACTTATTTTGCTCAAACCCTT
TTAGATGTAGTGCAATTTCTCAACGGTCAGGAAAAATTACCACTCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCAGGTAAAAATACGTTAGATTTAACGGATATTATCGGACAACAGCACGCTAAACGAGCATTGACCATTGCCGCAGCAG
GGCAGCATAATCTGCTCTTTCTTGGCCCACCGGGTACAGGAAAAACCATGTTAGCCAGCCGTTTGACAGGGCTTTTACCT
GAAATGACGGATTTAGAAGCGATAGAAACGGCATCTGTAACGAGTTTAGTTCAAAACGAGTTAAATTTTCATAATTGGAA
ACAACGTCCTTTTCGCGCACCACATCATAGTGCATCAATGCCAGCTTTAGTTGGTGGCGGAACGATCCCTAAACCTGGCG
AAATATCCTTAGCAACAAATGGGGTACTTTTTCTTGATGAACTTCCAGAGTTTGAACGAAAAGTGTTAGATGCACTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTAGAGCCAATGCCAAAATTCAATTTCCGGCTCGTTTTCAATTGGTGGC
TGCAATGAATCCAAGCCCCACAGGCCATTATACTGGGACACATAACCGTACTTCACCGCAGCAAATTATGCGTTATTTGA
ATCGACTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTGCCTTTATTGCCACAAGGCAGCTTACAAAAT
ACAGGTGATCGTGGCGAAACCAGCGTGCAAGTTCGTGAAAAAGTGTTAAAAGTGCGTGAGATTCAAATGGAAAGAGCAGG
GAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAATGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTGGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTTTCTCGAACCATTGCCGATCTACAA
GGAGAACAACAAATTTCTCAACCGCACTTGGCGGAGGCCTTGGGCTATCGAGCAATGGATCGTTTGCTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0Y7HS02

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

99.214

100

0.992

  comM Glaesserella parasuis strain SC1401

79.093

99.607

0.788

  comM Vibrio cholerae strain A1552

67.258

99.607

0.67

  comM Vibrio campbellii strain DS40M4

65.878

99.607

0.656

  comM Legionella pneumophila str. Paris

51.093

98.821

0.505

  comM Legionella pneumophila strain ERS1305867

51.093

98.821

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.99

100

0.475