Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DV365_RS05795 Genome accession   NZ_CP031237
Coordinates   1179447..1180976 (+) Length   509 a.a.
NCBI ID   WP_112103115.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M12125     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1174447..1185976
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV365_RS05770 oppC 1174793..1175728 (+) 936 WP_005651799.1 oligopeptide ABC transporter permease OppC -
  DV365_RS05775 - 1175738..1176709 (+) 972 WP_005651797.1 ABC transporter ATP-binding protein -
  DV365_RS05780 oppF 1176706..1177704 (+) 999 WP_005657282.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  DV365_RS05785 - 1177744..1178610 (-) 867 WP_162697402.1 VirK/YbjX family protein -
  DV365_RS05790 yihA 1178715..1179332 (+) 618 WP_065245503.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  DV365_RS05795 comM 1179447..1180976 (+) 1530 WP_112103115.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DV365_RS05800 deoC 1181010..1181681 (-) 672 WP_112103116.1 deoxyribose-phosphate aldolase -
  DV365_RS05805 - 1181792..1182295 (+) 504 WP_105877596.1 protein disulfide oxidoreductase -
  DV365_RS05810 rfaD 1182348..1183274 (+) 927 WP_012055407.1 ADP-glyceromanno-heptose 6-epimerase -
  DV365_RS05815 xylB 1183325..1184806 (-) 1482 WP_162790525.1 xylulokinase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55876.37 Da        Isoelectric Point: 9.8668

>NTDB_id=304366 DV365_RS05795 WP_112103115.1 1179447..1180976(+) (comM) [Haemophilus influenzae strain M12125]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASRLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLASEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSAQVREKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=304366 DV365_RS05795 WP_112103115.1 1179447..1180976(+) (comM) [Haemophilus influenzae strain M12125]
ATGTCCCTTGCTATTGTTTACAGCCGTGCTTCTATGGGTGTGCAAGCGCCGCTTGTCACCATTGAGGTGCATTTAAGCAA
CGGAAAACCAGGATTTACACTTGTTGGTTTGCCAGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCGTTGA
TGAATGCACAATTTAAATACCCAGCCAAGCGTATCACAGTGAATCTCGCCCCTGCGGATTTACCGAAAGAAGGCGGAAGA
TTTGATTTGCCTATTGCCATCGGAATTTTAGCCGCATCGGATCAGCTTGATGCTAGTCGCTTAAAGCAATTTGAATTTGT
GGCGGAGCTTGCGCTGACGGGTCAATTGCGTGGCGTACATGGTGTGATTCCCGCTATTCTTGCAGCGCAAAAGTCAAAGC
GAGAGTTAATTATCGCGAAACAAAATGCTAATGAAGCCTCGCTCGTGTCTGATCAAAATACTTATTTTGCTCAAACGCTT
TTAGATGTGGTGCAATTTCTCAATGGTCAGGAAAAATTACCGCTCGCCAGCGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCAGGTAAAAATACGTTAGATTTAACGGATATTATCGGACAACAGCACGCTAAACGAGCATTGACCATTGCCGCAGCGG
GGCAGCATAATTTGCTCTTTCTTGGCCCACCGGGTACAGGGAAAACCATGTTAGCCAGCCGTTTGACAGGGCTTTTGCCT
GAGATGACAGATTTAGAAGCGATAGAAACAGCATCTGTAACGAGTTTAGTTCAAAACGAGTTAAATTTTCATAATTGGAA
GCAACGTCCTTTTCGCGCACCACATCATAGTGCATCAATGCCTGCTTTAGTTGGTGGCGGAACGATCCCTAAACCTGGCG
AAATATCCTTAGCAACAAATGGGGTGCTTTTTCTTGATGAACTTCCAGAATTTGAGCGAAAAGTGTTAGATGCACTACGC
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTAGAGCCAATGCCAAAATTCAATTTCCGGCTCGTTTTCAATTAGTGGC
AGCAATGAATCCAAGCCCCACAGGCCATTATACTGGGACACATAACCGCACTTCACCGCAACAAATTATGCGTTATTTGA
ATCGACTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTGCCTTTACTGCCACAAGGTAGCTTGCAAAAT
ACGGGCGATCGTGGCGAAACCAGCGCGCAAGTTCGTGAAAAAGTGTTAAAAGTACGTGAGATTCAAATGGAAAGAGCGGG
GAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAATGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTGGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTTTCTCGAACCATTGCCGATCTACAA
GGAGAACAACAAATTTCTCAACCGCACTTGGCGGAGGCCTTGGGCTATCGAGCAATGGATCGTTTGCTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

99.214

100

0.992

  comM Glaesserella parasuis strain SC1401

79.093

99.607

0.788

  comM Vibrio cholerae strain A1552

67.387

100

0.674

  comM Vibrio campbellii strain DS40M4

66.075

99.607

0.658

  comM Legionella pneumophila str. Paris

51.093

98.821

0.505

  comM Legionella pneumophila strain ERS1305867

51.093

98.821

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.99

100

0.475


Multiple sequence alignment