Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DV389_RS00135 Genome accession   NZ_CP031250
Coordinates   25039..26568 (+) Length   509 a.a.
NCBI ID   WP_114892777.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M21384     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 20039..31568
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV389_RS00110 (DV389_00110) oppC 20385..21320 (+) 936 WP_114892775.1 oligopeptide ABC transporter permease OppC -
  DV389_RS00115 (DV389_00115) - 21330..22301 (+) 972 WP_114892776.1 ABC transporter ATP-binding protein -
  DV389_RS00120 (DV389_00120) oppF 22298..23296 (+) 999 WP_042601994.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  DV389_RS00125 (DV389_00125) - 23336..24202 (-) 867 WP_162816703.1 VirK/YbjX family protein -
  DV389_RS00130 (DV389_00130) yihA 24307..24924 (+) 618 WP_005626810.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  DV389_RS00135 (DV389_00135) comM 25039..26568 (+) 1530 WP_114892777.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DV389_RS00140 (DV389_00140) deoC 26602..27273 (-) 672 WP_114892778.1 deoxyribose-phosphate aldolase -
  DV389_RS00145 (DV389_00145) - 27384..27887 (+) 504 WP_042611685.1 protein disulfide oxidoreductase -
  DV389_RS00150 (DV389_00150) rfaD 27940..28866 (+) 927 WP_114892779.1 ADP-glyceromanno-heptose 6-epimerase -
  DV389_RS00155 (DV389_00155) waaF 28952..29992 (+) 1041 WP_114892780.1 lipopolysaccharide heptosyltransferase II -
  DV389_RS00160 (DV389_00160) - 30101..31324 (-) 1224 WP_114892781.1 MFS transporter -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55947.53 Da        Isoelectric Point: 10.0225

>NTDB_id=304931 DV389_RS00135 WP_114892777.1 25039..26568(+) (comM) [Haemophilus influenzae strain M21384]
MSLAIVYSRASMGVQAPLITIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASRLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSTQVRKKVLKVREIQMARAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
EEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=304931 DV389_RS00135 WP_114892777.1 25039..26568(+) (comM) [Haemophilus influenzae strain M21384]
ATGTCTCTTGCTATTGTTTACAGCCGTGCCTCTATGGGGGTGCAAGCACCACTTATCACTATTGAGGTGCATTTAAGCAA
CGGAAAACCCGGATTTACACTTGTTGGTTTGCCCGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCATTGA
TGAATGCACAATTTAAATACCCAGCCAAACGCATTACCGTGAATCTCGCACCAGCAGATTTACCGAAAGAAGGCGGACGA
TTTGATTTGCCTATTGCCATCGGAATTTTAGCCGCATCGGATCAGCTTGATGCGAGCCGCTTAAAGCAATTTGAATTTGT
GGCAGAGCTTGCGCTTACTGGGCAATTGCGTGGCGTTCACGGCGTAATTCCCGCTATTCTTGCAGCACAAAAGTCAAAGC
GAGAATTAATTATCGCAAAGCAAAATGCCAATGAAGCCTCGCTTGTTTCTGATCAAAATACTTATTTTGCACAAACACTT
TTAGATGTGGTGCAATTTCTCAATGGTCAAGAAAAATTACCTCTCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCGGGTAAAAATACATTAGATTTAACGGATATTATCGGACAACAACATGCCAAACGAGCATTGACCATTGCTGCAGCAG
GGCAGCATAATTTACTCTTTCTTGGCCCTCCGGGCACAGGGAAAACGATGTTAGCTAGCCGATTAACAGGGCTTTTACCT
GAAATGACAGATTTAGAAGCGATTGAAACAGCATCTGTAACGAGTTTAGTACAAAACGAGTTAAATTTTCATAATTGGAA
ACAACGTCCTTTTCGTGCCCCGCACCATAGTGCATCAATGCCTGCTTTAGTTGGTGGTGGGACGATCCCTAAACCTGGTG
AAATATCCTTAGCAACAAATGGCGTACTTTTTCTTGATGAACTTCCAGAGTTTGAACGAAAAGTATTGGATGCACTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCTAATGCAAAAATTCAATTCCCTGCTCGTTTTCAATTAGTCGC
AGCGATGAATCCAAGCCCCACAGGTCATTATACTGGAACACATAACCGCACTTCGCCGCAACAAATTATGCGTTATTTAA
ATCGACTTTCAGGGCCCTTTTTAGATCGCTTTGACTTGTCTATTGAAGTACCTTTATTGCCACAAGGTAGCTTACAAAAT
ACGGGCGATCGTGGCGAAACCAGCACACAAGTTCGTAAAAAAGTGTTAAAAGTGCGTGAGATTCAAATGGCAAGAGCGGG
GAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAACGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTTGGGCTTTCTGTTCGGGCCTATCATCGTATTTTGAAAGTATCTCGAACCATTGCCGATCTACAA
GAAGAACAACAAATTTCTCAACCGCACTTGGCTGAGGCCTTGGGCTATCGAGCAATGGATCGTTTGTTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

98.428

100

0.984

  comM Glaesserella parasuis strain SC1401

78.895

99.607

0.786

  comM Vibrio cholerae strain A1552

67.061

99.607

0.668

  comM Vibrio campbellii strain DS40M4

65.483

99.607

0.652

  comM Legionella pneumophila str. Paris

50.895

98.821

0.503

  comM Legionella pneumophila strain ERS1305867

50.895

98.821

0.503

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.99

100

0.475


Multiple sequence alignment