Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DV400_RS06790 Genome accession   NZ_CP031236
Coordinates   1367256..1368785 (+) Length   509 a.a.
NCBI ID   WP_050948569.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M25267     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1362256..1373785
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV400_RS06765 oppC 1362602..1363537 (+) 936 WP_005661282.1 oligopeptide ABC transporter permease OppC -
  DV400_RS06770 - 1363547..1364518 (+) 972 WP_005647625.1 ABC transporter ATP-binding protein -
  DV400_RS06775 oppF 1364515..1365513 (+) 999 WP_005657282.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  DV400_RS06780 - 1365553..1366419 (-) 867 WP_021034921.1 VirK/YbjX family protein -
  DV400_RS06785 yihA 1366524..1367141 (+) 618 WP_005690705.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  DV400_RS06790 comM 1367256..1368785 (+) 1530 WP_050948569.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DV400_RS06795 deoC 1368819..1369490 (-) 672 WP_050948570.1 deoxyribose-phosphate aldolase -
  DV400_RS06800 - 1369600..1370103 (+) 504 WP_050948571.1 protein disulfide oxidoreductase -
  DV400_RS06805 rfaD 1370156..1371082 (+) 927 WP_050848338.1 ADP-glyceromanno-heptose 6-epimerase -
  DV400_RS06810 xylB 1371133..1372614 (-) 1482 WP_050948572.1 xylulokinase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55822.38 Da        Isoelectric Point: 9.8613

>NTDB_id=304333 DV400_RS06790 WP_050948569.1 1367256..1368785(+) (comM) [Haemophilus influenzae strain M25267]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFILVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASCLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPIATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSAQVRKKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQISQPHLAEALGYRAMDRLLQKSSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=304333 DV400_RS06790 WP_050948569.1 1367256..1368785(+) (comM) [Haemophilus influenzae strain M25267]
ATGTCCCTTGCTATTGTTTACAGCCGTGCCTCTATGGGGGTGCAAGCGCCGCTTGTTACTATTGAGGTACATTTAAGCAA
CGGAAAACCCGGATTTATACTTGTTGGTTTGCCCGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCATTGA
TGAATGCACAATTTAAATACCCAGCCAAACGCATTACCGTGAATCTCGCACCTGCGGATTTGCCTAAAGAAGGCGGACGA
TTTGATTTGCCTATTGCCATCGGAATTTTAGCTGCATCGGATCAGCTTGATGCGAGCTGCTTAAAGCAATTTGAATTTGT
AGCGGAGCTTGCACTTACGGGGCAATTACGTGGCGTACACGGTGTAATTCCCGCTATTCTTGCAGCACAAAAATCAAAGC
GAGAGTTAATCATTGCGAAGCAAAATGCCAATGAAGCCTCACTTGTTTCTGATCAAAATACTTATTTTGCGCAAACACTT
TTAGATGTGGTGCAATTTCTCAATGGTCAGGAAAAATTACCTATCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCGGGTAAAAATACATTAGATTTAACGGATATTATTGGACAACAGCACGCTAAACGTGCATTAACCATTGCCGCAGCGG
GGCAGCATAATTTACTCTTTCTTGGCCCACCGGGTACAGGGAAAACCATGTTAGCCAGCCGATTAACAGGGCTTTTACCT
GAAATGACTGATTTAGAAGCGATTGAAACGGCATCTGTAACGAGTTTAGTGCAAAACGAGTTAAATTTTCATAATTGGAA
ACAACGTCCTTTTCGCGCCCCGCACCATAGTGCATCAATGCCAGCTTTAGTTGGTGGTGGAACGATCCCAAAACCTGGTG
AAATATCCTTAGCAACAAATGGGGTACTTTTTCTTGATGAACTTCCAGAGTTTGAACGAAAAGTGTTAGATGCGCTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCTAATGCAAAAATTCAATTTCCAGCTCGTTTTCAATTGGTGGC
AGCGATGAATCCAAGCCCCACAGGTCATTATACTGGAACACATAACCGCACTTCACCGCAACAAATTATGCGTTATTTAA
ATCGACTTTCAGGGCCCTTTTTAGATCGCTTTGACTTGTCTATTGAAGTGCCTTTATTGCCACAAGGTAGCTTACAAAAT
ACGGGCGATCGTGGCGAAACCAGCGCACAAGTTCGTAAAAAAGTGTTAAAAGTGCGAGAGATTCAAATGGAAAGAGCGGG
AAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAACGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTTGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTGTCTCGAACCATTGCCGATCTACAA
GGAGAACAACAAATTTCTCAACCGCACTTAGCGGAAGCCTTGGGCTATCGAGCAATGGATCGTTTGTTGCAGAAATCGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

98.625

100

0.986

  comM Glaesserella parasuis strain SC1401

79.249

99.411

0.788

  comM Vibrio cholerae strain A1552

66.864

99.607

0.666

  comM Vibrio campbellii strain DS40M4

65.545

99.214

0.65

  comM Legionella pneumophila str. Paris

50.696

98.821

0.501

  comM Legionella pneumophila strain ERS1305867

50.696

98.821

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.887

100

0.473


Multiple sequence alignment