Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DV369_RS04035 Genome accession   NZ_CP031239
Coordinates   805244..806773 (-) Length   509 a.a.
NCBI ID   WP_050948569.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M13034     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 800244..811773
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV369_RS04015 xylB 801415..802896 (+) 1482 WP_050948572.1 xylulokinase -
  DV369_RS04020 rfaD 802947..803873 (-) 927 WP_050848338.1 ADP-glyceromanno-heptose 6-epimerase -
  DV369_RS04025 - 803926..804429 (-) 504 WP_050948571.1 protein disulfide oxidoreductase -
  DV369_RS04030 deoC 804539..805210 (+) 672 WP_050948570.1 deoxyribose-phosphate aldolase -
  DV369_RS04035 comM 805244..806773 (-) 1530 WP_050948569.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DV369_RS04040 yihA 806888..807505 (-) 618 WP_050948568.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  DV369_RS04045 - 807610..808476 (+) 867 WP_050948567.1 VirK/YbjX family protein -
  DV369_RS04050 oppF 808516..809514 (-) 999 WP_050948566.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  DV369_RS04055 - 809511..810482 (-) 972 WP_050948565.1 ABC transporter ATP-binding protein -
  DV369_RS04060 oppC 810492..811427 (-) 936 WP_005651799.1 oligopeptide ABC transporter permease OppC -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55822.38 Da        Isoelectric Point: 9.8613

>NTDB_id=304438 DV369_RS04035 WP_050948569.1 805244..806773(-) (comM) [Haemophilus influenzae strain M13034]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFILVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASCLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPIATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSAQVRKKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQISQPHLAEALGYRAMDRLLQKSSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=304438 DV369_RS04035 WP_050948569.1 805244..806773(-) (comM) [Haemophilus influenzae strain M13034]
ATGTCTCTTGCTATTGTTTACAGCCGTGCCTCTATGGGGGTGCAAGCGCCGCTTGTTACTATTGAGGTACATTTAAGCAA
CGGAAAACCCGGATTTATACTTGTTGGTTTGCCCGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCATTGA
TGAATGCACAATTTAAATACCCAGCCAAACGCATTACCGTGAATCTCGCACCTGCGGATTTGCCTAAAGAAGGCGGACGA
TTTGATTTGCCTATTGCCATCGGAATTTTAGCTGCATCGGATCAGCTTGATGCGAGCTGCTTAAAGCAATTTGAATTTGT
AGCGGAGCTTGCACTTACGGGGCAATTACGTGGCGTACACGGTGTAATTCCCGCTATTCTTGCAGCACAAAAATCAAAGC
GAGAGTTAATCATTGCGAAGCAAAATGCCAATGAAGCCTCACTTGTTTCTGATCAAAATACTTATTTTGCGCAAACACTT
TTAGATGTGGTGCAATTTCTCAATGGTCAGGAAAAATTACCTATCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCGGGTAAAAATACATTAGATTTAACGGATATTATTGGACAACAGCACGCTAAACGTGCATTAACCATTGCCGCAGCGG
GGCAGCATAATTTACTCTTTCTTGGCCCACCGGGTACAGGGAAAACCATGTTAGCCAGCCGATTAACAGGGCTTTTACCT
GAAATGACTGATTTAGAAGCGATTGAAACGGCATCTGTAACGAGTTTAGTGCAAAACGAGTTAAATTTTCATAATTGGAA
ACAACGTCCTTTTCGCGCCCCGCACCATAGTGCATCAATGCCAGCTTTAGTTGGTGGTGGAACGATCCCAAAACCTGGTG
AAATATCCTTAGCAACAAATGGGGTACTTTTTCTTGATGAACTTCCAGAGTTTGAACGAAAAGTGTTAGATGCGCTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCTAATGCAAAAATTCAATTTCCAGCTCGTTTTCAATTGGTGGC
AGCGATGAATCCAAGCCCCACAGGTCATTATACTGGAACACATAACCGCACTTCACCGCAACAAATTATGCGTTATTTAA
ATCGACTTTCAGGGCCCTTTTTAGATCGCTTTGACTTGTCTATTGAAGTGCCTTTATTGCCACAAGGTAGCTTACAAAAT
ACGGGCGATCGTGGCGAAACCAGCGCACAAGTTCGTAAAAAAGTGTTAAAAGTGCGAGAGATTCAAATGGAAAGAGCGGG
AAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAACGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTTGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTGTCTCGAACCATTGCCGATCTACAA
GGAGAACAACAAATTTCTCAACCGCACTTAGCGGAAGCCTTGGGCTATCGAGCAATGGATCGTTTGTTGCAGAAATCGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

98.625

100

0.986

  comM Glaesserella parasuis strain SC1401

79.249

99.411

0.788

  comM Vibrio cholerae strain A1552

66.864

99.607

0.666

  comM Vibrio campbellii strain DS40M4

65.545

99.214

0.65

  comM Legionella pneumophila str. Paris

50.696

98.821

0.501

  comM Legionella pneumophila strain ERS1305867

50.696

98.821

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.887

100

0.473


Multiple sequence alignment