Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DQN24_RS08215 Genome accession   NZ_LS483411
Coordinates   1678628..1680157 (+) Length   509 a.a.
NCBI ID   WP_111695689.1    Uniprot ID   -
Organism   Haemophilus influenzae strain NCTC11426     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1673628..1685157
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQN24_RS08190 (NCTC11426_01642) oppC 1673974..1674909 (+) 936 WP_005651799.1 oligopeptide ABC transporter permease OppC -
  DQN24_RS08195 (NCTC11426_01643) - 1674919..1675890 (+) 972 WP_050948565.1 ABC transporter ATP-binding protein -
  DQN24_RS08200 (NCTC11426_01644) oppF 1675887..1676885 (+) 999 WP_050948566.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  DQN24_RS08205 (NCTC11426_01645) - 1676925..1677791 (-) 867 WP_172453990.1 VirK/YbjX family protein -
  DQN24_RS08210 (NCTC11426_01646) yihA 1677896..1678513 (+) 618 WP_005626810.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  DQN24_RS08215 (NCTC11426_01647) comM 1678628..1680157 (+) 1530 WP_111695689.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DQN24_RS08220 (NCTC11426_01648) deoC 1680191..1680862 (-) 672 WP_111695690.1 deoxyribose-phosphate aldolase -
  DQN24_RS08225 (NCTC11426_01649) - 1680972..1681475 (+) 504 WP_111695691.1 protein disulfide oxidoreductase -
  DQN24_RS08230 (NCTC11426_01650) rfaD 1681528..1682454 (+) 927 WP_111695692.1 ADP-glyceromanno-heptose 6-epimerase -
  DQN24_RS08235 (NCTC11426_01651) xylB 1682505..1683986 (-) 1482 WP_172453991.1 xylulokinase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55933.51 Da        Isoelectric Point: 10.0225

>NTDB_id=1140672 DQN24_RS08215 WP_111695689.1 1678628..1680157(+) (comM) [Haemophilus influenzae strain NCTC11426]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASRLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSTQVRKKVLKVREIQMARAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
EEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=1140672 DQN24_RS08215 WP_111695689.1 1678628..1680157(+) (comM) [Haemophilus influenzae strain NCTC11426]
ATGTCCCTTGCTATTGTTTACAGCCGTGCTTCTATGGGTGTGCAAGCACCGCTTGTCACTATTGAGGTGCATTTAAGTAA
CGGAAAACCCGGATTTACGCTTGTTGGTTTGCCCGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCATTGA
TGAATGCACAATTTAAATACCCAGCCAAACGCATTACCGTGAATCTCGCACCAGCAGATTTACCGAAAGAAGGCGGACGA
TTTGATTTGCCTATTGCCATCGGAATTTTAGCCGCATCGGATCAGCTTGATGCGAGCCGCTTAAAGCAATTTGAATTTGT
GGCAGAGCTTGCGCTTACTGGGCAATTGCGTGGCGTTCACGGCGTAATTCCCGCTATTCTTGCAGCACAAAAGTCAAAGC
GAGAATTAATTATCGCAAAGCAAAATGCCAATGAAGCCTCGCTTGTTTCTGATCAAAATACTTATTTTGCACAAACACTT
TTAGATGTGGTGCAATTTCTCAATGGTCAAGAAAAATTACCTCTCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCGGGTAAAAATACATTAGATTTAACGGATATTATCGGACAACAACATGCCAAACGAGCATTGACCATTGCTGCAGCAG
GGCAGCATAATTTACTCTTTCTTGGCCCTCCGGGCACAGGGAAAACGATGTTAGCTAGCCGATTAACAGGGCTTTTACCT
GAAATGACAGATTTAGAAGCGATTGAAACAGCATCTGTAACGAGTTTAGTACAAAACGAGTTAAATTTTCATAATTGGAA
ACAACGTCCTTTTCGTGCCCCGCACCATAGTGCATCAATGCCTGCTTTAGTTGGTGGTGGGACGATCCCTAAACCTGGTG
AAATATCCTTAGCAACAAATGGCGTACTTTTTCTTGATGAACTTCCAGAGTTTGAACGAAAAGTATTGGATGCACTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCTAATGCAAAAATTCAATTCCCTGCTCGTTTTCAATTAGTCGC
AGCGATGAATCCAAGCCCCACAGGTCATTATACTGGAACACATAACCGCACTTCGCCGCAACAAATTATGCGTTATTTAA
ATCGACTTTCAGGGCCCTTTTTAGATCGCTTTGACTTGTCTATTGAAGTGCCTTTATTGCCACAAGGTAGCTTACAAAAT
ACGGGCGATCGTGGCGAAACCAGCACACAAGTTCGTAAAAAAGTGTTAAAAGTGCGTGAGATTCAAATGGCAAGAGCGGG
GAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAACGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTTGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTATCTCGAACCATTGCCGATCTACAA
GAAGAACAACAAATTTCTCAACCGCACTTGGCTGAGGCCTTGGGCTATCGAGCAATGGACCGTTTGTTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

98.625

100

0.986

  comM Glaesserella parasuis strain SC1401

79.093

99.607

0.788

  comM Vibrio cholerae strain A1552

67.258

99.607

0.67

  comM Vibrio campbellii strain DS40M4

65.68

99.607

0.654

  comM Legionella pneumophila str. Paris

51.093

98.821

0.505

  comM Legionella pneumophila strain ERS1305867

51.093

98.821

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.796

100

0.473


Multiple sequence alignment