Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   DQM65_RS09920 Genome accession   NZ_LS483392
Coordinates   1927254..1928783 (+) Length   509 a.a.
NCBI ID   WP_111691920.1    Uniprot ID   -
Organism   Haemophilus influenzae strain NCTC11931     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1922254..1933783
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM65_RS09895 (NCTC11931_02003) oppC 1922600..1923535 (+) 936 WP_005661282.1 oligopeptide ABC transporter permease OppC -
  DQM65_RS09900 (NCTC11931_02004) - 1923545..1924516 (+) 972 WP_111691918.1 ABC transporter ATP-binding protein -
  DQM65_RS09905 (NCTC11931_02005) oppF 1924513..1925511 (+) 999 WP_111691919.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  DQM65_RS09910 (NCTC11931_02006) - 1925551..1926417 (-) 867 WP_172454844.1 VirK/YbjX family protein -
  DQM65_RS09915 (NCTC11931_02007) yihA 1926522..1927139 (+) 618 WP_005626810.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  DQM65_RS09920 (NCTC11931_02008) comM 1927254..1928783 (+) 1530 WP_111691920.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  DQM65_RS09925 (NCTC11931_02009) deoC 1928817..1929488 (-) 672 WP_111691921.1 deoxyribose-phosphate aldolase -
  DQM65_RS09930 (NCTC11931_02010) - 1929598..1930101 (+) 504 WP_111691922.1 protein disulfide oxidoreductase -
  DQM65_RS09935 (NCTC11931_02011) rfaD 1930154..1931080 (+) 927 WP_050846779.1 ADP-glyceromanno-heptose 6-epimerase -
  DQM65_RS09940 (NCTC11931_02012) xylB 1931131..1932612 (-) 1482 WP_172454845.1 xylulokinase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55973.51 Da        Isoelectric Point: 9.9440

>NTDB_id=1140048 DQM65_RS09920 WP_111691920.1 1927254..1928783(+) (comM) [Haemophilus influenzae strain NCTC11931]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASRLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSTQVRKKVLKVREIQIERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
EEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=1140048 DQM65_RS09920 WP_111691920.1 1927254..1928783(+) (comM) [Haemophilus influenzae strain NCTC11931]
ATGTCCCTTGCTATTGTTTACAGCCGTGCTTCTATGGGTGTGCAAGCACCGCTTGTCACTATTGAGGTGCATTTAAGCAA
CGGAAAACCAGGATTTACACTTGTTGGTTTGCCCGAAAAAACCGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCGTTGA
TGAATGCACAATTTAAATACCCAGCCAAACGCATTACCGTGAATCTCGCACCAGCAGATTTACCGAAAGAAGGCGGACGA
TTTGATTTGCCTATTGCCATCGGAATTTTAGCTGCATCGGATCAGCTTGATGCGAGCCGCTTAAAGCAATTTGAATTTGT
GGCAGAGCTTGCGCTTACTGGGCAATTGCGTGGCGTTCACGGCGTAATTCCCGCTATTCTTGCAGCACAAAAGTCAAAGC
GAGAATTAATTATCGCAAAACAAAATGCCAATGAAGCCTCACTTGTTTCTGATCAAAATACTTATTTTGCGCAAACACTT
TTAGATGTGGTGCAATTTCTCAATGGTCAAGAAAAATTACCTCTCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCGGGTAAAAATACATTAGATTTAACGGATATTATCGGACAACAACATGCCAAACGAGCATTGACCATTGCTGCAGCAG
GGCAGCATAATTTACTCTTTCTTGGCCCTCCGGGCACAGGGAAAACGATGTTAGCTAGCCGATTAACAGGGCTTTTACCT
GAAATGACAGATTTAGAAGCGATTGAAACAGCATCTGTAACGAGTTTAGTACAAAACGAGTTAAATTTTCATAATTGGAA
ACAACGTCCTTTTCGTGCCCCGCACCATAGTGCATCAATGCCTGCTTTAGTTGGTGGTGGGACGATCCCTAAACCTGGTG
AAATATCCTTAGCAACAAATGGCGTACTTTTTCTTGATGAACTTCCAGAGTTTGAACGAAAAGTATTGGATGCACTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCTAATGCAAAAATTCAATTCCCTGCTCGTTTTCAATTAGTCGC
AGCGATGAATCCAAGCCCCACAGGTCATTATACTGGAACACATAACCGCACTTCGCCGCAACAAATTATGCGTTATTTAA
ATCGACTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTGCCTTTATTGCCACAAGGTAGCTTACAAAAT
ACGGGCGATCGTGGCGAAACCAGCACACAAGTTCGTAAAAAAGTGTTAAAAGTGCGAGAGATTCAAATAGAAAGAGCGGG
GAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAACGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTTGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTATCTCGAACCATTGCCGATCTACAA
GAAGAACAACAAATTTCTCAACCGCACTTAGCGGAAGCCTTGGGCTATCGAGCAATGGATCGTTTGTTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

98.625

100

0.986

  comM Glaesserella parasuis strain SC1401

79.093

99.607

0.788

  comM Vibrio cholerae strain A1552

67.061

99.607

0.668

  comM Vibrio campbellii strain DS40M4

65.483

99.607

0.652

  comM Legionella pneumophila str. Paris

50.696

98.821

0.501

  comM Legionella pneumophila strain ERS1305867

50.696

98.821

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.99

100

0.475


Multiple sequence alignment