Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   HICON_RS09395 Genome accession   NC_014922
Coordinates   1886351..1887880 (+) Length   509 a.a.
NCBI ID   WP_013527838.1    Uniprot ID   A0AAV2U5A5
Organism   Haemophilus influenzae F3047     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1881351..1892880
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HICON_RS09370 (HICON_01750) oppC 1881697..1882632 (+) 936 WP_005651799.1 oligopeptide ABC transporter permease OppC -
  HICON_RS09375 (HICON_01760) - 1882642..1883613 (+) 972 WP_006996544.1 ABC transporter ATP-binding protein -
  HICON_RS09380 (HICON_01770) oppF 1883610..1884608 (+) 999 WP_006996545.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  HICON_RS09385 (HICON_01780) - 1884648..1885514 (-) 867 WP_013527836.1 VirK/YbjX family protein -
  HICON_RS09390 (HICON_01790) yihA 1885619..1886236 (+) 618 WP_013527837.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  HICON_RS09395 (HICON_01800) comM 1886351..1887880 (+) 1530 WP_013527838.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  HICON_RS09400 (HICON_01810) deoC 1887914..1888585 (-) 672 WP_006996549.1 deoxyribose-phosphate aldolase -
  HICON_RS09405 (HICON_01820) - 1888699..1889202 (+) 504 WP_013527839.1 protein disulfide oxidoreductase -
  HICON_RS09410 (HICON_01830) rfaD 1889255..1890181 (+) 927 WP_006996551.1 ADP-glyceromanno-heptose 6-epimerase -
  HICON_RS09415 (HICON_01840) waaF 1890267..1891307 (+) 1041 WP_013527840.1 lipopolysaccharide heptosyltransferase II -
  HICON_RS09420 (HICON_01850) - 1891417..1892640 (-) 1224 WP_013527841.1 MFS transporter -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55856.38 Da        Isoelectric Point: 9.7683

>NTDB_id=39467 HICON_RS09395 WP_013527838.1 1886351..1887880(+) (comM) [Haemophilus influenzae F3047]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASHLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKLRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSAQVREKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=39467 HICON_RS09395 WP_013527838.1 1886351..1887880(+) (comM) [Haemophilus influenzae F3047]
ATGTCCCTTGCTATTGTTTACAGCCGTGCTTCTATGGGTGTGCAAGCACCGCTAGTCACCATTGAGGTGCATTTAAGCAA
CGGAAAACCAGGATTTACACTTGTTGGTTTGCCAGAAAAAACGGTGAAAGAGGCACAAGATCGGGTGCGTAGTGCATTGA
TGAATGCACAATTTAAATACCCAGCCAAGCGTATCACTGTAAATCTTGCACCTGCGGATTTACCGAAAGAAGGCGGAAGA
TTTGATTTGCCTATCGCCATCGGAATTTTAGCCGCATCAGATCAGCTTGATGCGAGCCACTTAAAGCAATTTGAATTTGT
GGCAGAGCTTGCGCTGACGGGCCAATTGCGTGGCGTACATGGTGTGATTCCCGCTATCCTTGCAGCGCAAAAGTCAAAGC
GAGAGTTAATTATTGCGAAACAAAATGCTAATGAAGCCTCGCTCGTGTCTGATCAAAATACTTATTTTGCTCAAACGCTT
TTAGACGTGGTGCAATTTCTCAATGGTCAGGAAAAATTACCCCTCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCAGGTAAAAATACGTTAGATTTAACGGATATTATCGGACAACAGCACGCTAAACGAGCATTGACCATTGCCGCAGCAG
GGCAGCATAATTTGCTCTTTCTTGGTCCGCCAGGTACAGGGAAAACCATGCTCGCTAGCCGCTTGACGGGGCTTTTACCT
GAAATGACAGATTTAGAAGCGATAGAAACGGCATCTGTAACGAGTTTAGTTCAAAACGAGTTAAATTTTCATAATTGGAA
GCTACGTCCTTTTCGCGCACCACATCATAGTGCATCAATGCCTGCTTTAGTTGGTGGCGGAACGATCCCTAAACCTGGCG
AAATATCCTTAGCAACAAATGGGGTGCTTTTTCTTGATGAACTTCCAGAATTTGAGCGAAAAGTGTTAGATGCACTACGC
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCCAATGCAAAAATTCAATTCCCAGCTCGTTTTCAATTAGTGGC
AGCGATGAATCCAAGCCCCACAGGCCATTATACTGGGACACATAACCGCACTTCACCACAGCAAATTATGCGTTATTTGA
ATCGACTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTGCCTTTACTGCCACAAGGTAGCTTGCAAAAT
ACGGGCGATCGTGGCGAAACCAGCGCGCAAGTTCGTGAAAAAGTGTTAAAAGTACGTGAGATTCAAATGGAAAGAGCAGG
GAAAATTAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAATGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTGGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTTTCTCGAACCATTGCCGATCTACAA
GGAGAACAACAAATTTCTCAACCGCACTTGGCGGAGGCCTTGGGCTATCGAGCAATGGATCGTTTGCTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

99.411

100

0.994

  comM Glaesserella parasuis strain SC1401

79.093

99.607

0.788

  comM Vibrio cholerae strain A1552

67.653

99.607

0.674

  comM Vibrio campbellii strain DS40M4

66.075

99.607

0.658

  comM Legionella pneumophila str. Paris

51.292

98.821

0.507

  comM Legionella pneumophila strain ERS1305867

51.292

98.821

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.99

100

0.475


Multiple sequence alignment