Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   I6H57_RS07995 Genome accession   NZ_CP065991
Coordinates   1708273..1709655 (-) Length   460 a.a.
NCBI ID   WP_005694710.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae strain FDAARGOS_1000     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1703273..1714655
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6H57_RS07970 (I6H57_07970) - 1703565..1704434 (-) 870 WP_005694704.1 DUF535 family protein -
  I6H57_RS07975 (I6H57_07975) yihA 1704547..1705161 (+) 615 WP_032822378.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  I6H57_RS07980 (I6H57_07980) comM 1705291..1706820 (+) 1530 WP_005694706.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  I6H57_RS07985 (I6H57_07985) nfuA 1706864..1707448 (-) 585 WP_005694707.1 Fe-S biogenesis protein NfuA -
  I6H57_RS07990 (I6H57_07990) - 1707564..1708253 (-) 690 WP_032822376.1 ComF family protein -
  I6H57_RS07995 (I6H57_07995) comE 1708273..1709655 (-) 1383 WP_005694710.1 type IV pilus secretin PilQ Machinery gene
  I6H57_RS08000 (I6H57_08000) - 1709657..1710040 (-) 384 WP_005694711.1 hypothetical protein -
  I6H57_RS08005 (I6H57_08005) - 1710040..1710582 (-) 543 WP_005694712.1 hypothetical protein -
  I6H57_RS08010 (I6H57_08010) - 1710579..1711094 (-) 516 WP_005694713.1 hypothetical protein -
  I6H57_RS08015 (I6H57_08015) - 1711078..1711926 (-) 849 WP_005694714.1 pilus assembly protein PilM -
  I6H57_RS08020 (I6H57_08020) - 1712026..1714638 (+) 2613 WP_005694715.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 460 a.a.        Molecular weight: 51136.71 Da        Isoelectric Point: 7.1908

>NTDB_id=516036 I6H57_RS07995 WP_005694710.1 1708273..1709655(-) (comE) [Haemophilus parainfluenzae strain FDAARGOS_1000]
MVKQKIKTKFGQFLMCFLILWTTYSVAENRVFSLRLKQAPIVATLQQLALEQNANLMIDDELEGTLSLQLDNVDFDRLLR
SVAKIKGLSFYQENDIYYLGKPSQHEQYSEKITEPMAISGESLPSETPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSLKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVGGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMARKEKHFKQVEKVKK

Nucleotide


Download         Length: 1383 bp        

>NTDB_id=516036 I6H57_RS07995 WP_005694710.1 1708273..1709655(-) (comE) [Haemophilus parainfluenzae strain FDAARGOS_1000]
ATGGTAAAGCAGAAAATAAAAACAAAGTTTGGTCAGTTTTTAATGTGTTTTCTGATCCTATGGACAACTTATTCAGTGGC
AGAAAATCGCGTATTTTCACTTCGCTTAAAACAAGCTCCCATAGTAGCGACACTCCAACAACTTGCCCTTGAGCAAAATG
CCAATTTAATGATTGATGATGAGTTAGAAGGAACACTTTCATTACAATTAGATAACGTAGATTTTGATCGTTTATTGCGT
TCTGTTGCAAAAATCAAAGGGCTCTCTTTTTATCAAGAAAATGATATTTATTATTTAGGTAAGCCTTCTCAACATGAACA
ATATTCAGAGAAAATAACAGAACCTATGGCGATTAGCGGAGAAAGTTTGCCTAGTGAAACACCACTTGTGAGTACAACGG
TTAAACTGCATTTTGCTAAGGCTTCTGATGTGATGAAATCTTTAACCACAGGGAGCGGTTCTTTGCTTTCACCTAGCGGC
ACAATTACATTTGATGATCGAAGCAATGTATTACTGATTCAGGATGATGCACGTTCACTTAAAAATATCAAAAAATTAAT
TGCAGAGCTGGATAAACCTATTGAGCAAATTGTCATTGAAGCACGTATTGTGACGATTACCGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAGGCAGCCCATCGAGTGGGTGGCAGTTTAGATGCGAATGGGTTTAGC
AATATCAGTAATAATTTAAATGTGAATTTTGCGACAACGGTCACGCCAGCTGGCTCATTAGCACTTCAAGTAGCCAAAAT
TAATGGTCGATTGTTAGATTTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGCTTAC
TCACGACCAATAAGAAAAGTGCAAGCATCAAACAAGGGACAGAAATTCCTTATGTGGTGACAAATGGGAAAAATGACACC
CAATCAGTAGAGTTTCGCGAGGCTGTCTTAGGATTGGAAGTCACGCCGCATATTTCGAAGGATAATAATATTTTATTGGA
TTTATTAGTGAGTCAAAATTCCCCAGGGAATCGCGTGGCTTACGGGCAAAATGAAGTCGTATCTATTGATAAACAAGAAA
TCAATACGCAAGTTTTTGCCAAAGATGGTGAAACAATTGTATTGGGTGGTGTATTCCACGATACGATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCAGGTATTAAGCGCTTATTCAGTAAGGAAAGTGAACGTCATCAAAAACGAGA
ACTCGTCATTTTTGTGACACCTCATATTTTAAAACAAGGTGAAAGAATGGAAATGGCTAGAAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

72.829

97.609

0.711

  comE Haemophilus influenzae 86-028NP

71.938

97.609

0.702

  comE Glaesserella parasuis strain SC1401

54.374

91.957

0.5

  pilQ Vibrio campbellii strain DS40M4

41.667

93.913

0.391

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.981

92.174

0.387

  pilQ Vibrio cholerae strain A1552

41.981

92.174

0.387