Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   INP92_RS03565 Genome accession   NZ_CP063122
Coordinates   726731..728113 (+) Length   460 a.a.
NCBI ID   WP_111387967.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae strain M1C125_4     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 721731..733113
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INP92_RS09410 - 721742..722434 (-) 693 Protein_685 penicillin-binding protein 1A -
  INP92_RS03540 (INP92_03540) - 722510..724360 (-) 1851 Protein_686 penicillin-binding protein 1A -
  INP92_RS03545 (INP92_03545) - 724460..725308 (+) 849 WP_111387975.1 competence protein ComA -
  INP92_RS03550 (INP92_03550) - 725292..725807 (+) 516 WP_232088003.1 PilN domain-containing protein -
  INP92_RS03555 (INP92_03555) - 725804..726346 (+) 543 WP_111387971.1 competence protein ComC -
  INP92_RS03560 (INP92_03560) - 726346..726729 (+) 384 WP_111387969.1 pilus assembly protein PilP -
  INP92_RS03565 (INP92_03565) comE 726731..728113 (+) 1383 WP_111387967.1 type IV pilus secretin PilQ Machinery gene
  INP92_RS03570 (INP92_03570) - 728133..728822 (+) 690 WP_111387965.1 ComF family protein -
  INP92_RS03575 (INP92_03575) nfuA 728938..729522 (+) 585 WP_005694707.1 Fe-S biogenesis protein NfuA -
  INP92_RS03580 (INP92_03580) comM 729566..731095 (-) 1530 WP_111387963.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  INP92_RS03585 (INP92_03585) yihA 731225..731839 (-) 615 WP_197554855.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  INP92_RS03590 (INP92_03590) - 731953..732822 (+) 870 WP_197554857.1 VirK/YbjX family protein -

Sequence


Protein


Download         Length: 460 a.a.        Molecular weight: 51131.75 Da        Isoelectric Point: 7.9498

>NTDB_id=493141 INP92_RS03565 WP_111387967.1 726731..728113(+) (comE) [Haemophilus parainfluenzae strain M1C125_4]
MLNQKIKTKCGQFLMCFLILWTTYSAAENRVFSLRLKQAPMVATLQQLALEQNANLMIDDELEGKLSLQLDNVDFDRLLR
SVAKIKGFSFYQENNIYYLGKPSQHEQYAEKMTEPMAISGESLPSETPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSVKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKNNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQVEKVKK

Nucleotide


Download         Length: 1383 bp        

>NTDB_id=493141 INP92_RS03565 WP_111387967.1 726731..728113(+) (comE) [Haemophilus parainfluenzae strain M1C125_4]
ATGCTAAACCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTGTGGACAACTTACTCAGCGGC
AGAAAATCGCGTCTTTTCACTTCGGTTAAAACAGGCGCCAATGGTAGCAACTCTCCAGCAACTTGCTCTTGAGCAAAATG
CTAATTTAATGATTGATGATGAGCTAGAAGGAAAACTTTCATTGCAATTAGATAACGTAGATTTTGATCGCTTATTACGT
TCCGTGGCAAAAATCAAAGGGTTCTCTTTTTATCAAGAAAATAATATTTATTATCTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGACAGAACCTATGGCGATTAGCGGAGAAAGTTTGCCTAGCGAAACACCACTGGTGAGTACAACGG
TTAAACTTCATTTTGCCAAGGCCTCTGATGTGATGAAATCGTTAACAACTGGTAGTGGTTCTTTGCTTTCACCTAGCGGC
ACGATTACCTTTGATGACCGAAGCAATGTATTACTGATTCAGGATGATGCACGTTCTGTCAAAAATATCAAAAAGTTAAT
CGCAGAGTTGGATAAACCCATTGAGCAAATCGTGATTGAAGCACGTATTGTGACGATTACTGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAGGCAGCCCATCGAGTGAGTGGCAGTTTAGATGCGAATGGATTTAGT
AATATCAGTAATAATTTAAATGTGAATTTTGCGACAACCGTCACGCCAGCTGGCTCATTAGCTCTTCAAGTAGCTAAAAT
TAATGGTCGATTATTAGACCTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGCTTAC
TCACAACCAATAAGAAAAGCGCAAGCATTAAACAAGGGACAGAAATTCCTTATGTGGTGACGAATGGGAAAAATGACACC
CAATCAGTGGAATTTAGAGAGGCGGTGTTGGGATTAGAAGTGACACCGCATATTTCGAAGAATAATAATATTTTATTGGA
TTTATTAGTGAGTCAAAATTCCCCGGGAAATCGCGTGGCTTACGGGCAAAATGAAGTGGTGTCCATTGATAAACAAGAAA
TTAACACGCAAGTTTTTGCCAAAGATGGGGAAACAATTGTATTGGGTGGTGTATTCCACGATACGATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCTGGTATTAAGCGCTTATTTAGTAAGGAAAGTGAACGACATCAAAAGCGAGA
ACTCGTCATTTTTGTGACCCCTCATATTTTGAAACAAGGTGAAAGAATGGAGATGGCTAAGAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

73.497

97.609

0.717

  comE Haemophilus influenzae 86-028NP

72.383

97.609

0.707

  comE Glaesserella parasuis strain SC1401

53.901

91.957

0.496

  pilQ Vibrio campbellii strain DS40M4

41.204

93.913

0.387

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.509

92.174

0.383

  pilQ Vibrio cholerae strain A1552

41.509

92.174

0.383