Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   PARA_RS04075 Genome accession   NC_015964
Coordinates   797828..799210 (+) Length   460 a.a.
NCBI ID   WP_014064657.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae T3T1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 792828..804210
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PARA_RS10515 - 792839..793531 (-) 693 Protein_798 penicillin-binding protein 1A -
  PARA_RS04050 (PARA_08040) - 793607..795457 (-) 1851 Protein_799 penicillin-binding protein 1A -
  PARA_RS04055 (PARA_08050) - 795557..796399 (+) 843 WP_014064653.1 hypothetical protein -
  PARA_RS04060 (PARA_08060) - 796392..796904 (+) 513 WP_014064654.1 hypothetical protein -
  PARA_RS04065 (PARA_08070) - 796901..797443 (+) 543 WP_014064655.1 hypothetical protein -
  PARA_RS04070 (PARA_08080) - 797443..797826 (+) 384 WP_014064656.1 pilus assembly protein PilP -
  PARA_RS04075 (PARA_08090) comE 797828..799210 (+) 1383 WP_014064657.1 type IV pilus secretin PilQ Machinery gene
  PARA_RS04080 (PARA_08100) - 799230..799919 (+) 690 WP_014064658.1 ComF family protein -
  PARA_RS04085 (PARA_08110) nfuA 800033..800617 (+) 585 WP_014064659.1 Fe-S biogenesis protein NfuA -
  PARA_RS04090 (PARA_08120) comM 800661..802190 (-) 1530 WP_014064660.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  PARA_RS04095 (PARA_08130) yihA 802320..802934 (-) 615 WP_014064661.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  PARA_RS04100 (PARA_08140) - 803047..803916 (+) 870 WP_014064662.1 VirK/YbjX family protein -

Sequence


Protein


Download         Length: 460 a.a.        Molecular weight: 51070.78 Da        Isoelectric Point: 7.1773

>NTDB_id=42239 PARA_RS04075 WP_014064657.1 797828..799210(+) (comE) [Haemophilus parainfluenzae T3T1]
MVKQKIKTKCGQFLMCFLILWTTYSAAENRIFSLRLKQAPMVATLQQLALEQNTNLMIDDELEGTLSLQLDSVDFDRLLR
SVAKIKGLSFYQEKDIYYLGKPSQHEQYAEKMVEPMTISGESLPSETPLVSATVKLHFAKAADVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSVKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NIGDNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGI
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQVEKVKK

Nucleotide


Download         Length: 1383 bp        

>NTDB_id=42239 PARA_RS04075 WP_014064657.1 797828..799210(+) (comE) [Haemophilus parainfluenzae T3T1]
ATGGTAAAGCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTATGGACAACTTACTCAGCGGC
AGAAAATCGCATATTTTCACTTCGCTTAAAACAGGCGCCAATGGTGGCGACACTCCAGCAACTTGCTCTTGAGCAAAATA
CCAATTTAATGATTGATGATGAATTAGAAGGAACACTTTCATTACAATTAGATAGCGTCGATTTTGATCGTTTATTGCGT
TCTGTCGCAAAAATCAAAGGGCTCTCTTTTTATCAAGAAAAGGATATTTATTATTTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGGTAGAACCGATGACGATTAGCGGAGAAAGTTTGCCTAGTGAAACACCACTTGTGAGTGCAACGG
TTAAACTGCATTTTGCCAAGGCCGCTGATGTGATGAAATCGTTAACAACAGGGAGTGGTTCTTTGCTTTCACCTAGCGGC
ACAATTACATTTGATGACCGAAGCAATGTGTTACTGATTCAGGATGATGCACGTTCTGTCAAAAATATTAAAAAATTGAT
TGCAGAGCTGGATAAACCCATTGAGCAAATTGTCATTGAAGCACGTATTGTGACGATTACCGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGTATTTTTAATCCTACTGAGGCTGCTCATCGAGTGAGTGGCAGTCTAGATGCGAATGGCTTTAGT
AATATCGGTGATAATTTAAACGTGAATTTTGCGACAACGGTCACGCCAGCTGGCTCATTAGCACTTCAAGTGGCTAAAAT
TAATGGTCGATTATTAGATTTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGTTTAC
TCACAACTAATAAGAAAAGCGCAAGCATTAAACAAGGGACAGAAATTCCTTATGTAGTGACAAATGGTAAAAATGATACA
CAGTCAGTAGAGTTTCGCGAGGCAGTCTTAGGATTGGAAGTCACACCACATATTTCGAAGGATAATAATATTTTATTGGA
TTTATTAGTCAGTCAAAATTCCCCGGGAAATCGCGTGGCTTACGGGCAAAATGAAGTGGTGTCCATTGATAAACAAGAAA
TTAATACACAAGTTTTTGCCAAAGATGGCGAAACAATTGTATTGGGAGGGGTATTCCACGACACAATCACGAAAGGTATC
GATAAAGTACCATTATTGGGCGATATTCCCGGTATTAAGCGTCTATTTAGTAAGGAAAGTGAACGACATCAAAAACGCGA
ACTCGTTATTTTTGTGACCCCTCATATTTTGAAACAAGGTGAAAGAATGGAAATGGCTAAGAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

73.274

97.609

0.715

  comE Haemophilus influenzae 86-028NP

72.383

97.609

0.707

  comE Glaesserella parasuis strain SC1401

53.428

91.957

0.491

  pilQ Vibrio campbellii strain DS40M4

41.204

93.913

0.387

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.038

92.174

0.378

  pilQ Vibrio cholerae strain A1552

41.038

92.174

0.378


Multiple sequence alignment