Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   RDV53_RS02890 Genome accession   NZ_CP133470
Coordinates   592653..594035 (-) Length   460 a.a.
NCBI ID   WP_005694710.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae ATCC 33392 strain DSM 8978     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 587653..599035
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RDV53_RS02865 (RDV53_02865) - 587945..588814 (-) 870 WP_005694704.1 DUF535 family protein -
  RDV53_RS02870 (RDV53_02870) yihA 588927..589541 (+) 615 WP_032822378.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  RDV53_RS02875 (RDV53_02875) comM 589671..591200 (+) 1530 WP_005694706.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  RDV53_RS02880 (RDV53_02880) nfuA 591244..591828 (-) 585 WP_005694707.1 Fe-S biogenesis protein NfuA -
  RDV53_RS02885 (RDV53_02885) - 591944..592633 (-) 690 WP_032822376.1 ComF family protein -
  RDV53_RS02890 (RDV53_02890) comE 592653..594035 (-) 1383 WP_005694710.1 type IV pilus secretin PilQ Machinery gene
  RDV53_RS02895 (RDV53_02895) - 594037..594420 (-) 384 WP_005694711.1 hypothetical protein -
  RDV53_RS02900 (RDV53_02900) - 594420..594962 (-) 543 WP_005694712.1 hypothetical protein -
  RDV53_RS02905 (RDV53_02905) - 594959..595474 (-) 516 WP_005694713.1 hypothetical protein -
  RDV53_RS02910 (RDV53_02910) - 595458..596306 (-) 849 WP_005694714.1 pilus assembly protein PilM -
  RDV53_RS02915 (RDV53_02915) - 596406..599018 (+) 2613 WP_005694715.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 460 a.a.        Molecular weight: 51136.71 Da        Isoelectric Point: 7.1908

>NTDB_id=874433 RDV53_RS02890 WP_005694710.1 592653..594035(-) (comE) [Haemophilus parainfluenzae ATCC 33392 strain DSM 8978]
MVKQKIKTKFGQFLMCFLILWTTYSVAENRVFSLRLKQAPIVATLQQLALEQNANLMIDDELEGTLSLQLDNVDFDRLLR
SVAKIKGLSFYQENDIYYLGKPSQHEQYSEKITEPMAISGESLPSETPLVSTTVKLHFAKASDVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSLKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVGGSLDANGFS
NISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKGV
DKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMARKEKHFKQVEKVKK

Nucleotide


Download         Length: 1383 bp        

>NTDB_id=874433 RDV53_RS02890 WP_005694710.1 592653..594035(-) (comE) [Haemophilus parainfluenzae ATCC 33392 strain DSM 8978]
ATGGTAAAGCAGAAAATAAAAACAAAGTTTGGTCAGTTTTTAATGTGTTTTCTGATCCTATGGACAACTTATTCAGTGGC
AGAAAATCGCGTATTTTCACTTCGCTTAAAACAAGCTCCCATAGTAGCGACACTCCAACAACTTGCCCTTGAGCAAAATG
CCAATTTAATGATTGATGATGAGTTAGAAGGAACACTTTCATTACAATTAGATAACGTAGATTTTGATCGTTTATTGCGT
TCTGTTGCAAAAATCAAAGGGCTCTCTTTTTATCAAGAAAATGATATTTATTATTTAGGTAAGCCTTCTCAACATGAACA
ATATTCAGAGAAAATAACAGAACCTATGGCGATTAGCGGAGAAAGTTTGCCTAGTGAAACACCACTTGTGAGTACAACGG
TTAAACTGCATTTTGCTAAGGCTTCTGATGTGATGAAATCTTTAACCACAGGGAGCGGTTCTTTGCTTTCACCTAGCGGC
ACAATTACATTTGATGATCGAAGCAATGTATTACTGATTCAGGATGATGCACGTTCACTTAAAAATATCAAAAAATTAAT
TGCAGAGCTGGATAAACCTATTGAGCAAATTGTCATTGAAGCACGTATTGTGACGATTACCGATGAAAGCCTAAAAGAAT
TAGGTGTGCGTTGGGGCATTTTTAATCCTACTGAGGCAGCCCATCGAGTGGGTGGCAGTTTAGATGCGAATGGGTTTAGC
AATATCAGTAATAATTTAAATGTGAATTTTGCGACAACGGTCACGCCAGCTGGCTCATTAGCACTTCAAGTAGCCAAAAT
TAATGGTCGATTGTTAGATTTAGAATTGACCGCACTTGAACGTGAAAATAACGTAGAAATTATTGCAAGCCCTCGCTTAC
TCACGACCAATAAGAAAAGTGCAAGCATCAAACAAGGGACAGAAATTCCTTATGTGGTGACAAATGGGAAAAATGACACC
CAATCAGTAGAGTTTCGCGAGGCTGTCTTAGGATTGGAAGTCACGCCGCATATTTCGAAGGATAATAATATTTTATTGGA
TTTATTAGTGAGTCAAAATTCCCCAGGGAATCGCGTGGCTTACGGGCAAAATGAAGTCGTATCTATTGATAAACAAGAAA
TCAATACGCAAGTTTTTGCCAAAGATGGTGAAACAATTGTATTGGGTGGTGTATTCCACGATACGATCACGAAAGGTGTC
GATAAAGTACCATTATTGGGCGATATTCCAGGTATTAAGCGCTTATTCAGTAAGGAAAGTGAACGTCATCAAAAACGAGA
ACTCGTCATTTTTGTGACACCTCATATTTTAAAACAAGGTGAAAGAATGGAAATGGCTAGAAAAGAAAAGCATTTTAAGC
AAGTTGAAAAAGTGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

72.829

97.609

0.711

  comE Haemophilus influenzae 86-028NP

71.938

97.609

0.702

  comE Glaesserella parasuis strain SC1401

54.374

91.957

0.5

  pilQ Vibrio campbellii strain DS40M4

41.667

93.913

0.391

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.981

92.174

0.387

  pilQ Vibrio cholerae strain A1552

41.981

92.174

0.387