Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   QQS40_RS04235 Genome accession   NZ_CP127167
Coordinates   828170..829555 (+) Length   461 a.a.
NCBI ID   WP_329506361.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae strain HP01     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 823170..834555
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QQS40_RS11220 - 823181..823873 (-) 693 Protein_821 penicillin-binding protein 1A -
  QQS40_RS04210 - 823949..825799 (-) 1851 Protein_822 penicillin-binding protein 1A -
  QQS40_RS04215 - 825899..826747 (+) 849 WP_289902111.1 competence protein ComA -
  QQS40_RS04220 - 826734..827246 (+) 513 WP_289902110.1 competence protein ComB -
  QQS40_RS04225 - 827243..827785 (+) 543 WP_289902109.1 competence protein ComC -
  QQS40_RS04230 - 827824..828168 (+) 345 WP_329506359.1 pilus assembly protein PilP -
  QQS40_RS04235 comE 828170..829555 (+) 1386 WP_329506361.1 type IV pilus secretin PilQ Machinery gene
  QQS40_RS04240 - 829575..830264 (+) 690 WP_289902165.1 ComF family protein -
  QQS40_RS04245 nfuA 830378..830962 (+) 585 WP_005694707.1 Fe-S biogenesis protein NfuA -
  QQS40_RS04250 comM 831006..832535 (-) 1530 WP_128786785.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  QQS40_RS04255 yihA 832665..833279 (-) 615 WP_049357183.1 ribosome biogenesis GTP-binding protein YihA/YsxC -

Sequence


Protein


Download         Length: 461 a.a.        Molecular weight: 51318.98 Da        Isoelectric Point: 7.5108

>NTDB_id=842316 QQS40_RS04235 WP_329506361.1 828170..829555(+) (comE) [Haemophilus parainfluenzae strain HP01]
MIKQKIKTKCGQFLMCFLILWTTYSAAENRVFSLRLKQAPMVATLQQLALEQNANLMIDDELEGTLSLQLENVDFDRLLR
SVAKIKRLSFYQENDIYYLGKPSQHEQYAEKMTEPMAISGESLPSETPLVSTTIKLHFAKASDVMKSLTTGSGSLLSPSG
TITFDDRSNVLLIQDDARSIKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGVRWGIFNPTEAAHRVSGSLDANGFS
NISNNLNVNFATTVTPAGSLAFQVAKINGRLLDLELTALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVTNGKNDT
QSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQGNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKG
IDKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQIEKAKK

Nucleotide


Download         Length: 1386 bp        

>NTDB_id=842316 QQS40_RS04235 WP_329506361.1 828170..829555(+) (comE) [Haemophilus parainfluenzae strain HP01]
ATGATAAAGCAGAAAATAAAAACAAAGTGCGGTCAGTTTTTAATGTGTTTTTTGATCCTATGGACAACTTACTCGGCGGC
AGAAAATCGCGTATTTTCACTTCGCTTAAAACAAGCTCCCATGGTGGCGACACTCCAGCAACTTGCTCTTGAGCAAAATG
CCAATTTAATGATTGATGATGAGTTAGAAGGAACGCTTTCATTGCAATTAGAGAACGTAGATTTTGATCGTTTATTGCGT
TCTGTGGCAAAAATCAAAAGGCTCTCTTTTTATCAAGAAAATGATATTTATTATTTGGGTAAGCCTTCTCAACATGAACA
ATATGCAGAGAAAATGACAGAACCTATGGCGATTAGCGGAGAAAGTTTGCCTAGTGAAACACCACTTGTGAGTACAACGA
TTAAACTGCATTTTGCTAAGGCCTCTGATGTGATGAAATCTTTAACAACTGGTAGTGGTTCTTTGCTTTCACCTAGCGGC
ACAATTACCTTTGATGATCGAAGCAATGTATTACTGATTCAGGATGATGCACGTTCTATCAAAAATATCAAAAAATTGAT
TGCAGAGCTGGATAAACCCATTGAACAAATTGTCATCGAAGCACGTATTGTGACGATTACTGATGAAAGCCTGAAAGAGT
TAGGTGTACGTTGGGGTATTTTTAATCCGACTGAGGCTGCCCATCGAGTGAGTGGCAGTCTAGATGCGAATGGTTTTAGT
AATATCAGTAATAATTTAAATGTGAATTTTGCGACAACGGTCACGCCAGCTGGCTCATTAGCATTTCAAGTCGCTAAAAT
TAATGGCCGATTATTAGACTTAGAATTGACTGCACTTGAACGTGAAAATAACGTAGAAATTATTGCGAGCCCTCGCTTAC
TCACGACCAATAAGAAAAGTGCAAGTATCAAACAAGGGACGGAAATTCCTTATGTAGTGACGAACGGGAAAAATGACACC
CAATCAGTGGAGTTTCGAGAAGCTGTGTTGGGATTGGAAGTCACACCGCATATTTCAAAGGATAATAATATCTTATTGGA
TTTATTAGTCAGTCAAAATTCCCCGGGAAACCGTGTGGCTTATGGGCAAGGTAACGAAGTCGTGTCTATTGATAAACAAG
AAATTAATACACAAGTTTTTGCTAAAGATGGGGAAACGATTGTATTAGGTGGTGTATTCCACGATACGATCACAAAAGGA
ATTGATAAAGTGCCGCTATTGGGTGATATTCCAGGTATTAAGCGCTTATTTAGTAAGGAAAGTGAACGTCATCAAAAGCG
AGAGCTTGTAATTTTTGTGACTCCTCATATTTTAAAACAAGGTGAAAGAATGGAAATGGCTAAGAAAGAAAAGCATTTTA
AGCAAATTGAAAAAGCGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

72.889

97.614

0.711

  comE Haemophilus influenzae 86-028NP

72

97.614

0.703

  comE Glaesserella parasuis strain SC1401

54.245

91.974

0.499

  pilQ Vibrio campbellii strain DS40M4

41.57

93.926

0.39

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

41.882

92.191

0.386

  pilQ Vibrio cholerae strain A1552

41.882

92.191

0.386