Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   EL215_RS03830 Genome accession   NZ_LR134481
Coordinates   758963..760180 (+) Length   405 a.a.
NCBI ID   WP_420026300.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae strain NCTC10665     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 753963..765180
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL215_RS03805 (NCTC10665_00750) - 754573..756423 (-) 1851 Protein_732 penicillin-binding protein 1A -
  EL215_RS03810 (NCTC10665_00751) - 756523..757359 (+) 837 WP_126470287.1 pilus assembly protein PilM -
  EL215_RS03815 (NCTC10665_00752) - 757359..757871 (+) 513 WP_126470289.1 competence protein ComB -
  EL215_RS03820 (NCTC10665_00753) - 757868..758410 (+) 543 WP_126470291.1 competence protein ComC -
  EL215_RS03825 (NCTC10665_00754) - 758410..758793 (+) 384 WP_049357176.1 pilus assembly protein PilP -
  EL215_RS03830 (NCTC10665_00755) comE 758963..760180 (+) 1218 WP_420026300.1 type IV pilus secretin PilQ Machinery gene
  EL215_RS03835 (NCTC10665_00756) - 760200..760889 (+) 690 WP_126472006.1 ComF family protein -
  EL215_RS03840 (NCTC10665_00757) nfuA 761003..761587 (+) 585 WP_005694707.1 Fe-S biogenesis protein NfuA -
  EL215_RS03845 (NCTC10665_00758) comM 761631..763160 (-) 1530 WP_126470293.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  EL215_RS03850 (NCTC10665_00759) yihA 763291..763905 (-) 615 WP_126470295.1 ribosome biogenesis GTP-binding protein YihA/YsxC -

Sequence


Protein


Download         Length: 405 a.a.        Molecular weight: 44741.09 Da        Isoelectric Point: 5.9234

>NTDB_id=1123036 EL215_RS03830 WP_420026300.1 758963..760180(+) (comE) [Haemophilus parainfluenzae strain NCTC10665]
MIDDELEGTLSLQLDNVDFDRLLRAVAKIKGLSFYQENDIYYLGKASQHEQYTEKMDEPVAIVGESLPSEMPLVSTTVKL
HFAKASDVMKSLTTGSGSLLSPSGTITFDDRSNVLLIQDDARSIKNIKKLIAELDKPIEQIVIEARIVTITDESLKELGV
RWGIFNPTEAAHRVSGSLDANGFSNISNNLNVNFATTVTPAGSLALQVAKINGRLLDLELTALERENNVEIIASPRLLTT
NKKSASIKQGTEIPYVVTNGKNDTQSVEFREAVLGLEVTPHISKDNNILLDLLVSQNSPGNRVAYGQGNEVVSIDKQEIN
TQVFAKDGETIVLGGVFHDTITKGIDKVPLLGDIPGIKRLFSKESERHQKRELVIFVTPHILKQGERMEMAKKEKHFKQI
EKAKK

Nucleotide


Download         Length: 1218 bp        

>NTDB_id=1123036 EL215_RS03830 WP_420026300.1 758963..760180(+) (comE) [Haemophilus parainfluenzae strain NCTC10665]
ATGATTGATGATGAATTAGAAGGAACGCTTTCATTGCAATTAGATAACGTAGATTTTGATCGTTTATTGCGTGCTGTCGC
AAAAATCAAAGGACTCTCTTTTTATCAAGAAAATGATATTTATTATTTAGGCAAGGCTTCTCAACATGAACAATATACTG
AAAAAATGGATGAACCAGTAGCAATCGTCGGCGAAAGTTTGCCTAGTGAAATGCCACTTGTGAGTACAACGGTTAAATTG
CATTTTGCCAAAGCTTCTGATGTGATGAAATCGTTAACAACAGGGAGTGGTTCTTTGCTTTCACCTAGCGGCACAATTAC
ATTTGATGATCGAAGCAATGTGTTACTGATTCAGGATGATGCACGTTCTATCAAAAATATCAAAAAATTAATTGCAGAGC
TGGATAAACCCATTGAACAAATTGTCATCGAAGCACGTATTGTGACGATTACTGATGAAAGCCTGAAAGAGTTAGGTGTA
CGTTGGGGCATTTTTAATCCGACTGAGGCTGCCCATCGAGTGAGTGGCAGTCTAGATGCGAATGGTTTTAGTAATATCAG
TAATAATTTAAATGTGAATTTTGCGACAACAGTCACGCCAGCTGGCTCATTAGCACTTCAAGTCGCTAAAATTAATGGCC
GATTATTAGACTTAGAATTGACTGCACTTGAACGTGAAAATAACGTAGAAATTATTGCGAGCCCTCGCTTACTCACGACC
AATAAGAAAAGTGCAAGTATCAAACAAGGGACGGAAATTCCTTATGTAGTGACGAACGGGAAAAATGACACCCAATCAGT
GGAGTTTCGAGAAGCTGTGTTGGGATTGGAAGTCACACCGCATATTTCAAAGGATAATAATATCTTATTGGATTTATTAG
TCAGTCAAAATTCCCCGGGAAACCGTGTGGCTTATGGGCAAGGTAACGAAGTCGTGTCTATTGATAAACAAGAAATTAAT
ACACAAGTTTTTGCTAAAGATGGGGAAACGATTGTATTAGGTGGTGTATTCCACGATACGATCACAAAAGGAATTGATAA
AGTGCCGCTATTGGGTGATATTCCAGGTATTAAGCGCTTATTTAGTAAGGAAAGTGAACGTCATCAAAAGCGAGAGCTTG
TAATTTTTGTGACTCCTCATATTTTAAAACAAGGTGAAAGAATGGAAATGGCTAAGAAAGAAAAGCATTTTAAGCAAATT
GAAAAAGCGAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

76.75

98.765

0.758

  comE Haemophilus influenzae 86-028NP

75.75

98.765

0.748

  comE Glaesserella parasuis strain SC1401

57.068

94.321

0.538

  pilQ Vibrio campbellii strain DS40M4

42.574

99.753

0.425

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.317

98.025

0.415

  pilQ Vibrio cholerae strain A1552

42.317

98.025

0.415

  comQ Acinetobacter baylyi ADP1

35.731

100

0.368


Multiple sequence alignment