Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   ELZ62_RS03805 Genome accession   NZ_LR134168
Coordinates   734272..735666 (+) Length   464 a.a.
NCBI ID   WP_126507583.1    Uniprot ID   -
Organism   Haemophilus influenzae strain NCTC11394     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 729272..740666
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ELZ62_RS03780 (NCTC11394_00751) rsmE 729288..730025 (-) 738 WP_126507580.1 16S rRNA (uracil(1498)-N(3))-methyltransferase -
  ELZ62_RS03785 (NCTC11394_00752) lnt 730075..731604 (-) 1530 WP_126507581.1 apolipoprotein N-acyltransferase -
  ELZ62_RS03790 (NCTC11394_00753) corC 731627..732526 (-) 900 WP_126507582.1 CNNM family magnesium/cobalt transport protein CorC -
  ELZ62_RS03795 (NCTC11394_00754) ampD 733158..733712 (-) 555 WP_110432055.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  ELZ62_RS03800 (NCTC11394_00755) pilA 733826..734275 (+) 450 WP_110432056.1 prepilin peptidase-dependent pilin Machinery gene
  ELZ62_RS03805 (NCTC11394_00756) pilB 734272..735666 (+) 1395 WP_126507583.1 GspE/PulE family protein Machinery gene
  ELZ62_RS03810 (NCTC11394_00757) pilC 735663..736880 (+) 1218 WP_126507584.1 type II secretion system F family protein Machinery gene
  ELZ62_RS03815 (NCTC11394_00758) pilD 736877..737569 (+) 693 WP_041175037.1 A24 family peptidase Machinery gene
  ELZ62_RS03820 (NCTC11394_00759) rho 737625..738887 (-) 1263 WP_015701557.1 transcription termination factor Rho -
  ELZ62_RS03825 (NCTC11394_00760) metJ 739135..739452 (+) 318 WP_005631186.1 met regulon transcriptional regulator MetJ -
  ELZ62_RS03830 (NCTC11394_00761) cueR 739466..739852 (-) 387 WP_005648963.1 Cu(I)-responsive transcriptional regulator -
  ELZ62_RS03835 (NCTC11394_00762) - 739929..740135 (+) 207 WP_005686717.1 heavy-metal-associated domain-containing protein -
  ELZ62_RS03840 (NCTC11394_00763) - 740221..740427 (+) 207 WP_005666693.1 heavy-metal-associated domain-containing protein -

Sequence


Protein


Download         Length: 464 a.a.        Molecular weight: 52881.39 Da        Isoelectric Point: 5.8385

>NTDB_id=1118684 ELZ62_RS03805 WP_126507583.1 734272..735666(+) (pilB) [Haemophilus influenzae strain NCTC11394]
MTSYALLHTQRVIAQNGEVFTISPNLWERNQQQQSLLLRYFALPLKEENNRLWLGVDSLSNLSACETIAFITGKPVEPIL
LESSQLKELLQQLTPCQMQVEEQVKFYQHQETHFEQENDEPVIRLLNQIFESALQKNASDIHLETLADQFQVRFRIDGVL
QPQPLISKIFANRIISRLKLLAKLDISENRLPQDGRFQFKTTFSDILDFRLSTLPTHWGEKIVLRAQQNKPVELSFSELG
MTENQQQAFQRTLSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIELDGIIQSQINPQIGLDFNRLLR
AFLRQDPDIIMLGEIRDEESAMIALRAAQTGHLVLSTLHTNDAISAISRLQQLGIQQYEIENSLLLVIAQRLVRKICPKC
GGNLINSCDCNQGYRGRIGVYQFLHWQQNGYQTDFKNLHASGLEKVSQGITDEKEIERVLGKNS

Nucleotide


Download         Length: 1395 bp        

>NTDB_id=1118684 ELZ62_RS03805 WP_126507583.1 734272..735666(+) (pilB) [Haemophilus influenzae strain NCTC11394]
ATGACGAGCTATGCTTTACTTCATACTCAGCGTGTAATTGCTCAAAATGGCGAGGTATTTACGATCTCGCCAAATTTATG
GGAACGCAATCAGCAGCAACAATCCTTGCTTTTGCGGTATTTTGCTTTGCCACTTAAAGAAGAAAATAATCGTCTTTGGC
TAGGGGTTGATTCTCTTTCCAATCTTTCAGCTTGTGAAACCATTGCGTTTATAACAGGAAAACCTGTCGAACCAATTTTG
TTAGAAAGCAGCCAACTCAAAGAACTGTTACAGCAACTTACTCCGTGCCAAATGCAAGTGGAAGAGCAAGTTAAATTCTA
TCAACATCAAGAAACCCATTTTGAACAAGAAAATGATGAACCTGTTATCCGCTTACTTAATCAGATTTTTGAATCTGCCT
TACAAAAAAATGCCTCTGATATTCATTTAGAAACCTTGGCTGATCAGTTTCAAGTGCGGTTTAGAATTGATGGTGTTTTA
CAACCACAACCCTTAATAAGCAAAATATTCGCCAATCGTATTATTTCACGCTTAAAATTACTGGCTAAATTAGATATTAG
TGAAAATCGACTTCCACAAGATGGGCGATTTCAATTTAAAACGACTTTTTCCGATATTCTTGATTTTCGCCTTTCAACCT
TACCAACCCATTGGGGCGAAAAAATCGTGTTGCGAGCGCAACAAAATAAACCAGTAGAACTTAGCTTTTCTGAATTAGGT
ATGACAGAAAATCAGCAACAAGCATTTCAACGTACGCTTAGCCAGCCACAAGGATTAATTTTAGTCACTGGTCCAACAGG
AAGTGGAAAAAGTATCTCACTTTACACCGCACTTCAGTGGCTAAATACGCCTGATAAACATATTATGACTGCTGAAGATC
CCATTGAAATTGAGCTGGATGGCATTATTCAAAGCCAAATTAACCCACAGATTGGATTAGATTTTAACCGCCTATTGCGC
GCTTTTTTACGCCAAGATCCCGATATTATTATGTTAGGCGAAATTCGCGATGAAGAAAGTGCAATGATTGCACTACGTGC
CGCCCAAACCGGGCATTTAGTACTCTCAACTTTACATACCAATGATGCGATATCTGCCATTTCTCGATTACAACAACTTG
GTATTCAACAGTATGAAATTGAAAACAGCTTACTGCTCGTTATTGCGCAGCGTCTTGTACGAAAAATCTGTCCAAAGTGC
GGTGGAAATTTAATAAATTCTTGTGATTGCAATCAAGGTTATCGAGGTCGAATCGGCGTGTATCAATTTCTACATTGGCA
GCAGAATGGCTATCAAACGGATTTTAAAAATTTACATGCGAGTGGTTTAGAAAAAGTTAGCCAAGGCATAACAGATGAGA
AAGAAATTGAACGTGTGCTAGGTAAAAACTCATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Haemophilus influenzae 86-028NP

96.983

100

0.97

  pilB Haemophilus influenzae Rd KW20

96.976

99.784

0.968

  pilB Glaesserella parasuis strain SC1401

57.174

99.138

0.567

  pilB Vibrio cholerae strain A1552

38.899

100

0.442

  pilB Vibrio campbellii strain DS40M4

39.877

100

0.42

  pilB Legionella pneumophila strain ERS1305867

38.416

100

0.418

  pilB Vibrio parahaemolyticus RIMD 2210633

39.752

100

0.414

  pilB Acinetobacter baylyi ADP1

37.903

100

0.405

  pilB Acinetobacter baumannii D1279779

38.144

100

0.399

  pilF Thermus thermophilus HB27

36.345

100

0.373

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

36.966

100

0.373

  pilF Neisseria gonorrhoeae MS11

36.134

100

0.371


Multiple sequence alignment