Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   ELZ63_RS02020 Genome accession   NZ_LR134171
Coordinates   394935..396332 (+) Length   465 a.a.
NCBI ID   WP_065244997.1    Uniprot ID   -
Organism   Haemophilus influenzae strain NCTC12699     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 389935..401332
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ELZ63_RS01995 (NCTC12699_00399) rsmE 389950..390687 (-) 738 WP_065245002.1 16S rRNA (uracil(1498)-N(3))-methyltransferase -
  ELZ63_RS02000 (NCTC12699_00400) lnt 390737..392266 (-) 1530 WP_080474365.1 apolipoprotein N-acyltransferase -
  ELZ63_RS02005 (NCTC12699_00401) corC 392289..393188 (-) 900 WP_065245000.1 CNNM family magnesium/cobalt transport protein CorC -
  ELZ63_RS02010 (NCTC12699_00402) ampD 393820..394374 (-) 555 WP_065244999.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  ELZ63_RS02015 (NCTC12699_00403) pilA 394489..394938 (+) 450 WP_065244998.1 prepilin peptidase-dependent pilin Machinery gene
  ELZ63_RS02020 (NCTC12699_00404) pilB 394935..396332 (+) 1398 WP_065244997.1 GspE/PulE family protein Machinery gene
  ELZ63_RS02025 (NCTC12699_00405) pilC 396329..397549 (+) 1221 WP_065244996.1 type II secretion system F family protein Machinery gene
  ELZ63_RS02030 (NCTC12699_00406) pilD 397546..398238 (+) 693 WP_065244995.1 A24 family peptidase Machinery gene
  ELZ63_RS02035 (NCTC12699_00407) rho 398291..399553 (-) 1263 WP_005648966.1 transcription termination factor Rho -
  ELZ63_RS02040 (NCTC12699_00408) metJ 399801..400118 (+) 318 WP_005631186.1 met regulon transcriptional regulator MetJ -
  ELZ63_RS02045 (NCTC12699_00409) cueR 400132..400518 (-) 387 WP_005648963.1 Cu(I)-responsive transcriptional regulator -
  ELZ63_RS02050 (NCTC12699_00410) - 400595..400801 (+) 207 WP_065244994.1 heavy-metal-associated domain-containing protein -
  ELZ63_RS02055 (NCTC12699_00411) - 400876..401082 (+) 207 WP_005666693.1 heavy-metal-associated domain-containing protein -

Sequence


Protein


Download         Length: 465 a.a.        Molecular weight: 52849.30 Da        Isoelectric Point: 5.8360

>NTDB_id=1118735 ELZ63_RS02020 WP_065244997.1 394935..396332(+) (pilB) [Haemophilus influenzae strain NCTC12699]
MTSYALLHTQRVIAQNGEVFTISPDLWERNRQQQSLLLRYFALPLKEENNRLWLGVDSLSNLSACETIAFITGKPVEPIL
LESSQLKELLQQLAPNQPKVEEQVKYYQHQEQSHLEQQDDEPVIRLLNQIFESALQKNASDIHLETLADQFQVRFRIDGV
LQPQPLISKIFANRIISRLKLLAKLDISENRLPQDGRFQFKTTFSDILDFRLSTLPTHWGEKIVLRAQQNKPVELSFAEL
GMTESQQQAFQCALSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIELDGIIQSQINPQIGLDFNRLL
RTFLRQDPDIIMLGEIRDEESAMIALRAAQTGHLVLSTLHTNDAISAISRLQQLGIQQYEIENSLLLVIAQRLVRKLCSK
CGGNLINSCDCNQGYQGRIGVYQFLHWQQNGYQTDFKNLHASGLEKVNQGMTDNKELERVLGKNS

Nucleotide


Download         Length: 1398 bp        

>NTDB_id=1118735 ELZ63_RS02020 WP_065244997.1 394935..396332(+) (pilB) [Haemophilus influenzae strain NCTC12699]
ATGACGAGCTATGCTTTACTTCATACTCAGCGTGTAATTGCTCAAAATGGCGAAGTATTTACGATCTCGCCAGATTTATG
GGAACGCAATCGGCAGCAACAATCCTTGCTCTTACGTTATTTTGCTTTGCCACTTAAAGAAGAAAATAATCGTCTTTGGC
TAGGGGTTGATTCTCTCTCCAATCTTTCAGCCTGTGAAACCATTGCGTTTATAACAGGAAAACCTGTCGAACCAATTTTG
TTAGAAAGCAGCCAACTCAAAGAACTATTACAGCAACTTGCTCCTAATCAACCTAAAGTGGAAGAGCAAGTTAAATATTA
CCAACATCAAGAACAGTCTCATCTTGAACAACAAGATGATGAACCTGTTATCCGCTTACTTAATCAGATTTTTGAATCTG
CCTTACAAAAAAATGCCTCTGATATTCATTTAGAAACCTTGGCTGATCAATTTCAAGTGCGGTTTAGAATTGATGGTGTT
TTACAACCACAACCCTTAATAAGCAAAATATTCGCCAATCGTATTATTTCACGCTTAAAATTACTGGCTAAATTAGATAT
TAGTGAAAATCGACTTCCACAAGATGGGCGATTTCAATTTAAAACGACTTTTTCCGATATTCTTGATTTTCGCCTTTCAA
CCTTACCAACCCATTGGGGCGAAAAAATCGTGTTGCGAGCGCAACAAAATAAACCTGTAGAACTTAGCTTTGCTGAACTG
GGCATGACTGAAAGCCAACAACAGGCATTTCAATGCGCGCTTAGCCAACCCCAAGGATTAATTTTAGTCACTGGCCCAAC
AGGAAGTGGAAAAAGTATCTCACTTTACACCGCACTTCAGTGGCTAAATACGCCTGATAAACATATTATGACTGCTGAAG
ATCCCATTGAAATTGAGCTGGATGGCATTATTCAAAGCCAAATTAATCCACAGATTGGATTAGATTTTAACCGCCTATTG
CGTACTTTTTTACGCCAAGATCCCGATATCATTATGTTAGGCGAAATTCGCGATGAAGAAAGTGCAATGATTGCACTACG
TGCCGCCCAAACCGGGCATTTAGTACTCTCAACTTTACATACCAATGATGCGATATCTGCCATTTCTCGATTACAACAAC
TTGGTATTCAACAGTATGAAATTGAAAACAGCTTACTGCTCGTTATTGCGCAACGTCTTGTACGAAAGCTTTGTTCAAAG
TGCGGTGGAAATTTAATAAATTCTTGTGATTGCAATCAAGGTTATCAAGGTCGAATCGGCGTGTATCAATTTCTACATTG
GCAGCAGAATGGCTATCAAACGGATTTTAAAAATTTACATGCGAGTGGTTTAGAAAAAGTTAATCAAGGAATGACGGATA
ATAAAGAACTTGAACGTGTGCTAGGTAAAAACTCATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Haemophilus influenzae Rd KW20

94.612

99.785

0.944

  pilB Haemophilus influenzae 86-028NP

93.548

100

0.935

  pilB Glaesserella parasuis strain SC1401

58.114

98.065

0.57

  pilB Vibrio cholerae strain A1552

40.607

100

0.46

  pilB Vibrio campbellii strain DS40M4

40.9

100

0.43

  pilB Vibrio parahaemolyticus RIMD 2210633

40.58

100

0.422

  pilB Legionella pneumophila strain ERS1305867

38.742

100

0.411

  pilB Acinetobacter baylyi ADP1

37.827

100

0.404

  pilB Acinetobacter baumannii D1279779

38.462

100

0.398

  pilF Neisseria gonorrhoeae MS11

36.975

100

0.378

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

37.313

100

0.376

  pilF Thermus thermophilus HB27

35.477

100

0.368


Multiple sequence alignment