Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   CH624_RS04020 Genome accession   NZ_CP031682
Coordinates   765766..767157 (+) Length   463 a.a.
NCBI ID   WP_112109934.1    Uniprot ID   -
Organism   Haemophilus influenzae strain P665-7858     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 760766..772157
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CH624_RS03995 (CH624_03995) rsmE 760780..761517 (-) 738 WP_042611225.1 16S rRNA (uracil(1498)-N(3))-methyltransferase -
  CH624_RS04000 (CH624_04000) lnt 761567..763096 (-) 1530 WP_112074650.1 apolipoprotein N-acyltransferase -
  CH624_RS04005 (CH624_04005) corC 763119..764018 (-) 900 WP_005662987.1 CNNM family magnesium/cobalt transport protein CorC -
  CH624_RS04010 (CH624_04010) ampD 764654..765214 (-) 561 WP_005648974.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  CH624_RS04015 (CH624_04015) pilA 765329..765769 (+) 441 WP_112109935.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  CH624_RS04020 (CH624_04020) pilB 765766..767157 (+) 1392 WP_112109934.1 GspE/PulE family protein Machinery gene
  CH624_RS04025 (CH624_04025) pilC 767154..768374 (+) 1221 WP_112109933.1 type II secretion system F family protein Machinery gene
  CH624_RS04030 (CH624_04030) pilD 768371..769063 (+) 693 WP_041175383.1 prepilin peptidase Machinery gene
  CH624_RS04035 (CH624_04035) rho 769117..770379 (-) 1263 WP_005648966.1 transcription termination factor Rho -
  CH624_RS04040 (CH624_04040) metJ 770627..770944 (+) 318 WP_005631186.1 met regulon transcriptional regulator MetJ -
  CH624_RS04045 (CH624_04045) cueR 770958..771344 (-) 387 WP_005648963.1 Cu(I)-responsive transcriptional regulator -
  CH624_RS04050 (CH624_04050) - 771421..771627 (+) 207 WP_005666693.1 heavy-metal-associated domain-containing protein -
  CH624_RS04055 (CH624_04055) - 771701..771907 (+) 207 WP_005631184.1 heavy-metal-associated domain-containing protein -

Sequence


Protein


Download         Length: 463 a.a.        Molecular weight: 52685.18 Da        Isoelectric Point: 5.9800

>NTDB_id=309924 CH624_RS04020 WP_112109934.1 765766..767157(+) (pilB) [Haemophilus influenzae strain P665-7858]
MIDLTSAKPRVKAQNGEIFTISPDLWVRNQQQHSLLLRYFALPLKEENNRLWLGVDSLSNLSACETIAFITGKPVEPILL
ESSQLKELLQQLTPRQMQVEEQVKFYQHQETHFEQEDDEPVIRLLNQIFESALQKNASDIHLETLADQFQVRFRIDGVLQ
PQPLISKIFANRIISRLKLLAKLDISENRLPQDGRFQFKTTFSDILDFRLSTLPTHWGEKVVLRAQQNKPVELSFAELGM
TENQQQAFQHALSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIELDGIIQSQINSQIGLDFSRLLRT
FLRQDPDIIMLGEIRDEESAMIALRAAQTGHLILSTLHTNDAISAISRLQQLGIQQYEIENSLLLVIAQRLVRKICPKCG
GNLINSCDCNQGYQGRIGVYQFLHWQQNGYQTDFKNLHASGLEKVNQGMTDNKELERVLGKNS

Nucleotide


Download         Length: 1392 bp        

>NTDB_id=309924 CH624_RS04020 WP_112109934.1 765766..767157(+) (pilB) [Haemophilus influenzae strain P665-7858]
ATGATCGACTTAACTAGTGCAAAACCTCGTGTAAAAGCACAAAATGGCGAGATATTTACGATCTCGCCTGATTTATGGGT
GCGTAATCAACAACAGCACTCCTTGCTCTTGCGGTATTTTGCTTTGCCACTTAAAGAAGAAAATAATCGTCTTTGGCTAG
GGGTTGATTCTCTCTCCAATCTTTCAGCTTGTGAAACCATTGCGTTTATAACAGGAAAACCTGTCGAACCAATTTTGTTA
GAAAGCAGCCAACTCAAAGAACTGTTACAACAACTTACTCCGCGCCAAATGCAAGTGGAAGAGCAAGTTAAATTCTATCA
ACATCAAGAAACCCATTTTGAACAAGAAGATGATGAACCTGTTATCCGCTTACTTAATCAGATTTTTGAATCTGCCTTAC
AAAAAAATGCCTCTGATATTCATTTAGAAACCTTGGCTGATCAGTTTCAAGTGCGGTTTAGAATTGATGGTGTTTTACAA
CCACAACCCTTAATAAGCAAAATATTCGCCAATCGTATTATTTCACGCTTAAAATTACTGGCTAAATTAGATATTAGTGA
AAATCGACTTCCACAAGATGGGCGATTTCAATTTAAAACCACTTTTTCCGATATTCTTGATTTCCGTCTTTCAACGTTAC
CAACTCATTGGGGAGAGAAAGTTGTCCTAAGAGCACAGCAAAATAAACCTGTAGAACTTAGTTTTGCTGAACTGGGTATG
ACCGAAAATCAGCAACAAGCATTTCAACACGCACTTAGCCAGCCACAAGGATTAATTTTAGTAACCGGCCCCACAGGAAG
TGGGAAAAGTATCTCACTTTACACTGCACTTCAGTGGCTAAATACGCCTGATAAACATATTATGACAGCGGAAGATCCCA
TTGAAATTGAGCTTGATGGCATTATTCAAAGCCAAATTAATTCACAGATTGGATTAGATTTTAGCCGCCTATTGCGTACT
TTTTTACGCCAAGATCCCGATATCATTATGTTAGGCGAAATTCGCGATGAAGAAAGTGCAATGATTGCACTACGTGCCGC
CCAAACCGGGCATTTAATACTCTCAACTTTACATACCAATGATGCGATATCTGCCATTTCTCGATTACAACAACTTGGTA
TTCAACAGTATGAAATTGAAAACAGCTTACTGCTCGTTATTGCGCAACGTCTTGTACGAAAAATCTGTCCAAAGTGCGGT
GGAAATTTAATAAATTCTTGTGATTGCAATCAAGGTTATCAAGGTCGAATCGGCGTGTATCAATTTCTACATTGGCAGCA
GAATGGCTATCAAACGGATTTTAAAAATTTACATGCGAGTGGTTTAGAAAAAGTTAATCAAGGAATGACGGATAATAAAG
AACTTGAACGTGTGCTAGGTAAAAACTCATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Haemophilus influenzae 86-028NP

95.374

98.056

0.935

  pilB Haemophilus influenzae Rd KW20

94.481

97.84

0.924

  pilB Glaesserella parasuis strain SC1401

57.018

98.488

0.562

  pilB Vibrio cholerae strain A1552

40.369

100

0.425

  pilB Vibrio campbellii strain DS40M4

40.041

100

0.421

  pilB Vibrio parahaemolyticus RIMD 2210633

39.752

100

0.415

  pilB Legionella pneumophila strain ERS1305867

38.945

100

0.415

  pilB Acinetobacter baylyi ADP1

37.298

100

0.4

  pilB Acinetobacter baumannii D1279779

38.205

100

0.395

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

36.232

100

0.378

  pilF Neisseria gonorrhoeae MS11

36.345

100

0.374

  pilF Thermus thermophilus HB27

35.197

100

0.367


Multiple sequence alignment