Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   ACDK45_RS09375 Genome accession   NZ_CP167879
Coordinates   1877130..1878524 (-) Length   464 a.a.
NCBI ID   WP_044364868.1    Uniprot ID   -
Organism   Haemophilus influenzae strain NTHi52     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1872130..1883524
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACDK45_RS09340 (ACDK45_09340) - 1872369..1872575 (-) 207 WP_005648955.1 heavy-metal-associated domain-containing protein -
  ACDK45_RS09345 (ACDK45_09345) - 1872661..1872867 (-) 207 WP_012054944.1 heavy-metal-associated domain-containing protein -
  ACDK45_RS09350 (ACDK45_09350) cueR 1872944..1873330 (+) 387 WP_005663006.1 Cu(I)-responsive transcriptional regulator -
  ACDK45_RS09355 (ACDK45_09355) metJ 1873344..1873661 (-) 318 WP_005631186.1 met regulon transcriptional regulator MetJ -
  ACDK45_RS09360 (ACDK45_09360) rho 1873909..1875170 (+) 1262 Protein_1785 transcription termination factor Rho -
  ACDK45_RS09365 (ACDK45_09365) pilD 1875224..1875916 (-) 693 WP_038440789.1 A24 family peptidase Machinery gene
  ACDK45_RS09370 (ACDK45_09370) pilC 1875913..1877133 (-) 1221 WP_011271971.1 type II secretion system F family protein Machinery gene
  ACDK45_RS09375 (ACDK45_09375) pilB 1877130..1878524 (-) 1395 WP_044364868.1 GspE/PulE family protein Machinery gene
  ACDK45_RS09380 (ACDK45_09380) pilA 1878521..1878970 (-) 450 WP_044364865.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  ACDK45_RS09385 (ACDK45_09385) ampD 1879085..1879639 (+) 555 WP_044364862.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  ACDK45_RS09390 (ACDK45_09390) corC 1880272..1881171 (+) 900 WP_044364861.1 CNNM family magnesium/cobalt transport protein CorC -
  ACDK45_RS09395 (ACDK45_09395) lnt 1881194..1882723 (+) 1530 WP_080334596.1 apolipoprotein N-acyltransferase -
  ACDK45_RS09400 (ACDK45_09400) rsmE 1882773..1883510 (+) 738 WP_374930614.1 16S rRNA (uracil(1498)-N(3))-methyltransferase -

Sequence


Protein


Download         Length: 464 a.a.        Molecular weight: 52984.41 Da        Isoelectric Point: 5.6445

>NTDB_id=1040098 ACDK45_RS09375 WP_044364868.1 1877130..1878524(-) (pilB) [Haemophilus influenzae strain NTHi52]
MTSYALLHTQRVIAQNGEIFTISPDLWERNRQQQSLLLRYFALPLKEENNRLWLGVDSLSNLSACETIAFITGKPVEPIL
LESSQLKELLQQLTPCQMQVEEQVKFYQHQETHFEQEDDEPVIRLLNQIFESALQKNASDIHLETLADQFQVRFRIDGVL
QPQPLISKIFANRIISRLKLLAKLDISENRLPQDGRFQFKTTFSDILDFRLSTLPTHWGEKIVLRAQQNKPVELSFSELG
MTENQQQAFQRSLSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIELDGIIQSQINPQIGLDFSRLLR
AFLRQDPDIIMLGEIRDEESARIALRAAQTGHLVLSTLHTNDAISAISRLQQLGIQQHEIENSLLLVIAQRLVRKICPKC
GGNLINSCDCHQGYRGRIGVYQFLHWQQNGYQTDFENLRESGLEKVSQGITDEKEIERVLGKNS

Nucleotide


Download         Length: 1395 bp        

>NTDB_id=1040098 ACDK45_RS09375 WP_044364868.1 1877130..1878524(-) (pilB) [Haemophilus influenzae strain NTHi52]
ATGACGAGCTATGCTTTACTTCATACTCAGCGTGTAATTGCTCAAAATGGCGAGATCTTTACGATCTCGCCAGATTTATG
GGAACGCAATCGGCAGCAACAATCCTTGCTTTTGCGGTATTTTGCTTTGCCACTTAAAGAAGAAAATAATCGTCTTTGGC
TAGGGGTTGATTCTCTCTCCAATCTTTCAGCTTGTGAAACCATTGCGTTTATAACAGGAAAACCTGTCGAACCAATTTTG
TTAGAAAGCAGCCAACTCAAAGAACTGTTACAGCAACTTACTCCGTGCCAAATGCAAGTGGAAGAACAAGTTAAATTCTA
TCAACATCAAGAAACCCATTTTGAACAAGAAGATGATGAACCTGTTATCCGCTTACTTAATCAGATTTTTGAATCTGCCT
TACAAAAAAATGCCTCTGATATTCATTTAGAAACCTTGGCTGATCAGTTTCAAGTGCGGTTTAGAATTGATGGTGTTTTA
CAACCACAACCCTTAATAAGCAAAATATTCGCCAATCGTATTATTTCACGCTTAAAATTACTGGCTAAATTAGATATTAG
TGAAAATCGACTTCCACAAGATGGGCGATTTCAATTTAAAACGACTTTTTCCGATATTCTTGATTTTCGCCTTTCAACCT
TACCAACCCATTGGGGCGAAAAAATCGTGTTGCGAGCGCAACAAAATAAACCAGTAGAACTTAGCTTTTCTGAACTGGGT
ATGACCGAAAATCAGCAACAAGCATTTCAACGCTCACTTAGCCAGCCACAAGGATTAATTTTAGTAACCGGCCCCACAGG
AAGTGGGAAAAGTATCTCGCTTTACACCGCACTTCAGTGGCTAAATACGCCTGATAAACATATTATGACCGCTGAAGATC
CCATTGAAATTGAACTTGATGGTATTATTCAAAGCCAAATTAATCCGCAGATTGGATTAGATTTTAGCCGTCTATTGCGT
GCTTTTTTACGTCAAGATCCCGACATCATTATGCTAGGTGAAATTCGAGATGAAGAAAGTGCAAGGATTGCACTACGTGC
CGCTCAAACGGGACATTTGGTGCTTTCAACTTTACATACCAATGATGCAATATCTGCCATTTCTCGCTTACAACAACTCG
GTATTCAACAACATGAAATTGAAAACAGTTTACTACTCGTCATTGCACAGCGTCTTGTACGAAAAATCTGTCCAAAGTGC
GGTGGAAATTTAATAAATTCTTGTGATTGCCATCAAGGTTATCGAGGGCGAATCGGCGTGTATCAATTTCTACATTGGCA
ACAGAATGGCTATCAAACGGATTTTGAGAATTTACGAGAGAGTGGTTTGGAAAAAGTTAGCCAAGGCATAACAGATGAGA
AAGAAATTGAACGTGTGTTAGGTAAAAACTCATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Haemophilus influenzae 86-028NP

99.138

100

0.991

  pilB Haemophilus influenzae Rd KW20

96.328

99.784

0.961

  pilB Glaesserella parasuis strain SC1401

57.826

99.138

0.573

  pilB Vibrio cholerae strain A1552

39.469

100

0.448

  pilB Vibrio campbellii strain DS40M4

40.082

100

0.422

  pilB Vibrio parahaemolyticus RIMD 2210633

39.959

100

0.416

  pilB Legionella pneumophila strain ERS1305867

39.059

100

0.412

  pilB Acinetobacter baylyi ADP1

38.105

100

0.407

  pilB Acinetobacter baumannii D1279779

38.557

100

0.403

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

37.393

100

0.377

  pilF Neisseria gonorrhoeae MS11

36.555

100

0.375

  pilF Thermus thermophilus HB27

36.134

100

0.371


Multiple sequence alignment