Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   HD_RS04630 Genome accession   NC_002940
Coordinates   892976..894370 (+) Length   464 a.a.
NCBI ID   WP_010945036.1    Uniprot ID   Q7VM73
Organism   [Haemophilus] ducreyi 35000HP     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 887976..899370
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HD_RS04600 (HD1115) nth 888520..889155 (-) 636 WP_010945029.1 endonuclease III -
  HD_RS04605 (HD1117) - 889185..889934 (-) 750 WP_041603673.1 TSUP family transporter -
  HD_RS04610 (HD1118) - 890053..890538 (+) 486 WP_010945031.1 dihydrofolate reductase -
  HD_RS04615 (HD1120) - 890660..890875 (+) 216 WP_010945033.1 YgjV family protein -
  HD_RS04620 (HD1121) radA 890942..892321 (-) 1380 WP_010945034.1 DNA repair protein RadA -
  HD_RS04625 (HD1123) pilA 892515..892964 (+) 450 WP_010945035.1 prepilin peptidase-dependent pilin Machinery gene
  HD_RS04630 (HD1124) pilB 892976..894370 (+) 1395 WP_010945036.1 GspE/PulE family protein Machinery gene
  HD_RS04635 (HD1125) pilC 894354..895547 (+) 1194 WP_010945037.1 type II secretion system F family protein Machinery gene
  HD_RS04640 (HD1126) - 895547..896239 (+) 693 WP_041603480.1 prepilin peptidase -
  HD_RS04645 (HD1127) coaE 896257..896874 (+) 618 WP_010945039.1 dephospho-CoA kinase -
  HD_RS04650 (HD1129) yacG 896877..897065 (+) 189 WP_010945040.1 DNA gyrase inhibitor YacG -
  HD_RS04655 (HD1130) holA 897099..898124 (-) 1026 WP_010945041.1 DNA polymerase III subunit delta -
  HD_RS04660 (HD1131) lptE 898130..898627 (-) 498 WP_010945042.1 LPS assembly lipoprotein LptE -

Sequence


Protein


Download         Length: 464 a.a.        Molecular weight: 52913.08 Da        Isoelectric Point: 6.6454

>NTDB_id=21073 HD_RS04630 WP_010945036.1 892976..894370(+) (pilB) [[Haemophilus] ducreyi 35000HP]
MQYAVFDLHRQQNFIISAERWQKNCQEKDLLLRYLALPVQENEDRLWLAIDDMKNLNACEIFAFMAHKTIEPVLVSAEEL
KYLLNQLAPEQQAVYEESELTYQVQTESETLNLNDPIIQLLDNLFKFCLRQNASDIHFEPQKAQLQIRLRIDGVLHIYKT
LAIHLAARIISRIKLLARLDISETRLPQDGQFHFKTIVSETLDFRVSSLPTQWGGKIVLRLQKNKPTHFDFVELGLLPAQ
KTTLLHLLHQPQGLILVTGPTGSGKSITLYSALSHLNQPDKHILTAEDPIEIEIEGIIQTQVNRAIQLDFSHLLRTFLRQ
DPDIIMLGEIRDPESAEIALRAAQTGHLVLSTLHTNDAPSAIERLLQLGIKEYELKNALLLVIAQRLVRKLCTQCFGKGC
QACYQGYRGRIGIYQLLSRTGKIFEKETAYLDFASLHHSAKHKLEADITSLAEINRVLGDAENI

Nucleotide


Download         Length: 1395 bp        

>NTDB_id=21073 HD_RS04630 WP_010945036.1 892976..894370(+) (pilB) [[Haemophilus] ducreyi 35000HP]
ATGCAGTATGCGGTATTTGATCTTCATCGCCAACAGAATTTTATTATTTCAGCGGAGAGATGGCAAAAGAACTGTCAAGA
AAAAGATCTTTTATTACGTTATCTTGCATTACCGGTTCAAGAAAATGAGGACAGGTTATGGCTAGCCATTGATGATATGA
AAAATTTAAATGCGTGTGAAATTTTTGCATTTATGGCTCATAAAACGATAGAACCGGTGTTGGTTTCTGCGGAAGAATTG
AAATATTTATTGAATCAACTGGCGCCAGAGCAACAAGCGGTATATGAAGAAAGTGAGCTAACTTATCAAGTACAAACAGA
AAGTGAAACATTGAATTTAAATGATCCTATTATTCAATTACTTGATAATTTATTTAAATTTTGTTTAAGGCAAAATGCTT
CCGATATTCATTTTGAGCCACAAAAAGCACAGTTACAGATTCGGTTGCGGATTGATGGTGTGTTACATATCTACAAAACA
TTGGCAATTCATTTGGCTGCTCGCATTATTTCACGTATCAAATTGTTGGCTAGACTTGATATTAGCGAAACACGGCTACC
GCAAGATGGGCAATTCCATTTTAAAACAATCGTATCTGAAACCCTTGATTTCCGTGTATCGAGTTTGCCTACTCAATGGG
GCGGAAAAATTGTATTACGCTTACAAAAAAATAAACCGACCCATTTTGATTTTGTTGAATTGGGGTTATTGCCTGCGCAA
AAAACTACGTTATTACACTTATTGCATCAACCACAAGGTCTAATTTTAGTTACTGGCCCCACCGGCAGTGGCAAAAGTAT
CACATTGTATAGTGCATTAAGTCATCTGAATCAACCGGATAAGCATATTTTAACGGCAGAAGATCCAATAGAAATTGAGA
TCGAAGGAATCATTCAAACACAAGTTAATCGAGCGATTCAGCTAGATTTTAGCCATCTATTACGAACATTTTTGCGCCAA
GATCCTGATATTATTATGTTAGGTGAGATTCGTGATCCGGAGAGTGCCGAAATTGCATTACGCGCTGCACAAACAGGGCA
TCTTGTGCTTTCTACATTACATACTAATGATGCGCCCTCTGCAATTGAACGGCTATTACAATTAGGTATTAAAGAATATG
AGCTTAAAAACGCCTTATTATTAGTCATTGCACAACGGCTTGTCCGCAAGCTATGCACGCAATGTTTTGGTAAAGGCTGT
CAGGCTTGCTATCAGGGCTATAGAGGACGCATTGGCATTTATCAATTATTAAGCCGTACGGGAAAAATCTTTGAAAAAGA
GACCGCTTACCTTGATTTTGCTAGTTTACATCATAGTGCGAAGCATAAGTTAGAGGCTGATATTACGTCCTTAGCAGAAA
TTAACAGAGTATTAGGCGATGCTGAAAATATATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q7VM73

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Glaesserella parasuis strain SC1401

74.017

98.707

0.731

  pilB Haemophilus influenzae Rd KW20

58.482

96.552

0.565

  pilB Haemophilus influenzae 86-028NP

57.906

96.767

0.56

  pilB Vibrio campbellii strain DS40M4

40.408

100

0.427

  pilB Vibrio cholerae strain A1552

39.919

100

0.422

  pilB Vibrio parahaemolyticus RIMD 2210633

38.386

100

0.42

  pilB Legionella pneumophila strain ERS1305867

38.618

100

0.409

  pilB Acinetobacter baylyi ADP1

38.493

100

0.407

  pilB Acinetobacter baumannii D1279779

37.654

100

0.394

  pilF Neisseria gonorrhoeae MS11

36.076

100

0.369


Multiple sequence alignment