Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   ACHZ97_RS03765 Genome accession   NZ_CP172310
Coordinates   827170..828894 (-) Length   574 a.a.
NCBI ID   WP_396616715.1    Uniprot ID   -
Organism   Lysobacter soli strain P18     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 818862..842214 827170..828894 within 0


Gene organization within MGE regions


Location: 818862..842214
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACHZ97_RS03730 (ACHZ97_03730) - 818862..819317 (-) 456 WP_115841424.1 DUF721 domain-containing protein -
  ACHZ97_RS03735 (ACHZ97_03735) - 819316..820239 (+) 924 WP_115841425.1 M23 family metallopeptidase -
  ACHZ97_RS03740 (ACHZ97_03740) secA 820541..823270 (+) 2730 WP_396616714.1 preprotein translocase subunit SecA -
  ACHZ97_RS03745 (ACHZ97_03745) - 823412..824353 (+) 942 WP_396617500.1 Nudix family hydrolase -
  ACHZ97_RS03750 (ACHZ97_03750) coaE 824366..824983 (-) 618 WP_115841428.1 dephospho-CoA kinase -
  ACHZ97_RS03755 (ACHZ97_03755) - 825030..825893 (-) 864 WP_157029412.1 A24 family peptidase -
  ACHZ97_RS03760 (ACHZ97_03760) pilC 825910..827109 (-) 1200 WP_396617501.1 type II secretion system F family protein Machinery gene
  ACHZ97_RS03765 (ACHZ97_03765) pilB 827170..828894 (-) 1725 WP_396616715.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACHZ97_RS03770 (ACHZ97_03770) - 829027..830214 (-) 1188 WP_396616716.1 glycosyltransferase family 4 protein -
  ACHZ97_RS03775 (ACHZ97_03775) - 830207..830917 (-) 711 WP_396616717.1 class I SAM-dependent methyltransferase -
  ACHZ97_RS03780 (ACHZ97_03780) - 830951..831355 (+) 405 WP_396616718.1 GtrA family protein -
  ACHZ97_RS03785 (ACHZ97_03785) - 831340..831915 (-) 576 WP_396616719.1 class I SAM-dependent methyltransferase -
  ACHZ97_RS03790 (ACHZ97_03790) - 831912..833096 (-) 1185 WP_396616720.1 glycosyltransferase family 2 protein -
  ACHZ97_RS03795 (ACHZ97_03795) - 833093..835072 (-) 1980 WP_396616721.1 glycosyltransferase 87 family protein -
  ACHZ97_RS03800 (ACHZ97_03800) - 835166..835633 (-) 468 WP_396616722.1 pilin -
  ACHZ97_RS03805 (ACHZ97_03805) pilR 836117..837580 (-) 1464 WP_396616723.1 sigma-54-dependent transcriptional regulator Regulator
  ACHZ97_RS03810 (ACHZ97_03810) - 837664..839289 (-) 1626 WP_396616724.1 sensor histidine kinase -
  ACHZ97_RS03815 (ACHZ97_03815) sucC 839506..840672 (+) 1167 WP_115841435.1 ADP-forming succinate--CoA ligase subunit beta -
  ACHZ97_RS03820 (ACHZ97_03820) sucD 840685..841560 (+) 876 WP_115841436.1 succinate--CoA ligase subunit alpha -
  ACHZ97_RS03825 (ACHZ97_03825) - 841831..842214 (+) 384 WP_396616725.1 endonuclease domain-containing protein -

Sequence


Protein


Download         Length: 574 a.a.        Molecular weight: 62427.87 Da        Isoelectric Point: 6.2707

>NTDB_id=1066213 ACHZ97_RS03765 WP_396616715.1 827170..828894(-) (pilB) [Lysobacter soli strain P18]
MSTVATANLVGVTGIARRLVMDGALEEHVAREAMRAASAERKPIAVYLTEKRLVTSRAMAAANSVEFGVPLLDAAALDPA
QMAVRAVSEDLVRKHQVLPLFKRGSKLFLGIADPTNSHALDEIKFQTNLIVEPILVDQDTITRGIERALEQTDALSDAVG
DDGEGLETLDVGKDDDLSGDSGVDAKGDDTPVVKFVNKALVDAIRRGASDIHFEPYETEYRVRLRIDGILKTVAKVPVKL
QARIAARLKVMAQLDIAEKRVPQDGRIKLNLSKTKQIDFRVSTLPTLFGEKVVLRVLDGSAAKLGIDKLGYEDDQRSLFL
AAIQKPYGMVLVTGPTGSGKTVSLYTALNILNDDQRNISTVEDPVEIRVPGINQVQMNVKRGMTFAAALRSFLRQDPDVI
MVGEIRDLETAEIGIKAAQTGHMVLSTLHTNDAPQTIARLMNMGVAPYNITSSVTLVIAQRLARRLHDCKREMELPEHAL
LAEGFSHEDVAGGMKIYEPVGCPDCTEGYKGRTGIYQVMPMSEQIQAIILEGGNAMQIAEAAQKAGIRDLRQSALLKVKN
GVTSLAEINRVTKD

Nucleotide


Download         Length: 1725 bp        

>NTDB_id=1066213 ACHZ97_RS03765 WP_396616715.1 827170..828894(-) (pilB) [Lysobacter soli strain P18]
ATGTCCACGGTCGCCACGGCCAACCTGGTGGGGGTCACCGGTATTGCACGACGCTTGGTGATGGACGGCGCGCTCGAAGA
GCACGTCGCCCGCGAAGCCATGCGTGCCGCCAGTGCCGAACGTAAACCGATCGCCGTCTATCTCACCGAGAAGCGGCTGG
TCACGTCCAGGGCGATGGCCGCGGCGAACTCGGTGGAGTTCGGCGTGCCGCTGCTCGACGCCGCCGCGCTCGATCCGGCG
CAGATGGCCGTTCGCGCCGTCAGCGAGGACCTGGTGCGCAAGCACCAGGTGCTGCCGCTGTTCAAGCGTGGCAGCAAGCT
CTTCCTGGGAATCGCCGATCCCACCAATTCGCACGCGCTGGACGAGATCAAGTTCCAGACGAACCTGATCGTCGAGCCCA
TTCTGGTCGACCAGGACACGATCACGCGCGGCATCGAACGCGCGCTGGAACAGACCGACGCGCTCAGCGATGCCGTCGGC
GACGACGGTGAAGGGCTCGAAACGCTCGACGTCGGCAAGGACGACGACCTGTCCGGCGACAGCGGCGTCGACGCGAAGGG
CGACGACACGCCGGTCGTCAAGTTCGTCAACAAGGCGCTGGTGGATGCGATCCGCCGCGGCGCCTCCGACATCCACTTCG
AGCCGTACGAAACCGAGTACCGCGTGCGCTTGCGCATCGACGGCATCCTGAAGACCGTCGCCAAGGTGCCGGTGAAGCTG
CAGGCGCGCATCGCTGCGCGACTGAAGGTGATGGCGCAGTTGGACATCGCCGAGAAGCGAGTCCCGCAGGACGGTCGCAT
CAAGCTCAACCTGTCCAAGACCAAGCAGATCGACTTTCGTGTCAGCACGCTGCCGACCCTCTTCGGCGAAAAGGTGGTGC
TGCGCGTACTGGACGGCAGCGCGGCCAAGCTGGGCATCGACAAGCTCGGTTACGAGGACGACCAGCGCAGCCTGTTCCTG
GCGGCGATCCAGAAGCCTTACGGCATGGTGCTGGTCACCGGTCCGACCGGTTCGGGCAAGACGGTGTCGCTGTACACGGC
GTTGAATATCCTCAACGACGATCAACGCAACATCTCCACGGTCGAAGACCCTGTCGAAATCCGCGTGCCCGGCATCAACC
AGGTGCAGATGAACGTCAAGCGCGGCATGACCTTCGCCGCCGCGCTGCGCAGCTTCCTGCGTCAGGACCCCGACGTGATC
ATGGTCGGCGAAATCCGCGACCTGGAAACGGCCGAGATCGGCATCAAGGCTGCGCAGACCGGCCACATGGTGCTGTCGAC
CCTGCACACCAACGACGCGCCGCAGACCATCGCCCGACTGATGAACATGGGCGTGGCGCCGTACAACATCACCTCGTCGG
TGACACTGGTGATCGCCCAGCGCCTTGCGCGTCGCCTGCACGACTGCAAGCGCGAAATGGAACTGCCCGAGCACGCGCTG
CTGGCCGAAGGCTTCTCGCACGAAGACGTCGCCGGCGGCATGAAGATCTACGAGCCCGTTGGTTGCCCCGACTGCACCGA
GGGCTACAAGGGCCGCACCGGCATCTATCAGGTCATGCCGATGTCGGAACAAATCCAGGCCATCATCCTGGAAGGCGGCA
ACGCCATGCAGATCGCCGAAGCGGCGCAGAAGGCGGGCATCCGCGACCTGCGTCAGTCCGCGCTGCTGAAAGTAAAGAAC
GGCGTCACCAGCCTCGCCGAAATCAACCGCGTCACCAAGGACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Acinetobacter baylyi ADP1

55.477

98.606

0.547

  pilB Acinetobacter baumannii D1279779

54.159

98.432

0.533

  pilB Legionella pneumophila strain ERS1305867

52.297

98.606

0.516

  pilF Neisseria gonorrhoeae MS11

46.914

98.78

0.463

  pilB Vibrio cholerae strain A1552

50.984

88.502

0.451

  pilB Vibrio parahaemolyticus RIMD 2210633

48.679

92.334

0.449

  pilB Vibrio campbellii strain DS40M4

50

89.199

0.446

  pilB/pilB1 Synechocystis sp. PCC 6803

38.519

94.077

0.362


Multiple sequence alignment