Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilB   Type   Machinery gene
Locus tag   ACLWA7_RS17175 Genome accession   NZ_LR962646
Coordinates   3967060..3968796 (-) Length   578 a.a.
NCBI ID   WP_053051303.1    Uniprot ID   A0A8E4ENF3
Organism   Xanthomonas arboricola isolate Xanthomonas sp. CPBF 1483     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3968831..3988090 3967060..3968796 flank 35


Gene organization within MGE regions


Location: 3967060..3988090
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACLWA7_RS17175 (X12_003412) pilB 3967060..3968796 (-) 1737 WP_053051303.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACLWA7_RS17180 (X12_003413) - 3968831..3970357 (-) 1527 WP_053051304.1 hypothetical protein -
  ACLWA7_RS17185 (X12_003414) - 3970327..3970896 (-) 570 WP_231103946.1 hypothetical protein -
  ACLWA7_RS17190 (X12_003415) - 3970967..3972556 (-) 1590 WP_053051306.1 phosphoethanolamine transferase -
  ACLWA7_RS17195 (X12_003416) pilA 3972692..3973117 (-) 426 WP_053051307.1 pilin Machinery gene
  ACLWA7_RS17200 (X12_003417) pilC 3973471..3974730 (+) 1260 WP_053051318.1 type II secretion system F family protein Machinery gene
  ACLWA7_RS17205 (X12_003418) - 3974737..3975600 (+) 864 WP_016849610.1 prepilin peptidase -
  ACLWA7_RS17210 (X12_003419) coaE 3975614..3976237 (+) 624 WP_053051308.1 dephospho-CoA kinase -
  ACLWA7_RS17215 (X12_003420) - 3976247..3976485 (+) 239 Protein_3363 SymE family type I addiction module toxin -
  ACLWA7_RS17220 (X12_003421) - 3976589..3977923 (-) 1335 WP_016904089.1 sensor histidine kinase -
  ACLWA7_RS17225 (X12_003422) - 3977916..3978593 (-) 678 WP_006448355.1 response regulator transcription factor -
  ACLWA7_RS17230 (X12_003423) - 3978621..3979098 (-) 478 Protein_3366 hypothetical protein -
  ACLWA7_RS17235 (X12_003424) rimK 3979289..3980194 (-) 906 WP_166767257.1 30S ribosomal protein S6--L-glutamate ligase -
  ACLWA7_RS17240 (X12_003425) glgX 3980685..3982817 (+) 2133 WP_053051310.1 glycogen debranching protein GlgX -
  ACLWA7_RS17245 (X12_003426) - 3983489..3983881 (-) 393 WP_023904915.1 H-NS family nucleoid-associated regulatory protein -
  ACLWA7_RS17250 (X12_003427) - 3983970..3984356 (-) 387 WP_026064907.1 hypothetical protein -
  ACLWA7_RS17255 (X12_003428) - 3984931..3985443 (-) 513 WP_049805061.1 ImmA/IrrE family metallo-endopeptidase -
  ACLWA7_RS17260 (X12_003429) - 3985428..3985634 (-) 207 WP_374057769.1 ImmA/IrrE family metallo-endopeptidase -
  ACLWA7_RS17265 (X12_003430) - 3986178..3986738 (+) 561 WP_102596783.1 hypothetical protein -
  ACLWA7_RS17270 (X12_003431) - 3986810..3988090 (+) 1281 WP_080591431.1 retropepsin-like aspartic protease -

Sequence


Protein


Download         Length: 578 a.a.        Molecular weight: 62620.72 Da        Isoelectric Point: 5.6044

>NTDB_id=1134888 ACLWA7_RS17175 WP_053051303.1 3967060..3968796(-) (pilB) [Xanthomonas arboricola isolate Xanthomonas sp. CPBF 1483]
MNSVATANLVGITGIARRLVQDGALEEAAARTAMEQAAAAKVPLPQWFAERKLVSAAQLAAANAVEFGMPLMDVSVFDAN
QNAVKLVSEELLQKYQVLPLFKRGNRLFVGVSNPTQTRALDDIKFHTNLVVEPILVDEDQIRRTLEQWQASNASLGSSLG
DDDEGMGDLDVSAGDEDMGAGGDSGVDAKGDDTPVVKFVNKVLVDAIRRGASDIHFEPYEDDYRVRLRIDGLLKNVAKAP
VKLNQRIAARLKVMSQLDIAEKRVPQDGRIKLNLSKTKQIDFRVSTLPTLFGEKVVLRILDGSAAKLGIEKLGYEADQQK
LFLEAIHKPYGMVLVTGPTGSGKTVSLYTALGILNDETRNISTAEDPVEIRLPGVNQVQQNNKRGMTFAAALRSFLRQDP
DIIMVGEIRDLETAEIAIKAAQTGHMVLSTLHTNDAPQTIARLMNMGIAPYNITSSVTLVIAQRLARRLCNNCKRKSSLP
EHALLAEGFTPAQIAAGIELYEAVGCDECTEGYKGRTGIYQVMPMTDEIGAIVLEGGNAMQIAEAAQSIGIRDLRQSALV
KAAHGVTSLAEINRVTKD

Nucleotide


Download         Length: 1737 bp        

>NTDB_id=1134888 ACLWA7_RS17175 WP_053051303.1 3967060..3968796(-) (pilB) [Xanthomonas arboricola isolate Xanthomonas sp. CPBF 1483]
ATGAATTCAGTAGCTACCGCCAACCTCGTTGGTATTACCGGCATCGCTCGTCGCCTTGTGCAGGATGGTGCCCTGGAGGA
AGCCGCTGCGCGGACGGCAATGGAACAGGCGGCCGCTGCCAAGGTGCCGCTTCCTCAGTGGTTTGCCGAAAGAAAACTGG
TGTCGGCGGCACAACTGGCGGCAGCCAATGCCGTGGAATTCGGTATGCCGCTGATGGATGTGTCGGTGTTCGACGCCAAT
CAGAACGCGGTCAAGCTGGTCAGTGAAGAGTTGCTCCAGAAGTACCAGGTGCTGCCGCTGTTCAAGCGCGGCAACCGGTT
GTTCGTGGGGGTGAGCAACCCGACCCAGACCCGGGCGCTGGACGACATCAAGTTTCATACCAACCTGGTGGTCGAGCCGA
TCCTGGTCGACGAGGACCAGATCCGTCGCACCTTGGAGCAGTGGCAGGCCAGCAACGCGTCGTTGGGCTCGTCGCTAGGG
GACGATGACGAGGGGATGGGCGACCTGGACGTGTCGGCCGGTGACGAGGACATGGGCGCCGGCGGGGATTCCGGGGTCGA
TGCCAAGGGCGACGACACGCCGGTGGTGAAGTTCGTCAACAAGGTGCTGGTGGATGCGATCCGGCGGGGAGCCTCCGACA
TCCATTTCGAGCCCTATGAAGACGACTACCGGGTGCGGCTGCGGATCGATGGGCTGCTGAAGAATGTGGCCAAGGCACCG
GTAAAGCTGAACCAGCGCATCGCGGCGCGTCTGAAGGTGATGTCGCAGCTGGACATTGCCGAGAAGCGGGTGCCGCAGGA
CGGGCGCATCAAGCTCAACCTGTCCAAGACCAAGCAGATCGACTTCCGCGTGAGCACCTTGCCGACGTTGTTCGGCGAGA
AGGTGGTGCTGCGTATCCTGGACGGCAGCGCGGCCAAGCTGGGCATCGAGAAGCTGGGCTACGAGGCGGACCAGCAGAAG
CTGTTCCTGGAGGCGATCCATAAGCCGTACGGGATGGTGCTGGTGACCGGGCCGACCGGTTCGGGCAAGACGGTGTCGTT
GTACACGGCACTCGGCATCCTCAACGACGAGACGCGCAACATCTCCACCGCCGAGGACCCGGTGGAAATCCGTTTGCCCG
GCGTCAATCAGGTGCAGCAGAACAACAAGCGCGGCATGACCTTTGCCGCGGCGCTGCGCTCGTTCCTGCGCCAGGACCCG
GACATCATCATGGTCGGCGAAATCCGCGACCTGGAGACGGCCGAGATTGCGATCAAGGCCGCGCAGACCGGTCACATGGT
GCTGTCCACGCTGCACACCAACGATGCGCCGCAGACCATCGCGCGTTTGATGAACATGGGCATCGCGCCTTACAACATCA
CCTCGTCGGTGACCCTGGTGATCGCGCAGCGTCTGGCGCGGCGCCTGTGCAACAACTGCAAGCGCAAATCGAGCCTGCCC
GAGCATGCGTTGCTGGCCGAAGGCTTCACGCCTGCGCAGATCGCTGCCGGGATCGAGCTGTATGAGGCGGTCGGTTGCGA
CGAGTGTACCGAAGGCTACAAGGGCCGTACCGGTATCTACCAGGTGATGCCGATGACCGACGAGATCGGCGCGATCGTGC
TGGAAGGCGGTAACGCGATGCAGATCGCCGAGGCGGCACAGAGTATCGGCATCCGCGACCTACGCCAGTCGGCGCTGGTC
AAGGCCGCGCACGGGGTAACCAGCCTGGCCGAGATCAACCGGGTGACGAAGGACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilB Acinetobacter baumannii D1279779

55.634

98.27

0.547

  pilB Acinetobacter baylyi ADP1

55.634

98.27

0.547

  pilB Legionella pneumophila strain ERS1305867

52.548

98.443

0.517

  pilB Vibrio cholerae strain A1552

49.132

99.654

0.49

  pilF Neisseria gonorrhoeae MS11

48.415

98.27

0.476

  pilB Vibrio campbellii strain DS40M4

45.87

98.443

0.452

  pilB Vibrio parahaemolyticus RIMD 2210633

46.429

96.886

0.45

  pilF Thermus thermophilus HB27

40.192

89.965

0.362


Multiple sequence alignment