Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   ACLQZY_RS15540 Genome accession   NZ_LR877306
Coordinates   3635895..3637157 (+) Length   420 a.a.
NCBI ID   WP_047124749.1    Uniprot ID   -
Organism   Xanthomonas arboricola isolate Xanthomonas sp. CPBF 796     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3621645..3653515 3635895..3637157 within 0


Gene organization within MGE regions


Location: 3621645..3653515
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACLQZY_RS15475 (X12_003082) - 3621645..3622037 (-) 393 WP_026064065.1 endonuclease domain-containing protein -
  ACLQZY_RS15480 (X12_003083) - 3622216..3622593 (+) 378 WP_411757940.1 hypothetical protein -
  ACLQZY_RS15485 (X12_003084) sucD 3622670..3623545 (-) 876 WP_011038208.1 succinate--CoA ligase subunit alpha -
  ACLQZY_RS15490 (X12_003085) sucC 3623570..3624739 (-) 1170 WP_053045679.1 ADP-forming succinate--CoA ligase subunit beta -
  ACLQZY_RS15495 (X12_003086) - 3624971..3626584 (+) 1614 WP_016901263.1 sensor histidine kinase -
  ACLQZY_RS15500 (X12_003087) pilR 3626777..3628171 (+) 1395 WP_053045681.1 sigma-54-dependent transcriptional regulator Regulator
  ACLQZY_RS15505 (X12_003088) pilB 3628300..3630033 (-) 1734 WP_047124753.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACLQZY_RS15510 (X12_003089) - 3630183..3630782 (-) 600 WP_047124752.1 class I SAM-dependent methyltransferase -
  ACLQZY_RS15515 (X12_003090) - 3630793..3631824 (-) 1032 WP_080959838.1 glycosyltransferase -
  ACLQZY_RS15520 (X12_003091) - 3631821..3632615 (-) 795 WP_052767998.1 glycosyltransferase family A protein -
  ACLQZY_RS15525 (X12_003092) - 3632620..3633000 (-) 381 WP_052767997.1 SMR family transporter -
  ACLQZY_RS15530 (X12_003093) - 3633346..3633762 (+) 417 WP_047124751.1 pilin -
  ACLQZY_RS15535 (X12_003094) - 3633818..3635785 (+) 1968 WP_146091317.1 hypothetical protein -
  ACLQZY_RS15540 (X12_003095) pilC 3635895..3637157 (+) 1263 WP_047124749.1 type II secretion system F family protein Machinery gene
  ACLQZY_RS15545 (X12_003096) - 3637164..3638027 (+) 864 WP_002812278.1 prepilin peptidase -
  ACLQZY_RS15550 (X12_003097) coaE 3638041..3638664 (+) 624 WP_047124748.1 dephospho-CoA kinase -
  ACLQZY_RS15555 (X12_003098) - 3639190..3639588 (+) 399 WP_016904088.1 SymE family type I addiction module toxin -
  ACLQZY_RS15560 (X12_003099) - 3639693..3641027 (-) 1335 WP_016904089.1 sensor histidine kinase -
  ACLQZY_RS15565 (X12_003100) - 3641020..3641697 (-) 678 WP_006448355.1 response regulator transcription factor -
  ACLQZY_RS15570 (X12_003101) - 3641729..3642199 (-) 471 WP_026064719.1 hypothetical protein -
  ACLQZY_RS15575 (X12_003102) rimK 3642426..3643331 (-) 906 WP_166767257.1 30S ribosomal protein S6--L-glutamate ligase -
  ACLQZY_RS15580 (X12_003103) glgX 3643821..3645953 (+) 2133 WP_016904092.1 glycogen debranching protein GlgX -
  ACLQZY_RS15585 (X12_003104) - 3646625..3647017 (-) 393 WP_023904915.1 H-NS family nucleoid-associated regulatory protein -
  ACLQZY_RS15590 (X12_003105) - 3647106..3647492 (-) 387 WP_026064907.1 hypothetical protein -
  ACLQZY_RS15595 (X12_003106) - 3648067..3648579 (-) 513 WP_049805061.1 ImmA/IrrE family metallo-endopeptidase -
  ACLQZY_RS15600 (X12_003107) - 3648564..3648770 (-) 207 WP_374057769.1 ImmA/IrrE family metallo-endopeptidase -
  ACLQZY_RS15605 (X12_003108) - 3649314..3649883 (+) 570 WP_087962438.1 hypothetical protein -
  ACLQZY_RS15610 (X12_003109) - 3649955..3651235 (+) 1281 WP_080591431.1 retropepsin-like aspartic protease -
  ACLQZY_RS15615 (X12_003110) - 3651725..3652087 (+) 363 WP_016904888.1 hypothetical protein -
  ACLQZY_RS15620 (X12_003111) - 3652184..3653515 (+) 1332 WP_016905121.1 IS4 family transposase -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 45956.70 Da        Isoelectric Point: 10.3627

>NTDB_id=1132557 ACLQZY_RS15540 WP_047124749.1 3635895..3637157(+) (pilC) [Xanthomonas arboricola isolate Xanthomonas sp. CPBF 796]
MSAVSRTVKKGTKPVSRANAMTMFVWEGTDKRGIKMKGDELARNANMLRAELRRRGIIPTVVKTKPKPLFGAAGKPIKAK
EIAFFSRQMATMMKSGVPIVGSLEIIGEGAKNPRMRKMVGEIRTDIEGGLSLNEAVSKHPVQFDELYRNLVRAGESAGVL
DTVLDTVATYKENIEALKGKIKKALFYPAMVMAVALIVSSILLIWVVPQFETVFSSFGAELPAFTQMIVNLSRFMVSWWF
PMLLVAIGSAIGLVMAYKRSPKMQHLFDRLILKVPVIGKIMHDSAIARFARTTAVTFKAGVPLVEALSIVAGATGNKVYE
EAVLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDAMLFKVADYYEQDVNNAVDALSSLLEPMIMIFIGTI
VGGMVIGMYLPIFKLGAVVG

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=1132557 ACLQZY_RS15540 WP_047124749.1 3635895..3637157(+) (pilC) [Xanthomonas arboricola isolate Xanthomonas sp. CPBF 796]
ATGTCCGCAGTTAGCAGAACGGTGAAAAAGGGGACCAAGCCGGTCAGCCGCGCCAACGCGATGACCATGTTCGTGTGGGA
GGGGACAGACAAACGCGGCATCAAGATGAAGGGAGATGAGCTCGCTCGCAATGCCAACATGTTGCGTGCAGAGCTACGGC
GGCGTGGGATCATACCAACTGTCGTCAAGACCAAGCCCAAGCCACTATTCGGTGCGGCAGGCAAACCCATCAAAGCCAAG
GAGATCGCGTTCTTCAGTCGACAGATGGCCACCATGATGAAGTCTGGCGTGCCCATCGTGGGCTCCCTCGAAATCATCGG
CGAGGGTGCCAAGAACCCTCGCATGAGAAAAATGGTTGGCGAAATCCGCACCGACATTGAGGGAGGCTTATCCCTCAACG
AGGCTGTCAGCAAGCATCCAGTACAGTTTGACGAGCTCTACCGCAATTTGGTGAGGGCGGGCGAGAGCGCAGGTGTGCTT
GATACTGTCTTAGACACGGTGGCGACCTATAAGGAGAACATTGAGGCGCTCAAAGGGAAGATCAAGAAGGCGCTGTTCTA
TCCTGCAATGGTGATGGCCGTAGCTTTGATAGTCAGTTCTATCTTGCTGATTTGGGTGGTCCCGCAGTTCGAAACCGTGT
TCTCTAGCTTCGGTGCGGAACTACCGGCGTTTACCCAAATGATCGTTAACCTGTCGCGATTCATGGTCTCCTGGTGGTTC
CCGATGCTGCTGGTTGCCATCGGGTCGGCCATTGGCTTGGTCATGGCTTACAAGCGTTCGCCGAAGATGCAGCATCTATT
CGACCGGTTGATCCTGAAAGTGCCAGTCATTGGCAAGATCATGCATGACAGTGCCATTGCAAGATTCGCACGCACTACTG
CGGTGACCTTCAAGGCCGGTGTTCCATTGGTGGAAGCGTTGAGCATCGTCGCCGGGGCCACAGGCAATAAGGTGTATGAA
GAGGCTGTGCTGCGCATGCGCGATGACGTGTCTGTCGGTTATCCCGTTAACATGGCAATGAAACAGGTCAATCTCTTTCC
ACACATGGTGATCCAGATGACCGCAATCGGCGAAGAGGCTGGCGCACTCGACGCCATGTTGTTCAAGGTGGCTGATTATT
ACGAGCAAGATGTGAACAATGCAGTGGATGCCTTGAGTAGCTTGCTTGAGCCGATGATCATGATCTTCATCGGCACCATC
GTAGGCGGCATGGTCATCGGCATGTATCTTCCGATCTTCAAACTCGGCGCAGTGGTTGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

53.266

94.762

0.505

  pilC Legionella pneumophila strain ERS1305867

52.273

94.286

0.493

  pilC Acinetobacter baylyi ADP1

50.617

96.429

0.488

  pilC Acinetobacter baumannii D1279779

48.522

96.667

0.469

  pilG Neisseria gonorrhoeae MS11

43.284

95.714

0.414

  pilG Neisseria meningitidis 44/76-A

43.284

95.714

0.414

  pilC Vibrio campbellii strain DS40M4

39.85

95

0.379

  pilC Vibrio cholerae strain A1552

39.547

94.524

0.374

  pilC Thermus thermophilus HB27

38.653

95.476

0.369


Multiple sequence alignment