Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA/pilAI   Type   Machinery gene
Locus tag   NUG20_RS16220 Genome accession   NZ_CP102592
Coordinates   3884219..3884626 (-) Length   135 a.a.
NCBI ID   WP_263395459.1    Uniprot ID   -
Organism   Xanthomonas sp. CFBP 8443     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3875316..3883700 3884219..3884626 flank 519


Gene organization within MGE regions


Location: 3875316..3884626
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NUG20_RS16175 (NUG20_16180) - 3875316..3876518 (+) 1203 WP_263395452.1 glycosyltransferase -
  NUG20_RS16180 (NUG20_16185) - 3876515..3877765 (+) 1251 WP_263395453.1 hypothetical protein -
  NUG20_RS16185 (NUG20_16190) - 3877740..3878831 (-) 1092 WP_263395454.1 glycosyltransferase -
  NUG20_RS16190 (NUG20_16195) - 3878902..3879771 (-) 870 WP_263395455.1 FkbM family methyltransferase -
  NUG20_RS16195 (NUG20_16200) - 3879768..3880580 (-) 813 WP_263395456.1 class I SAM-dependent methyltransferase -
  NUG20_RS16200 (NUG20_16205) - 3880693..3881808 (+) 1116 WP_263395457.1 glycosyltransferase family 4 protein -
  NUG20_RS16205 (NUG20_16210) - 3881841..3883640 (-) 1800 WP_263395458.1 hypothetical protein -
  NUG20_RS16210 (NUG20_16215) - 3883741..3884151 (-) 411 WP_286038011.1 pilin -
  NUG20_RS16220 (NUG20_16225) pilA/pilAI 3884219..3884626 (-) 408 WP_263395459.1 pilin Machinery gene

Sequence


Protein


Download         Length: 135 a.a.        Molecular weight: 14103.16 Da        Isoelectric Point: 8.8652

>NTDB_id=717979 NUG20_RS16220 WP_263395459.1 3884219..3884626(-) (pilA/pilAI) [Xanthomonas sp. CFBP 8443]
MKKQQGFTLIELMIVVAIIAILAAIALPAYQNYVRKSQVTAALSDITPGKTQFEVLTNEGKSIASASDIGLTSATSRCSA
ITPTAGDTGNINCVMIGGDGVQGKSIRWTRNASGNWACSTDATAKYRPSTCTTAL

Nucleotide


Download         Length: 408 bp        

>NTDB_id=717979 NUG20_RS16220 WP_263395459.1 3884219..3884626(-) (pilA/pilAI) [Xanthomonas sp. CFBP 8443]
ATGAAGAAGCAGCAGGGCTTTACCCTGATCGAACTGATGATCGTCGTCGCGATCATCGCCATCCTGGCCGCCATCGCGCT
GCCGGCCTACCAGAACTACGTGCGCAAGTCGCAGGTCACCGCGGCCCTGTCCGACATCACCCCGGGCAAGACGCAGTTCG
AAGTGCTGACCAATGAAGGCAAGAGCATCGCTAGCGCTAGCGATATTGGTCTGACTTCTGCCACGAGCCGTTGCAGTGCC
ATCACCCCGACCGCGGGTGATACCGGCAATATCAACTGCGTCATGATCGGTGGCGATGGCGTCCAAGGGAAGTCGATTCG
GTGGACTCGTAATGCGTCCGGCAACTGGGCCTGCTCGACCGATGCTACCGCAAAGTACCGTCCGAGCACCTGCACCACGG
CGCTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA/pilAI Pseudomonas stutzeri DSM 10701

52.593

100

0.526

  pilA Acinetobacter baumannii strain A118

46.377

100

0.474

  pilA/pilAII Pseudomonas stutzeri DSM 10701

45.865

98.519

0.452

  pilA Pseudomonas aeruginosa PAK

40.268

100

0.444

  pilA Vibrio cholerae O1 biovar El Tor strain E7946

40.559

100

0.43

  pilA Vibrio cholerae strain A1552

40.559

100

0.43

  pilA Vibrio cholerae C6706

40.559

100

0.43

  comP Acinetobacter baylyi ADP1

37.671

100

0.407

  pilA2 Legionella pneumophila strain ERS1305867

36.986

100

0.4

  pilA2 Legionella pneumophila str. Paris

36.986

100

0.4

  pilE Neisseria gonorrhoeae MS11

32.468

100

0.37

  pilA Vibrio parahaemolyticus RIMD 2210633

40.984

90.37

0.37

  pilA Ralstonia pseudosolanacearum GMI1000

42.609

85.185

0.363