Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA/pilAI   Type   Machinery gene
Locus tag   CR152_RS11150 Genome accession   NZ_CP024608
Coordinates   2460263..2460688 (+) Length   141 a.a.
NCBI ID   WP_307718570.1    Uniprot ID   -
Organism   Massilia violaceinigra strain B2     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 2439042..2459763 2460263..2460688 flank 500


Gene organization within MGE regions


Location: 2439042..2460688
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CR152_RS11040 (CR152_10985) - 2439267..2439833 (+) 567 WP_099882194.1 DUF2889 domain-containing protein -
  CR152_RS11045 (CR152_10990) sucC 2439909..2441081 (+) 1173 WP_099874963.1 ADP-forming succinate--CoA ligase subunit beta -
  CR152_RS11050 (CR152_10995) sucD 2441107..2441988 (+) 882 WP_099874964.1 succinate--CoA ligase subunit alpha -
  CR152_RS11055 (CR152_11000) - 2442077..2442670 (+) 594 WP_099874965.1 FHA domain-containing protein -
  CR152_RS34110 (CR152_11005) - 2442884..2443327 (+) 444 WP_099874966.1 pilin -
  CR152_RS11065 (CR152_11010) - 2443672..2445081 (+) 1410 WP_099874967.1 IS66 family transposase -
  CR152_RS11070 (CR152_11015) - 2445171..2445377 (+) 207 WP_208640110.1 hypothetical protein -
  CR152_RS11075 (CR152_11020) - 2445423..2446511 (+) 1089 WP_099882196.1 IS630 family transposase -
  CR152_RS34995 (CR152_11025) - 2446549..2446695 (+) 147 WP_370663813.1 hypothetical protein -
  CR152_RS11085 (CR152_11030) pilA/pilAII 2447032..2447457 (+) 426 WP_307718567.1 pilin Regulator
  CR152_RS11090 (CR152_11035) - 2447655..2448287 (+) 633 Protein_2179 IS5 family transposase -
  CR152_RS11095 (CR152_11040) - 2448434..2448649 (+) 216 WP_370663873.1 hypothetical protein -
  CR152_RS11105 (CR152_11050) - 2450220..2451737 (+) 1518 Protein_2182 IS1634 family transposase -
  CR152_RS11110 (CR152_11055) - 2451801..2452172 (+) 372 Protein_2183 transposase -
  CR152_RS11115 (CR152_11060) - 2452340..2453860 (-) 1521 WP_099874335.1 IS66 family transposase -
  CR152_RS11120 (CR152_11065) tnpB 2453916..2454287 (-) 372 WP_208640063.1 IS66 family insertion sequence element accessory protein TnpB -
  CR152_RS11125 (CR152_11070) - 2454257..2454550 (-) 294 WP_099874336.1 hypothetical protein -
  CR152_RS11130 (CR152_11075) pilA/pilAII 2455285..2455710 (+) 426 WP_307718568.1 pilin Regulator
  CR152_RS11135 (CR152_11080) pilA/pilAII 2456580..2457005 (+) 426 WP_307718569.1 pilin Regulator
  CR152_RS11140 (CR152_11085) istB 2457587..2458372 (-) 786 WP_099874971.1 IS21-like element helper ATPase IstB -
  CR152_RS11145 (CR152_11090) istA 2458369..2459861 (-) 1493 Protein_2190 IS21 family transposase -
  CR152_RS11150 (CR152_11095) pilA/pilAI 2460263..2460688 (+) 426 WP_307718570.1 pilin Machinery gene

Sequence


Protein


Download         Length: 141 a.a.        Molecular weight: 14418.11 Da        Isoelectric Point: 9.5447

>NTDB_id=254203 CR152_RS11150 WP_307718570.1 2460263..2460688(+) (pilA/pilAI) [Massilia violaceinigra strain B2]
MKSMKMMKKAQAGFTLIELMIVVAIIGILAAVALPAYQDYVAKSQVAASLAEITPGKVQVETKLAEGTAVAAPTDIGVAA
VTKRCSAISAAVAVTGAGSIECTMLGNSAVLGKTIKWARVATGAWACTSSADPKYVPKECK

Nucleotide


Download         Length: 426 bp        

>NTDB_id=254203 CR152_RS11150 WP_307718570.1 2460263..2460688(+) (pilA/pilAI) [Massilia violaceinigra strain B2]
ATGAAATCGATGAAAATGATGAAAAAAGCACAAGCTGGTTTTACCCTGATCGAACTGATGATTGTTGTTGCAATTATCGG
TATCTTGGCTGCTGTCGCACTGCCGGCGTACCAGGACTATGTTGCCAAGTCGCAAGTCGCTGCTTCGCTGGCTGAAATCA
CCCCAGGTAAAGTCCAAGTAGAAACCAAACTGGCTGAAGGCACTGCTGTTGCTGCTCCTACCGACATCGGCGTCGCCGCT
GTCACGAAACGCTGCTCCGCAATCTCCGCAGCCGTTGCTGTGACCGGCGCTGGTAGCATCGAGTGCACGATGCTCGGCAA
TTCCGCTGTGTTGGGCAAAACCATCAAGTGGGCGCGTGTTGCTACTGGCGCTTGGGCTTGCACTTCGAGCGCCGACCCGA
AATACGTTCCAAAAGAATGCAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA/pilAI Pseudomonas stutzeri DSM 10701

51.449

97.872

0.504

  pilA/pilAII Pseudomonas stutzeri DSM 10701

46.853

100

0.475

  pilA Pseudomonas aeruginosa PAK

43.537

100

0.454

  pilA Acinetobacter baumannii strain A118

45.926

95.745

0.44

  pilA Ralstonia pseudosolanacearum GMI1000

35.503

100

0.426

  pilA Vibrio cholerae O1 biovar El Tor strain E7946

40

100

0.426

  pilA Vibrio cholerae strain A1552

40

100

0.426

  pilA Vibrio cholerae C6706

40

100

0.426

  pilE Neisseria gonorrhoeae strain FA1090

35.849

100

0.404

  comP Acinetobacter baylyi ADP1

39.31

100

0.404

  pilA Vibrio parahaemolyticus RIMD 2210633

41.176

96.454

0.397

  pilE Neisseria gonorrhoeae MS11

41.86

91.489

0.383

  pilA2 Legionella pneumophila str. Paris

37.681

97.872

0.369

  pilA2 Legionella pneumophila strain ERS1305867

37.226

97.163

0.362


Multiple sequence alignment