Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilU   Type   Machinery gene
Locus tag   G4G31_RS08575 Genome accession   NZ_CP050451
Coordinates   1853958..1855145 (+) Length   395 a.a.
NCBI ID   WP_182991062.1    Uniprot ID   A0A7G5ZB92
Organism   Massilia sp. Se16.2.3     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1848958..1860145
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G4G31_RS08545 (G4G31_08550) proC 1849462..1850277 (+) 816 WP_182991056.1 pyrroline-5-carboxylate reductase -
  G4G31_RS08550 (G4G31_08555) - 1850598..1850894 (-) 297 WP_182991057.1 YqjK family protein -
  G4G31_RS08555 (G4G31_08560) - 1850887..1851264 (-) 378 WP_182991058.1 phage holin family protein -
  G4G31_RS08560 (G4G31_08565) - 1851278..1851595 (-) 318 WP_182991059.1 YqjD family protein -
  G4G31_RS08565 (G4G31_08570) ubiA 1851666..1852520 (-) 855 WP_182991060.1 4-hydroxybenzoate octaprenyltransferase -
  G4G31_RS08570 (G4G31_08575) - 1852843..1853784 (-) 942 WP_182991061.1 hydrogen peroxide-inducible genes activator -
  G4G31_RS08575 (G4G31_08580) pilU 1853958..1855145 (+) 1188 WP_182991062.1 PilT/PilU family type 4a pilus ATPase Machinery gene
  G4G31_RS08580 (G4G31_08585) - 1855166..1855639 (+) 474 WP_182991063.1 adenylyltransferase/cytidyltransferase family protein -
  G4G31_RS08585 (G4G31_08590) - 1855652..1856347 (+) 696 WP_182991064.1 YjjG family noncanonical pyrimidine nucleotidase -
  G4G31_RS08590 (G4G31_08595) - 1856370..1857131 (+) 762 WP_202033731.1 alpha/beta hydrolase -
  G4G31_RS08595 (G4G31_08600) - 1857261..1857695 (+) 435 WP_182991066.1 DUF2892 domain-containing protein -
  G4G31_RS08600 (G4G31_08605) - 1857692..1858231 (+) 540 WP_182991067.1 RNA polymerase sigma factor -
  G4G31_RS08605 (G4G31_08610) trmL 1858748..1859218 (-) 471 WP_182991068.1 tRNA (uridine(34)/cytosine(34)/5- carboxymethylaminomethyluridine(34)-2'-O)- methyltransferase TrmL -
  G4G31_RS08610 (G4G31_08615) - 1859291..1859998 (-) 708 WP_229425462.1 ComF family protein -

Sequence


Protein


Download         Length: 395 a.a.        Molecular weight: 43818.08 Da        Isoelectric Point: 6.3705

>NTDB_id=432833 G4G31_RS08575 WP_182991062.1 1853958..1855145(+) (pilU) [Massilia sp. Se16.2.3]
MSTPFGPADAQAYIHKLLTVMHQQGGSDLFISADFPPSMKHQGAMKPMSQQRLTGEVTRALALSLMNERQRAEFEAEMEC
NFAISLPNVCRFRVNVFVQQQSVGMVVRTIASEIPNFEKLELPEVLKDVVMTKRGLVLVVGGTGSGKSTTLAAMIDYRNS
NSAGHIITVEDPVEYVHKNKNCLVTHREVGVDTHSWHNALKNTLRQAPDVILIGEIRDTETMEHAIAFAETGHLCLGTLH
ANNANQTMDRIINFFPEERRNQLLMDLSANLRAIVSQRLVRTADGLGRKAAIEILLNTPTISEMILKGNFHSIKEIMQKS
RELGMCTFDQALYELYNKGDITYDEAIRNADSANGLRLQIKLSGDRREADTGTSAKGPGLSMMLEEEPQDPANPT

Nucleotide


Download         Length: 1188 bp        

>NTDB_id=432833 G4G31_RS08575 WP_182991062.1 1853958..1855145(+) (pilU) [Massilia sp. Se16.2.3]
ATGTCTACCCCCTTCGGTCCGGCCGATGCCCAGGCCTATATCCATAAACTGCTCACCGTCATGCACCAGCAGGGCGGCTC
CGACCTCTTCATCTCTGCCGACTTTCCGCCCAGCATGAAGCACCAGGGCGCGATGAAACCGATGAGCCAGCAGCGCCTGA
CGGGCGAAGTCACGCGTGCGCTTGCGCTGTCGCTGATGAACGAACGCCAGCGCGCCGAGTTCGAAGCCGAAATGGAGTGC
AATTTCGCGATCTCGCTGCCGAACGTCTGCCGTTTTCGCGTGAATGTATTCGTGCAGCAGCAGAGCGTCGGCATGGTGGT
GCGCACGATCGCCTCGGAAATTCCGAACTTCGAGAAGCTCGAGCTGCCCGAGGTATTGAAGGACGTCGTCATGACCAAGC
GCGGCCTGGTGCTGGTCGTCGGCGGCACGGGGTCCGGCAAGTCGACCACGCTGGCGGCGATGATCGACTACCGCAACAGC
AACTCGGCCGGCCACATCATCACGGTGGAAGACCCTGTCGAGTACGTGCACAAGAACAAGAACTGCCTGGTGACGCACCG
CGAAGTCGGCGTCGACACCCATTCCTGGCACAACGCGCTGAAGAACACGCTGCGCCAGGCGCCGGACGTGATCCTGATCG
GCGAGATCCGCGACACCGAGACGATGGAGCACGCGATTGCCTTTGCCGAAACCGGCCACCTGTGCCTCGGCACGCTGCAC
GCGAACAATGCCAACCAGACAATGGACCGCATCATCAACTTCTTCCCGGAAGAGCGGCGCAACCAGTTGCTGATGGACCT
GTCGGCGAACCTGCGCGCGATCGTCTCGCAGCGCCTGGTGCGCACCGCGGACGGCCTTGGCCGCAAGGCTGCCATCGAAA
TCCTGCTCAACACCCCGACCATCAGCGAGATGATCCTGAAGGGGAATTTCCATAGCATCAAGGAAATCATGCAGAAGTCG
CGCGAACTGGGCATGTGCACCTTCGACCAGGCGCTGTACGAGCTCTACAACAAGGGCGACATCACGTATGACGAGGCGAT
CCGCAACGCCGACTCGGCCAACGGCCTGCGCCTGCAGATCAAGCTCTCCGGCGACCGCCGCGAGGCCGATACGGGCACGA
GTGCAAAAGGCCCGGGCCTGTCGATGATGCTCGAAGAAGAACCACAAGACCCTGCCAATCCGACCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7G5ZB92

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilU Pseudomonas stutzeri DSM 10701

57.56

95.443

0.549

  pilU Acinetobacter baylyi ADP1

53.736

88.101

0.473

  pilU Vibrio cholerae strain A1552

49.448

91.646

0.453

  pilT Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

44.214

85.316

0.377

  pilT Legionella pneumophila strain ERS1305867

44.985

83.291

0.375

  pilT Legionella pneumophila strain Lp02

44.985

83.291

0.375

  pilT Pseudomonas aeruginosa PAK

43.62

85.316

0.372

  pilT Pseudomonas stutzeri DSM 10701

43.027

85.316

0.367

  pilT Acinetobacter baylyi ADP1

43.598

83.038

0.362

  pilT Acinetobacter baumannii D1279779

43.465

83.291

0.362

  pilT Acinetobacter nosocomialis M2

43.465

83.291

0.362

  pilT Acinetobacter baumannii strain A118

43.465

83.291

0.362

  pilT Vibrio cholerae strain A1552

43.072

84.051

0.362

  pilT Vibrio cholerae O1 biovar El Tor strain E7946

43.072

84.051

0.362