Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilU   Type   Machinery gene
Locus tag   UNDKW_RS04575 Genome accession   NZ_AP018439
Coordinates   1013773..1014909 (-) Length   378 a.a.
NCBI ID   WP_162057759.1    Uniprot ID   A0A6N4SYV0
Organism   Undibacterium sp. KW1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1008773..1019909
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  UNDKW_RS04545 (UNDKW_0915) glcE 1009024..1010088 (+) 1065 WP_162057754.1 glycolate oxidase subunit GlcE -
  UNDKW_RS04550 (UNDKW_0916) glcF 1010092..1011390 (+) 1299 WP_162057755.1 glycolate oxidase subunit GlcF -
  UNDKW_RS04555 (UNDKW_0917) - 1011394..1011684 (+) 291 WP_162057756.1 DUF2867 domain-containing protein -
  UNDKW_RS04560 (UNDKW_0918) - 1011699..1012832 (-) 1134 Protein_913 IS4 family transposase -
  UNDKW_RS04565 (UNDKW_0919) - 1013008..1013265 (+) 258 WP_162057757.1 DUF2867 domain-containing protein -
  UNDKW_RS04570 (UNDKW_0920) - 1013294..1013776 (-) 483 WP_162057758.1 glutathione peroxidase -
  UNDKW_RS04575 (UNDKW_0921) pilU 1013773..1014909 (-) 1137 WP_162057759.1 PilT/PilU family type 4a pilus ATPase Machinery gene
  UNDKW_RS04580 (UNDKW_0922) pilT 1014929..1015972 (-) 1044 WP_162057760.1 type IV pilus twitching motility protein PilT Machinery gene
  UNDKW_RS04585 (UNDKW_0923) - 1016107..1016811 (+) 705 WP_162057761.1 YggS family pyridoxal phosphate-dependent enzyme -
  UNDKW_RS04590 (UNDKW_0924) proC 1016801..1017625 (+) 825 WP_232063235.1 pyrroline-5-carboxylate reductase -
  UNDKW_RS04595 (UNDKW_0925) - 1017646..1017981 (-) 336 WP_162057762.1 YqjK family protein -
  UNDKW_RS04600 (UNDKW_0926) - 1017985..1018374 (-) 390 WP_162039957.1 phage holin family protein -
  UNDKW_RS04605 (UNDKW_0927) - 1018390..1018701 (-) 312 WP_162039958.1 YqjD family protein -
  UNDKW_RS04610 (UNDKW_0928) ubiA 1018832..1019686 (-) 855 WP_370529123.1 4-hydroxybenzoate octaprenyltransferase -

Sequence


Protein


Download         Length: 378 a.a.        Molecular weight: 41902.26 Da        Isoelectric Point: 6.7721

>NTDB_id=69607 UNDKW_RS04575 WP_162057759.1 1013773..1014909(-) (pilU) [Undibacterium sp. KW1]
MERDQATKFMNDLLRLMLSKNGSDLFITADFPPAFKIDGRVTPVSNQPLSPAHTVDLARSIMNDKQSSEFESTKECNFAI
SPAGLGRFRVSAFMQQGRVGLVLRTITTAIPKLEDLGLPENLKEIAMTKRGLVIMVGATGSGKSTSLAAMLGYRNANSYG
HIITIEDPIEYVHPHMNCIITQREIGIDTDDWGAALKNSLRQAPDVIQIGEIRDRETMDFAVAFAETGHLCLATLHANSS
NQALDRIINFFPEERRQQLLMDLSLNLKAVISQRLVPLKGRKGRAAAVEIMLNTPLVSDLIFKGAVHEIKEIMKKSRELG
MQTFDQSLFDLHEADQITYEDALRNADSVNELRLAIMLKGKDAKDRDLTAGTKHLGIV

Nucleotide


Download         Length: 1137 bp        

>NTDB_id=69607 UNDKW_RS04575 WP_162057759.1 1013773..1014909(-) (pilU) [Undibacterium sp. KW1]
ATGGAACGCGATCAGGCCACCAAATTCATGAATGATTTGCTGAGACTCATGCTCAGCAAAAACGGTTCAGACTTGTTTAT
TACTGCAGACTTCCCGCCTGCCTTCAAGATAGACGGGCGGGTAACGCCAGTGTCGAACCAACCTTTGTCCCCTGCACATA
CTGTCGATCTGGCCCGTTCTATCATGAACGATAAACAGTCTTCAGAATTTGAATCGACCAAGGAATGTAATTTCGCGATC
AGTCCTGCTGGGCTCGGGCGCTTCCGCGTCTCGGCCTTCATGCAGCAAGGCCGCGTTGGCCTGGTCTTGCGGACAATCAC
CACGGCCATTCCCAAACTTGAAGACCTGGGCTTGCCAGAAAATCTCAAGGAAATTGCCATGACCAAGCGCGGTCTGGTCA
TCATGGTAGGTGCTACCGGTTCAGGTAAATCCACCTCGCTGGCGGCCATGCTGGGGTACAGAAACGCAAACAGCTACGGT
CACATCATCACGATCGAAGACCCTATAGAATATGTGCACCCGCACATGAATTGCATCATCACCCAGCGTGAGATCGGCAT
TGATACTGATGACTGGGGCGCGGCATTGAAAAACTCCCTGCGTCAGGCGCCAGACGTTATCCAGATCGGTGAGATCCGTG
ACCGCGAAACCATGGACTTTGCCGTTGCCTTTGCAGAAACCGGCCATCTTTGCCTGGCTACCCTGCATGCGAATAGTTCC
AACCAGGCGCTGGACCGTATCATCAACTTCTTCCCGGAAGAACGTCGCCAGCAATTGCTGATGGATTTGTCACTGAACTT
GAAAGCCGTTATTTCTCAGCGCCTGGTACCACTGAAAGGTCGCAAGGGTCGTGCTGCTGCTGTAGAGATCATGCTCAATA
CACCTCTGGTGTCTGACCTGATCTTCAAAGGTGCGGTACATGAAATCAAAGAGATCATGAAGAAGTCCCGTGAGCTGGGC
ATGCAGACCTTCGATCAATCTTTGTTTGATTTACATGAAGCAGACCAGATCACGTATGAAGATGCACTGCGCAATGCCGA
CTCTGTCAATGAACTGCGTCTGGCCATCATGTTAAAGGGCAAAGACGCTAAAGACCGCGACCTGACTGCAGGTACTAAAC
ATTTGGGTATCGTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6N4SYV0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilU Pseudomonas stutzeri DSM 10701

63.085

96.032

0.606

  pilU Acinetobacter baylyi ADP1

59.331

94.974

0.563

  pilU Vibrio cholerae strain A1552

54.213

94.18

0.511

  pilT Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

44.38

91.799

0.407

  pilT Pseudomonas aeruginosa PAK

41.84

89.153

0.373

  pilT Vibrio cholerae O1 biovar El Tor strain E7946

44.304

83.598

0.37

  pilT Vibrio cholerae strain A1552

44.304

83.598

0.37

  pilT Pseudomonas stutzeri DSM 10701

41.246

89.153

0.368

  pilT Acinetobacter baylyi ADP1

42.547

85.185

0.362

  pilT Legionella pneumophila strain ERS1305867

42.812

84.656

0.362

  pilT Legionella pneumophila strain Lp02

42.812

84.656

0.362


Multiple sequence alignment