Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilU   Type   Machinery gene
Locus tag   CAP31_RS02420 Genome accession   NZ_CP021138
Coordinates   480092..481219 (-) Length   375 a.a.
NCBI ID   WP_087446079.1    Uniprot ID   A0A1Y0G5I1
Organism   Sulfuriferula sp. AH1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 475092..486219
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CAP31_RS02385 (CAP31_02385) - 475329..476075 (-) 747 WP_087446072.1 type 1 glutamine amidotransferase domain-containing protein -
  CAP31_RS02390 (CAP31_02390) - 476079..476795 (-) 717 WP_223247340.1 DUF4197 domain-containing protein -
  CAP31_RS02395 (CAP31_02395) - 476905..477414 (+) 510 WP_087446074.1 peroxiredoxin -
  CAP31_RS02400 (CAP31_02400) - 477411..477956 (+) 546 WP_087446075.1 DinB family protein -
  CAP31_RS02405 (CAP31_02405) - 477946..478482 (-) 537 WP_087446076.1 peroxiredoxin -
  CAP31_RS02410 (CAP31_02410) - 478495..478911 (+) 417 WP_087446077.1 CopD family protein -
  CAP31_RS02415 (CAP31_02415) - 478908..480059 (+) 1152 WP_087446078.1 class I SAM-dependent RNA methyltransferase -
  CAP31_RS02420 (CAP31_02420) pilU 480092..481219 (-) 1128 WP_087446079.1 PilT/PilU family type 4a pilus ATPase Machinery gene
  CAP31_RS02425 (CAP31_02425) pilT 481224..482267 (-) 1044 WP_087446080.1 type IV pilus twitching motility protein PilT Machinery gene
  CAP31_RS02430 (CAP31_02430) - 482301..482990 (+) 690 WP_087446081.1 YggS family pyridoxal phosphate-dependent enzyme -
  CAP31_RS02435 (CAP31_02435) proC 482996..483802 (+) 807 WP_087448224.1 pyrroline-5-carboxylate reductase -
  CAP31_RS02440 (CAP31_02440) - 483820..484395 (+) 576 WP_087446082.1 YggT family protein -

Sequence


Protein


Download         Length: 375 a.a.        Molecular weight: 41951.24 Da        Isoelectric Point: 7.0820

>NTDB_id=228499 CAP31_RS02420 WP_087446079.1 480092..481219(-) (pilU) [Sulfuriferula sp. AH1]
MDALNTIHDLLRLMLSKRASDLFITAGFPPAIKVDGKIAPVSNQTLTPSHTRELARSIMNEKQWKDFEAHHEANFAISPA
EIGRFRVNAFVQQGRIGVVMRTITTQIPKLDQLNLPPVLRDVAMVKRGLVIFVGGTGSGKSTSLAAMVGHRNENAQDHII
TIEDPVEYVHEHKKSIVTQREVGVDTENWFAALKNTLRQAPDVILIGEVRDRETMEYAIAFAETGHLCLTTLHANSANQA
LDRIINFFPDDRRQQLLMDLSLNLKAFISQRLVARKGTTGRIAAVEVLLNSPLIADLIFKGQVHEIKEIMAKSRELGMQT
FDQSLFDLYEAGLITYEDALRNADSLNDLRLKIKLQGQESKDRDIMSGINNLNIV

Nucleotide


Download         Length: 1128 bp        

>NTDB_id=228499 CAP31_RS02420 WP_087446079.1 480092..481219(-) (pilU) [Sulfuriferula sp. AH1]
ATGGACGCACTGAATACCATACACGACCTGCTGCGCCTGATGCTGAGTAAACGCGCATCCGACCTGTTCATCACTGCCGG
GTTTCCGCCTGCCATCAAGGTCGACGGCAAAATCGCTCCGGTTTCCAATCAGACGCTGACGCCTTCGCATACCCGCGAAC
TCGCGCGCAGCATCATGAACGAGAAGCAATGGAAGGATTTCGAGGCCCATCACGAAGCCAACTTCGCCATCTCACCGGCA
GAAATCGGACGCTTCCGGGTCAACGCCTTCGTGCAGCAGGGACGCATCGGCGTCGTCATGCGGACCATCACTACCCAGAT
ACCCAAACTGGACCAGCTCAATCTGCCGCCGGTATTACGCGATGTCGCCATGGTAAAACGCGGTCTGGTCATTTTTGTCG
GCGGTACCGGTTCCGGCAAATCCACTTCGCTGGCAGCCATGGTCGGCCATCGCAACGAGAATGCGCAAGACCATATCATT
ACCATCGAAGATCCGGTCGAGTACGTACACGAACACAAAAAATCCATCGTCACTCAGCGTGAAGTAGGCGTCGATACGGA
AAACTGGTTTGCCGCGCTGAAAAACACCTTGCGCCAGGCCCCCGACGTCATCCTCATCGGCGAGGTACGCGACCGCGAAA
CCATGGAATACGCGATTGCTTTCGCAGAAACCGGGCACCTGTGTCTGACCACGCTGCACGCCAACTCGGCCAACCAGGCA
CTGGACCGTATCATCAATTTCTTCCCGGATGATCGCCGTCAGCAATTACTGATGGATTTGTCGCTCAACCTCAAGGCTTT
CATCTCGCAGCGCCTGGTTGCGCGCAAGGGCACGACCGGCCGTATCGCGGCGGTCGAAGTGCTGCTCAATTCGCCGCTTA
TCGCCGATCTGATCTTCAAGGGACAGGTACACGAAATCAAGGAGATCATGGCCAAGTCGCGCGAACTGGGCATGCAGACT
TTCGACCAGTCGTTATTCGACTTGTACGAAGCCGGCCTGATTACCTACGAAGACGCATTGCGCAATGCCGACTCGCTCAA
TGATCTCCGGCTCAAGATCAAATTGCAGGGTCAGGAATCCAAGGACCGCGACATCATGTCCGGGATCAACAATCTCAATA
TCGTTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1Y0G5I1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilU Pseudomonas stutzeri DSM 10701

62.849

95.467

0.6

  pilU Acinetobacter baylyi ADP1

58.543

95.2

0.557

  pilU Vibrio cholerae strain A1552

55.132

90.933

0.501

  pilT Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

44.807

89.867

0.403

  pilT Acinetobacter baylyi ADP1

43.068

90.4

0.389

  pilT Pseudomonas aeruginosa PAK

42.73

89.867

0.384

  pilT Pseudomonas stutzeri DSM 10701

42.136

89.867

0.379

  pilT Acinetobacter baumannii D1279779

41.742

88.8

0.371

  pilT Acinetobacter nosocomialis M2

41.742

88.8

0.371

  pilT Acinetobacter baumannii strain A118

41.742

88.8

0.371

  pilT Vibrio cholerae O1 biovar El Tor strain E7946

41.566

88.533

0.368

  pilT Vibrio cholerae strain A1552

41.566

88.533

0.368

  pilT Legionella pneumophila strain ERS1305867

40.541

88.8

0.36

  pilT Legionella pneumophila strain Lp02

40.541

88.8

0.36


Multiple sequence alignment