Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilU   Type   Machinery gene
Locus tag   KFB96_RS16695 Genome accession   NZ_CP073760
Coordinates   3668670..3669908 (+) Length   412 a.a.
NCBI ID   WP_213457262.1    Uniprot ID   -
Organism   MAG: Thiocapsa sp. isolate M50B4     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3663670..3674908
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KFB96_RS16690 (KFB96_16690) - 3667110..3668576 (+) 1467 WP_213457264.1 glutamate synthase subunit beta -
  KFB96_RS16695 (KFB96_16695) pilU 3668670..3669908 (+) 1239 WP_213457262.1 PilT/PilU family type 4a pilus ATPase Machinery gene
  KFB96_RS16700 (KFB96_16700) pilU 3669916..3670971 (+) 1056 WP_213457260.1 PilT/PilU family type 4a pilus ATPase Machinery gene
  KFB96_RS16705 (KFB96_16705) gspD 3671003..3673186 (-) 2184 WP_300970415.1 type II secretion system secretin GspD -
  KFB96_RS16710 (KFB96_16710) - 3673230..3673823 (-) 594 WP_300970416.1 hypothetical protein -
  KFB96_RS16715 (KFB96_16715) gspM 3673820..3674434 (-) 615 WP_213457256.1 type II secretion system protein GspM -

Sequence


Protein


Download         Length: 412 a.a.        Molecular weight: 45334.75 Da        Isoelectric Point: 5.4355

>NTDB_id=560547 KFB96_RS16695 WP_213457262.1 3668670..3669908(+) (pilU) [MAG: Thiocapsa sp. isolate M50B4]
MDRQQAAHFMLDCLRKMMAKKGSDLFVSAGFPPACKIDGRMTPISEQALTSAQTNVLVRSIMNDRQIRDFDASKECNFAI
SPQSIGRFRVNAFIQQGHCGAVLRTINATIPELDGLALPPVLKDIVMGERGLVLVVGGTGSGKSTSLAAMLGHRNEHSYG
HIITIEDPVEYVHPHRHCLITQREVGVDTESWEIALMNTLRQAPDVILIGEIRSRETMEHAINFSETGHLCLSTLHANNA
NQALDRIINLFPEERRAQLLMDLSLNLKAVISQRLLPCTDGGRIAAVEVLINTPLIQDLIFKGDVGSIKAIMKRSREQGM
QTFDMSLFDLYEAGKISYEDALRSADSMNELRLAIKLEGHEAQGRDPFAETDGLTILGDDDGDSFDSTLFSETLTSSTMI
DGLPGSDRGRKS

Nucleotide


Download         Length: 1239 bp        

>NTDB_id=560547 KFB96_RS16695 WP_213457262.1 3668670..3669908(+) (pilU) [MAG: Thiocapsa sp. isolate M50B4]
ATGGATCGTCAGCAAGCCGCCCATTTCATGCTCGATTGTCTGCGCAAGATGATGGCCAAAAAGGGGTCGGACCTCTTCGT
CTCCGCGGGTTTTCCGCCGGCCTGCAAGATCGACGGGCGCATGACGCCCATCAGCGAGCAAGCCTTGACCTCCGCCCAGA
CCAACGTGCTGGTGCGCTCCATCATGAACGACCGGCAGATTCGCGACTTCGACGCCTCCAAGGAGTGCAATTTCGCGATC
AGCCCGCAGTCGATCGGTCGGTTTCGGGTCAACGCCTTCATCCAGCAAGGTCATTGCGGCGCGGTCCTGCGCACCATCAA
TGCGACCATCCCGGAGCTCGACGGACTCGCGCTTCCCCCGGTGTTGAAGGACATCGTGATGGGCGAGCGCGGCCTGGTGT
TGGTCGTCGGCGGCACGGGCTCCGGAAAATCGACGTCGCTCGCGGCCATGCTGGGTCATCGCAACGAACACAGCTACGGA
CATATCATCACGATCGAGGACCCGGTCGAGTACGTCCATCCCCACCGCCATTGTCTCATCACCCAGCGCGAGGTCGGCGT
CGATACCGAGAGCTGGGAAATCGCGCTGATGAATACGCTGCGCCAGGCGCCCGATGTCATCCTGATCGGCGAGATCCGCA
GCCGAGAGACGATGGAGCATGCCATCAACTTCTCCGAAACCGGCCATCTGTGCCTCTCCACACTCCATGCCAACAACGCC
AACCAGGCCCTCGACCGCATCATCAATCTGTTTCCCGAGGAGCGGCGTGCTCAGTTGCTGATGGATCTTTCCCTGAATCT
CAAGGCGGTGATCTCGCAACGTCTATTGCCCTGCACCGACGGTGGTCGCATCGCCGCAGTCGAGGTCTTGATCAACACGC
CCTTGATCCAGGATCTGATCTTCAAGGGCGATGTCGGGTCCATCAAGGCGATCATGAAGCGCTCGCGCGAGCAGGGCATG
CAGACCTTCGACATGTCGCTCTTCGATCTTTACGAGGCCGGGAAGATCAGCTACGAGGATGCGTTGCGCAGCGCCGATTC
CATGAACGAGCTTCGCTTGGCGATCAAGTTGGAGGGCCACGAAGCCCAGGGCAGGGATCCCTTCGCCGAGACGGACGGAC
TGACCATTCTCGGGGACGACGATGGCGATTCGTTCGACTCGACCCTGTTCAGCGAGACTCTGACGAGCTCCACGATGATC
GACGGATTGCCTGGTTCGGACCGCGGGCGAAAATCTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilU Pseudomonas stutzeri DSM 10701

58.933

91.019

0.536

  pilU Acinetobacter baylyi ADP1

58.333

84.466

0.493

  pilU Vibrio cholerae strain A1552

55.718

82.767

0.461

  pilT Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

43.768

83.738

0.367