Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   TCCBUS3UF1_RS10690 Genome accession   NC_017278
Coordinates   2116700..2119366 (+) Length   888 a.a.
NCBI ID   WP_014516520.1    Uniprot ID   -
Organism   Thermus sp. CCB_US3_UF1     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2111700..2124366
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  TCCBUS3UF1_RS10645 (TCCBUS3UF1_21250) - 2112056..2112859 (+) 804 WP_014516511.1 histidinol-phosphatase HisJ family protein -
  TCCBUS3UF1_RS10650 (TCCBUS3UF1_21260) - 2112835..2113104 (-) 270 WP_014516512.1 hypothetical protein -
  TCCBUS3UF1_RS10655 (TCCBUS3UF1_21270) - 2113132..2113734 (-) 603 WP_014516513.1 sulfite oxidase-like oxidoreductase -
  TCCBUS3UF1_RS10660 (TCCBUS3UF1_21280) - 2113781..2114113 (-) 333 WP_014516514.1 DUF190 domain-containing protein -
  TCCBUS3UF1_RS10665 (TCCBUS3UF1_21290) crcB 2114118..2114495 (-) 378 WP_041434054.1 fluoride efflux transporter CrcB -
  TCCBUS3UF1_RS10670 (TCCBUS3UF1_21300) ribH 2114516..2114989 (+) 474 WP_014516516.1 6,7-dimethyl-8-ribityllumazine synthase -
  TCCBUS3UF1_RS10675 (TCCBUS3UF1_21310) - 2114976..2115380 (-) 405 WP_014516517.1 DUF4395 family protein -
  TCCBUS3UF1_RS10680 (TCCBUS3UF1_21320) pgeF 2115458..2116186 (+) 729 WP_014516518.1 peptidoglycan editing factor PgeF -
  TCCBUS3UF1_RS10685 (TCCBUS3UF1_21330) - 2116221..2116703 (+) 483 WP_014516519.1 YqeG family HAD IIIA-type phosphatase -
  TCCBUS3UF1_RS10690 (TCCBUS3UF1_21340) pilF 2116700..2119366 (+) 2667 WP_014516520.1 type IV pilus assembly ATPase PilB Machinery gene
  TCCBUS3UF1_RS10695 (TCCBUS3UF1_21350) pilT 2119378..2120463 (+) 1086 WP_014516521.1 type IV pilus twitching motility protein PilT Machinery gene
  TCCBUS3UF1_RS10700 (TCCBUS3UF1_21360) gatB 2120486..2121895 (+) 1410 WP_014516522.1 Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB -
  TCCBUS3UF1_RS10705 (TCCBUS3UF1_21370) purM 2121901..2122902 (+) 1002 WP_014516523.1 phosphoribosylformylglycinamidine cyclo-ligase -
  TCCBUS3UF1_RS10710 (TCCBUS3UF1_21380) - 2122899..2123528 (+) 630 WP_014516524.1 histidine phosphatase family protein -

Sequence


Protein


Download         Length: 888 a.a.        Molecular weight: 97600.74 Da        Isoelectric Point: 5.2132

>NTDB_id=45835 TCCBUS3UF1_RS10690 WP_014516520.1 2116700..2119366(+) (pilF) [Thermus sp. CCB_US3_UF1]
MSVLTIGDKRLGAILLDAGLLTDEELQMALEKHREVGGSLAEVIVDSGLLSERRIAQAIEDHFGIPLVELHTLEIPPKVK
ALLPAEKAKELQAIPFALDEEAGVVRVAFVNPLDTLSLEEVEDLTGLVVEPYQATRSSFLYALAKGYPELNLPLPPLPTG
PTQGELKLGELLLEKGLLDRATLEEALVEQEKTGDLLGRILVKKGLPEGALYQTLAEQKGLEFLPSTEGISPDPAATALL
LRSDALRFSAVPVAFREGKVEVVLADPRHKEAVEELLGRPGRFYLTLPKEWEALFHRAYPEKGRLGEVLVQEGRLSREHL
REALEVQKRLPKAKPLGEILVELGLARPEDVEEALKKQRQGGGRLEDTLVQSGKLKPEALAQAVAAQLGYAYIDPAENPP
DPGAALLIPEDLARRYGIFPHHLEGRNLVLLMKDPRNILALDDVRLALKRKGQTYELVPAVATEAAITKLIERFYGKEEL
GELAKELSKGYQEEEATPTELDESAAQKFVKQVIREAYLQDASDIHVEPRQADVLVRLRIDGALRQYTTLPKGALNAIIS
VIKIMGGLNIAEKRLPQDGRVRYREGSIDLDLRLSTLPTVYGEKAVMRLLKKASDIPEIEGLGFAPGVFERFQEVISKPY
GIFLITGPTGSGKSFTTFSILKRIATPDKNTQTIEDPVEYEIPGINQTQVNPQAGLTFARALRAFLRQDPDIIMVGEIRD
SETAKIATEAALTGHLVIATLHTNDAAQAVTRLDEMGVELFNISAALIGVLSQRLVRRICDHCKVEVKPDPEVLRRLGLT
EGEIAGAKLYKGMGCERCSGTGYKGRYAIHELLVVDDEIRHAIVAGKSATEIKEIARKKGMKTLREDGIYKALLGITTLE
EVLARTIE

Nucleotide


Download         Length: 2667 bp        

>NTDB_id=45835 TCCBUS3UF1_RS10690 WP_014516520.1 2116700..2119366(+) (pilF) [Thermus sp. CCB_US3_UF1]
ATGAGCGTGCTCACCATTGGCGACAAACGGCTGGGGGCCATCCTTTTGGACGCCGGGCTCCTCACGGACGAGGAGCTGCA
GATGGCCCTGGAGAAGCACCGGGAAGTGGGGGGAAGCCTGGCCGAGGTGATCGTGGACTCGGGCCTCCTTTCGGAAAGGC
GCATCGCCCAGGCCATTGAGGACCACTTCGGCATCCCCCTGGTGGAGCTGCACACCCTGGAGATCCCCCCCAAGGTCAAG
GCCCTGCTTCCGGCGGAGAAGGCCAAGGAGCTCCAGGCCATCCCCTTCGCCCTGGACGAGGAGGCCGGGGTGGTGCGGGT
GGCCTTCGTCAACCCCTTGGACACCCTGAGCCTCGAGGAGGTGGAGGACCTCACCGGGTTGGTGGTGGAGCCCTACCAGG
CCACCCGCAGCTCCTTCCTCTACGCCCTGGCCAAGGGGTACCCGGAGCTCAACCTCCCCCTGCCCCCCTTACCCACAGGC
CCCACGCAAGGGGAGCTGAAACTGGGCGAGCTCCTTCTGGAAAAGGGCCTCCTGGACCGGGCCACCCTGGAAGAGGCCCT
GGTGGAGCAGGAGAAGACGGGGGACCTCCTGGGGCGGATCCTGGTGAAGAAAGGCCTGCCCGAGGGAGCCCTCTACCAGA
CCCTGGCCGAACAGAAGGGCCTGGAGTTCCTCCCCTCCACCGAAGGGATCTCCCCCGACCCCGCAGCCACCGCCCTCCTC
CTCCGCTCCGACGCCCTGCGCTTTAGCGCCGTTCCCGTGGCCTTCCGCGAGGGCAAGGTGGAGGTGGTCCTGGCCGACCC
CCGGCACAAGGAGGCGGTGGAGGAGCTCCTGGGGCGGCCGGGCCGCTTTTACCTCACCCTGCCCAAGGAGTGGGAGGCCC
TGTTCCACCGGGCCTACCCGGAGAAGGGCCGCTTGGGGGAGGTCCTGGTGCAGGAGGGCCGCCTGAGCCGGGAGCACCTG
CGGGAAGCTTTGGAGGTGCAAAAGCGCCTTCCCAAGGCCAAACCCCTGGGGGAGATCCTGGTGGAGCTGGGCCTGGCCCG
GCCCGAAGACGTGGAAGAGGCCCTGAAGAAGCAGCGCCAGGGCGGGGGGCGTCTGGAGGACACCCTGGTCCAGTCGGGCA
AGCTCAAGCCCGAGGCCCTGGCCCAGGCCGTGGCCGCCCAGCTGGGCTACGCCTACATCGACCCCGCGGAGAACCCCCCT
GACCCCGGGGCGGCCCTCCTCATCCCCGAGGACCTGGCCCGCCGCTACGGCATCTTCCCCCACCACCTGGAAGGGAGGAA
CCTGGTCCTCCTGATGAAAGACCCCAGGAACATCCTGGCCCTGGACGACGTGCGCCTGGCCCTCAAGCGCAAGGGGCAAA
CCTACGAGCTGGTGCCGGCGGTGGCCACCGAGGCCGCCATCACCAAGCTCATCGAGCGCTTCTACGGCAAGGAGGAGCTG
GGGGAGCTGGCCAAGGAGCTCTCCAAGGGGTACCAGGAGGAGGAGGCCACCCCCACCGAGCTGGACGAAAGCGCCGCCCA
GAAGTTCGTCAAGCAGGTGATCCGGGAGGCCTACCTGCAGGACGCCTCGGACATCCACGTGGAACCCCGGCAGGCCGACG
TCCTGGTGCGCCTCCGCATCGATGGGGCCCTGCGCCAGTACACCACCTTGCCCAAGGGGGCCCTGAACGCCATCATCAGC
GTCATCAAGATCATGGGCGGGCTCAACATCGCCGAAAAGCGCCTGCCCCAGGACGGGCGGGTGCGGTACCGCGAGGGCTC
CATTGACCTGGACCTGCGCCTTTCCACCCTGCCCACGGTGTATGGGGAGAAGGCGGTGATGCGCCTCCTCAAAAAGGCCA
GCGACATCCCGGAGATCGAGGGCCTGGGCTTTGCCCCCGGGGTCTTTGAGCGCTTCCAGGAGGTGATCTCCAAGCCCTAC
GGGATCTTCCTCATCACCGGGCCCACGGGAAGCGGCAAGAGCTTCACCACCTTCTCCATCCTGAAGCGGATCGCCACCCC
CGATAAAAACACCCAGACCATCGAAGACCCCGTGGAGTACGAGATCCCCGGGATCAACCAGACCCAGGTAAACCCCCAGG
CTGGCCTCACCTTCGCCCGGGCCCTTAGGGCCTTCCTGCGCCAGGACCCGGACATCATCATGGTGGGGGAGATCCGGGAC
TCGGAAACGGCCAAGATCGCCACCGAGGCCGCCCTCACCGGCCACCTGGTCATCGCCACCCTGCACACCAACGACGCCGC
CCAGGCCGTGACCCGCCTGGACGAGATGGGGGTGGAGCTTTTCAACATCTCCGCCGCCCTCATCGGCGTTCTCTCCCAGC
GCCTGGTGCGGCGGATCTGCGACCACTGCAAGGTGGAGGTCAAACCAGACCCCGAGGTCCTGCGCCGCCTGGGCCTCACG
GAAGGGGAGATCGCGGGGGCCAAGCTCTACAAGGGCATGGGGTGCGAGCGATGCAGCGGCACCGGGTACAAGGGCCGCTA
CGCCATCCACGAGCTCCTGGTGGTGGACGACGAGATCCGCCACGCCATCGTGGCGGGAAAGTCGGCCACGGAGATCAAGG
AGATCGCCCGCAAAAAGGGCATGAAAACCCTGCGGGAGGACGGGATCTACAAGGCCCTCCTGGGGATTACCACCCTCGAG
GAGGTCCTGGCGCGTACCATTGAGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Thermus thermophilus HB27

86.727

100

0.868

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

57.351

100

0.575


Multiple sequence alignment