Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   THEOS_RS09760 Genome accession   NC_019386
Coordinates   1864214..1866880 (+) Length   888 a.a.
NCBI ID   WP_016330152.1    Uniprot ID   K7R7J1
Organism   Thermus oshimai JL-2     
Function   power the assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1859214..1871880
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  THEOS_RS09715 (Theos_1961) - 1859600..1860403 (+) 804 WP_016330143.1 histidinol-phosphatase HisJ family protein -
  THEOS_RS09720 (Theos_1962) - 1860379..1860636 (-) 258 WP_016330144.1 hypothetical protein -
  THEOS_RS09725 (Theos_1963) - 1860661..1861266 (-) 606 WP_016330145.1 sulfite oxidase-like oxidoreductase -
  THEOS_RS09730 (Theos_1964) - 1861311..1861640 (-) 330 WP_016330146.1 DUF190 domain-containing protein -
  THEOS_RS09735 (Theos_1965) crcB 1861645..1862022 (-) 378 WP_016330147.1 fluoride efflux transporter CrcB -
  THEOS_RS09740 (Theos_1966) ribH 1862044..1862520 (+) 477 WP_041436569.1 6,7-dimethyl-8-ribityllumazine synthase -
  THEOS_RS09745 (Theos_1967) - 1862504..1862908 (-) 405 WP_016330149.1 DUF4395 domain-containing protein -
  THEOS_RS09750 (Theos_1968) pgeF 1862979..1863707 (+) 729 WP_016330150.1 peptidoglycan editing factor PgeF -
  THEOS_RS09755 (Theos_1969) - 1863732..1864217 (+) 486 WP_016330151.1 YqeG family HAD IIIA-type phosphatase -
  THEOS_RS09760 (Theos_1970) pilF 1864214..1866880 (+) 2667 WP_016330152.1 type IV pilus assembly ATPase PilB Machinery gene
  THEOS_RS09765 (Theos_1971) pilT 1866892..1867983 (+) 1092 WP_016330153.1 type IV pilus twitching motility protein PilT Machinery gene
  THEOS_RS09770 (Theos_1972) gatB 1868001..1869431 (+) 1431 WP_016330154.1 Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB -
  THEOS_RS09775 (Theos_1973) purM 1869436..1870437 (+) 1002 WP_016330155.1 phosphoribosylformylglycinamidine cyclo-ligase -
  THEOS_RS09780 (Theos_1974) - 1870434..1871063 (+) 630 WP_016330156.1 histidine phosphatase family protein -

Sequence


Protein


Download         Length: 888 a.a.        Molecular weight: 97983.17 Da        Isoelectric Point: 5.0923

>NTDB_id=54367 THEOS_RS09760 WP_016330152.1 1864214..1866880(+) (pilF) [Thermus oshimai JL-2]
MSVLTIGDKRLGAILLDAGLLTDEELQMALERHREVGGSLAEVLVDMGLLSERRIAQAIEDHFGIPLVELHTLEIPPKVR
ALLPAEKAKELQAIPFALDEEAGVVRVAFVNPLDTLALEEVEDLTGLVVEPYQTTKSAFLYTLAKAYPELDLPVPPPPSG
PSGAEMRLGELLVEKGLISRDTLEEALVEQEKTGDLLGRILVRKGLDEKALYRVLAEQNGLEFLEDTAGLTPALEATRLL
LRSDALRYSAVPVGLKDGKVEVVLSDPRHKAQVEELLGKPTHFYLTLPRAWEELFHRVYPEKGRLGEVLVQEGKLSREAL
KEALAVQETLPKAKPLGEILVELGLVRPEDVEEALKKQRQGGGRLEDTLIQSGKLKPEALAQAMAAQLGYPYINPEESPP
DPGVALLLPEDLARRYGVFPHHLEGNSLVLLMREPRNIIVLDDLKQYFKRKGLNYTLAPAVAPEAAITKLIERFYGKAEL
GEIAKELSRGFKEEEAPSLDLDESAAQRFVKQVIREAYLQEASDIHIEPRQGDVLVRLRVDGTLRQYTTLPKGALGPVIS
VVKILGGLDIAERRLPQDGRVRYREGGVDLDLRLSTLPTVYGEKAVMRLLKKASDIPEIEGLGFAPEVFQRFKEVIEKPY
GIFLITGPTGSGKSFTTFSILKRIATPDKNTQTIEDPVEYEIPGINQTQVNPQAGLTFARALRAFLRQDPDIIMVGEIRD
SETAKIATEAALTGHLVIATLHTNDAAQAITRLDEMGVELFNISAALIGVLSQRLVRKICEGCKQEVKPDPEVLRRLGLS
EEEIRGAKLYKGLGCERCGGTGYKGRYAIHELLVVDDEIRHAIVAGKSATEIKEIARRKGMKTLREDGIYKALQGITTLE
EVLARTIE

Nucleotide


Download         Length: 2667 bp        

>NTDB_id=54367 THEOS_RS09760 WP_016330152.1 1864214..1866880(+) (pilF) [Thermus oshimai JL-2]
ATGAGCGTCCTGACCATCGGCGACAAGCGGCTTGGGGCCATCCTTTTGGACGCGGGCCTCCTCACGGACGAGGAGCTCCA
GATGGCCTTAGAGCGGCACCGGGAGGTGGGGGGGTCCTTGGCCGAGGTCCTGGTGGACATGGGCCTCCTCTCCGAGCGCC
GCATCGCCCAGGCCATAGAGGACCACTTCGGCATCCCCCTGGTGGAGCTCCACACCCTGGAGATCCCCCCCAAGGTGCGG
GCCCTTCTCCCCGCGGAGAAGGCCAAGGAGCTCCAGGCCATCCCCTTCGCCCTAGACGAGGAGGCGGGGGTGGTGCGGGT
GGCCTTCGTGAACCCCCTGGACACCCTCGCCCTGGAAGAGGTGGAGGACCTCACCGGACTGGTGGTGGAACCCTACCAGA
CCACCAAGAGCGCCTTCCTCTACACCCTGGCCAAGGCCTACCCCGAGCTGGACCTCCCCGTGCCCCCACCCCCTTCCGGG
CCCAGCGGGGCGGAGATGCGCCTGGGGGAGCTTCTGGTAGAGAAGGGCCTCATAAGCCGGGACACCCTGGAGGAGGCCCT
GGTGGAGCAGGAAAAGACGGGGGACCTTTTGGGGCGGATCCTGGTGCGAAAGGGGCTGGACGAGAAGGCCCTTTACCGGG
TCCTGGCGGAGCAGAACGGCCTGGAGTTCCTGGAGGACACCGCAGGGCTCACCCCCGCCCTCGAGGCCACCCGCCTCCTC
CTCCGCTCGGATGCCCTCCGCTACAGCGCGGTGCCCGTGGGCCTTAAGGACGGGAAGGTGGAGGTGGTCCTCTCTGACCC
CCGGCACAAGGCCCAGGTGGAGGAGCTTTTGGGCAAGCCGACCCATTTCTACCTCACCCTGCCCAGGGCCTGGGAAGAGC
TCTTCCACCGGGTCTACCCGGAGAAGGGGCGGCTTGGGGAGGTGTTGGTCCAGGAGGGGAAGCTCTCCCGGGAGGCCCTG
AAGGAGGCCCTGGCGGTGCAGGAAACCCTCCCCAAGGCCAAGCCCCTGGGGGAAATCCTGGTGGAGCTGGGCCTCGTCCG
CCCCGAGGACGTGGAGGAGGCCCTGAAGAAGCAGCGGCAGGGCGGGGGGCGCCTGGAGGACACCCTGATCCAGTCCGGCA
AGCTCAAGCCCGAGGCCCTGGCCCAGGCCATGGCCGCCCAGCTGGGCTACCCCTACATCAACCCCGAGGAAAGCCCCCCC
GACCCCGGGGTGGCCCTCCTCCTCCCCGAGGACCTGGCCCGGCGCTACGGGGTCTTCCCCCACCACCTGGAGGGGAACTC
CCTGGTCCTCCTCATGCGGGAACCCCGGAACATCATCGTTCTGGACGACCTCAAGCAGTACTTCAAGCGCAAGGGGCTGA
ACTACACCCTGGCCCCCGCGGTGGCCCCGGAGGCGGCCATCACCAAGCTCATCGAACGCTTCTACGGCAAGGCGGAGCTG
GGGGAGATCGCCAAGGAGCTCTCCCGGGGCTTCAAGGAGGAGGAGGCCCCCAGCCTGGACCTGGACGAGAGCGCCGCCCA
GCGCTTCGTCAAGCAGGTGATCCGGGAGGCCTACCTCCAAGAGGCCTCGGACATCCACATCGAACCCCGCCAGGGCGACG
TCCTGGTGCGCCTCCGGGTTGACGGCACCCTGCGCCAGTACACCACCCTGCCCAAAGGGGCCTTGGGGCCGGTGATCAGC
GTGGTCAAGATCCTGGGGGGTCTGGACATCGCGGAGAGGCGTCTCCCCCAGGACGGCCGCGTGCGCTACCGGGAAGGGGG
GGTGGACCTGGACCTCCGCCTCTCCACCCTGCCCACGGTGTACGGGGAGAAGGCCGTCATGCGCCTTTTGAAAAAAGCCT
CGGACATCCCCGAGATTGAGGGGCTCGGGTTCGCCCCTGAGGTCTTCCAGCGCTTTAAGGAGGTCATCGAAAAACCCTAC
GGCATCTTCCTCATCACCGGGCCCACAGGGTCAGGGAAGAGCTTCACCACCTTCTCCATCCTGAAGCGCATCGCCACCCC
CGACAAGAACACCCAGACCATCGAGGACCCCGTGGAGTACGAGATCCCGGGCATCAACCAGACCCAGGTGAACCCGCAGG
CCGGGCTCACCTTCGCCCGCGCGCTTAGGGCCTTCCTCCGGCAGGACCCGGACATCATCATGGTGGGGGAGATCCGGGAC
TCGGAAACGGCCAAGATCGCCACCGAGGCCGCCCTCACCGGCCACCTGGTCATCGCCACCCTGCACACCAACGATGCCGC
CCAGGCCATCACCCGCCTGGACGAGATGGGGGTGGAGCTCTTCAACATCTCCGCGGCCCTCATCGGGGTGCTCTCCCAGA
GGCTGGTGCGGAAGATCTGCGAGGGGTGCAAGCAGGAGGTGAAGCCCGACCCCGAGGTCCTAAGGCGCCTGGGGCTTAGC
GAAGAGGAGATCCGGGGGGCCAAGCTCTACAAGGGCCTGGGGTGCGAGCGGTGCGGGGGCACGGGGTACAAGGGCCGCTA
CGCCATCCACGAGCTTTTGGTGGTGGACGACGAGATCCGCCACGCCATCGTGGCGGGGAAGTCGGCCACGGAGATCAAGG
AGATCGCCCGCAGGAAGGGCATGAAGACCCTGCGGGAAGACGGGATCTACAAGGCCCTCCAGGGGATCACCACCCTCGAG
GAGGTCCTGGCCCGTACCATTGAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB K7R7J1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Thermus thermophilus HB27

85.264

100

0.854

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

56.726

100

0.57


Multiple sequence alignment