Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   KY496_RS23835 Genome accession   NZ_CP080379
Coordinates   5384414..5386141 (-) Length   575 a.a.
NCBI ID   WP_219862732.1    Uniprot ID   -
Organism   Massilia sp. NP310     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5379414..5391141
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KY496_RS23815 (KY496_23815) - 5380561..5381769 (+) 1209 WP_219862725.1 serine protease -
  KY496_RS23820 (KY496_23820) - 5381999..5382400 (-) 402 WP_219862727.1 hypothetical protein -
  KY496_RS23825 (KY496_23825) - 5382442..5382789 (-) 348 WP_219862729.1 helix-turn-helix transcriptional regulator -
  KY496_RS23830 (KY496_23830) pilC 5383159..5384391 (-) 1233 WP_219862731.1 type II secretion system F family protein Machinery gene
  KY496_RS23835 (KY496_23835) pilF 5384414..5386141 (-) 1728 WP_219862732.1 type IV-A pilus assembly ATPase PilB Machinery gene
  KY496_RS23840 (KY496_23840) - 5386344..5387912 (-) 1569 WP_219862734.1 methyl-accepting chemotaxis protein -
  KY496_RS23845 (KY496_23845) - 5388007..5388411 (-) 405 WP_219862736.1 hypothetical protein -
  KY496_RS23850 (KY496_23850) - 5388422..5389663 (-) 1242 WP_219862738.1 hypothetical protein -
  KY496_RS23855 (KY496_23855) - 5389854..5390411 (-) 558 WP_184235350.1 ClpP family protease -

Sequence


Protein


Download         Length: 575 a.a.        Molecular weight: 62029.33 Da        Isoelectric Point: 5.0994

>NTDB_id=592441 KY496_RS23835 WP_219862732.1 5384414..5386141(-) (pilF) [Massilia sp. NP310]
MAAVLPNTTTGTALSGLARALVQANRLSAAQADMLHKKAIQEKTAFIDAVLASGAIEPRALAAFCAETFGYPLFDLSAFA
PEFLPTSAIDARLMQAQRVIALAKRGNKLSVALSDPTNNQALDQIKFQSEATVEPVIVPHDALLNLLAAIAKGAEQELSE
LAGDDAEIEFAEEDQQAATNPEAASDVEDAPIVRFLNKILVDAVQMGASDIHFEPYEKYYRIRLRVDGVLRDHASPPLSI
REKLVSRIKVLARLDIAEKRVPQDGRMRLIMSATRTIDFRVSTLPTLFGEKTVMRILDATQAQMGIDALGYDPDQKALLL
EAITRPYGMVLVTGPTGSGKTVSLYSCLNLLNKPGINISTAEDPAEINLPGVNQVNVNEKAGLTFPVALKSFLRQDPDII
MVGEIRDLETADIAVKAAQTGHMVFSTLHTNDAPSTLTRLMNMGVAPFNIASSVILITAQRLARRLCGCKQPLEISREAL
LAAGYRDSDLDGDWRPYGPVGCDRCLGSGYKGRVGIYQIMPISPSIEALILANGNSMEIAAQAEKEGVNSLRRSGLLKVK
QGLTSLEEVLGCTNE

Nucleotide


Download         Length: 1728 bp        

>NTDB_id=592441 KY496_RS23835 WP_219862732.1 5384414..5386141(-) (pilF) [Massilia sp. NP310]
ATGGCAGCAGTCCTCCCCAACACGACCACCGGCACCGCGCTGTCGGGCCTGGCGCGCGCGCTGGTGCAGGCCAATCGCCT
GAGCGCCGCACAGGCGGACATGCTGCACAAGAAGGCCATCCAGGAAAAGACCGCCTTCATCGACGCCGTGCTCGCCAGCG
GCGCGATCGAGCCGCGCGCGCTGGCCGCCTTCTGCGCCGAGACCTTCGGCTATCCGCTGTTCGACCTGTCGGCCTTCGCG
CCGGAGTTCCTGCCCACCAGCGCCATCGACGCCAGGCTGATGCAGGCCCAGCGCGTGATCGCGCTGGCCAAGCGCGGCAA
CAAGCTGTCGGTCGCGCTGTCCGACCCGACCAATAACCAGGCCCTGGACCAGATCAAGTTCCAGAGCGAGGCGACGGTCG
AGCCGGTGATCGTGCCGCACGACGCGCTGCTCAATCTGCTGGCCGCGATCGCCAAGGGCGCCGAACAGGAGTTGAGCGAG
CTGGCCGGCGACGACGCCGAGATCGAGTTCGCCGAGGAAGACCAGCAGGCGGCCACGAACCCGGAGGCCGCCAGCGACGT
CGAGGACGCGCCGATCGTGCGCTTCCTGAACAAGATCCTGGTGGACGCGGTACAGATGGGCGCGTCGGACATCCACTTCG
AGCCCTACGAAAAGTATTACCGGATCCGCCTGCGGGTCGACGGCGTGCTGCGCGACCACGCCTCGCCGCCGCTGTCGATC
CGCGAAAAACTGGTGTCGCGCATCAAGGTGCTGGCGCGGCTGGACATCGCCGAGAAACGCGTGCCGCAAGATGGCCGCAT
GCGGCTCATCATGTCGGCCACCCGTACCATCGATTTCCGTGTCAGCACCTTGCCCACGCTGTTCGGCGAAAAGACCGTGA
TGCGTATCCTGGACGCGACCCAGGCGCAAATGGGCATCGACGCGCTCGGCTACGATCCGGACCAGAAGGCGCTGCTGCTG
GAGGCGATCACCCGCCCCTACGGCATGGTGCTGGTGACGGGCCCGACCGGTTCCGGCAAGACCGTGTCGCTGTACAGCTG
CCTGAATCTGCTGAACAAGCCTGGCATTAATATTTCGACCGCCGAAGACCCGGCCGAGATCAACCTGCCGGGCGTGAACC
AGGTCAACGTCAACGAAAAGGCCGGCCTGACCTTCCCGGTGGCGCTGAAGTCCTTCCTGCGCCAGGACCCCGACATCATC
ATGGTCGGCGAGATCCGCGACCTGGAAACCGCCGACATCGCGGTCAAGGCGGCCCAGACCGGCCACATGGTGTTCTCGAC
GCTGCACACCAACGATGCGCCGTCGACCCTGACGCGCCTGATGAACATGGGCGTGGCGCCGTTCAACATCGCCTCGTCGG
TGATCCTCATCACGGCCCAGCGCCTGGCGCGGCGCCTGTGCGGCTGCAAGCAGCCGCTCGAGATCAGCCGCGAGGCCCTG
CTGGCCGCCGGCTACCGCGACAGCGACCTGGACGGCGACTGGCGGCCCTACGGTCCGGTCGGCTGCGACCGCTGCCTGGG
CTCGGGCTACAAGGGCCGGGTCGGCATCTACCAGATCATGCCGATCTCCCCCAGTATCGAGGCGCTGATCCTGGCCAACG
GCAATTCCATGGAAATCGCGGCCCAGGCCGAGAAGGAAGGCGTCAACTCGTTGCGCCGCTCGGGTTTGCTGAAAGTGAAA
CAGGGGCTGACCAGCCTTGAAGAAGTGCTTGGCTGCACCAACGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Neisseria gonorrhoeae MS11

54.529

97.913

0.534

  pilB Acinetobacter baylyi ADP1

53.779

98.957

0.532

  pilB Acinetobacter baumannii D1279779

53.697

98.783

0.53

  pilB Legionella pneumophila strain ERS1305867

48.404

98.087

0.475

  pilB Vibrio cholerae strain A1552

45.989

97.565

0.449

  pilB Vibrio parahaemolyticus RIMD 2210633

45.989

97.565

0.449

  pilB Vibrio campbellii strain DS40M4

44.484

97.739

0.435

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

41.509

92.174

0.383

  pilF Thermus thermophilus HB27

40.637

92.87

0.377