Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilF   Type   Machinery gene
Locus tag   HPQ68_RS02995 Genome accession   NZ_CP053748
Coordinates   670222..671952 (+) Length   576 a.a.
NCBI ID   WP_255756395.1    Uniprot ID   -
Organism   Massilia sp. erpn     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 665222..676952
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HPQ68_RS02965 (HPQ68_02960) uraH 665632..665982 (+) 351 WP_255756390.1 hydroxyisourate hydrolase -
  HPQ68_RS02970 (HPQ68_02965) - 666046..666285 (+) 240 WP_255756391.1 type II toxin-antitoxin system prevent-host-death family antitoxin -
  HPQ68_RS02975 (HPQ68_02970) - 666323..666817 (-) 495 WP_255758210.1 NADH-quinone oxidoreductase subunit B family protein -
  HPQ68_RS02980 (HPQ68_02975) - 666927..667694 (-) 768 WP_255756392.1 ankyrin repeat domain-containing protein -
  HPQ68_RS02985 (HPQ68_02980) - 667773..668591 (-) 819 WP_255756393.1 AraC family transcriptional regulator -
  HPQ68_RS02990 (HPQ68_02985) - 668838..670127 (+) 1290 WP_255756394.1 HlyC/CorC family transporter -
  HPQ68_RS02995 (HPQ68_02990) pilF 670222..671952 (+) 1731 WP_255756395.1 type IV-A pilus assembly ATPase PilB Machinery gene
  HPQ68_RS03000 (HPQ68_02995) pilC 671999..673216 (+) 1218 WP_255756396.1 type II secretion system F family protein Machinery gene
  HPQ68_RS03005 (HPQ68_03000) - 673324..675507 (+) 2184 WP_255756397.1 serine/threonine-protein kinase -
  HPQ68_RS03010 (HPQ68_03005) - 675504..676064 (+) 561 WP_255756398.1 hypothetical protein -

Sequence


Protein


Download         Length: 576 a.a.        Molecular weight: 62197.82 Da        Isoelectric Point: 5.0989

>NTDB_id=446874 HPQ68_RS02995 WP_255756395.1 670222..671952(+) (pilF) [Massilia sp. erpn]
MAAVQPSAVPGAPMPGLGRALIQAGRLTAPQAEALQKKSLNDKQAFIDALLGSGMMDARELAAFCSATFGYPLMDLQALN
PDALPPKLIEPRLMHGQRVLALARRGNKIAVALSDPTNTQALDQIKFQTESSVEPVIVPHDALLRLLTELGKDSDQAMNE
LAGEEGEIQFAEEEEAAAAAPDAAANEVEDAPIVRFLNKMLMDAVGMGASDLHFEPFEKFYRIRFRVDGVLIEHAQPPVS
IKDKLVSRIKVLAKLDISEKRVPQDGRMRLIVSPTKTIDLRISTLPTLFGEKTVMRILDATQAQMGIDSLGYEPDQRQLL
LDAIQRPYGMVLVTGPTGSGKTVSLYTCLNILNKPGINISTAEDPAEINLPGVNQVNVNDKAGLTFPVALKSFLRQDPDI
IMVGEIRDLETADIAIKAAQTGHMVFSTLHTNDAPSTLTRLMNMGVAPFNIASSVILITAQRLGRRLCSCKQPVEIADEL
LLRAGYQQEELDGSWKPYGPVGCERCNGTGYKGRVGIYEIMPITPAIESLILAHGNAMQIAAQAQADGVKSLRQSGLVKV
KAGLTSLEEVLGCTNE

Nucleotide


Download         Length: 1731 bp        

>NTDB_id=446874 HPQ68_RS02995 WP_255756395.1 670222..671952(+) (pilF) [Massilia sp. erpn]
ATGGCAGCAGTCCAACCCAGTGCGGTCCCTGGCGCCCCCATGCCGGGCCTGGGACGGGCTTTGATCCAGGCCGGGCGCCT
CACCGCGCCGCAGGCCGAAGCGCTGCAAAAAAAATCCCTCAACGATAAACAGGCGTTCATCGATGCACTGCTGGGCAGCG
GCATGATGGATGCGCGCGAGCTGGCCGCCTTCTGCTCGGCCACTTTCGGCTATCCGCTGATGGACTTGCAGGCGCTGAAC
CCGGACGCCCTGCCGCCCAAGCTGATCGAACCGCGCCTGATGCACGGCCAGCGCGTGCTGGCCCTGGCGCGGCGCGGCAA
CAAGATCGCCGTCGCCCTTTCCGACCCCACCAATACCCAGGCCCTGGACCAGATCAAGTTCCAGACCGAGTCGTCGGTGG
AACCGGTGATCGTGCCACACGACGCCTTGCTGCGCCTGCTGACGGAACTGGGCAAGGACAGCGACCAGGCGATGAACGAG
CTGGCCGGCGAGGAAGGCGAGATCCAGTTCGCCGAGGAGGAGGAAGCAGCGGCGGCAGCGCCGGACGCCGCCGCCAACGA
GGTCGAGGACGCGCCCATCGTGCGCTTCCTGAACAAGATGTTGATGGATGCGGTGGGCATGGGCGCCTCCGACCTGCATT
TCGAGCCGTTTGAAAAGTTTTACCGCATCCGCTTCCGCGTCGACGGGGTGCTGATCGAGCACGCGCAGCCGCCTGTGTCG
ATCAAGGACAAGCTGGTGTCGCGCATCAAGGTGCTGGCCAAGCTGGATATCTCGGAAAAGCGCGTGCCGCAGGATGGCCG
CATGCGCCTGATCGTCTCGCCCACCAAGACCATCGACCTGCGCATCTCCACCTTGCCCACCCTGTTCGGCGAAAAGACCG
TGATGCGCATTCTCGACGCCACCCAGGCGCAGATGGGCATCGATTCCCTCGGCTACGAGCCGGACCAGCGGCAGCTGCTG
CTGGACGCCATCCAGCGTCCCTACGGCATGGTGCTGGTGACCGGGCCGACCGGCTCGGGCAAGACGGTGTCGCTGTACAC
CTGCCTGAATATCCTGAACAAGCCGGGCATCAATATCTCGACGGCGGAAGACCCGGCCGAGATCAACCTGCCCGGCGTCA
ACCAGGTCAACGTCAACGACAAGGCGGGCCTGACCTTCCCGGTGGCGCTGAAATCCTTCCTGCGGCAAGACCCGGACATC
ATCATGGTGGGCGAAATCCGCGACCTGGAGACGGCCGATATCGCCATCAAGGCGGCGCAGACCGGGCATATGGTGTTCTC
CACCCTGCACACCAACGACGCGCCGTCGACCCTGACGCGCCTGATGAATATGGGCGTGGCGCCGTTCAATATCGCCTCTT
CCGTGATCCTGATCACGGCCCAGCGCCTGGGCCGCCGCCTGTGCAGCTGCAAGCAGCCGGTGGAGATCGCGGACGAATTG
CTGCTGCGCGCCGGCTACCAGCAGGAAGAGCTGGACGGCAGCTGGAAGCCGTATGGCCCGGTGGGCTGCGAGCGCTGCAA
CGGTACCGGCTACAAGGGGCGCGTCGGCATTTACGAGATCATGCCGATCACGCCCGCCATCGAGTCGCTGATTCTGGCGC
ATGGCAATGCGATGCAGATCGCCGCCCAGGCCCAGGCCGACGGCGTGAAGTCGCTGCGCCAATCGGGACTGGTCAAGGTC
AAGGCCGGCCTGACCAGCCTGGAGGAAGTGCTGGGCTGCACCAACGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilF Neisseria gonorrhoeae MS11

52.993

98.611

0.523

  pilB Acinetobacter baumannii D1279779

51.463

100

0.519

  pilB Acinetobacter baylyi ADP1

52.753

97.743

0.516

  pilB Legionella pneumophila strain ERS1305867

47.08

98.09

0.462

  pilB Vibrio parahaemolyticus RIMD 2210633

46.809

97.917

0.458

  pilB Vibrio cholerae strain A1552

47.122

96.528

0.455

  pilB Vibrio campbellii strain DS40M4

45.583

98.264

0.448

  pilF Thermus thermophilus HB27

39.789

98.611

0.392

  pilB Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

39.062

100

0.391