Detailed information
Overview
| Name | pilF | Type | Machinery gene |
| Locus tag | HPQ68_RS02995 | Genome accession | NZ_CP053748 |
| Coordinates | 670222..671952 (+) | Length | 576 a.a. |
| NCBI ID | WP_255756395.1 | Uniprot ID | - |
| Organism | Massilia sp. erpn | ||
| Function | assembly of type IV pilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 665222..676952
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HPQ68_RS02965 (HPQ68_02960) | uraH | 665632..665982 (+) | 351 | WP_255756390.1 | hydroxyisourate hydrolase | - |
| HPQ68_RS02970 (HPQ68_02965) | - | 666046..666285 (+) | 240 | WP_255756391.1 | type II toxin-antitoxin system prevent-host-death family antitoxin | - |
| HPQ68_RS02975 (HPQ68_02970) | - | 666323..666817 (-) | 495 | WP_255758210.1 | NADH-quinone oxidoreductase subunit B family protein | - |
| HPQ68_RS02980 (HPQ68_02975) | - | 666927..667694 (-) | 768 | WP_255756392.1 | ankyrin repeat domain-containing protein | - |
| HPQ68_RS02985 (HPQ68_02980) | - | 667773..668591 (-) | 819 | WP_255756393.1 | AraC family transcriptional regulator | - |
| HPQ68_RS02990 (HPQ68_02985) | - | 668838..670127 (+) | 1290 | WP_255756394.1 | HlyC/CorC family transporter | - |
| HPQ68_RS02995 (HPQ68_02990) | pilF | 670222..671952 (+) | 1731 | WP_255756395.1 | type IV-A pilus assembly ATPase PilB | Machinery gene |
| HPQ68_RS03000 (HPQ68_02995) | pilC | 671999..673216 (+) | 1218 | WP_255756396.1 | type II secretion system F family protein | Machinery gene |
| HPQ68_RS03005 (HPQ68_03000) | - | 673324..675507 (+) | 2184 | WP_255756397.1 | serine/threonine-protein kinase | - |
| HPQ68_RS03010 (HPQ68_03005) | - | 675504..676064 (+) | 561 | WP_255756398.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 576 a.a. Molecular weight: 62197.82 Da Isoelectric Point: 5.0989
>NTDB_id=446874 HPQ68_RS02995 WP_255756395.1 670222..671952(+) (pilF) [Massilia sp. erpn]
MAAVQPSAVPGAPMPGLGRALIQAGRLTAPQAEALQKKSLNDKQAFIDALLGSGMMDARELAAFCSATFGYPLMDLQALN
PDALPPKLIEPRLMHGQRVLALARRGNKIAVALSDPTNTQALDQIKFQTESSVEPVIVPHDALLRLLTELGKDSDQAMNE
LAGEEGEIQFAEEEEAAAAAPDAAANEVEDAPIVRFLNKMLMDAVGMGASDLHFEPFEKFYRIRFRVDGVLIEHAQPPVS
IKDKLVSRIKVLAKLDISEKRVPQDGRMRLIVSPTKTIDLRISTLPTLFGEKTVMRILDATQAQMGIDSLGYEPDQRQLL
LDAIQRPYGMVLVTGPTGSGKTVSLYTCLNILNKPGINISTAEDPAEINLPGVNQVNVNDKAGLTFPVALKSFLRQDPDI
IMVGEIRDLETADIAIKAAQTGHMVFSTLHTNDAPSTLTRLMNMGVAPFNIASSVILITAQRLGRRLCSCKQPVEIADEL
LLRAGYQQEELDGSWKPYGPVGCERCNGTGYKGRVGIYEIMPITPAIESLILAHGNAMQIAAQAQADGVKSLRQSGLVKV
KAGLTSLEEVLGCTNE
MAAVQPSAVPGAPMPGLGRALIQAGRLTAPQAEALQKKSLNDKQAFIDALLGSGMMDARELAAFCSATFGYPLMDLQALN
PDALPPKLIEPRLMHGQRVLALARRGNKIAVALSDPTNTQALDQIKFQTESSVEPVIVPHDALLRLLTELGKDSDQAMNE
LAGEEGEIQFAEEEEAAAAAPDAAANEVEDAPIVRFLNKMLMDAVGMGASDLHFEPFEKFYRIRFRVDGVLIEHAQPPVS
IKDKLVSRIKVLAKLDISEKRVPQDGRMRLIVSPTKTIDLRISTLPTLFGEKTVMRILDATQAQMGIDSLGYEPDQRQLL
LDAIQRPYGMVLVTGPTGSGKTVSLYTCLNILNKPGINISTAEDPAEINLPGVNQVNVNDKAGLTFPVALKSFLRQDPDI
IMVGEIRDLETADIAIKAAQTGHMVFSTLHTNDAPSTLTRLMNMGVAPFNIASSVILITAQRLGRRLCSCKQPVEIADEL
LLRAGYQQEELDGSWKPYGPVGCERCNGTGYKGRVGIYEIMPITPAIESLILAHGNAMQIAAQAQADGVKSLRQSGLVKV
KAGLTSLEEVLGCTNE
Nucleotide
Download Length: 1731 bp
>NTDB_id=446874 HPQ68_RS02995 WP_255756395.1 670222..671952(+) (pilF) [Massilia sp. erpn]
ATGGCAGCAGTCCAACCCAGTGCGGTCCCTGGCGCCCCCATGCCGGGCCTGGGACGGGCTTTGATCCAGGCCGGGCGCCT
CACCGCGCCGCAGGCCGAAGCGCTGCAAAAAAAATCCCTCAACGATAAACAGGCGTTCATCGATGCACTGCTGGGCAGCG
GCATGATGGATGCGCGCGAGCTGGCCGCCTTCTGCTCGGCCACTTTCGGCTATCCGCTGATGGACTTGCAGGCGCTGAAC
CCGGACGCCCTGCCGCCCAAGCTGATCGAACCGCGCCTGATGCACGGCCAGCGCGTGCTGGCCCTGGCGCGGCGCGGCAA
CAAGATCGCCGTCGCCCTTTCCGACCCCACCAATACCCAGGCCCTGGACCAGATCAAGTTCCAGACCGAGTCGTCGGTGG
AACCGGTGATCGTGCCACACGACGCCTTGCTGCGCCTGCTGACGGAACTGGGCAAGGACAGCGACCAGGCGATGAACGAG
CTGGCCGGCGAGGAAGGCGAGATCCAGTTCGCCGAGGAGGAGGAAGCAGCGGCGGCAGCGCCGGACGCCGCCGCCAACGA
GGTCGAGGACGCGCCCATCGTGCGCTTCCTGAACAAGATGTTGATGGATGCGGTGGGCATGGGCGCCTCCGACCTGCATT
TCGAGCCGTTTGAAAAGTTTTACCGCATCCGCTTCCGCGTCGACGGGGTGCTGATCGAGCACGCGCAGCCGCCTGTGTCG
ATCAAGGACAAGCTGGTGTCGCGCATCAAGGTGCTGGCCAAGCTGGATATCTCGGAAAAGCGCGTGCCGCAGGATGGCCG
CATGCGCCTGATCGTCTCGCCCACCAAGACCATCGACCTGCGCATCTCCACCTTGCCCACCCTGTTCGGCGAAAAGACCG
TGATGCGCATTCTCGACGCCACCCAGGCGCAGATGGGCATCGATTCCCTCGGCTACGAGCCGGACCAGCGGCAGCTGCTG
CTGGACGCCATCCAGCGTCCCTACGGCATGGTGCTGGTGACCGGGCCGACCGGCTCGGGCAAGACGGTGTCGCTGTACAC
CTGCCTGAATATCCTGAACAAGCCGGGCATCAATATCTCGACGGCGGAAGACCCGGCCGAGATCAACCTGCCCGGCGTCA
ACCAGGTCAACGTCAACGACAAGGCGGGCCTGACCTTCCCGGTGGCGCTGAAATCCTTCCTGCGGCAAGACCCGGACATC
ATCATGGTGGGCGAAATCCGCGACCTGGAGACGGCCGATATCGCCATCAAGGCGGCGCAGACCGGGCATATGGTGTTCTC
CACCCTGCACACCAACGACGCGCCGTCGACCCTGACGCGCCTGATGAATATGGGCGTGGCGCCGTTCAATATCGCCTCTT
CCGTGATCCTGATCACGGCCCAGCGCCTGGGCCGCCGCCTGTGCAGCTGCAAGCAGCCGGTGGAGATCGCGGACGAATTG
CTGCTGCGCGCCGGCTACCAGCAGGAAGAGCTGGACGGCAGCTGGAAGCCGTATGGCCCGGTGGGCTGCGAGCGCTGCAA
CGGTACCGGCTACAAGGGGCGCGTCGGCATTTACGAGATCATGCCGATCACGCCCGCCATCGAGTCGCTGATTCTGGCGC
ATGGCAATGCGATGCAGATCGCCGCCCAGGCCCAGGCCGACGGCGTGAAGTCGCTGCGCCAATCGGGACTGGTCAAGGTC
AAGGCCGGCCTGACCAGCCTGGAGGAAGTGCTGGGCTGCACCAACGAATAA
ATGGCAGCAGTCCAACCCAGTGCGGTCCCTGGCGCCCCCATGCCGGGCCTGGGACGGGCTTTGATCCAGGCCGGGCGCCT
CACCGCGCCGCAGGCCGAAGCGCTGCAAAAAAAATCCCTCAACGATAAACAGGCGTTCATCGATGCACTGCTGGGCAGCG
GCATGATGGATGCGCGCGAGCTGGCCGCCTTCTGCTCGGCCACTTTCGGCTATCCGCTGATGGACTTGCAGGCGCTGAAC
CCGGACGCCCTGCCGCCCAAGCTGATCGAACCGCGCCTGATGCACGGCCAGCGCGTGCTGGCCCTGGCGCGGCGCGGCAA
CAAGATCGCCGTCGCCCTTTCCGACCCCACCAATACCCAGGCCCTGGACCAGATCAAGTTCCAGACCGAGTCGTCGGTGG
AACCGGTGATCGTGCCACACGACGCCTTGCTGCGCCTGCTGACGGAACTGGGCAAGGACAGCGACCAGGCGATGAACGAG
CTGGCCGGCGAGGAAGGCGAGATCCAGTTCGCCGAGGAGGAGGAAGCAGCGGCGGCAGCGCCGGACGCCGCCGCCAACGA
GGTCGAGGACGCGCCCATCGTGCGCTTCCTGAACAAGATGTTGATGGATGCGGTGGGCATGGGCGCCTCCGACCTGCATT
TCGAGCCGTTTGAAAAGTTTTACCGCATCCGCTTCCGCGTCGACGGGGTGCTGATCGAGCACGCGCAGCCGCCTGTGTCG
ATCAAGGACAAGCTGGTGTCGCGCATCAAGGTGCTGGCCAAGCTGGATATCTCGGAAAAGCGCGTGCCGCAGGATGGCCG
CATGCGCCTGATCGTCTCGCCCACCAAGACCATCGACCTGCGCATCTCCACCTTGCCCACCCTGTTCGGCGAAAAGACCG
TGATGCGCATTCTCGACGCCACCCAGGCGCAGATGGGCATCGATTCCCTCGGCTACGAGCCGGACCAGCGGCAGCTGCTG
CTGGACGCCATCCAGCGTCCCTACGGCATGGTGCTGGTGACCGGGCCGACCGGCTCGGGCAAGACGGTGTCGCTGTACAC
CTGCCTGAATATCCTGAACAAGCCGGGCATCAATATCTCGACGGCGGAAGACCCGGCCGAGATCAACCTGCCCGGCGTCA
ACCAGGTCAACGTCAACGACAAGGCGGGCCTGACCTTCCCGGTGGCGCTGAAATCCTTCCTGCGGCAAGACCCGGACATC
ATCATGGTGGGCGAAATCCGCGACCTGGAGACGGCCGATATCGCCATCAAGGCGGCGCAGACCGGGCATATGGTGTTCTC
CACCCTGCACACCAACGACGCGCCGTCGACCCTGACGCGCCTGATGAATATGGGCGTGGCGCCGTTCAATATCGCCTCTT
CCGTGATCCTGATCACGGCCCAGCGCCTGGGCCGCCGCCTGTGCAGCTGCAAGCAGCCGGTGGAGATCGCGGACGAATTG
CTGCTGCGCGCCGGCTACCAGCAGGAAGAGCTGGACGGCAGCTGGAAGCCGTATGGCCCGGTGGGCTGCGAGCGCTGCAA
CGGTACCGGCTACAAGGGGCGCGTCGGCATTTACGAGATCATGCCGATCACGCCCGCCATCGAGTCGCTGATTCTGGCGC
ATGGCAATGCGATGCAGATCGCCGCCCAGGCCCAGGCCGACGGCGTGAAGTCGCTGCGCCAATCGGGACTGGTCAAGGTC
AAGGCCGGCCTGACCAGCCTGGAGGAAGTGCTGGGCTGCACCAACGAATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| pilF | Neisseria gonorrhoeae MS11 |
52.993 |
98.611 |
0.523 |
| pilB | Acinetobacter baumannii D1279779 |
51.463 |
100 |
0.519 |
| pilB | Acinetobacter baylyi ADP1 |
52.753 |
97.743 |
0.516 |
| pilB | Legionella pneumophila strain ERS1305867 |
47.08 |
98.09 |
0.462 |
| pilB | Vibrio parahaemolyticus RIMD 2210633 |
46.809 |
97.917 |
0.458 |
| pilB | Vibrio cholerae strain A1552 |
47.122 |
96.528 |
0.455 |
| pilB | Vibrio campbellii strain DS40M4 |
45.583 |
98.264 |
0.448 |
| pilF | Thermus thermophilus HB27 |
39.789 |
98.611 |
0.392 |
| pilB | Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539 |
39.062 |
100 |
0.391 |