Detailed information
Overview
| Name | pilB | Type | Machinery gene |
| Locus tag | HICON_RS04830 | Genome accession | NC_014922 |
| Coordinates | 949327..950721 (+) | Length | 464 a.a. |
| NCBI ID | WP_013525533.1 | Uniprot ID | A0AAV2U5G9 |
| Organism | Haemophilus influenzae F3047 | ||
| Function | type IV pilus biogenesis and function (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 944327..955721
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| HICON_RS04805 (HICON_11720) | rsmE | 944345..945082 (-) | 738 | WP_006996180.1 | 16S rRNA (uracil(1498)-N(3))-methyltransferase | - |
| HICON_RS04810 (HICON_11730) | lnt | 945132..946661 (-) | 1530 | WP_013525535.1 | apolipoprotein N-acyltransferase | - |
| HICON_RS04815 (HICON_11740) | corC | 946684..947583 (-) | 900 | WP_013525534.1 | CNNM family magnesium/cobalt transport protein CorC | - |
| HICON_RS04820 (HICON_11750) | ampD | 948206..948766 (-) | 561 | WP_006996183.1 | 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD | - |
| HICON_RS04825 (HICON_11760) | pilA | 948881..949330 (+) | 450 | WP_006996184.1 | prepilin-type N-terminal cleavage/methylation domain-containing protein | Machinery gene |
| HICON_RS04830 (HICON_11770) | pilB | 949327..950721 (+) | 1395 | WP_013525533.1 | GspE/PulE family protein | Machinery gene |
| HICON_RS04835 (HICON_11780) | pilC | 950718..951935 (+) | 1218 | WP_013525532.1 | type II secretion system F family protein | Machinery gene |
| HICON_RS04840 (HICON_11790) | pilD | 951932..952624 (+) | 693 | WP_013525531.1 | prepilin peptidase | Machinery gene |
| HICON_RS04845 (HICON_11800) | rho | 952679..953941 (-) | 1263 | WP_005666690.1 | transcription termination factor Rho | - |
| HICON_RS04850 (HICON_11810) | metJ | 954189..954506 (+) | 318 | WP_005631186.1 | met regulon transcriptional regulator MetJ | - |
| HICON_RS04855 (HICON_11820) | cueR | 954520..954906 (-) | 387 | WP_005648963.1 | Cu(I)-responsive transcriptional regulator | - |
| HICON_RS04860 (HICON_11830) | - | 954983..955189 (+) | 207 | WP_013525530.1 | heavy-metal-associated domain-containing protein | - |
| HICON_RS04865 (HICON_11840) | - | 955264..955470 (+) | 207 | WP_013525530.1 | heavy-metal-associated domain-containing protein | - |
Sequence
Protein
Download Length: 464 a.a. Molecular weight: 52905.44 Da Isoelectric Point: 5.8398
>NTDB_id=39461 HICON_RS04830 WP_013525533.1 949327..950721(+) (pilB) [Haemophilus influenzae F3047]
MTSYALLHTQRVTAQNGEIFTISPDLWVRNQQQQSLLLRYFALPLKEENNRLWLGVDSLSNLSACETIAFITGKPVEPIL
LESSQLKELLQQLTPRQMQVEEQVKFYQHQETHFEQEDDEPVIRLLNQIFESALQKNASDIHLETLADQFQVRFRIDGVL
QPQPLISKIFANRIISRLKLLAKLDISENRLPQDGRFQFKTTFSDILDFRLSTLPTHWGEKIVLRAQQNKPVELSFAELG
MTENQQQAFQRALSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIELDGIIQSQINPQIGLDFSRLLR
AFLRQDPDIIMLGEIRDEESAMIALRAAQTGHLVLSTLHTNDAISAISRLQQLGIQQYEIENSLLLVIAQRLVRKICPKC
GGNLINSCDCHQGYRGRIGVYQFLHWQQNGYQTDFQNLRVSGLEKVSQGITDEKEIERVLGKNS
MTSYALLHTQRVTAQNGEIFTISPDLWVRNQQQQSLLLRYFALPLKEENNRLWLGVDSLSNLSACETIAFITGKPVEPIL
LESSQLKELLQQLTPRQMQVEEQVKFYQHQETHFEQEDDEPVIRLLNQIFESALQKNASDIHLETLADQFQVRFRIDGVL
QPQPLISKIFANRIISRLKLLAKLDISENRLPQDGRFQFKTTFSDILDFRLSTLPTHWGEKIVLRAQQNKPVELSFAELG
MTENQQQAFQRALSQPQGLILVTGPTGSGKSISLYTALQWLNTPDKHIMTAEDPIEIELDGIIQSQINPQIGLDFSRLLR
AFLRQDPDIIMLGEIRDEESAMIALRAAQTGHLVLSTLHTNDAISAISRLQQLGIQQYEIENSLLLVIAQRLVRKICPKC
GGNLINSCDCHQGYRGRIGVYQFLHWQQNGYQTDFQNLRVSGLEKVSQGITDEKEIERVLGKNS
Nucleotide
Download Length: 1395 bp
>NTDB_id=39461 HICON_RS04830 WP_013525533.1 949327..950721(+) (pilB) [Haemophilus influenzae F3047]
ATGACGAGCTATGCTTTACTTCATACTCAGCGTGTAACCGCTCAAAATGGCGAGATCTTTACGATCTCGCCTGATTTATG
GGTGCGTAATCAACAACAGCAATCCTTGCTCTTGCGGTATTTTGCTTTGCCACTTAAAGAAGAAAATAATCGTCTTTGGC
TAGGGGTTGATTCTCTCTCCAATCTTTCAGCTTGTGAAACCATTGCGTTTATAACAGGAAAACCTGTCGAACCAATTTTG
TTAGAAAGCAGCCAACTCAAAGAACTGTTACAACAACTTACTCCGCGCCAAATGCAAGTGGAAGAGCAAGTTAAATTCTA
TCAACATCAAGAAACCCATTTTGAACAAGAAGATGATGAACCTGTTATCCGCTTACTTAATCAGATTTTTGAATCTGCCT
TACAAAAAAATGCCTCTGATATTCATTTAGAAACCTTGGCTGATCAGTTTCAAGTGCGGTTTAGAATTGATGGTGTTTTA
CAACCACAACCCTTAATAAGCAAAATATTCGCCAATCGTATTATTTCACGCTTAAAATTACTGGCTAAATTAGATATTAG
TGAAAATCGACTTCCACAAGATGGACGATTTCAATTTAAAACCACTTTTTCCGATATTCTTGATTTTCGCCTTTCAACCT
TACCAACCCATTGGGGCGAAAAAATAGTGTTGCGAGCGCAACAAAATAAACCTGTAGAACTTAGCTTTGCTGAACTGGGT
ATGACCGAAAATCAGCAACAAGCATTTCAACGCGCACTTAGCCAGCCACAAGGATTAATTTTAGTAACTGGCCCAACAGG
AAGTGGAAAAAGTATCTCACTTTACACCGCACTTCAGTGGCTAAATACGCCTGATAAACATATTATGACCGCTGAGGATC
CCATTGAAATTGAGCTTGATGGCATTATTCAAAGCCAAATTAACCCACAGATTGGATTAGATTTTAGCCGTCTATTGCGC
GCTTTTTTACGTCAAGATCCCGACATCATTATGCTAGGTGAAATTCGTGATGAAGAAAGTGCGATGATTGCACTACGTGC
CGCTCAAACGGGGCATTTGGTGCTTTCAACTTTACATACCAATGATGCAATTTCTGCCATTTCTCGCTTACAACAACTCG
GTATTCAGCAGTATGAAATTGAAAACAGTTTACTACTTGTCATTGCACAGCGTCTTGTACGAAAAATCTGTCCAAAGTGC
GGTGGAAATTTAATAAATTCTTGTGATTGCCATCAAGGTTATCGAGGGCGAATCGGCGTGTATCAATTTCTACATTGGCA
ACAGAATGGCTATCAAACAGATTTTCAAAATTTACGTGTAAGTGGTTTAGAAAAAGTTAGCCAAGGCATAACAGATGAGA
AAGAAATTGAACGTGTGTTAGGTAAAAACTCATGA
ATGACGAGCTATGCTTTACTTCATACTCAGCGTGTAACCGCTCAAAATGGCGAGATCTTTACGATCTCGCCTGATTTATG
GGTGCGTAATCAACAACAGCAATCCTTGCTCTTGCGGTATTTTGCTTTGCCACTTAAAGAAGAAAATAATCGTCTTTGGC
TAGGGGTTGATTCTCTCTCCAATCTTTCAGCTTGTGAAACCATTGCGTTTATAACAGGAAAACCTGTCGAACCAATTTTG
TTAGAAAGCAGCCAACTCAAAGAACTGTTACAACAACTTACTCCGCGCCAAATGCAAGTGGAAGAGCAAGTTAAATTCTA
TCAACATCAAGAAACCCATTTTGAACAAGAAGATGATGAACCTGTTATCCGCTTACTTAATCAGATTTTTGAATCTGCCT
TACAAAAAAATGCCTCTGATATTCATTTAGAAACCTTGGCTGATCAGTTTCAAGTGCGGTTTAGAATTGATGGTGTTTTA
CAACCACAACCCTTAATAAGCAAAATATTCGCCAATCGTATTATTTCACGCTTAAAATTACTGGCTAAATTAGATATTAG
TGAAAATCGACTTCCACAAGATGGACGATTTCAATTTAAAACCACTTTTTCCGATATTCTTGATTTTCGCCTTTCAACCT
TACCAACCCATTGGGGCGAAAAAATAGTGTTGCGAGCGCAACAAAATAAACCTGTAGAACTTAGCTTTGCTGAACTGGGT
ATGACCGAAAATCAGCAACAAGCATTTCAACGCGCACTTAGCCAGCCACAAGGATTAATTTTAGTAACTGGCCCAACAGG
AAGTGGAAAAAGTATCTCACTTTACACCGCACTTCAGTGGCTAAATACGCCTGATAAACATATTATGACCGCTGAGGATC
CCATTGAAATTGAGCTTGATGGCATTATTCAAAGCCAAATTAACCCACAGATTGGATTAGATTTTAGCCGTCTATTGCGC
GCTTTTTTACGTCAAGATCCCGACATCATTATGCTAGGTGAAATTCGTGATGAAGAAAGTGCGATGATTGCACTACGTGC
CGCTCAAACGGGGCATTTGGTGCTTTCAACTTTACATACCAATGATGCAATTTCTGCCATTTCTCGCTTACAACAACTCG
GTATTCAGCAGTATGAAATTGAAAACAGTTTACTACTTGTCATTGCACAGCGTCTTGTACGAAAAATCTGTCCAAAGTGC
GGTGGAAATTTAATAAATTCTTGTGATTGCCATCAAGGTTATCGAGGGCGAATCGGCGTGTATCAATTTCTACATTGGCA
ACAGAATGGCTATCAAACAGATTTTCAAAATTTACGTGTAAGTGGTTTAGAAAAAGTTAGCCAAGGCATAACAGATGAGA
AAGAAATTGAACGTGTGTTAGGTAAAAACTCATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| pilB | Haemophilus influenzae 86-028NP |
98.491 |
100 |
0.985 |
| pilB | Haemophilus influenzae Rd KW20 |
96.76 |
99.784 |
0.966 |
| pilB | Glaesserella parasuis strain SC1401 |
57.675 |
98.276 |
0.567 |
| pilB | Vibrio cholerae strain A1552 |
39.848 |
100 |
0.453 |
| pilB | Legionella pneumophila strain ERS1305867 |
39.757 |
100 |
0.422 |
| pilB | Vibrio parahaemolyticus RIMD 2210633 |
40.373 |
100 |
0.42 |
| pilB | Vibrio campbellii strain DS40M4 |
40.206 |
100 |
0.42 |
| pilB | Acinetobacter baylyi ADP1 |
38.306 |
100 |
0.409 |
| pilB | Acinetobacter baumannii D1279779 |
38.877 |
100 |
0.403 |
| pilB | Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539 |
36.853 |
100 |
0.384 |
| pilF | Neisseria gonorrhoeae MS11 |
36.975 |
100 |
0.379 |
| pilF | Thermus thermophilus HB27 |
35.924 |
100 |
0.369 |