Detailed information
Overview
| Name | pilB | Type | Machinery gene |
| Locus tag | LZZ50_RS17375 | Genome accession | NZ_CP090441 |
| Coordinates | 4123653..4125389 (-) | Length | 578 a.a. |
| NCBI ID | WP_245102984.1 | Uniprot ID | - |
| Organism | Xanthomonas arboricola strain YchA | ||
| Function | power the assembly of type IV pilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 4125424..4135694 | 4123653..4125389 | flank | 35 |
Gene organization within MGE regions
Location: 4123653..4135694
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| LZZ50_RS17375 (LZZ50_17375) | pilB | 4123653..4125389 (-) | 1737 | WP_245102984.1 | type IV-A pilus assembly ATPase PilB | Machinery gene |
| LZZ50_RS17380 (LZZ50_17380) | - | 4125424..4126950 (-) | 1527 | WP_342370390.1 | hypothetical protein | - |
| LZZ50_RS17385 (LZZ50_17385) | - | 4126920..4127549 (-) | 630 | WP_245102989.1 | hypothetical protein | - |
| LZZ50_RS17390 (LZZ50_17390) | - | 4127560..4129149 (-) | 1590 | WP_245102991.1 | phosphoethanolamine transferase | - |
| LZZ50_RS17395 (LZZ50_17395) | pilA2 | 4129285..4129710 (-) | 426 | WP_054592196.1 | pilin | Machinery gene |
| LZZ50_RS17400 (LZZ50_17400) | pilC | 4130064..4131323 (+) | 1260 | WP_184362404.1 | type II secretion system F family protein | Machinery gene |
| LZZ50_RS17405 (LZZ50_17405) | - | 4131330..4132193 (+) | 864 | WP_016849610.1 | A24 family peptidase | - |
| LZZ50_RS17410 (LZZ50_17410) | coaE | 4132207..4132830 (+) | 624 | WP_184608693.1 | dephospho-CoA kinase | - |
| LZZ50_RS17415 (LZZ50_17415) | - | 4132930..4133076 (+) | 147 | Protein_3410 | SymE family type I addiction module toxin | - |
| LZZ50_RS17420 (LZZ50_17420) | - | 4133188..4134522 (-) | 1335 | WP_016904089.1 | HAMP domain-containing sensor histidine kinase | - |
| LZZ50_RS17425 (LZZ50_17425) | - | 4134515..4135192 (-) | 678 | WP_006448355.1 | response regulator transcription factor | - |
| LZZ50_RS17430 (LZZ50_17430) | - | 4135224..4135694 (-) | 471 | WP_245102993.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 578 a.a. Molecular weight: 62708.82 Da Isoelectric Point: 5.6063
>NTDB_id=642476 LZZ50_RS17375 WP_245102984.1 4123653..4125389(-) (pilB) [Xanthomonas arboricola strain YchA]
MNSVATTNLVGITGIARRLVQDGALEEAAARTAMEQAAAAKVPLPQWFAERKLVSAAQLAAANAVEFGMPLMDVSVFDAN
QNAVKLVSEELLQKYQVLPLFKRGNRLFVGVSNPTQTRALDDIKFHTNLVVEPILVDEDQIRRTLEQWQASNTSLGSSLG
DDDEGMGDLDVSAGEEDMGAGGDSGVDAKGDDTPVVKFVNKVLVDAIRRGASDIHFEPYEDDYRVRLRIDGLLKNVAKAP
VKLNQRIAARLKVMSQLDIAEKRVPQDGRIKLNLSKTKQIDFRVSTLPTLFGEKVVLRILDGSAAKLGIEKLGYEADQQK
LFLEAIHKPYGMVLVTGPTGSGKTVSLYTALGILNDETRNISTAEDPVEIRLPGVNQVQQNNKRGMTFAAALRSFLRQDP
DIIMVGEIRDLETAEIAIKAAQTGHMVLSTLHTNDAPQTIARLMNMGIAPYNITSSVTLVIAQRLARRLCNNCKRKSTLP
EHALLAEGFTPAQIAAGIELYEAVGCDECTEGYKGRTGIYQVMPMTDEIGAIVLEGGNAMQIAEAAQSIGIRDLRQSALV
KAAHGVTSLAEINRVTKD
MNSVATTNLVGITGIARRLVQDGALEEAAARTAMEQAAAAKVPLPQWFAERKLVSAAQLAAANAVEFGMPLMDVSVFDAN
QNAVKLVSEELLQKYQVLPLFKRGNRLFVGVSNPTQTRALDDIKFHTNLVVEPILVDEDQIRRTLEQWQASNTSLGSSLG
DDDEGMGDLDVSAGEEDMGAGGDSGVDAKGDDTPVVKFVNKVLVDAIRRGASDIHFEPYEDDYRVRLRIDGLLKNVAKAP
VKLNQRIAARLKVMSQLDIAEKRVPQDGRIKLNLSKTKQIDFRVSTLPTLFGEKVVLRILDGSAAKLGIEKLGYEADQQK
LFLEAIHKPYGMVLVTGPTGSGKTVSLYTALGILNDETRNISTAEDPVEIRLPGVNQVQQNNKRGMTFAAALRSFLRQDP
DIIMVGEIRDLETAEIAIKAAQTGHMVLSTLHTNDAPQTIARLMNMGIAPYNITSSVTLVIAQRLARRLCNNCKRKSTLP
EHALLAEGFTPAQIAAGIELYEAVGCDECTEGYKGRTGIYQVMPMTDEIGAIVLEGGNAMQIAEAAQSIGIRDLRQSALV
KAAHGVTSLAEINRVTKD
Nucleotide
Download Length: 1737 bp
>NTDB_id=642476 LZZ50_RS17375 WP_245102984.1 4123653..4125389(-) (pilB) [Xanthomonas arboricola strain YchA]
ATGAATTCAGTAGCTACCACCAACCTCGTTGGTATTACCGGCATCGCTCGTCGCCTTGTGCAGGATGGTGCCCTGGAGGA
AGCCGCTGCGCGGACGGCAATGGAACAGGCGGCCGCTGCCAAGGTGCCGCTTCCTCAGTGGTTTGCCGAAAGAAAACTGG
TGTCGGCGGCACAACTGGCGGCAGCCAATGCCGTGGAATTCGGTATGCCGCTGATGGATGTGTCGGTGTTCGACGCCAAT
CAGAACGCGGTCAAGCTGGTCAGCGAGGAGTTGCTCCAGAAGTACCAGGTGCTGCCGCTGTTCAAGCGCGGTAACCGGTT
GTTCGTGGGGGTGAGCAACCCGACCCAGACCCGGGCGCTGGACGACATCAAGTTCCATACCAACCTGGTGGTCGAGCCGA
TCCTGGTAGACGAGGACCAGATCCGTCGCACCTTGGAGCAATGGCAAGCCAGCAATACGTCGCTTGGCTCGTCGCTCGGT
GACGACGATGAGGGGATGGGCGACCTGGACGTGTCGGCCGGGGAAGAGGACATGGGCGCCGGCGGGGATTCCGGGGTCGA
TGCCAAGGGCGACGACACGCCGGTGGTGAAGTTCGTCAACAAGGTGCTGGTGGATGCGATCCGGCGGGGAGCCTCCGACA
TCCACTTCGAGCCGTACGAAGACGACTATCGGGTGCGCTTGCGCATCGATGGGCTGCTGAAGAATGTGGCCAAGGCACCG
GTAAAGCTGAACCAGCGCATCGCGGCGCGTCTGAAGGTGATGTCGCAGCTGGATATCGCCGAGAAGCGGGTGCCGCAGGA
CGGGCGCATCAAGCTCAACCTGTCCAAGACCAAGCAGATCGACTTCCGCGTGAGCACCTTGCCGACGTTGTTCGGCGAGA
AGGTGGTGCTGCGTATCCTGGACGGCAGCGCGGCCAAGCTGGGCATCGAGAAGCTGGGCTACGAGGCGGACCAGCAGAAG
CTGTTCCTGGAGGCGATCCACAAGCCGTACGGGATGGTGTTGGTGACCGGGCCGACCGGTTCGGGCAAGACGGTGTCGCT
GTACACGGCACTGGGCATCCTCAACGACGAGACGCGCAACATCTCCACCGCCGAGGACCCGGTGGAAATCCGCCTGCCTG
GCGTCAATCAGGTGCAGCAGAACAACAAGCGCGGCATGACCTTCGCCGCGGCGCTGCGCTCGTTCCTGCGCCAGGACCCG
GACATCATCATGGTCGGCGAAATCCGCGACCTGGAGACGGCCGAGATTGCGATCAAGGCCGCGCAGACCGGTCACATGGT
GCTGTCCACGCTGCACACCAACGATGCGCCGCAGACCATCGCGCGTTTGATGAACATGGGCATCGCGCCTTACAACATCA
CCTCGTCGGTGACCCTGGTGATCGCGCAGCGTCTGGCGCGGCGCCTGTGCAACAACTGCAAGCGCAAATCGACCTTGCCC
GAGCATGCGTTGCTGGCCGAAGGCTTCACGCCTGCGCAGATCGCTGCCGGGATCGAGCTATATGAGGCGGTCGGTTGCGA
CGAGTGCACCGAAGGCTACAAGGGCCGTACCGGTATCTACCAGGTGATGCCGATGACCGACGAGATCGGCGCGATCGTGC
TGGAAGGCGGTAACGCGATGCAGATCGCCGAGGCGGCACAGAGTATCGGCATCCGCGACCTGCGCCAGTCGGCGTTGGTC
AAGGCCGCGCACGGGGTAACCAGCCTGGCCGAGATCAACCGAGTGACGAAGGACTAA
ATGAATTCAGTAGCTACCACCAACCTCGTTGGTATTACCGGCATCGCTCGTCGCCTTGTGCAGGATGGTGCCCTGGAGGA
AGCCGCTGCGCGGACGGCAATGGAACAGGCGGCCGCTGCCAAGGTGCCGCTTCCTCAGTGGTTTGCCGAAAGAAAACTGG
TGTCGGCGGCACAACTGGCGGCAGCCAATGCCGTGGAATTCGGTATGCCGCTGATGGATGTGTCGGTGTTCGACGCCAAT
CAGAACGCGGTCAAGCTGGTCAGCGAGGAGTTGCTCCAGAAGTACCAGGTGCTGCCGCTGTTCAAGCGCGGTAACCGGTT
GTTCGTGGGGGTGAGCAACCCGACCCAGACCCGGGCGCTGGACGACATCAAGTTCCATACCAACCTGGTGGTCGAGCCGA
TCCTGGTAGACGAGGACCAGATCCGTCGCACCTTGGAGCAATGGCAAGCCAGCAATACGTCGCTTGGCTCGTCGCTCGGT
GACGACGATGAGGGGATGGGCGACCTGGACGTGTCGGCCGGGGAAGAGGACATGGGCGCCGGCGGGGATTCCGGGGTCGA
TGCCAAGGGCGACGACACGCCGGTGGTGAAGTTCGTCAACAAGGTGCTGGTGGATGCGATCCGGCGGGGAGCCTCCGACA
TCCACTTCGAGCCGTACGAAGACGACTATCGGGTGCGCTTGCGCATCGATGGGCTGCTGAAGAATGTGGCCAAGGCACCG
GTAAAGCTGAACCAGCGCATCGCGGCGCGTCTGAAGGTGATGTCGCAGCTGGATATCGCCGAGAAGCGGGTGCCGCAGGA
CGGGCGCATCAAGCTCAACCTGTCCAAGACCAAGCAGATCGACTTCCGCGTGAGCACCTTGCCGACGTTGTTCGGCGAGA
AGGTGGTGCTGCGTATCCTGGACGGCAGCGCGGCCAAGCTGGGCATCGAGAAGCTGGGCTACGAGGCGGACCAGCAGAAG
CTGTTCCTGGAGGCGATCCACAAGCCGTACGGGATGGTGTTGGTGACCGGGCCGACCGGTTCGGGCAAGACGGTGTCGCT
GTACACGGCACTGGGCATCCTCAACGACGAGACGCGCAACATCTCCACCGCCGAGGACCCGGTGGAAATCCGCCTGCCTG
GCGTCAATCAGGTGCAGCAGAACAACAAGCGCGGCATGACCTTCGCCGCGGCGCTGCGCTCGTTCCTGCGCCAGGACCCG
GACATCATCATGGTCGGCGAAATCCGCGACCTGGAGACGGCCGAGATTGCGATCAAGGCCGCGCAGACCGGTCACATGGT
GCTGTCCACGCTGCACACCAACGATGCGCCGCAGACCATCGCGCGTTTGATGAACATGGGCATCGCGCCTTACAACATCA
CCTCGTCGGTGACCCTGGTGATCGCGCAGCGTCTGGCGCGGCGCCTGTGCAACAACTGCAAGCGCAAATCGACCTTGCCC
GAGCATGCGTTGCTGGCCGAAGGCTTCACGCCTGCGCAGATCGCTGCCGGGATCGAGCTATATGAGGCGGTCGGTTGCGA
CGAGTGCACCGAAGGCTACAAGGGCCGTACCGGTATCTACCAGGTGATGCCGATGACCGACGAGATCGGCGCGATCGTGC
TGGAAGGCGGTAACGCGATGCAGATCGCCGAGGCGGCACAGAGTATCGGCATCCGCGACCTGCGCCAGTCGGCGTTGGTC
AAGGCCGCGCACGGGGTAACCAGCCTGGCCGAGATCAACCGAGTGACGAAGGACTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| pilB | Acinetobacter baylyi ADP1 |
55.478 |
99.481 |
0.552 |
| pilB | Acinetobacter baumannii D1279779 |
55.634 |
98.27 |
0.547 |
| pilB | Legionella pneumophila strain ERS1305867 |
52.373 |
98.443 |
0.516 |
| pilB | Vibrio cholerae strain A1552 |
49.135 |
100 |
0.491 |
| pilF | Neisseria gonorrhoeae MS11 |
49.12 |
98.27 |
0.483 |
| pilB | Vibrio campbellii strain DS40M4 |
45.87 |
98.443 |
0.452 |
| pilB | Vibrio parahaemolyticus RIMD 2210633 |
46.429 |
96.886 |
0.45 |