Detailed information
Overview
| Name | pilC | Type | Machinery gene |
| Locus tag | MM141_RS23820 | Genome accession | NZ_CP093012 |
| Coordinates | 5113518..5114738 (+) | Length | 406 a.a. |
| NCBI ID | WP_016253888.1 | Uniprot ID | - |
| Organism | Pseudomonas aeruginosa strain H20 | ||
| Function | assembly of type IV pilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 5113518..5122222 | 5113518..5114738 | within | 0 |
Gene organization within MGE regions
Location: 5113518..5122222
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| MM141_RS23820 (MM141_23830) | pilC | 5113518..5114738 (+) | 1221 | WP_016253888.1 | type 4a pilus biogenesis protein PilC | Machinery gene |
| MM141_RS23825 (MM141_23835) | pilD | 5114742..5115614 (+) | 873 | WP_009875878.1 | type IV prepilin peptidase/methyltransferase PilD | Machinery gene |
| MM141_RS23830 (MM141_23840) | coaE | 5115611..5116222 (+) | 612 | WP_003112838.1 | dephospho-CoA kinase | - |
| MM141_RS23835 (MM141_23845) | yacG | 5116219..5116419 (+) | 201 | WP_003094656.1 | DNA gyrase inhibitor YacG | - |
| MM141_RS23840 (MM141_23850) | - | 5116456..5116665 (-) | 210 | WP_003094660.1 | hypothetical protein | - |
| MM141_RS23845 (MM141_23855) | - | 5116771..5117460 (-) | 690 | WP_003103868.1 | energy-coupling factor ABC transporter permease | - |
| MM141_RS23850 (MM141_23860) | - | 5117457..5117927 (-) | 471 | WP_003094664.1 | hypothetical protein | - |
| MM141_RS23855 (MM141_23865) | - | 5117924..5118349 (-) | 426 | WP_009875877.1 | GNAT family N-acetyltransferase | - |
| MM141_RS23860 (MM141_23870) | - | 5118482..5119111 (+) | 630 | WP_003094668.1 | DUF1780 domain-containing protein | - |
| MM141_RS23865 (MM141_23875) | - | 5119108..5119557 (+) | 450 | WP_003094670.1 | MOSC domain-containing protein | - |
| MM141_RS23870 (MM141_23880) | - | 5119583..5119753 (+) | 171 | WP_003094672.1 | DUF3094 family protein | - |
| MM141_RS23875 (MM141_23885) | - | 5119817..5121124 (+) | 1308 | WP_003112837.1 | NAD(P)/FAD-dependent oxidoreductase | - |
Sequence
Protein
Download Length: 406 a.a. Molecular weight: 44549.37 Da Isoelectric Point: 9.6983
>NTDB_id=661335 MM141_RS23820 WP_016253888.1 5113518..5114738(+) (pilC) [Pseudomonas aeruginosa strain H20]
MADKALKTSVFIWEGTDKKGAKVKGELTGQNPMLVKAHLRKQGINPLKVRKKGISLLGAGKKVKPMDIALFTRQMATMMG
AGVPLLQSFDIIGEGFDNPNMRKLVDEIKQEVSSGNSLANSLRKKPQYFDELYCNLVDAGEQSGALENLLDRVATYKEKT
ESLKAKIKKAMTYPIAVIIVALIVSAILLIKVVPQFQSVFEGFGAELPAFTQMIVNLSEFMQEWWFFIILAIAIFGFAFK
ELHKRSQKFRDTLDRTILKLPIFGGIVYKSAVARYARTLSTTFAAGVPLVDALDSVSGATGNIVFKNAVSKIKQDVSTGM
QLNFSMRTTSVFPNMAIQMTAIGEESGSLDEMLSKVASYYEEEVDNAVDNLTTLMEPMIMAVLGVLVGGLIVAMYLPIFQ
LGNVVG
MADKALKTSVFIWEGTDKKGAKVKGELTGQNPMLVKAHLRKQGINPLKVRKKGISLLGAGKKVKPMDIALFTRQMATMMG
AGVPLLQSFDIIGEGFDNPNMRKLVDEIKQEVSSGNSLANSLRKKPQYFDELYCNLVDAGEQSGALENLLDRVATYKEKT
ESLKAKIKKAMTYPIAVIIVALIVSAILLIKVVPQFQSVFEGFGAELPAFTQMIVNLSEFMQEWWFFIILAIAIFGFAFK
ELHKRSQKFRDTLDRTILKLPIFGGIVYKSAVARYARTLSTTFAAGVPLVDALDSVSGATGNIVFKNAVSKIKQDVSTGM
QLNFSMRTTSVFPNMAIQMTAIGEESGSLDEMLSKVASYYEEEVDNAVDNLTTLMEPMIMAVLGVLVGGLIVAMYLPIFQ
LGNVVG
Nucleotide
Download Length: 1221 bp
>NTDB_id=661335 MM141_RS23820 WP_016253888.1 5113518..5114738(+) (pilC) [Pseudomonas aeruginosa strain H20]
ATGGCGGACAAAGCGTTAAAAACCAGCGTTTTCATCTGGGAGGGCACCGACAAGAAAGGCGCCAAGGTCAAGGGCGAACT
GACCGGGCAGAATCCCATGCTGGTGAAAGCCCATCTGCGCAAGCAAGGCATCAATCCGCTCAAGGTACGCAAGAAAGGTA
TCTCCCTGCTGGGCGCAGGCAAGAAAGTGAAACCCATGGACATCGCCCTGTTCACCCGGCAGATGGCGACCATGATGGGC
GCTGGCGTTCCCCTCCTGCAATCGTTCGACATCATCGGCGAGGGCTTCGACAACCCCAACATGCGCAAGCTTGTGGATGA
AATCAAACAGGAAGTTTCCTCAGGTAACAGCCTAGCCAACTCCTTGAGAAAAAAGCCCCAGTATTTTGACGAGCTTTATT
GCAACCTGGTAGATGCAGGGGAACAGTCTGGCGCCTTGGAAAACCTTCTCGATCGGGTGGCAACCTATAAAGAAAAGACG
GAATCACTGAAAGCCAAGATCAAAAAGGCGATGACCTATCCCATTGCCGTCATCATTGTCGCACTGATTGTATCTGCGAT
CCTCCTGATTAAAGTGGTTCCACAATTTCAGTCGGTCTTTGAAGGTTTCGGCGCGGAACTTCCCGCCTTTACCCAGATGA
TTGTCAATCTATCGGAGTTCATGCAGGAGTGGTGGTTCTTCATCATACTGGCGATAGCGATATTTGGCTTTGCATTCAAA
GAATTGCATAAACGCTCACAAAAATTCCGTGACACACTCGATAGAACGATCCTCAAACTTCCCATTTTCGGAGGCATCGT
CTACAAATCTGCGGTCGCCCGTTATGCACGGACCTTGTCCACGACCTTCGCCGCGGGTGTTCCCCTGGTCGATGCGCTCG
ACTCCGTCTCCGGAGCGACCGGCAATATCGTGTTCAAGAACGCGGTCAGCAAGATCAAGCAAGACGTTTCCACCGGCATG
CAGCTCAACTTCTCCATGCGCACCACCAGCGTCTTTCCCAACATGGCGATCCAGATGACCGCCATCGGCGAGGAGTCCGG
TTCGCTCGATGAGATGCTGAGCAAAGTCGCCAGCTACTACGAAGAGGAAGTCGACAACGCCGTGGACAACCTCACCACGC
TCATGGAACCGATGATCATGGCCGTTCTCGGCGTACTGGTTGGCGGTCTGATCGTGGCCATGTACCTTCCGATCTTCCAA
CTCGGCAACGTCGTCGGATAA
ATGGCGGACAAAGCGTTAAAAACCAGCGTTTTCATCTGGGAGGGCACCGACAAGAAAGGCGCCAAGGTCAAGGGCGAACT
GACCGGGCAGAATCCCATGCTGGTGAAAGCCCATCTGCGCAAGCAAGGCATCAATCCGCTCAAGGTACGCAAGAAAGGTA
TCTCCCTGCTGGGCGCAGGCAAGAAAGTGAAACCCATGGACATCGCCCTGTTCACCCGGCAGATGGCGACCATGATGGGC
GCTGGCGTTCCCCTCCTGCAATCGTTCGACATCATCGGCGAGGGCTTCGACAACCCCAACATGCGCAAGCTTGTGGATGA
AATCAAACAGGAAGTTTCCTCAGGTAACAGCCTAGCCAACTCCTTGAGAAAAAAGCCCCAGTATTTTGACGAGCTTTATT
GCAACCTGGTAGATGCAGGGGAACAGTCTGGCGCCTTGGAAAACCTTCTCGATCGGGTGGCAACCTATAAAGAAAAGACG
GAATCACTGAAAGCCAAGATCAAAAAGGCGATGACCTATCCCATTGCCGTCATCATTGTCGCACTGATTGTATCTGCGAT
CCTCCTGATTAAAGTGGTTCCACAATTTCAGTCGGTCTTTGAAGGTTTCGGCGCGGAACTTCCCGCCTTTACCCAGATGA
TTGTCAATCTATCGGAGTTCATGCAGGAGTGGTGGTTCTTCATCATACTGGCGATAGCGATATTTGGCTTTGCATTCAAA
GAATTGCATAAACGCTCACAAAAATTCCGTGACACACTCGATAGAACGATCCTCAAACTTCCCATTTTCGGAGGCATCGT
CTACAAATCTGCGGTCGCCCGTTATGCACGGACCTTGTCCACGACCTTCGCCGCGGGTGTTCCCCTGGTCGATGCGCTCG
ACTCCGTCTCCGGAGCGACCGGCAATATCGTGTTCAAGAACGCGGTCAGCAAGATCAAGCAAGACGTTTCCACCGGCATG
CAGCTCAACTTCTCCATGCGCACCACCAGCGTCTTTCCCAACATGGCGATCCAGATGACCGCCATCGGCGAGGAGTCCGG
TTCGCTCGATGAGATGCTGAGCAAAGTCGCCAGCTACTACGAAGAGGAAGTCGACAACGCCGTGGACAACCTCACCACGC
TCATGGAACCGATGATCATGGCCGTTCTCGGCGTACTGGTTGGCGGTCTGATCGTGGCCATGTACCTTCCGATCTTCCAA
CTCGGCAACGTCGTCGGATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| pilC | Pseudomonas stutzeri DSM 10701 |
76.543 |
99.754 |
0.764 |
| pilC | Acinetobacter baumannii D1279779 |
61.386 |
99.507 |
0.611 |
| pilC | Acinetobacter baylyi ADP1 |
60.837 |
100 |
0.608 |
| pilC | Legionella pneumophila strain ERS1305867 |
55.051 |
97.537 |
0.537 |
| pilG | Neisseria gonorrhoeae MS11 |
46.173 |
99.754 |
0.461 |
| pilG | Neisseria meningitidis 44/76-A |
45.409 |
99.261 |
0.451 |
| pilC | Vibrio cholerae strain A1552 |
42.611 |
100 |
0.426 |
| pilC | Vibrio campbellii strain DS40M4 |
42.065 |
97.783 |
0.411 |
| pilC | Thermus thermophilus HB27 |
36.908 |
98.768 |
0.365 |