Detailed information
Overview
| Name | pilC | Type | Machinery gene |
| Locus tag | ICA18_RS06115 | Genome accession | NZ_CP062164 |
| Coordinates | 1383992..1385248 (-) | Length | 418 a.a. |
| NCBI ID | WP_104613429.1 | Uniprot ID | - |
| Organism | Xanthomonas arboricola pv. corylina strain A7 | ||
| Function | assembly of type IV pilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 1370694..1385248 | 1383992..1385248 | within | 0 |
Gene organization within MGE regions
Location: 1370694..1385248
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| ICA18_RS06050 (ICA18_06055) | - | 1370694..1371479 (+) | 786 | WP_104567471.1 | hypothetical protein | - |
| ICA18_RS06055 (ICA18_06060) | - | 1371518..1372453 (+) | 936 | WP_104567470.1 | hypothetical protein | - |
| ICA18_RS06060 (ICA18_06065) | - | 1372722..1373824 (+) | 1103 | WP_104567867.1 | IS3 family transposase | - |
| ICA18_RS06065 (ICA18_06070) | - | 1374369..1374755 (+) | 387 | WP_026064907.1 | hypothetical protein | - |
| ICA18_RS06070 (ICA18_06075) | - | 1374844..1375236 (+) | 393 | WP_023904915.1 | H-NS family nucleoid-associated regulatory protein | - |
| ICA18_RS06075 (ICA18_06080) | glgX | 1375908..1378040 (-) | 2133 | WP_053051310.1 | glycogen debranching protein GlgX | - |
| ICA18_RS06080 (ICA18_06085) | rimK | 1378531..1379436 (+) | 906 | WP_212583745.1 | 30S ribosomal protein S6--L-glutamate ligase | - |
| ICA18_RS06085 (ICA18_06090) | - | 1379663..1380022 (+) | 360 | WP_223571086.1 | hypothetical protein | - |
| ICA18_RS06090 (ICA18_06095) | - | 1380123..1380800 (+) | 678 | WP_006448355.1 | response regulator transcription factor | - |
| ICA18_RS06095 (ICA18_06100) | - | 1380793..1382127 (+) | 1335 | WP_016904089.1 | HAMP domain-containing sensor histidine kinase | - |
| ICA18_RS06100 (ICA18_06105) | - | 1382230..1382385 (-) | 156 | Protein_1187 | SymE family type I addiction module toxin | - |
| ICA18_RS06105 (ICA18_06110) | coaE | 1382485..1383108 (-) | 624 | WP_212583746.1 | dephospho-CoA kinase | - |
| ICA18_RS06110 (ICA18_06115) | - | 1383122..1383985 (-) | 864 | WP_104612232.1 | A24 family peptidase | - |
| ICA18_RS06115 (ICA18_06120) | pilC | 1383992..1385248 (-) | 1257 | WP_104613429.1 | type II secretion system F family protein | Machinery gene |
Sequence
Protein
Download Length: 418 a.a. Molecular weight: 45803.19 Da Isoelectric Point: 10.2556
>NTDB_id=486692 ICA18_RS06115 WP_104613429.1 1383992..1385248(-) (pilC) [Xanthomonas arboricola pv. corylina strain A7]
MSVARNVIKKPVDRSTSQLQPFIWEGTDKRGVKMKGEQTARNVNMLRAELRRQGITPNVVKPKPKPLFGAAGKKVTPKDI
SFFSRQMATMMKSGVPIVGSLEIIGEGHKNPRMKQMVGQIRTDIEGGSSLHESISRHPVQFDELYRNLVRAGEGAGVLET
VLDTVATYKENIEALKGKIKKALFYPAMVVVVALVVSAILLVFVVPQFEEVFKGFGAELPAFTQMIVAASRFMVSYWWAL
LVIVAGAVIGFIYAYKRSPRMQHGMDRIILKVPVIGQIMHNSAIARFARTTAVTFKAGVPLVEALGIVAGATGNKVYEEA
VLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDTMLFKVAEYFEQEVNNAVDALSSLLEPLIMVFIGTIVG
GMVIGMYLPIFKLGAVVG
MSVARNVIKKPVDRSTSQLQPFIWEGTDKRGVKMKGEQTARNVNMLRAELRRQGITPNVVKPKPKPLFGAAGKKVTPKDI
SFFSRQMATMMKSGVPIVGSLEIIGEGHKNPRMKQMVGQIRTDIEGGSSLHESISRHPVQFDELYRNLVRAGEGAGVLET
VLDTVATYKENIEALKGKIKKALFYPAMVVVVALVVSAILLVFVVPQFEEVFKGFGAELPAFTQMIVAASRFMVSYWWAL
LVIVAGAVIGFIYAYKRSPRMQHGMDRIILKVPVIGQIMHNSAIARFARTTAVTFKAGVPLVEALGIVAGATGNKVYEEA
VLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDTMLFKVAEYFEQEVNNAVDALSSLLEPLIMVFIGTIVG
GMVIGMYLPIFKLGAVVG
Nucleotide
Download Length: 1257 bp
>NTDB_id=486692 ICA18_RS06115 WP_104613429.1 1383992..1385248(-) (pilC) [Xanthomonas arboricola pv. corylina strain A7]
ATGTCCGTCGCTCGCAACGTCATCAAAAAACCAGTCGATCGAAGCACAAGCCAGCTGCAGCCGTTTATCTGGGAAGGAAC
GGACAAGCGCGGCGTCAAGATGAAGGGAGAGCAAACGGCGCGCAACGTAAACATGTTGCGCGCCGAACTTCGGCGCCAGG
GTATTACACCAAACGTCGTTAAGCCCAAGCCAAAGCCACTGTTTGGCGCGGCAGGCAAAAAAGTGACTCCTAAGGACATC
TCGTTCTTTAGCCGCCAAATGGCAACCATGATGAAATCCGGCGTACCTATTGTTGGTTCACTGGAAATTATTGGGGAGGG
TCACAAAAACCCACGGATGAAGCAAATGGTTGGTCAGATAAGAACCGATATCGAGGGAGGCTCGTCCCTACATGAATCCA
TTAGCAGACACCCAGTTCAATTTGACGAGCTCTACCGAAACCTAGTCCGCGCAGGTGAAGGCGCGGGCGTGCTAGAAACC
GTACTCGACACAGTTGCAACTTACAAAGAAAATATTGAGGCATTGAAGGGCAAGATCAAGAAAGCTCTTTTCTACCCTGC
GATGGTCGTCGTAGTTGCTCTCGTGGTTAGCGCAATCCTGCTAGTTTTCGTAGTTCCACAGTTCGAAGAGGTTTTCAAGG
GCTTTGGTGCGGAATTACCTGCTTTCACACAAATGATCGTAGCCGCATCGCGCTTTATGGTTAGCTACTGGTGGGCTCTA
TTGGTCATCGTAGCGGGCGCAGTCATTGGTTTCATTTATGCCTACAAGCGCTCGCCACGCATGCAGCACGGCATGGATCG
GATTATCTTAAAGGTACCTGTCATAGGCCAAATCATGCACAACAGCGCGATCGCTCGCTTCGCCCGCACAACTGCCGTTA
CCTTTAAAGCCGGTGTTCCGTTAGTGGAAGCGCTAGGGATTGTGGCTGGCGCCACAGGCAATAAAGTCTACGAAGAAGCC
GTGCTTCGCATGCGAGACGATGTGTCGGTGGGCTACCCGGTCAACATGGCCATGAAACAGGTCAATTTATTCCCGCACAT
GGTGATCCAAATGACTGCCATTGGCGAGGAAGCAGGTGCGCTTGACACTATGCTGTTCAAGGTTGCCGAGTACTTCGAGC
AGGAAGTTAACAACGCCGTGGATGCGCTCAGCAGCCTACTGGAACCGCTGATCATGGTGTTCATCGGCACCATCGTAGGC
GGCATGGTCATAGGCATGTATCTTCCGATCTTCAAACTCGGCGCAGTGGTTGGATAA
ATGTCCGTCGCTCGCAACGTCATCAAAAAACCAGTCGATCGAAGCACAAGCCAGCTGCAGCCGTTTATCTGGGAAGGAAC
GGACAAGCGCGGCGTCAAGATGAAGGGAGAGCAAACGGCGCGCAACGTAAACATGTTGCGCGCCGAACTTCGGCGCCAGG
GTATTACACCAAACGTCGTTAAGCCCAAGCCAAAGCCACTGTTTGGCGCGGCAGGCAAAAAAGTGACTCCTAAGGACATC
TCGTTCTTTAGCCGCCAAATGGCAACCATGATGAAATCCGGCGTACCTATTGTTGGTTCACTGGAAATTATTGGGGAGGG
TCACAAAAACCCACGGATGAAGCAAATGGTTGGTCAGATAAGAACCGATATCGAGGGAGGCTCGTCCCTACATGAATCCA
TTAGCAGACACCCAGTTCAATTTGACGAGCTCTACCGAAACCTAGTCCGCGCAGGTGAAGGCGCGGGCGTGCTAGAAACC
GTACTCGACACAGTTGCAACTTACAAAGAAAATATTGAGGCATTGAAGGGCAAGATCAAGAAAGCTCTTTTCTACCCTGC
GATGGTCGTCGTAGTTGCTCTCGTGGTTAGCGCAATCCTGCTAGTTTTCGTAGTTCCACAGTTCGAAGAGGTTTTCAAGG
GCTTTGGTGCGGAATTACCTGCTTTCACACAAATGATCGTAGCCGCATCGCGCTTTATGGTTAGCTACTGGTGGGCTCTA
TTGGTCATCGTAGCGGGCGCAGTCATTGGTTTCATTTATGCCTACAAGCGCTCGCCACGCATGCAGCACGGCATGGATCG
GATTATCTTAAAGGTACCTGTCATAGGCCAAATCATGCACAACAGCGCGATCGCTCGCTTCGCCCGCACAACTGCCGTTA
CCTTTAAAGCCGGTGTTCCGTTAGTGGAAGCGCTAGGGATTGTGGCTGGCGCCACAGGCAATAAAGTCTACGAAGAAGCC
GTGCTTCGCATGCGAGACGATGTGTCGGTGGGCTACCCGGTCAACATGGCCATGAAACAGGTCAATTTATTCCCGCACAT
GGTGATCCAAATGACTGCCATTGGCGAGGAAGCAGGTGCGCTTGACACTATGCTGTTCAAGGTTGCCGAGTACTTCGAGC
AGGAAGTTAACAACGCCGTGGATGCGCTCAGCAGCCTACTGGAACCGCTGATCATGGTGTTCATCGGCACCATCGTAGGC
GGCATGGTCATAGGCATGTATCTTCCGATCTTCAAACTCGGCGCAGTGGTTGGATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| pilC | Legionella pneumophila strain ERS1305867 |
53.202 |
97.129 |
0.517 |
| pilC | Pseudomonas stutzeri DSM 10701 |
52.525 |
94.737 |
0.498 |
| pilC | Acinetobacter baylyi ADP1 |
51.365 |
96.411 |
0.495 |
| pilC | Acinetobacter baumannii D1279779 |
50.62 |
96.411 |
0.488 |
| pilG | Neisseria gonorrhoeae MS11 |
43.86 |
95.455 |
0.419 |
| pilG | Neisseria meningitidis 44/76-A |
43.358 |
95.455 |
0.414 |
| pilC | Vibrio cholerae strain A1552 |
41.5 |
95.694 |
0.397 |
| pilC | Vibrio campbellii strain DS40M4 |
40.695 |
96.411 |
0.392 |