Detailed information
Overview
| Name | pilC | Type | Machinery gene |
| Locus tag | NX720_RS18860 | Genome accession | NZ_CP103300 |
| Coordinates | 4685428..4686645 (-) | Length | 405 a.a. |
| NCBI ID | WP_262596681.1 | Uniprot ID | - |
| Organism | Endozoicomonas euniceicola strain EF212 | ||
| Function | assembly of type IV pilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Genomic island | 4668794..4685291 | 4685428..4686645 | flank | 137 |
Gene organization within MGE regions
Location: 4668794..4686645
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NX720_RS18795 (NX720_18795) | - | 4668794..4669432 (+) | 639 | WP_262596661.1 | VWA domain-containing protein | - |
| NX720_RS18800 (NX720_18800) | - | 4669467..4670114 (+) | 648 | WP_262596662.1 | VWA domain-containing protein | - |
| NX720_RS18805 (NX720_18805) | - | 4670130..4671161 (+) | 1032 | WP_262596663.1 | TerY-C metal binding domain-containing protein | - |
| NX720_RS18810 (NX720_18810) | - | 4671161..4673077 (+) | 1917 | WP_262596665.1 | PP2C family serine/threonine-protein phosphatase | - |
| NX720_RS18815 (NX720_18815) | - | 4673070..4674566 (+) | 1497 | WP_262596667.1 | helix-hairpin-helix domain-containing protein | - |
| NX720_RS18820 (NX720_18820) | - | 4675150..4675998 (-) | 849 | WP_262596669.1 | hypothetical protein | - |
| NX720_RS18825 (NX720_18825) | tnpC | 4676652..4678205 (-) | 1554 | WP_262596671.1 | IS66 family transposase | - |
| NX720_RS18830 (NX720_18830) | tnpB | 4678215..4678583 (-) | 369 | WP_262596672.1 | IS66 family insertion sequence element accessory protein TnpB | - |
| NX720_RS18835 (NX720_18835) | tnpA | 4678585..4678920 (-) | 336 | WP_262596673.1 | IS66 family insertion sequence element accessory protein TnpA | - |
| NX720_RS18840 (NX720_18840) | - | 4678917..4680824 (-) | 1908 | WP_262596675.1 | TnsD family transposase | - |
| NX720_RS18845 (NX720_18845) | - | 4680860..4682236 (-) | 1377 | WP_262596677.1 | ATP-binding protein | - |
| NX720_RS18850 (NX720_18850) | - | 4682252..4684468 (-) | 2217 | WP_262596678.1 | Mu transposase C-terminal domain-containing protein | - |
| NX720_RS18855 (NX720_18855) | - | 4684470..4685291 (-) | 822 | WP_262596680.1 | TnsA endonuclease N-terminal domain-containing protein | - |
| NX720_RS18860 (NX720_18860) | pilC | 4685428..4686645 (-) | 1218 | WP_262596681.1 | type II secretion system F family protein | Machinery gene |
Sequence
Protein
Download Length: 405 a.a. Molecular weight: 43608.37 Da Isoelectric Point: 9.7718
>NTDB_id=721823 NX720_RS18860 WP_262596681.1 4685428..4686645(-) (pilC) [Endozoicomonas euniceicola strain EF212]
MAKKSAKSSTFIWEGKDKSGRKTKGEIEGTSIALIKAELRKQGISATRVKKKGMSFGKKGGKITPLDIALFTRQLATMIK
AGVPLLNAFDITTDGIEKPAMKELLVKVKNEVAGGTTLAEALRAHPLYFDDLYCNLVSSGEQSGALETLLDRIATYKEKS
EALKAKIKKAMNYPVAVVCVAFIVTGILLVKVVPQFEEVFQGFGAELPAFTQMVIHISNFVQQWWLAAILGLAAFGFMIK
KLMLRSKAARDKKDRLVLKLPVIGPILEKSAVARFARTLSTTFAAGVPLVDALDSVSGAAGNVVFADATNQIKEDVSTGQ
QLQFAMKSSGIFPAMAIQMVSIGEESGSLDEMLDKVATFYENEVDNAVDGLTSLMEPLIMSVLGVLVGGLIVAMYLPIFQ
MGSVV
MAKKSAKSSTFIWEGKDKSGRKTKGEIEGTSIALIKAELRKQGISATRVKKKGMSFGKKGGKITPLDIALFTRQLATMIK
AGVPLLNAFDITTDGIEKPAMKELLVKVKNEVAGGTTLAEALRAHPLYFDDLYCNLVSSGEQSGALETLLDRIATYKEKS
EALKAKIKKAMNYPVAVVCVAFIVTGILLVKVVPQFEEVFQGFGAELPAFTQMVIHISNFVQQWWLAAILGLAAFGFMIK
KLMLRSKAARDKKDRLVLKLPVIGPILEKSAVARFARTLSTTFAAGVPLVDALDSVSGAAGNVVFADATNQIKEDVSTGQ
QLQFAMKSSGIFPAMAIQMVSIGEESGSLDEMLDKVATFYENEVDNAVDGLTSLMEPLIMSVLGVLVGGLIVAMYLPIFQ
MGSVV
Nucleotide
Download Length: 1218 bp
>NTDB_id=721823 NX720_RS18860 WP_262596681.1 4685428..4686645(-) (pilC) [Endozoicomonas euniceicola strain EF212]
ATGGCCAAAAAATCAGCCAAATCCTCTACTTTTATCTGGGAAGGTAAAGACAAGAGTGGACGTAAAACCAAAGGGGAAAT
AGAAGGTACCAGCATTGCCTTGATCAAGGCCGAGCTGCGTAAGCAGGGCATTTCTGCCACCAGGGTAAAAAAGAAAGGTA
TGTCCTTTGGCAAAAAAGGGGGCAAAATTACCCCCCTTGATATCGCCCTGTTCACCCGGCAGCTTGCCACTATGATAAAA
GCGGGTGTACCACTGCTCAACGCCTTTGACATAACAACAGACGGTATCGAAAAACCCGCCATGAAAGAGCTGCTTGTCAA
GGTTAAAAACGAGGTGGCAGGTGGTACGACTCTGGCCGAAGCCCTTCGGGCGCACCCACTCTATTTTGATGACCTGTACT
GCAACCTGGTCAGCTCCGGTGAACAGTCCGGAGCACTGGAAACATTGCTGGACAGAATTGCAACCTATAAAGAGAAGTCT
GAAGCCCTTAAAGCCAAAATCAAAAAAGCGATGAACTACCCGGTTGCGGTTGTCTGTGTTGCTTTCATTGTTACCGGCAT
TCTGCTGGTAAAAGTGGTGCCACAGTTTGAAGAAGTCTTTCAGGGATTCGGAGCTGAACTCCCGGCCTTCACCCAGATGG
TTATTCATATTTCCAACTTTGTTCAGCAATGGTGGCTGGCGGCTATTCTCGGACTGGCGGCCTTTGGTTTTATGATTAAA
AAACTGATGCTGCGCTCAAAAGCCGCAAGAGATAAAAAAGACAGGCTAGTGCTAAAGCTGCCTGTTATTGGTCCAATACT
CGAAAAGTCTGCCGTCGCACGTTTTGCCCGAACCCTTTCAACGACATTTGCTGCCGGTGTTCCATTAGTCGACGCACTGG
ACTCAGTATCAGGCGCTGCGGGCAATGTTGTCTTTGCGGATGCCACCAACCAAATCAAAGAAGATGTTTCCACAGGTCAG
CAGCTGCAATTTGCTATGAAGAGTTCAGGCATTTTTCCGGCGATGGCGATTCAGATGGTTTCCATCGGAGAAGAGTCTGG
CTCTCTGGATGAAATGCTGGATAAAGTCGCCACCTTCTACGAAAACGAAGTGGATAACGCCGTTGATGGCCTGACCAGCC
TGATGGAACCATTGATTATGTCGGTACTCGGGGTGCTGGTTGGAGGCCTGATTGTGGCTATGTACCTGCCTATCTTTCAG
ATGGGGTCTGTTGTGTAA
ATGGCCAAAAAATCAGCCAAATCCTCTACTTTTATCTGGGAAGGTAAAGACAAGAGTGGACGTAAAACCAAAGGGGAAAT
AGAAGGTACCAGCATTGCCTTGATCAAGGCCGAGCTGCGTAAGCAGGGCATTTCTGCCACCAGGGTAAAAAAGAAAGGTA
TGTCCTTTGGCAAAAAAGGGGGCAAAATTACCCCCCTTGATATCGCCCTGTTCACCCGGCAGCTTGCCACTATGATAAAA
GCGGGTGTACCACTGCTCAACGCCTTTGACATAACAACAGACGGTATCGAAAAACCCGCCATGAAAGAGCTGCTTGTCAA
GGTTAAAAACGAGGTGGCAGGTGGTACGACTCTGGCCGAAGCCCTTCGGGCGCACCCACTCTATTTTGATGACCTGTACT
GCAACCTGGTCAGCTCCGGTGAACAGTCCGGAGCACTGGAAACATTGCTGGACAGAATTGCAACCTATAAAGAGAAGTCT
GAAGCCCTTAAAGCCAAAATCAAAAAAGCGATGAACTACCCGGTTGCGGTTGTCTGTGTTGCTTTCATTGTTACCGGCAT
TCTGCTGGTAAAAGTGGTGCCACAGTTTGAAGAAGTCTTTCAGGGATTCGGAGCTGAACTCCCGGCCTTCACCCAGATGG
TTATTCATATTTCCAACTTTGTTCAGCAATGGTGGCTGGCGGCTATTCTCGGACTGGCGGCCTTTGGTTTTATGATTAAA
AAACTGATGCTGCGCTCAAAAGCCGCAAGAGATAAAAAAGACAGGCTAGTGCTAAAGCTGCCTGTTATTGGTCCAATACT
CGAAAAGTCTGCCGTCGCACGTTTTGCCCGAACCCTTTCAACGACATTTGCTGCCGGTGTTCCATTAGTCGACGCACTGG
ACTCAGTATCAGGCGCTGCGGGCAATGTTGTCTTTGCGGATGCCACCAACCAAATCAAAGAAGATGTTTCCACAGGTCAG
CAGCTGCAATTTGCTATGAAGAGTTCAGGCATTTTTCCGGCGATGGCGATTCAGATGGTTTCCATCGGAGAAGAGTCTGG
CTCTCTGGATGAAATGCTGGATAAAGTCGCCACCTTCTACGAAAACGAAGTGGATAACGCCGTTGATGGCCTGACCAGCC
TGATGGAACCATTGATTATGTCGGTACTCGGGGTGCTGGTTGGAGGCCTGATTGTGGCTATGTACCTGCCTATCTTTCAG
ATGGGGTCTGTTGTGTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| pilC | Pseudomonas stutzeri DSM 10701 |
67.16 |
100 |
0.672 |
| pilC | Acinetobacter baylyi ADP1 |
60.049 |
100 |
0.605 |
| pilC | Acinetobacter baumannii D1279779 |
59.753 |
100 |
0.598 |
| pilC | Legionella pneumophila strain ERS1305867 |
55.637 |
100 |
0.56 |
| pilG | Neisseria gonorrhoeae MS11 |
45.771 |
99.259 |
0.454 |
| pilG | Neisseria meningitidis 44/76-A |
45.522 |
99.259 |
0.452 |
| pilC | Vibrio cholerae strain A1552 |
44.584 |
98.025 |
0.437 |
| pilC | Vibrio campbellii strain DS40M4 |
42.857 |
98.519 |
0.422 |
| pilC | Thermus thermophilus HB27 |
38.596 |
98.519 |
0.38 |