Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   ACLVWI_RS16550 Genome accession   NZ_LR962897
Coordinates   3840949..3842205 (+) Length   418 a.a.
NCBI ID   WP_411862068.1    Uniprot ID   -
Organism   Xanthomonas arboricola isolate Xanthomonas sp. CPBF 1586     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3840949..3849762 3840949..3842205 within 0


Gene organization within MGE regions


Location: 3840949..3849762
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACLVWI_RS16550 (X12_003302) pilC 3840949..3842205 (+) 1257 WP_411862068.1 type II secretion system F family protein Machinery gene
  ACLVWI_RS16555 (X12_003303) - 3842212..3843075 (+) 864 WP_411862069.1 prepilin peptidase -
  ACLVWI_RS16560 (X12_003304) coaE 3843089..3843694 (+) 606 WP_411862070.1 dephospho-CoA kinase -
  ACLVWI_RS16565 - 3844120..3844566 (+) 447 Protein_3246 DUF6531 domain-containing protein -
  ACLVWI_RS16570 (X12_003305) - 3844747..3848328 (+) 3582 WP_411863117.1 RHS repeat-associated core domain-containing protein -
  ACLVWI_RS16575 (X12_003306) - 3848671..3848994 (+) 324 WP_104646208.1 hypothetical protein -
  ACLVWI_RS16580 (X12_003307) - 3849367..3849762 (+) 396 WP_411862071.1 SymE family type I addiction module toxin -

Sequence


Protein


Download         Length: 418 a.a.        Molecular weight: 45537.98 Da        Isoelectric Point: 10.3308

>NTDB_id=1134954 ACLVWI_RS16550 WP_411862068.1 3840949..3842205(+) (pilC) [Xanthomonas arboricola isolate Xanthomonas sp. CPBF 1586]
MAAVRGVVKTKPTLQLEQASPFVWQGTDKRGVKMKGEQTAKNANLLRAELRRQGITPTVVKVKPKPLFGAAGKKVSAKDI
SFFSRQMATMMKSGVPIVGALEIIGSGQKNPRMRNMVGQIRADIEGGSSLHEAVSRHPVQFDELYRNLVKAGEGAGVLET
VLDTIATYKENIEALKGKIKKALFYPAMVVAVALIVSAILLIFVVPQFEDVFKGFGAELPAFTQIIVNMSRFMVSWWWLI
LFVVVVAIIGFIFAYKRSPSMQHGMDRVILKVPIIGQIMHNSSIARFARTTAVTFKAGVPLVEALSIVAGATGNSVYETA
VLRMRDDVSVGYPVNMAMKQVNLFPHMVVQMTAIGEEAGALDAMLFKVAEYFEQEVNNAVDALSSLIEPLIMVFIGTIVG
GMVIGMYLPIFKLASVVG

Nucleotide


Download         Length: 1257 bp        

>NTDB_id=1134954 ACLVWI_RS16550 WP_411862068.1 3840949..3842205(+) (pilC) [Xanthomonas arboricola isolate Xanthomonas sp. CPBF 1586]
ATGGCAGCAGTACGTGGTGTGGTCAAGACGAAGCCGACCTTGCAATTAGAGCAGGCTAGTCCGTTTGTCTGGCAGGGAAC
AGACAAGCGCGGCGTGAAGATGAAGGGGGAGCAAACGGCAAAAAACGCCAACCTGCTACGTGCGGAACTCCGCAGGCAGG
GGATAACGCCTACTGTTGTAAAGGTAAAGCCGAAGCCGTTATTTGGCGCGGCTGGCAAGAAGGTTTCCGCTAAGGACATC
TCGTTCTTTAGCCGCCAGATGGCGACGATGATGAAGTCCGGAGTTCCGATCGTTGGGGCGCTTGAGATCATCGGCAGCGG
CCAGAAAAATCCGCGCATGCGGAACATGGTTGGGCAAATCCGCGCAGACATCGAAGGCGGTTCGTCACTGCATGAAGCAG
TCAGCAGACACCCCGTACAGTTTGACGAGCTCTACAGGAACCTTGTTAAAGCCGGCGAAGGTGCGGGCGTTTTGGAGACG
GTGCTGGACACCATTGCGACATACAAGGAGAACATCGAGGCGCTGAAGGGCAAGATCAAGAAAGCCTTGTTCTATCCGGC
AATGGTTGTCGCAGTAGCACTAATCGTCAGCGCGATTCTGCTCATCTTTGTGGTACCTCAATTTGAGGACGTGTTCAAGG
GCTTTGGCGCGGAACTGCCCGCCTTCACTCAGATCATTGTGAACATGTCGCGGTTCATGGTGTCGTGGTGGTGGCTGATT
CTTTTTGTTGTGGTCGTTGCCATCATTGGCTTCATCTTCGCCTACAAACGCTCCCCCTCAATGCAACACGGCATGGACCG
GGTCATCCTCAAAGTGCCGATAATCGGCCAAATCATGCACAACAGTTCCATTGCACGTTTTGCACGGACGACTGCAGTGA
CATTCAAGGCAGGCGTACCATTGGTCGAAGCATTGAGCATCGTGGCTGGCGCAACCGGCAACTCGGTATACGAGACTGCC
GTACTACGGATGCGGGACGATGTGTCAGTGGGCTACCCTGTGAACATGGCAATGAAACAGGTCAACCTGTTTCCGCACAT
GGTGGTGCAGATGACCGCCATTGGTGAAGAGGCTGGCGCACTGGATGCCATGCTATTCAAGGTGGCGGAGTACTTCGAGC
AAGAGGTCAACAATGCGGTTGATGCGCTGAGCAGCCTGATCGAACCCCTCATCATGGTGTTCATTGGTACCATCGTCGGC
GGCATGGTCATCGGCATGTACCTACCCATCTTCAAGCTCGCTTCGGTGGTTGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

52.513

95.215

0.5

  pilC Legionella pneumophila strain ERS1305867

51.232

97.129

0.498

  pilC Acinetobacter baylyi ADP1

51.134

94.976

0.486

  pilC Acinetobacter baumannii D1279779

50.63

94.976

0.481

  pilG Neisseria gonorrhoeae MS11

43.719

95.215

0.416

  pilG Neisseria meningitidis 44/76-A

43.216

95.215

0.411

  pilC Vibrio cholerae strain A1552

41.606

98.325

0.409

  pilC Vibrio campbellii strain DS40M4

40.399

95.933

0.388


Multiple sequence alignment