Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   LOK39_RS15320 Genome accession   NZ_HG992337
Coordinates   3605861..3607123 (+) Length   420 a.a.
NCBI ID   WP_104593888.1    Uniprot ID   -
Organism   Xanthomonas arboricola strain 1314c isolate 1314c     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3607130..3618864 3605861..3607123 flank 7


Gene organization within MGE regions


Location: 3605861..3618864
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LOK39_RS15320 (XA1314C_30170) pilC 3605861..3607123 (+) 1263 WP_104593888.1 type II secretion system F family protein Machinery gene
  LOK39_RS15325 (XA1314C_30180) - 3607130..3607993 (+) 864 WP_002812278.1 A24 family peptidase -
  LOK39_RS15330 (XA1314C_30190) coaE 3608007..3608630 (+) 624 WP_039525803.1 dephospho-CoA kinase -
  LOK39_RS15335 (XA1314C_30200) - 3608730..3608885 (+) 156 Protein_3006 SymE family type I addiction module toxin -
  LOK39_RS15340 (XA1314C_30210) - 3608989..3610323 (-) 1335 WP_016904089.1 HAMP domain-containing sensor histidine kinase -
  LOK39_RS15345 (XA1314C_30220) - 3610316..3610993 (-) 678 WP_006448355.1 response regulator transcription factor -
  LOK39_RS15350 (XA1314C_30230) - 3611021..3611493 (-) 473 Protein_3009 hypothetical protein -
  LOK39_RS15355 (XA1314C_30240) rimK 3611684..3612589 (-) 906 WP_181116269.1 30S ribosomal protein S6--L-glutamate ligase -
  LOK39_RS15360 (XA1314C_30250) glgX 3613081..3615213 (+) 2133 WP_228599919.1 glycogen debranching protein GlgX -
  LOK39_RS15365 (XA1314C_30260) - 3615860..3616252 (-) 393 WP_023904915.1 H-NS family nucleoid-associated regulatory protein -
  LOK39_RS15370 (XA1314C_30270) - 3616341..3616727 (-) 387 WP_026064907.1 hypothetical protein -

Sequence


Protein


Download         Length: 420 a.a.        Molecular weight: 46110.54 Da        Isoelectric Point: 10.2556

>NTDB_id=1112317 LOK39_RS15320 WP_104593888.1 3605861..3607123(+) (pilC) [Xanthomonas arboricola strain 1314c isolate 1314c]
MSVTRNAIKKQTVDRSTSQQAQLFLWEGTDKRGVKMKGEQTARNMNMLRAELRRQGINPSVVKLKPKPLFGAAGKKITPK
DIAFFSRQMATMMKSGVPIVSSLEIIGEGHKNPRMKKMVGQVRTDIEGGSSLYESISKHPVQFDELYRNLVRAGEGAGVL
ETVLDTVATYKENIEALKGKIKKALFYPAMVIAVALIVSAILLIFVVPQFEEVFKGFGAELPAFTQMIVGASRFMVSYWW
IMLFVVAGAIAGFIFAYKRSPSMQHAMDRLILRVPVIGQIMHNSSIARFARTTAVTFKAGVPLVEALGIVAGATGNRVYE
DAVLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDAMLFKVAEYFEQEVNNAVDALSSLLEPMIMVFIGVV
VGGMVIGMYLPIFKLGAVVG

Nucleotide


Download         Length: 1263 bp        

>NTDB_id=1112317 LOK39_RS15320 WP_104593888.1 3605861..3607123(+) (pilC) [Xanthomonas arboricola strain 1314c isolate 1314c]
ATGTCTGTCACACGTAATGCCATTAAGAAGCAGACGGTGGACCGCAGTACCAGTCAACAGGCACAGCTGTTCCTCTGGGA
AGGAACCGATAAACGCGGAGTCAAGATGAAAGGCGAACAGACCGCACGCAACATGAACATGTTGCGTGCGGAACTTCGCC
GCCAAGGTATCAACCCGTCGGTAGTCAAGCTCAAGCCCAAGCCACTGTTCGGTGCTGCAGGCAAAAAAATTACGCCGAAA
GACATAGCATTTTTCAGCCGTCAGATGGCGACCATGATGAAGTCGGGGGTACCGATTGTAAGCTCGCTTGAGATCATCGG
TGAGGGGCACAAAAACCCGCGCATGAAAAAGATGGTAGGCCAAGTGAGAACGGATATCGAGGGCGGCTCCTCTCTCTATG
AATCGATCAGCAAACATCCTGTGCAGTTTGATGAGCTCTACCGAAACCTCGTACGAGCAGGTGAAGGTGCAGGTGTTCTA
GAAACCGTATTAGATACGGTGGCCACTTACAAAGAAAACATAGAAGCGCTAAAGGGCAAGATCAAAAAGGCCCTGTTTTA
TCCAGCGATGGTGATTGCAGTCGCCCTTATTGTCAGTGCCATCCTGCTCATTTTTGTTGTACCTCAATTTGAAGAAGTCT
TTAAAGGTTTCGGCGCCGAATTACCTGCCTTTACTCAAATGATTGTGGGAGCATCACGCTTCATGGTGAGCTATTGGTGG
ATCATGCTATTCGTCGTAGCAGGTGCCATTGCGGGATTCATCTTTGCTTATAAACGCTCGCCAAGCATGCAACACGCTAT
GGACAGACTAATATTAAGAGTACCTGTCATAGGCCAAATCATGCATAACAGTTCAATTGCGCGTTTCGCGCGCACCACCG
CTGTTACCTTTAAGGCAGGTGTCCCACTGGTGGAGGCATTAGGCATTGTTGCTGGCGCCACCGGCAATCGCGTTTACGAA
GATGCGGTGCTCCGCATGCGCGACGATGTGTCCGTGGGATACCCTGTCAACATGGCGATGAAACAGGTGAACCTGTTTCC
ACACATGGTCATTCAAATGACTGCGATTGGCGAAGAAGCTGGTGCGCTAGATGCCATGCTCTTCAAGGTAGCGGAATACT
TTGAGCAGGAAGTCAACAATGCCGTAGATGCACTCAGCAGTCTGCTCGAACCGATGATTATGGTCTTCATAGGCGTTGTC
GTAGGCGGCATGGTCATCGGCATGTATCTTCCGATCTTCAAACTCGGCGCAGTGGTTGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

54.912

94.524

0.519

  pilC Legionella pneumophila strain ERS1305867

53.283

94.286

0.502

  pilC Acinetobacter baylyi ADP1

51.613

95.952

0.495

  pilC Acinetobacter baumannii D1279779

50.372

95.952

0.483

  pilG Neisseria gonorrhoeae MS11

44.059

96.19

0.424

  pilG Neisseria meningitidis 44/76-A

43.457

96.429

0.419

  pilC Vibrio campbellii strain DS40M4

40.541

96.905

0.393

  pilC Vibrio cholerae strain A1552

41.058

94.524

0.388


Multiple sequence alignment