Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   H5027_RS04820 Genome accession   NZ_LR861807
Coordinates   1158871..1160130 (-) Length   419 a.a.
NCBI ID   WP_053051318.1    Uniprot ID   A0A7U7D8L3
Organism   Xanthomonas arboricola pv. juglandis isolate 3     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1146863..1164770 1158871..1160130 within 0


Gene organization within MGE regions


Location: 1146863..1164770
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H5027_RS04755 (XSP_000951) - 1146863..1147423 (-) 561 WP_102596783.1 hypothetical protein -
  H5027_RS22610 - 1147940..1148089 (-) 150 WP_256468814.1 hypothetical protein -
  H5027_RS04765 (XSP_000953) - 1148158..1148670 (+) 513 WP_049805061.1 ImmA/IrrE family metallo-endopeptidase -
  H5027_RS04770 (XSP_000954) - 1149245..1149631 (+) 387 WP_026064907.1 hypothetical protein -
  H5027_RS04775 (XSP_000955) - 1149720..1150112 (+) 393 WP_023904915.1 H-NS family nucleoid-associated regulatory protein -
  H5027_RS04780 (XSP_000956) glgX 1150784..1152916 (-) 2133 WP_053051310.1 glycogen debranching protein GlgX -
  H5027_RS04785 (XSP_000957) rimK 1153407..1154312 (+) 906 WP_166767257.1 30S ribosomal protein S6--L-glutamate ligase -
  H5027_RS22645 (XSP_000958) - 1154503..1154980 (+) 478 Protein_937 hypothetical protein -
  H5027_RS04795 (XSP_000959) - 1155008..1155685 (+) 678 WP_006448355.1 response regulator transcription factor -
  H5027_RS04800 (XSP_000960) - 1155678..1157012 (+) 1335 WP_016904089.1 HAMP domain-containing sensor histidine kinase -
  H5027_RS04805 (XSP_000961) - 1157116..1157264 (-) 149 Protein_940 SymE family type I addiction module toxin -
  H5027_RS04810 (XSP_000962) coaE 1157364..1157987 (-) 624 WP_053051308.1 dephospho-CoA kinase -
  H5027_RS04815 (XSP_000963) - 1158001..1158864 (-) 864 WP_016849610.1 A24 family peptidase -
  H5027_RS04820 (XSP_000964) pilC 1158871..1160130 (-) 1260 WP_053051318.1 type II secretion system F family protein Machinery gene
  H5027_RS04825 (XSP_000965) pilA 1160484..1160909 (+) 426 WP_053051307.1 pilin Machinery gene
  H5027_RS04830 (XSP_000966) - 1161045..1162634 (+) 1590 WP_053051306.1 phosphoethanolamine transferase -
  H5027_RS04835 (XSP_000967) - 1162705..1163274 (+) 570 WP_231103946.1 hypothetical protein -
  H5027_RS04840 (XSP_000968) - 1163244..1164770 (+) 1527 WP_053051304.1 hypothetical protein -

Sequence


Protein


Download         Length: 419 a.a.        Molecular weight: 46099.61 Da        Isoelectric Point: 10.3006

>NTDB_id=1132470 H5027_RS04820 WP_053051318.1 1158871..1160130(-) (pilC) [Xanthomonas arboricola pv. juglandis isolate 3]
MSVARSVIKKQPVDRNTSMLQTFVWEGADKRGVKMKGEQTARNANMLRAELRRQGIVPSMVKQKPKPLFGAAGKKITPKD
IAFFSRQMATMMKSGVPIVSSLEIIGEGHKNPRMRKMIGQIRTDIEGGSSLYESISKHPVQFDELYRNLVRAGEGAGVLE
TVLDTVATYKENIEALKGKIKKALFYPAMVVAVAIIVSAILLIFVVPQFEEVFKSFGAELPAFTQLLVNASRFMVSYWWL
MLVVTVGSIVGFIFAYKRSPRMQHGLDRLILKVPVIGQIMHNSAIARFARTTAVTFKAGVPLVEALGIVAGATGNKLYEE
AVFRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDAMLFKVAEYFEQEVNNAVDALSSLLEPLIMVFIGTIV
GGMVIGMYLPIFKLGAVVG

Nucleotide


Download         Length: 1260 bp        

>NTDB_id=1132470 H5027_RS04820 WP_053051318.1 1158871..1160130(-) (pilC) [Xanthomonas arboricola pv. juglandis isolate 3]
ATGTCAGTCGCGCGTAGCGTCATCAAGAAGCAACCGGTCGACCGGAACACCAGCATGCTGCAGACCTTCGTTTGGGAAGG
AGCTGACAAGCGTGGCGTGAAGATGAAGGGAGAGCAGACAGCCCGCAACGCCAACATGTTGCGAGCAGAGCTGCGCCGTC
AGGGCATCGTGCCCAGCATGGTGAAGCAAAAGCCGAAGCCCCTGTTTGGAGCGGCGGGGAAGAAAATTACCCCAAAAGAC
ATTGCGTTCTTTAGCCGCCAGATGGCGACCATGATGAAGTCGGGCGTGCCCATCGTCAGTTCGTTGGAGATCATTGGCGA
GGGACACAAGAACCCACGCATGAGAAAAATGATCGGCCAGATCAGGACAGACATCGAGGGTGGATCCTCGCTCTACGAGT
CCATCAGCAAACATCCCGTTCAGTTTGACGAGCTCTACCGCAACCTTGTCCGCGCGGGCGAAGGCGCTGGCGTACTGGAG
ACAGTCCTCGATACGGTCGCCACCTACAAAGAAAACATTGAAGCCCTAAAGGGCAAGATCAAAAAGGCACTGTTCTACCC
TGCCATGGTGGTTGCGGTAGCAATCATCGTGAGTGCGATTTTGCTCATCTTCGTGGTGCCTCAGTTCGAAGAAGTATTCA
AAAGCTTCGGTGCCGAACTACCGGCATTCACTCAGCTCCTCGTCAACGCATCGCGCTTCATGGTCAGCTATTGGTGGTTG
ATGCTGGTGGTGACAGTTGGATCTATCGTTGGCTTCATCTTTGCCTACAAACGTTCTCCACGGATGCAGCATGGACTCGA
TCGATTAATCCTCAAGGTGCCCGTAATCGGGCAGATCATGCACAACAGTGCGATTGCACGCTTTGCACGGACCACTGCAG
TGACATTCAAAGCAGGCGTTCCTCTAGTGGAGGCACTTGGGATCGTCGCTGGCGCTACTGGCAATAAGCTTTATGAGGAG
GCTGTTTTCAGGATGCGGGACGATGTGTCTGTGGGTTACCCGGTCAACATGGCGATGAAACAGGTGAACCTATTTCCACA
CATGGTTATTCAAATGACTGCGATTGGCGAAGAAGCCGGCGCGCTTGATGCAATGCTCTTCAAAGTGGCTGAATACTTTG
AGCAAGAAGTCAACAACGCCGTAGACGCGCTCAGTAGCCTGCTCGAGCCACTGATCATGGTATTCATCGGCACTATTGTG
GGCGGCATGGTCATCGGCATGTATCTTCCGATCTTCAAACTCGGCGCAGTGGTGGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Legionella pneumophila strain ERS1305867

53.202

96.897

0.516

  pilC Pseudomonas stutzeri DSM 10701

53.788

94.511

0.508

  pilC Acinetobacter baylyi ADP1

51.111

96.659

0.494

  pilC Acinetobacter baumannii D1279779

50.864

96.659

0.492

  pilG Neisseria gonorrhoeae MS11

43.86

95.227

0.418

  pilG Neisseria meningitidis 44/76-A

43.25

95.465

0.413

  pilC Vibrio cholerae strain A1552

40.5

95.465

0.387

  pilC Vibrio campbellii strain DS40M4

38.75

95.465

0.37


Multiple sequence alignment