Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   FHQ07_RS05760 Genome accession   NZ_CP040871
Coordinates   1183502..1184773 (+) Length   423 a.a.
NCBI ID   WP_139715909.1    Uniprot ID   A0A5B7ZPQ9
Organism   Thermomonas aquatica strain SY21     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1175681..1191728 1183502..1184773 within 0


Gene organization within MGE regions


Location: 1175681..1191728
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FHQ07_RS05720 (FHQ07_05720) - 1175681..1176859 (+) 1179 WP_139715903.1 class I SAM-dependent methyltransferase -
  FHQ07_RS14315 - 1176868..1177410 (+) 543 WP_168191471.1 GNAT family protein -
  FHQ07_RS05730 (FHQ07_05730) - 1177407..1178147 (+) 741 WP_139715904.1 WbqC family protein -
  FHQ07_RS05735 (FHQ07_05735) - 1178119..1178853 (+) 735 WP_240703555.1 CmcI family methyltransferase -
  FHQ07_RS05740 (FHQ07_05740) - 1178893..1179783 (+) 891 WP_139715905.1 glycosyltransferase family A protein -
  FHQ07_RS05745 (FHQ07_05745) - 1179783..1180772 (+) 990 WP_139715906.1 hypothetical protein -
  FHQ07_RS05750 (FHQ07_05750) - 1180769..1181650 (-) 882 WP_139715907.1 glycosyltransferase family 2 protein -
  FHQ07_RS05755 (FHQ07_05755) pilB 1181772..1183490 (+) 1719 WP_139715908.1 type IV-A pilus assembly ATPase PilB Machinery gene
  FHQ07_RS05760 (FHQ07_05760) pilC 1183502..1184773 (+) 1272 WP_139715909.1 type II secretion system F family protein Machinery gene
  FHQ07_RS05765 (FHQ07_05765) pilD 1184792..1185655 (+) 864 WP_139715910.1 A24 family peptidase Machinery gene
  FHQ07_RS05770 (FHQ07_05770) coaE 1185676..1186299 (+) 624 WP_139715911.1 dephospho-CoA kinase -
  FHQ07_RS05775 (FHQ07_05775) - 1186350..1187312 (-) 963 WP_139715912.1 Nudix family hydrolase -
  FHQ07_RS05780 (FHQ07_05780) secA 1187371..1190097 (-) 2727 WP_139715913.1 preprotein translocase subunit SecA -
  FHQ07_RS05785 (FHQ07_05785) - 1190244..1191200 (-) 957 WP_240703556.1 M23 family metallopeptidase -
  FHQ07_RS14660 - 1191225..1191728 (+) 504 WP_338419631.1 DciA family protein -

Sequence


Protein


Download         Length: 423 a.a.        Molecular weight: 46378.06 Da        Isoelectric Point: 9.9799

>NTDB_id=367749 FHQ07_RS05760 WP_139715909.1 1183502..1184773(+) (pilC) [Thermomonas aquatica strain SY21]
MSARRAAISKARTQAEARRLNPMVEFVWQGKDKRGVVMKGEQLAKNANLLRAELRKQGITPTVVKAKGKPMFGGGASRIK
PKDIAVFSRQLATMMKSGVPLVMALEIIGSGQKNPAMKKMVGGVKGDIEGGASIYEALSEYPVQFDELYRNLVRAGESSG
VLETVLDTIATYKENIETIKGKIKKALFYPTAIIAVAILICAILLIYVVPVFKETFQSYGADLPAFTELVFGISDYLVKW
WWLFGIVIAIAIGVFMFFYKRSTALKHFIDRMMLKIPVIGQVLHNSAIARFSRTLALTFRAGVPLVEALENVAGATGNMV
YEQAVLRMKNDVAVGYPVNVAMKQVNLFPHMVVQMTAIGEEAGALDAMLYKVAEFYEEEVNNAVDAISSLIEPFIMVIIG
GLVGSIVIAMYLPIFKIAMTVMG

Nucleotide


Download         Length: 1272 bp        

>NTDB_id=367749 FHQ07_RS05760 WP_139715909.1 1183502..1184773(+) (pilC) [Thermomonas aquatica strain SY21]
ATGTCCGCACGCAGAGCCGCAATTAGCAAGGCCAGGACCCAGGCGGAAGCGCGCCGCCTCAACCCGATGGTCGAGTTCGT
CTGGCAGGGCAAGGACAAACGCGGCGTGGTGATGAAGGGCGAGCAACTGGCGAAGAACGCCAACCTGTTGCGCGCCGAAC
TGCGCAAGCAGGGCATCACCCCGACTGTCGTCAAGGCCAAGGGCAAGCCGATGTTCGGCGGCGGCGCCAGCCGCATCAAG
CCGAAGGACATTGCCGTCTTCAGCCGCCAGCTGGCGACCATGATGAAGTCCGGCGTGCCGCTGGTGATGGCGCTGGAGAT
CATCGGCAGCGGCCAGAAGAACCCGGCCATGAAGAAGATGGTCGGCGGCGTGAAGGGCGACATCGAGGGCGGCGCATCGA
TCTACGAGGCGCTCAGCGAATATCCGGTGCAGTTCGACGAGCTCTATCGCAACCTGGTGCGCGCAGGCGAATCGTCCGGC
GTACTGGAAACAGTCCTCGACACCATCGCGACTTACAAGGAAAACATCGAAACCATCAAGGGCAAGATCAAGAAAGCCCT
GTTCTACCCGACCGCGATCATCGCAGTGGCGATCCTGATCTGCGCGATTCTCCTGATCTACGTCGTTCCCGTCTTCAAGG
AAACGTTCCAGAGCTACGGAGCCGACCTCCCCGCATTCACCGAGCTGGTGTTCGGGATTTCCGACTATCTGGTCAAGTGG
TGGTGGCTATTCGGGATCGTCATCGCGATCGCGATCGGCGTTTTCATGTTCTTCTACAAGCGTTCGACGGCCCTGAAACA
TTTCATCGACCGGATGATGCTGAAGATCCCGGTGATCGGCCAGGTTCTGCACAACTCCGCGATCGCCCGCTTCTCCCGCA
CCCTGGCGCTCACCTTCAGGGCCGGCGTCCCGCTGGTGGAAGCGCTGGAGAACGTCGCCGGCGCCACCGGCAACATGGTC
TACGAACAGGCCGTGCTGCGCATGAAGAACGACGTGGCGGTCGGGTACCCAGTGAACGTGGCGATGAAGCAGGTCAACCT
GTTCCCGCACATGGTGGTGCAGATGACCGCGATCGGCGAAGAAGCCGGCGCGCTGGATGCGATGCTGTACAAGGTCGCCG
AGTTCTACGAGGAAGAGGTCAACAATGCGGTCGATGCGATCTCCAGCCTGATCGAGCCGTTCATCATGGTCATCATCGGC
GGCCTGGTCGGTTCGATCGTGATCGCGATGTACCTGCCGATCTTCAAGATCGCGATGACCGTGATGGGTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A5B7ZPQ9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Legionella pneumophila strain ERS1305867

52.605

95.272

0.501

  pilC Acinetobacter baylyi ADP1

50.739

95.981

0.487

  pilC Pseudomonas stutzeri DSM 10701

50.877

94.326

0.48

  pilC Acinetobacter baumannii D1279779

49.383

95.745

0.473

  pilG Neisseria gonorrhoeae MS11

41.75

94.563

0.395

  pilG Neisseria meningitidis 44/76-A

41.5

94.563

0.392

  pilC Vibrio cholerae strain A1552

38.213

95.272

0.364


Multiple sequence alignment