Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilC   Type   Machinery gene
Locus tag   MM141_RS23820 Genome accession   NZ_CP093012
Coordinates   5113518..5114738 (+) Length   406 a.a.
NCBI ID   WP_016253888.1    Uniprot ID   -
Organism   Pseudomonas aeruginosa strain H20     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 5113518..5122222 5113518..5114738 within 0


Gene organization within MGE regions


Location: 5113518..5122222
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MM141_RS23820 (MM141_23830) pilC 5113518..5114738 (+) 1221 WP_016253888.1 type 4a pilus biogenesis protein PilC Machinery gene
  MM141_RS23825 (MM141_23835) pilD 5114742..5115614 (+) 873 WP_009875878.1 type IV prepilin peptidase/methyltransferase PilD Machinery gene
  MM141_RS23830 (MM141_23840) coaE 5115611..5116222 (+) 612 WP_003112838.1 dephospho-CoA kinase -
  MM141_RS23835 (MM141_23845) yacG 5116219..5116419 (+) 201 WP_003094656.1 DNA gyrase inhibitor YacG -
  MM141_RS23840 (MM141_23850) - 5116456..5116665 (-) 210 WP_003094660.1 hypothetical protein -
  MM141_RS23845 (MM141_23855) - 5116771..5117460 (-) 690 WP_003103868.1 energy-coupling factor ABC transporter permease -
  MM141_RS23850 (MM141_23860) - 5117457..5117927 (-) 471 WP_003094664.1 hypothetical protein -
  MM141_RS23855 (MM141_23865) - 5117924..5118349 (-) 426 WP_009875877.1 GNAT family N-acetyltransferase -
  MM141_RS23860 (MM141_23870) - 5118482..5119111 (+) 630 WP_003094668.1 DUF1780 domain-containing protein -
  MM141_RS23865 (MM141_23875) - 5119108..5119557 (+) 450 WP_003094670.1 MOSC domain-containing protein -
  MM141_RS23870 (MM141_23880) - 5119583..5119753 (+) 171 WP_003094672.1 DUF3094 family protein -
  MM141_RS23875 (MM141_23885) - 5119817..5121124 (+) 1308 WP_003112837.1 NAD(P)/FAD-dependent oxidoreductase -

Sequence


Protein


Download         Length: 406 a.a.        Molecular weight: 44549.37 Da        Isoelectric Point: 9.6983

>NTDB_id=661335 MM141_RS23820 WP_016253888.1 5113518..5114738(+) (pilC) [Pseudomonas aeruginosa strain H20]
MADKALKTSVFIWEGTDKKGAKVKGELTGQNPMLVKAHLRKQGINPLKVRKKGISLLGAGKKVKPMDIALFTRQMATMMG
AGVPLLQSFDIIGEGFDNPNMRKLVDEIKQEVSSGNSLANSLRKKPQYFDELYCNLVDAGEQSGALENLLDRVATYKEKT
ESLKAKIKKAMTYPIAVIIVALIVSAILLIKVVPQFQSVFEGFGAELPAFTQMIVNLSEFMQEWWFFIILAIAIFGFAFK
ELHKRSQKFRDTLDRTILKLPIFGGIVYKSAVARYARTLSTTFAAGVPLVDALDSVSGATGNIVFKNAVSKIKQDVSTGM
QLNFSMRTTSVFPNMAIQMTAIGEESGSLDEMLSKVASYYEEEVDNAVDNLTTLMEPMIMAVLGVLVGGLIVAMYLPIFQ
LGNVVG

Nucleotide


Download         Length: 1221 bp        

>NTDB_id=661335 MM141_RS23820 WP_016253888.1 5113518..5114738(+) (pilC) [Pseudomonas aeruginosa strain H20]
ATGGCGGACAAAGCGTTAAAAACCAGCGTTTTCATCTGGGAGGGCACCGACAAGAAAGGCGCCAAGGTCAAGGGCGAACT
GACCGGGCAGAATCCCATGCTGGTGAAAGCCCATCTGCGCAAGCAAGGCATCAATCCGCTCAAGGTACGCAAGAAAGGTA
TCTCCCTGCTGGGCGCAGGCAAGAAAGTGAAACCCATGGACATCGCCCTGTTCACCCGGCAGATGGCGACCATGATGGGC
GCTGGCGTTCCCCTCCTGCAATCGTTCGACATCATCGGCGAGGGCTTCGACAACCCCAACATGCGCAAGCTTGTGGATGA
AATCAAACAGGAAGTTTCCTCAGGTAACAGCCTAGCCAACTCCTTGAGAAAAAAGCCCCAGTATTTTGACGAGCTTTATT
GCAACCTGGTAGATGCAGGGGAACAGTCTGGCGCCTTGGAAAACCTTCTCGATCGGGTGGCAACCTATAAAGAAAAGACG
GAATCACTGAAAGCCAAGATCAAAAAGGCGATGACCTATCCCATTGCCGTCATCATTGTCGCACTGATTGTATCTGCGAT
CCTCCTGATTAAAGTGGTTCCACAATTTCAGTCGGTCTTTGAAGGTTTCGGCGCGGAACTTCCCGCCTTTACCCAGATGA
TTGTCAATCTATCGGAGTTCATGCAGGAGTGGTGGTTCTTCATCATACTGGCGATAGCGATATTTGGCTTTGCATTCAAA
GAATTGCATAAACGCTCACAAAAATTCCGTGACACACTCGATAGAACGATCCTCAAACTTCCCATTTTCGGAGGCATCGT
CTACAAATCTGCGGTCGCCCGTTATGCACGGACCTTGTCCACGACCTTCGCCGCGGGTGTTCCCCTGGTCGATGCGCTCG
ACTCCGTCTCCGGAGCGACCGGCAATATCGTGTTCAAGAACGCGGTCAGCAAGATCAAGCAAGACGTTTCCACCGGCATG
CAGCTCAACTTCTCCATGCGCACCACCAGCGTCTTTCCCAACATGGCGATCCAGATGACCGCCATCGGCGAGGAGTCCGG
TTCGCTCGATGAGATGCTGAGCAAAGTCGCCAGCTACTACGAAGAGGAAGTCGACAACGCCGTGGACAACCTCACCACGC
TCATGGAACCGATGATCATGGCCGTTCTCGGCGTACTGGTTGGCGGTCTGATCGTGGCCATGTACCTTCCGATCTTCCAA
CTCGGCAACGTCGTCGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilC Pseudomonas stutzeri DSM 10701

76.543

99.754

0.764

  pilC Acinetobacter baumannii D1279779

61.386

99.507

0.611

  pilC Acinetobacter baylyi ADP1

60.837

100

0.608

  pilC Legionella pneumophila strain ERS1305867

55.051

97.537

0.537

  pilG Neisseria gonorrhoeae MS11

46.173

99.754

0.461

  pilG Neisseria meningitidis 44/76-A

45.409

99.261

0.451

  pilC Vibrio cholerae strain A1552

42.611

100

0.426

  pilC Vibrio campbellii strain DS40M4

42.065

97.783

0.411

  pilC Thermus thermophilus HB27

36.908

98.768

0.365