Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   ACA097_RS17680 Genome accession   NZ_CP166971
Coordinates   3767609..3768031 (+) Length   140 a.a.
NCBI ID   WP_371364865.1    Uniprot ID   -
Organism   Pseudomonas sp. QL9     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3764456..3793076 3767609..3768031 within 0


Gene organization within MGE regions


Location: 3764456..3793076
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACA097_RS17670 (ACA097_17665) pilC 3764456..3765676 (-) 1221 WP_371364859.1 type II secretion system F family protein Machinery gene
  ACA097_RS17675 (ACA097_17670) pilB 3765680..3767383 (-) 1704 WP_371364862.1 type IV-A pilus assembly ATPase PilB Machinery gene
  ACA097_RS17680 (ACA097_17675) comP 3767609..3768031 (+) 423 WP_371364865.1 pilin Machinery gene
  ACA097_RS17685 (ACA097_17680) - 3768079..3769167 (+) 1089 WP_371364867.1 hypothetical protein -
  ACA097_RS17695 (ACA097_17690) - 3769469..3770788 (+) 1320 WP_371364870.1 tyrosine-type recombinase/integrase -
  ACA097_RS17700 (ACA097_17695) - 3770785..3771603 (+) 819 WP_371364872.1 hypothetical protein -
  ACA097_RS17705 (ACA097_17700) - 3771695..3771928 (+) 234 WP_371364874.1 helix-turn-helix transcriptional regulator -
  ACA097_RS17710 (ACA097_17705) - 3771925..3772338 (+) 414 WP_371364877.1 helix-turn-helix domain-containing protein -
  ACA097_RS17715 (ACA097_17710) - 3772774..3773088 (+) 315 WP_371364879.1 hypothetical protein -
  ACA097_RS17720 (ACA097_17715) - 3773085..3773396 (+) 312 WP_371364881.1 hypothetical protein -
  ACA097_RS17725 (ACA097_17720) - 3773401..3774324 (+) 924 WP_371364884.1 toprim domain-containing protein -
  ACA097_RS17730 (ACA097_17725) - 3774321..3775865 (+) 1545 WP_371364888.1 DUF3631 domain-containing protein -
  ACA097_RS17735 (ACA097_17730) - 3775865..3776158 (+) 294 WP_371364890.1 hypothetical protein -
  ACA097_RS17740 (ACA097_17735) - 3776214..3777110 (+) 897 WP_371364893.1 hypothetical protein -
  ACA097_RS17745 (ACA097_17740) - 3777405..3778136 (-) 732 WP_371364896.1 HNH endonuclease -
  ACA097_RS17750 (ACA097_17745) - 3778136..3779470 (-) 1335 WP_371364899.1 AAA family ATPase -
  ACA097_RS17755 (ACA097_17750) - 3779633..3779866 (+) 234 WP_371364902.1 hypothetical protein -
  ACA097_RS17760 (ACA097_17755) - 3780169..3780621 (+) 453 WP_371364904.1 DUF1441 family protein -
  ACA097_RS17765 (ACA097_17760) - 3780766..3782178 (+) 1413 WP_371364906.1 HEPN domain-containing protein -
  ACA097_RS17770 (ACA097_17765) - 3783412..3787746 (+) 4335 WP_371364908.1 AAA domain-containing protein -
  ACA097_RS17775 (ACA097_17770) - 3787846..3788589 (+) 744 WP_371364910.1 hypothetical protein -
  ACA097_RS17780 (ACA097_17775) - 3789260..3790117 (-) 858 WP_371364913.1 HNH endonuclease -
  ACA097_RS17785 (ACA097_17780) - 3790557..3791795 (+) 1239 Protein_3492 transposase -
  ACA097_RS17790 (ACA097_17785) - 3792006..3793076 (+) 1071 WP_371364915.1 diguanylate cyclase -

Sequence


Protein


Download         Length: 140 a.a.        Molecular weight: 13903.95 Da        Isoelectric Point: 8.1113

>NTDB_id=1036487 ACA097_RS17680 WP_371364865.1 3767609..3768031(+) (comP) [Pseudomonas sp. QL9]
MKAQKGFTLIELMIVVAIIGILAAVAIPAYQDYTVRARVSELILAASSARTCVTEASQLASAVSAGNCQAPAVVGMVASS
SITGGTISVTGTTGSNSPQGTNITLTPTWNSTANTVTWACTGSPAKYLPGSCAAAAAPAP

Nucleotide


Download         Length: 423 bp        

>NTDB_id=1036487 ACA097_RS17680 WP_371364865.1 3767609..3768031(+) (comP) [Pseudomonas sp. QL9]
ATGAAAGCTCAAAAAGGTTTCACCCTTATCGAATTGATGATCGTAGTAGCGATTATCGGCATTCTGGCCGCTGTGGCCAT
TCCCGCCTACCAGGATTACACCGTACGAGCTCGTGTTTCCGAGCTTATTCTTGCGGCGAGCAGCGCTCGTACCTGTGTAA
CAGAGGCATCGCAGCTGGCCAGTGCTGTATCTGCAGGCAACTGCCAGGCTCCGGCTGTGGTTGGTATGGTCGCGAGCTCC
AGTATTACCGGCGGAACAATTTCGGTCACCGGTACTACCGGTAGCAATAGCCCGCAAGGCACTAATATCACTCTGACTCC
CACCTGGAACAGTACTGCCAATACTGTTACCTGGGCTTGCACTGGCTCGCCGGCCAAGTATCTGCCCGGTTCTTGCGCCG
CTGCTGCGGCTCCTGCTCCGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Acinetobacter baylyi ADP1

51.701

100

0.543

  pilA2 Legionella pneumophila str. Paris

50.735

97.143

0.493

  pilA2 Legionella pneumophila strain ERS1305867

50

97.143

0.486

  pilA Ralstonia pseudosolanacearum GMI1000

47.015

95.714

0.45

  pilE Neisseria gonorrhoeae MS11

37.576

100

0.443

  pilA Pseudomonas aeruginosa PAK

39.216

100

0.429

  pilA/pilA1 Eikenella corrodens VA1

40

100

0.429

  pilA/pilAI Pseudomonas stutzeri DSM 10701

40

100

0.414

  pilA/pilAII Pseudomonas stutzeri DSM 10701

39.583

100

0.407

  pilA Glaesserella parasuis strain SC1401

36.242

100

0.386

  pilA Acinetobacter baumannii strain A118

37.5

100

0.386

  pilA Haemophilus influenzae Rd KW20

39.259

96.429

0.379

  pilA Haemophilus influenzae 86-028NP

37.681

98.571

0.371


Multiple sequence alignment