Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilE   Type   Machinery gene
Locus tag   ACMGRF_RS14105 Genome accession   NZ_CP179923
Coordinates   3058281..3058718 (-) Length   145 a.a.
NCBI ID   WP_032957219.1    Uniprot ID   -
Organism   Stenotrophomonas sp. EMP41     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3050221..3070706 3058281..3058718 within 0


Gene organization within MGE regions


Location: 3050221..3070706
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACMGRF_RS14070 (ACMGRF_14070) - 3050221..3051435 (+) 1215 WP_414485327.1 glycosyltransferase -
  ACMGRF_RS14075 (ACMGRF_14075) - 3051479..3052675 (+) 1197 WP_262224786.1 polysaccharide biosynthesis protein -
  ACMGRF_RS14080 (ACMGRF_14080) - 3052638..3053753 (-) 1116 WP_032955899.1 glycosyltransferase -
  ACMGRF_RS14085 (ACMGRF_14085) - 3053750..3054562 (-) 813 WP_262224784.1 FkbM family methyltransferase -
  ACMGRF_RS14090 (ACMGRF_14090) - 3054732..3055823 (+) 1092 WP_414485328.1 glycosyltransferase family 4 protein -
  ACMGRF_RS14095 (ACMGRF_14095) - 3055857..3056273 (-) 417 WP_032957223.1 pilin -
  ACMGRF_RS14100 (ACMGRF_14100) - 3056277..3058166 (-) 1890 WP_262224782.1 hypothetical protein -
  ACMGRF_RS14105 (ACMGRF_14105) pilE 3058281..3058718 (-) 438 WP_032957219.1 pilin Machinery gene
  ACMGRF_RS14110 (ACMGRF_14110) pilC 3059074..3060330 (+) 1257 WP_262224781.1 type II secretion system F family protein Machinery gene
  ACMGRF_RS14115 (ACMGRF_14115) - 3060338..3061201 (+) 864 WP_049416956.1 prepilin peptidase -
  ACMGRF_RS14120 (ACMGRF_14120) coaE 3061213..3061824 (+) 612 WP_099487579.1 dephospho-CoA kinase -
  ACMGRF_RS14125 (ACMGRF_14125) - 3061998..3062384 (-) 387 WP_111177536.1 hypothetical protein -
  ACMGRF_RS14130 (ACMGRF_14130) - 3062926..3064296 (-) 1371 WP_032954439.1 sensor histidine kinase -
  ACMGRF_RS14135 (ACMGRF_14135) - 3064259..3064936 (-) 678 WP_005410804.1 response regulator transcription factor -
  ACMGRF_RS14140 (ACMGRF_14140) - 3065099..3065560 (-) 462 WP_049412681.1 hypothetical protein -
  ACMGRF_RS14145 (ACMGRF_14145) - 3065676..3066146 (-) 471 WP_049412683.1 hypothetical protein -
  ACMGRF_RS14150 (ACMGRF_14150) - 3066245..3066724 (-) 480 WP_049412685.1 hypothetical protein -
  ACMGRF_RS14155 (ACMGRF_14155) rimK 3066983..3067861 (-) 879 WP_005418651.1 30S ribosomal protein S6--L-glutamate ligase -
  ACMGRF_RS14160 (ACMGRF_14160) - 3067959..3068201 (-) 243 WP_049412687.1 hypothetical protein -
  ACMGRF_RS14165 (ACMGRF_14165) - 3068525..3069748 (-) 1224 WP_049412688.1 FAD-dependent oxidoreductase -
  ACMGRF_RS14170 (ACMGRF_14170) - 3069879..3070706 (+) 828 WP_049412690.1 AraC family transcriptional regulator -

Sequence


Protein


Download         Length: 145 a.a.        Molecular weight: 14808.02 Da        Isoelectric Point: 9.6273

>NTDB_id=1093866 ACMGRF_RS14105 WP_032957219.1 3058281..3058718(-) (pilE) [Stenotrophomonas sp. EMP41]
MKNQKGFTLIELMIVVAIIAILAAIALPAYQDYVARSQVSEAMTLSSGAKTAVTEYYADKGAFPTTNAMAGLAGASSING
KYVASVTVGANGVITAKMKGSGSVSSKVAGRDFALVPADAGGSITWDCKGSATTIDAKYRPSSCR

Nucleotide


Download         Length: 438 bp        

>NTDB_id=1093866 ACMGRF_RS14105 WP_032957219.1 3058281..3058718(-) (pilE) [Stenotrophomonas sp. EMP41]
ATGAAGAACCAGAAGGGCTTCACCCTTATCGAACTGATGATCGTTGTTGCGATCATCGCCATCCTGGCTGCCATCGCGCT
GCCGGCTTACCAGGACTATGTGGCACGCTCGCAGGTTTCCGAAGCGATGACCCTGAGCTCGGGTGCCAAGACCGCTGTTA
CCGAGTACTACGCCGACAAGGGCGCCTTCCCGACCACCAATGCAATGGCCGGCCTTGCTGGCGCGTCGTCGATCAACGGC
AAGTACGTGGCCAGCGTGACCGTTGGTGCCAACGGCGTGATTACCGCCAAGATGAAGGGCTCGGGCTCGGTGAGCTCCAA
GGTCGCCGGTCGCGACTTTGCGCTGGTCCCGGCAGACGCTGGCGGCTCGATCACCTGGGACTGCAAGGGCTCCGCCACGA
CCATCGACGCCAAGTACCGTCCGTCGTCCTGCCGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilE Neisseria gonorrhoeae MS11

49.682

100

0.538

  pilE Neisseria gonorrhoeae strain FA1090

50

100

0.538

  pilA2 Legionella pneumophila str. Paris

50

99.31

0.497

  pilA2 Legionella pneumophila strain ERS1305867

49.306

99.31

0.49

  comP Acinetobacter baylyi ADP1

45.455

100

0.483

  pilE Neisseria elongata subsp. glycolytica ATCC 29315

38.043

100

0.483

  pilA/pilA1 Eikenella corrodens VA1

42.675

100

0.462

  pilA Ralstonia pseudosolanacearum GMI1000

38.922

100

0.448

  pilA Acinetobacter baumannii strain A118

39.865

100

0.407

  pilA Vibrio cholerae O1 biovar El Tor strain E7946

38.562

100

0.407

  pilA Vibrio cholerae strain A1552

38.562

100

0.407

  pilA Vibrio cholerae C6706

38.562

100

0.407

  pilA/pilAI Pseudomonas stutzeri DSM 10701

40.845

97.931

0.4

  pilA Vibrio campbellii strain DS40M4

36.242

100

0.372

  pilA Haemophilus influenzae Rd KW20

37.5

99.31

0.372

  pilA Haemophilus influenzae 86-028NP

38.571

96.552

0.372


Multiple sequence alignment