Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilE   Type   Machinery gene
Locus tag   EO087_RS15065 Genome accession   NZ_CP035300
Coordinates   3284936..3285379 (-) Length   147 a.a.
NCBI ID   WP_128899580.1    Uniprot ID   A0A410ULL4
Organism   Dyella sp. M7H15-1     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3279936..3290379
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EO087_RS15055 (EO087_15055) - 3280868..3282844 (-) 1977 WP_128899578.1 hypothetical protein -
  EO087_RS15060 (EO087_15060) - 3282841..3284373 (-) 1533 WP_240669069.1 tetratricopeptide repeat protein -
  EO087_RS15065 (EO087_15065) pilE 3284936..3285379 (-) 444 WP_128899580.1 pilin Machinery gene
  EO087_RS15070 (EO087_15070) - 3285790..3286230 (-) 441 WP_128899581.1 pilin -
  EO087_RS15075 (EO087_15075) - 3286445..3287725 (-) 1281 WP_164931867.1 glycosyltransferase family 39 protein -
  EO087_RS15080 (EO087_15080) - 3287889..3290027 (+) 2139 WP_128899583.1 S9 family peptidase -

Sequence


Protein


Download         Length: 147 a.a.        Molecular weight: 15046.26 Da        Isoelectric Point: 9.0355

>NTDB_id=339065 EO087_RS15065 WP_128899580.1 3284936..3285379(-) (pilE) [Dyella sp. M7H15-1]
MKKIQQGFTLIELMIVVAIIAILAAIAIPAYQNYLIRAQVSEGAVLADGAKTAVGEFFTNTGRLPGNNTSAGLAASTSIT
GKYVSSVTVAGTGITAAFSQAATNAAIYSDIFALSPITSAGSIVWHCGSTQTTVPQKYLPTSCRNGG

Nucleotide


Download         Length: 444 bp        

>NTDB_id=339065 EO087_RS15065 WP_128899580.1 3284936..3285379(-) (pilE) [Dyella sp. M7H15-1]
ATGAAGAAAATCCAGCAGGGCTTCACCCTTATCGAACTGATGATCGTAGTTGCGATCATTGCCATCCTAGCGGCCATTGC
CATCCCGGCTTATCAGAACTACCTGATCCGCGCACAAGTTTCTGAAGGTGCGGTGCTAGCCGATGGCGCCAAGACGGCTG
TAGGCGAGTTCTTTACCAATACGGGCCGCCTGCCAGGCAACAACACCTCTGCTGGTCTCGCGGCGTCCACCAGCATCACC
GGTAAGTACGTCAGCTCCGTTACAGTCGCCGGCACCGGAATCACCGCAGCCTTCAGTCAGGCCGCCACCAACGCTGCCAT
CTATAGTGACATTTTTGCGCTGTCGCCCATTACCAGTGCGGGCAGCATCGTTTGGCACTGCGGCAGCACCCAAACCACCG
TTCCCCAGAAGTACCTGCCCACTAGCTGCCGTAACGGCGGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A410ULL4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilE Neisseria gonorrhoeae MS11

44.375

100

0.483

  pilE Neisseria gonorrhoeae strain FA1090

44.025

100

0.476

  pilE Neisseria elongata subsp. glycolytica ATCC 29315

35.979

100

0.463

  pilA Ralstonia pseudosolanacearum GMI1000

39.759

100

0.449

  pilA2 Legionella pneumophila str. Paris

45.775

96.599

0.442

  pilA2 Legionella pneumophila strain ERS1305867

45.775

96.599

0.442

  comP Acinetobacter baylyi ADP1

41.333

100

0.422

  pilA/pilA1 Eikenella corrodens VA1

38.961

100

0.408

  pilA Acinetobacter baumannii strain A118

39.161

97.279

0.381

  pilA Pseudomonas aeruginosa PAK

35.526

100

0.367


Multiple sequence alignment