Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilE   Type   Machinery gene
Locus tag   CA923_RS03165 Genome accession   NZ_CP021267
Coordinates   662486..662935 (-) Length   149 a.a.
NCBI ID   WP_010946364.1    Uniprot ID   Q5ZXV4
Organism   Legionella pneumophila subsp. pneumophila strain Burlington     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 657486..667935
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CA923_RS03145 - 657761..658153 (-) 393 WP_010946360.1 hypothetical protein -
  CA923_RS03150 - 658290..658667 (+) 378 WP_010946361.1 hypothetical protein -
  CA923_RS03155 - 658868..660175 (+) 1308 WP_011213222.1 tetratricopeptide repeat protein -
  CA923_RS03160 comEC 660201..662408 (-) 2208 WP_010946363.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CA923_RS03165 pilE 662486..662935 (-) 450 WP_010946364.1 type IV pilin protein Machinery gene
  CA923_RS03170 - 662947..666456 (-) 3510 WP_011213223.1 PilC/PilY family type IV pilus protein -
  CA923_RS03175 - 666469..666981 (-) 513 WP_010946366.1 pilus assembly protein -

Sequence


Protein


Download         Length: 149 a.a.        Molecular weight: 16296.57 Da        Isoelectric Point: 7.6922

>NTDB_id=229409 CA923_RS03165 WP_010946364.1 662486..662935(-) (pilE) [Legionella pneumophila subsp. pneumophila strain Burlington]
MLSHVHFMKNSRMKQSAFTLVEVLISMVIMGILVSIAYPSYLQYIQKSRRADAHATLTQDQIILERCYSQNFSYAAACGA
LPAFPQTTPNGYYTINISNLTATTYTLTATPVGTQAKDTECASMSINQANVKTAVDSSANAQPECWNPG

Nucleotide


Download         Length: 450 bp        

>NTDB_id=229409 CA923_RS03165 WP_010946364.1 662486..662935(-) (pilE) [Legionella pneumophila subsp. pneumophila strain Burlington]
ATGCTCAGTCATGTACATTTTATGAAGAATAGCCGTATGAAACAATCCGCGTTTACCCTGGTTGAAGTTCTGATCAGCAT
GGTCATTATGGGCATTCTGGTTTCAATTGCCTATCCATCCTATTTACAATATATCCAAAAATCCCGTCGTGCCGATGCTC
ACGCCACATTGACACAAGATCAAATTATTTTAGAACGCTGTTATTCACAGAATTTTTCTTATGCTGCGGCGTGTGGCGCC
TTACCAGCATTTCCTCAAACAACGCCGAACGGGTACTATACTATCAATATTTCAAACCTGACAGCCACAACGTATACCTT
AACTGCAACCCCTGTTGGAACTCAAGCCAAAGATACCGAGTGTGCCAGCATGTCAATTAACCAGGCCAATGTAAAAACAG
CAGTAGATTCCTCCGCTAATGCGCAACCAGAATGCTGGAATCCCGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q5ZXV4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilE Legionella pneumophila str. Paris

100

100

1

  pilE Legionella pneumophila strain ERS1305867

100

100

1


Multiple sequence alignment