Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   E4T54_RS04560 Genome accession   NZ_CP038271
Coordinates   1044253..1044696 (-) Length   147 a.a.
NCBI ID   WP_028385927.1    Uniprot ID   A0A0W0U957
Organism   Legionella geestiana strain 1308     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1039253..1049696
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E4T54_RS04550 (E4T54_04550) letS 1040238..1043000 (+) 2763 WP_172607336.1 two-component system sensor histidine kinase LetS Regulator
  E4T54_RS04555 (E4T54_04555) - 1043011..1044243 (+) 1233 WP_028385926.1 6-phosphofructokinase -
  E4T54_RS04560 (E4T54_04560) comP 1044253..1044696 (-) 444 WP_028385927.1 pilin Machinery gene
  E4T54_RS04565 (E4T54_04565) - 1044814..1045137 (+) 324 WP_028385928.1 BolA family protein -
  E4T54_RS04570 (E4T54_04570) - 1045214..1046617 (+) 1404 WP_028385929.1 APC family permease -

Sequence


Protein


Download         Length: 147 a.a.        Molecular weight: 15699.34 Da        Isoelectric Point: 7.7814

>NTDB_id=353036 E4T54_RS04560 WP_028385927.1 1044253..1044696(-) (comP) [Legionella geestiana strain 1308]
MKQKGFTLIELMIVLAIIGILAAIAVPAYRDYTVRARVMEGLNLATAAKTSVVEAAQTAGGLEAITAENIGYTFSGATEN
VKTITVAPKTGIITIRMQPVAMDVELLLTPVESPRGSGALTWRCSVKSPEENRYVPQMCRVSDLNAS

Nucleotide


Download         Length: 444 bp        

>NTDB_id=353036 E4T54_RS04560 WP_028385927.1 1044253..1044696(-) (comP) [Legionella geestiana strain 1308]
ATGAAACAAAAAGGATTCACACTCATAGAACTGATGATAGTGCTTGCCATCATCGGTATCCTGGCAGCCATCGCTGTACC
CGCCTATCGCGATTACACTGTGCGTGCGCGGGTGATGGAGGGGCTCAATCTTGCCACAGCCGCAAAAACGTCTGTGGTGG
AAGCCGCGCAAACCGCGGGCGGTCTTGAAGCCATTACCGCCGAGAACATCGGGTATACGTTTTCTGGAGCAACCGAAAAT
GTCAAAACCATTACGGTTGCCCCAAAAACCGGGATTATTACCATTCGCATGCAGCCGGTAGCCATGGACGTGGAGCTGCT
GCTGACGCCTGTGGAGTCTCCCAGAGGCTCAGGCGCCCTGACCTGGCGCTGCTCGGTCAAGTCGCCTGAAGAAAACCGCT
ATGTGCCACAGATGTGTCGAGTCAGCGACTTAAATGCATCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0W0U957

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Acinetobacter baylyi ADP1

50

100

0.503

  pilA2 Legionella pneumophila strain ERS1305867

49.286

95.238

0.469

  pilA2 Legionella pneumophila str. Paris

49.286

95.238

0.469

  pilA Ralstonia pseudosolanacearum GMI1000

40.123

100

0.442

  pilE Neisseria gonorrhoeae MS11

35.032

100

0.374

  pilA/pilAII Pseudomonas stutzeri DSM 10701

40.146

93.197

0.374

  pilA/pilA1 Eikenella corrodens VA1

37.162

100

0.374


Multiple sequence alignment