Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA   Type   Machinery gene
Locus tag   INP94_RS00255 Genome accession   NZ_CP063120
Coordinates   54120..54563 (+) Length   147 a.a.
NCBI ID   WP_005697917.1    Uniprot ID   A0A369YN52
Organism   Haemophilus parainfluenzae strain M1C137_2     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 49120..59563
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INP94_RS00225 (INP94_00225) tadA 49339..49824 (-) 486 WP_049371643.1 tRNA adenosine(34) deaminase TadA -
  INP94_RS00230 (INP94_00230) - 49861..50712 (-) 852 WP_049364941.1 thymidylate synthase -
  INP94_RS00235 (INP94_00235) lgt 50726..51532 (-) 807 WP_070582537.1 prolipoprotein diacylglyceryl transferase -
  INP94_RS00240 (INP94_00240) - 51541..52335 (-) 795 WP_049371641.1 sulfite exporter TauE/SafE family protein -
  INP94_RS00245 (INP94_00245) rppH 52335..52928 (-) 594 WP_049371640.1 RNA pyrophosphohydrolase -
  INP94_RS00250 (INP94_00250) ampD 53414..53977 (-) 564 WP_049371639.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  INP94_RS00255 (INP94_00255) pilA 54120..54563 (+) 444 WP_005697917.1 prepilin peptidase-dependent pilin Machinery gene
  INP94_RS00260 (INP94_00260) pilB 54570..55949 (+) 1380 WP_197543655.1 GspE/PulE family protein Machinery gene
  INP94_RS00265 (INP94_00265) pilC 55958..57181 (+) 1224 WP_197543656.1 type II secretion system F family protein Machinery gene
  INP94_RS00270 (INP94_00270) pilD 57178..57855 (+) 678 WP_197543657.1 prepilin peptidase Machinery gene
  INP94_RS00275 (INP94_00275) coaE 57944..58564 (+) 621 WP_197543658.1 dephospho-CoA kinase -
  INP94_RS00280 (INP94_00280) yacG 58557..58763 (+) 207 WP_005695505.1 DNA gyrase inhibitor YacG -
  INP94_RS00285 (INP94_00285) - 58760..59041 (+) 282 WP_197543659.1 GNAT family N-acetyltransferase -

Sequence


Protein


Download         Length: 147 a.a.        Molecular weight: 15151.42 Da        Isoelectric Point: 8.4674

>NTDB_id=493062 INP94_RS00255 WP_005697917.1 54120..54563(+) (pilA) [Haemophilus parainfluenzae strain M1C137_2]
MKLTFSKPLHKGFTLIELMIVIAIIAILATIAIPSYQNYTKKAAISELLQASAPYKSDVELCVYSTNAPTNCSGGSNGIA
ADITTAKGYVKSITTKSGVITVTGNGALDGISYSLTATGSTSSGVTWTTACPSNADLFPAGFCSPTK

Nucleotide


Download         Length: 444 bp        

>NTDB_id=493062 INP94_RS00255 WP_005697917.1 54120..54563(+) (pilA) [Haemophilus parainfluenzae strain M1C137_2]
ATGAAACTGACTTTTTCTAAACCTTTACATAAAGGTTTTACGTTAATTGAATTGATGATCGTGATTGCGATTATTGCTAT
TCTCGCGACGATCGCCATTCCGTCTTATCAAAATTATACCAAAAAAGCGGCTATCTCCGAATTGTTGCAAGCTTCGGCAC
CTTATAAATCTGATGTGGAGTTATGTGTCTATAGCACCAATGCCCCGACAAACTGTTCAGGTGGTTCAAATGGCATTGCA
GCAGACATCACTACTGCAAAAGGCTATGTTAAATCCATTACCACGAAATCGGGTGTGATTACGGTGACAGGAAATGGGGC
ATTGGATGGGATCAGTTATTCTTTAACGGCGACAGGCTCAACCTCATCAGGTGTCACCTGGACAACTGCTTGCCCAAGCA
ATGCGGATTTATTCCCTGCGGGTTTCTGCTCTCCTACCAAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A369YN52

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA Haemophilus influenzae 86-028NP

75.694

97.959

0.741

  pilA Haemophilus influenzae Rd KW20

73.611

97.959

0.721

  pilA Glaesserella parasuis strain SC1401

56.081

100

0.565

  pilA Vibrio campbellii strain DS40M4

41.096

99.32

0.408

  pilE Neisseria gonorrhoeae MS11

41.985

89.116

0.374

  pilA2 Legionella pneumophila str. Paris

42.188

87.075

0.367