Detailed information    

insolico Bioinformatically predicted

Overview


Name   pilA   Type   Machinery gene
Locus tag   DV401_RS04775 Genome accession   NZ_CP031254
Coordinates   935458..935907 (+) Length   149 a.a.
NCBI ID   WP_103708055.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M25588     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 930458..940907
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV401_RS04755 rsmE 930923..931660 (-) 738 WP_014550478.1 16S rRNA (uracil(1498)-N(3))-methyltransferase -
  DV401_RS04760 lnt 931710..933239 (-) 1530 WP_080481790.1 apolipoprotein N-acyltransferase -
  DV401_RS04765 corC 933262..934161 (-) 900 WP_005656283.1 CNNM family magnesium/cobalt transport protein CorC -
  DV401_RS04770 ampD 934792..935343 (-) 552 WP_071162800.1 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD -
  DV401_RS04775 pilA 935458..935907 (+) 450 WP_103708055.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  DV401_RS04780 pilB 935904..937298 (+) 1395 WP_114877323.1 GspE/PulE family protein Machinery gene
  DV401_RS04785 pilC 937295..938512 (+) 1218 WP_114877325.1 type II secretion system F family protein Machinery gene
  DV401_RS04790 pilD 938509..939201 (+) 693 WP_114877327.1 prepilin peptidase Machinery gene
  DV401_RS04795 rho 939255..940517 (-) 1263 WP_005666690.1 transcription termination factor Rho -

Sequence


Protein


Download         Length: 149 a.a.        Molecular weight: 15683.22 Da        Isoelectric Point: 9.3724

>NTDB_id=305104 DV401_RS04775 WP_103708055.1 935458..935907(+) (pilA) [Haemophilus influenzae strain M25588]
MKLTTQQTLKKGFTLIELMIVIAIIAILATIAIPSYQNYTKKAAVSELLQASAPYKADVELCVYSTKEITNCTGGKNGIA
ADIKTTKGYVKSVTTSNGAITVKGDGTLANMEYILQATGNAATGVTWTTTCKGTDASLFPANFCGSVKK

Nucleotide


Download         Length: 450 bp        

>NTDB_id=305104 DV401_RS04775 WP_103708055.1 935458..935907(+) (pilA) [Haemophilus influenzae strain M25588]
ATGAAACTAACAACACAGCAAACCTTGAAAAAAGGGTTTACATTAATAGAGCTAATGATTGTGATTGCAATTATTGCTAT
TTTAGCCACTATCGCAATTCCTTCTTATCAAAATTATACTAAAAAAGCAGCGGTATCTGAATTACTGCAAGCGTCAGCAC
CTTATAAGGCTGATGTGGAATTATGCGTTTATAGCACAAAGGAAATAACAAACTGTACGGGTGGAAAAAATGGTATTGCA
GCGGATATAAAGACAACAAAAGGCTATGTAAAATCAGTGACAACAAGCAACGGTGCAATAACAGTAAAAGGGGATGGCAC
ATTGGCAAATATGGAATATATTTTGCAAGCTACAGGTAATGCTGCAACAGGTGTAACTTGGACAACAACTTGCAAAGGAA
CGGATGCCTCTTTATTTCCAGCAAATTTTTGCGGAAGTGTCAAAAAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  pilA Haemophilus influenzae 86-028NP

95.973

100

0.96

  pilA Haemophilus influenzae Rd KW20

90.604

100

0.906

  pilA Glaesserella parasuis strain SC1401

54.73

99.329

0.544

  pilA2 Legionella pneumophila str. Paris

42.177

98.658

0.416

  pilA2 Legionella pneumophila strain ERS1305867

41.497

98.658

0.409

  pilA Vibrio campbellii strain DS40M4

39.716

94.631

0.376


Multiple sequence alignment