Detailed information    

insolico Bioinformatically predicted

Overview


Name   comN   Type   Machinery gene
Locus tag   ACDK45_RS03215 Genome accession   NZ_CP167879
Coordinates   625594..626106 (+) Length   170 a.a.
NCBI ID   WP_011272299.1    Uniprot ID   Q4QLX0
Organism   Haemophilus influenzae strain NTHi52     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 620594..631106
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACDK45_RS03195 (ACDK45_03195) nrfF 620898..622040 (-) 1143 WP_005688390.1 heme lyase NrfEFG subunit NrfF -
  ACDK45_RS03200 (ACDK45_03200) - 622037..622567 (-) 531 WP_005688388.1 DsbE family thiol:disulfide interchange protein -
  ACDK45_RS03205 (ACDK45_03205) nrfE 622567..624473 (-) 1907 Protein_602 heme lyase NrfEFG subunit NrfE -
  ACDK45_RS03210 (ACDK45_03210) suhB 624582..625394 (-) 813 WP_011272298.1 inositol-1-monophosphatase -
  ACDK45_RS03215 (ACDK45_03215) comN 625594..626106 (+) 513 WP_011272299.1 Tfp pilus assembly protein FimT/FimU Machinery gene
  ACDK45_RS03220 (ACDK45_03220) comO 626106..626825 (+) 720 WP_011272300.1 type II secretion system protein J Machinery gene
  ACDK45_RS03225 (ACDK45_03225) comP 626822..627505 (+) 684 WP_011272301.1 DUF2572 family protein Machinery gene
  ACDK45_RS03230 (ACDK45_03230) comQ 627498..627785 (+) 288 WP_021034778.1 DUF5374 domain-containing protein Machinery gene

Sequence


Protein


Download         Length: 170 a.a.        Molecular weight: 19874.13 Da        Isoelectric Point: 8.7271

>NTDB_id=1040083 ACDK45_RS03215 WP_011272299.1 625594..626106(+) (comN) [Haemophilus influenzae strain NTHi52]
MQKGVTLVELLIGLAIISIVLSFVVPLWQTDSPKTILAKEQHRLYLFLRQIQVRAENSSEVWFLLINRNLATQQWCLTAQ
VKNNQTCDCLNPINCPKEVYAHFYYPYFPNKTMIQSHHIYPKEITRFDGIRNTIVTRCFILQAENERTLFLFFNVGSIRL
KTNQFDSACN

Nucleotide


Download         Length: 513 bp        

>NTDB_id=1040083 ACDK45_RS03215 WP_011272299.1 625594..626106(+) (comN) [Haemophilus influenzae strain NTHi52]
ATGCAGAAAGGTGTGACATTAGTGGAATTATTGATTGGATTAGCAATCATCAGCATTGTGCTGAGTTTTGTCGTGCCATT
ATGGCAAACCGATTCACCTAAAACGATTTTAGCCAAAGAGCAACATCGCTTGTATTTATTTCTACGACAAATTCAGGTTC
GTGCAGAAAATTCATCGGAAGTGTGGTTTTTACTTATCAATCGGAACCTTGCGACACAACAATGGTGCTTAACGGCACAA
GTAAAAAATAACCAAACTTGTGATTGTTTAAATCCGATAAATTGCCCGAAAGAGGTTTATGCGCATTTTTACTATCCTTA
TTTTCCTAATAAAACGATGATTCAAAGCCATCATATTTATCCAAAAGAAATCACGAGATTTGATGGCATTCGTAATACTA
TCGTTACTCGTTGCTTTATTTTGCAAGCAGAAAATGAACGTACGTTATTTTTATTTTTCAATGTTGGCAGTATTCGTTTA
AAAACCAATCAATTTGATAGTGCTTGTAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q4QLX0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comN Haemophilus influenzae Rd KW20

95.882

100

0.959


Multiple sequence alignment