Detailed information    

insolico Bioinformatically predicted

Overview


Name   comN   Type   Machinery gene
Locus tag   INP89_RS05350 Genome accession   NZ_CP063127
Coordinates   1086222..1086734 (-) Length   170 a.a.
NCBI ID   WP_005651545.1    Uniprot ID   A0A0H3PJN2
Organism   Haemophilus influenzae strain M1C112_1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1081222..1091734
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INP89_RS05335 (INP89_05330) comQ 1084528..1084815 (-) 288 WP_012055313.1 DUF5374 domain-containing protein Machinery gene
  INP89_RS05340 (INP89_05335) comP 1084808..1085491 (-) 684 WP_012055312.1 DUF2572 family protein Machinery gene
  INP89_RS05345 (INP89_05340) comO 1085506..1086222 (-) 717 WP_012055311.1 type II secretion system protein J Machinery gene
  INP89_RS05350 (INP89_05345) comN 1086222..1086734 (-) 513 WP_005651545.1 Tfp pilus assembly protein FimT/FimU Machinery gene
  INP89_RS05355 (INP89_05350) suhB 1086934..1087737 (+) 804 WP_012055310.1 inositol-1-monophosphatase -
  INP89_RS05360 (INP89_05355) nrfE 1087845..1089752 (+) 1908 WP_012055309.1 heme lyase NrfEFG subunit NrfE -
  INP89_RS05365 (INP89_05360) - 1089749..1090279 (+) 531 WP_041174822.1 DsbE family thiol:disulfide interchange protein -
  INP89_RS05370 (INP89_05365) nrfF 1090276..1091430 (+) 1155 WP_012055307.1 heme lyase NrfEFG subunit NrfF -

Sequence


Protein


Download         Length: 170 a.a.        Molecular weight: 19877.15 Da        Isoelectric Point: 8.9743

>NTDB_id=493227 INP89_RS05350 WP_005651545.1 1086222..1086734(-) (comN) [Haemophilus influenzae strain M1C112_1]
MQKGMTLVELLIGLAIISIVLNFAVPLWKTDSPKTILAKEQHRLYLFLRQIQARAENSSEVWFLLINRNLATQQWCLTAQ
VKNNQTCDCLNPINCPKEVYAHFYYPYFPNKTMIQSHHIYPKEITRFDGIRNTIVTRCFILQAENERTLFLFFNVGSIRL
KTNQFDSACN

Nucleotide


Download         Length: 513 bp        

>NTDB_id=493227 INP89_RS05350 WP_005651545.1 1086222..1086734(-) (comN) [Haemophilus influenzae strain M1C112_1]
ATGCAGAAAGGTATGACATTAGTGGAATTATTGATTGGGTTAGCCATTATCAGTATTGTGCTGAATTTTGCAGTACCATT
ATGGAAAACCGATTCGCCTAAAACGATTTTAGCCAAAGAGCAACATCGCCTGTATTTATTTCTACGCCAAATTCAGGCTC
GGGCAGAAAATTCATCGGAAGTGTGGTTTTTACTTATCAATCGTAACCTTGCGACACAGCAATGGTGCTTAACGGCACAA
GTAAAAAATAACCAAACTTGTGATTGTTTAAATCCGATAAATTGCCCGAAAGAGGTTTATGCTCATTTTTACTACCCTTA
TTTTCCTAACAAAACGATGATTCAAAGTCATCATATTTATCCAAAAGAAATCACGAGATTTGATGGCATTCGTAATACTA
TCGTTACTCGTTGCTTTATTTTGCAAGCTGAAAATGAACGTACGTTATTTTTATTTTTCAATGTTGGCAGTATTCGTTTG
AAAACTAATCAATTTGATAGTGCTTGTAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0H3PJN2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comN Haemophilus influenzae Rd KW20

98.824

100

0.988