Detailed information    

insolico Bioinformatically predicted

Overview


Name   comN   Type   Machinery gene
Locus tag   CH628_RS00535 Genome accession   NZ_CP031681
Coordinates   93538..94050 (-) Length   170 a.a.
NCBI ID   WP_011272299.1    Uniprot ID   Q4QLX0
Organism   Haemophilus influenzae strain P669-6977     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 88538..99050
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CH628_RS00520 (CH628_00520) comQ 91859..92146 (-) 288 WP_021034778.1 DUF5374 domain-containing protein Machinery gene
  CH628_RS00525 (CH628_00525) comP 92139..92822 (-) 684 WP_011272301.1 DUF2572 family protein Machinery gene
  CH628_RS00530 (CH628_00530) comO 92819..93538 (-) 720 WP_011272300.1 type II secretion system protein J Machinery gene
  CH628_RS00535 (CH628_00535) comN 93538..94050 (-) 513 WP_011272299.1 type II secretion system protein Machinery gene
  CH628_RS00540 (CH628_00540) suhB 94250..95062 (+) 813 WP_011272298.1 inositol-1-monophosphatase -
  CH628_RS00545 (CH628_00545) nrfE 95171..97078 (+) 1908 WP_011272297.1 heme lyase NrfEFG subunit NrfE -
  CH628_RS00550 (CH628_00550) - 97078..97608 (+) 531 WP_005688388.1 DsbE family thiol:disulfide interchange protein -
  CH628_RS00555 (CH628_00555) nrfF 97605..98747 (+) 1143 WP_005688390.1 heme lyase NrfEFG subunit NrfF -

Sequence


Protein


Download         Length: 170 a.a.        Molecular weight: 19874.13 Da        Isoelectric Point: 8.7271

>NTDB_id=309869 CH628_RS00535 WP_011272299.1 93538..94050(-) (comN) [Haemophilus influenzae strain P669-6977]
MQKGVTLVELLIGLAIISIVLSFVVPLWQTDSPKTILAKEQHRLYLFLRQIQVRAENSSEVWFLLINRNLATQQWCLTAQ
VKNNQTCDCLNPINCPKEVYAHFYYPYFPNKTMIQSHHIYPKEITRFDGIRNTIVTRCFILQAENERTLFLFFNVGSIRL
KTNQFDSACN

Nucleotide


Download         Length: 513 bp        

>NTDB_id=309869 CH628_RS00535 WP_011272299.1 93538..94050(-) (comN) [Haemophilus influenzae strain P669-6977]
ATGCAGAAAGGTGTGACATTAGTGGAATTATTGATTGGATTAGCAATCATCAGCATTGTGCTGAGTTTTGTCGTGCCATT
ATGGCAAACCGATTCACCTAAAACGATTTTAGCCAAAGAGCAACATCGCTTGTATTTATTTCTACGACAAATTCAGGTTC
GTGCAGAAAATTCATCGGAAGTGTGGTTTTTACTTATCAATCGGAACCTTGCGACACAACAATGGTGCTTAACGGCACAA
GTAAAAAATAACCAAACTTGTGATTGTTTAAATCCGATAAATTGCCCGAAAGAGGTTTATGCGCATTTTTACTATCCTTA
TTTTCCTAATAAAACGATGATTCAAAGCCATCATATTTATCCAAAAGAAATCACGAGATTTGATGGCATTCGTAATACTA
TCGTTACTCGTTGCTTTATTTTGCAAGCAGAAAATGAACGTACGTTATTTTTATTTTTCAATGTTGGCAGTATTCGTTTA
AAAACCAATCAATTTGATAGTGCTTGTAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q4QLX0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comN Haemophilus influenzae Rd KW20

95.882

100

0.959


Multiple sequence alignment