Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   Hhaem_RS09165 Genome accession   NZ_AP024093
Coordinates   1866427..1867110 (+) Length   227 a.a.
NCBI ID   WP_223893568.1    Uniprot ID   -
Organism   Haemophilus haemolyticus strain 2019-19     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1861427..1872110
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  Hhaem_RS09150 (Hhaem_17400) suhB 1864151..1864954 (-) 804 WP_005633301.1 inositol-1-monophosphatase -
  Hhaem_RS09155 (Hhaem_17410) comN 1865154..1865672 (+) 519 WP_215770196.1 type II secretion system protein Machinery gene
  Hhaem_RS09160 (Hhaem_17420) comO 1865666..1866382 (+) 717 WP_223894344.1 type II secretion system protein J Machinery gene
  Hhaem_RS09165 (Hhaem_17430) comP 1866427..1867110 (+) 684 WP_223893568.1 DUF2572 family protein Machinery gene
  Hhaem_RS09170 (Hhaem_17440) comQ 1867103..1867390 (+) 288 WP_223894345.1 DUF5374 domain-containing protein Machinery gene
  Hhaem_RS09175 (Hhaem_17450) recC 1867439..1870798 (+) 3360 WP_223893570.1 exodeoxyribonuclease V subunit gamma -
  Hhaem_RS09180 (Hhaem_17460) nrdR 1870861..1871310 (+) 450 WP_223893572.1 transcriptional regulator NrdR -

Sequence


Protein


Download         Length: 227 a.a.        Molecular weight: 25495.24 Da        Isoelectric Point: 8.4559

>NTDB_id=83444 Hhaem_RS09165 WP_223893568.1 1866427..1867110(+) (comP) [Haemophilus haemolyticus strain 2019-19]
MTIQKGIITLTILIFISGLLTVILLLDDSHLSFFRAQQNQRKHYVERTLQLQKMTGEKRQTACLDLPLNNDESVKQISIT
LDGATDAIQYFLWCERMSLFKKSPTRGDNQGALKDFIHTEKLTDFRPRFSSPPKILNANKTPKLYWFSDSQAEVQINGTV
SAVLIAEGDLKLTGKGRISGAVITNGNLTLDGVTLAYGKPVVTTLVQQYSQWQLAEKSWSDFNVPDE

Nucleotide


Download         Length: 684 bp        

>NTDB_id=83444 Hhaem_RS09165 WP_223893568.1 1866427..1867110(+) (comP) [Haemophilus haemolyticus strain 2019-19]
ATGACAATACAAAAAGGCATTATTACGCTGACTATTCTGATTTTTATTTCAGGTTTATTAACCGTAATCTTATTGTTGGA
TGACAGTCATTTAAGTTTTTTTCGTGCGCAACAAAATCAACGAAAACACTATGTAGAAAGAACATTACAACTACAAAAAA
TGACAGGGGAGAAAAGACAAACCGCCTGCCTTGATTTACCGTTAAATAATGATGAAAGTGTAAAGCAAATCAGCATCACG
CTTGATGGTGCCACCGATGCAATTCAATATTTTCTTTGGTGTGAAAGAATGAGCCTATTTAAAAAATCGCCCACAAGAGG
CGATAATCAAGGTGCATTGAAAGATTTTATTCACACAGAAAAACTAACGGATTTTCGACCGCGCTTTTCTTCCCCGCCCA
AGATTTTAAATGCGAATAAAACACCTAAACTATATTGGTTTTCAGATTCACAAGCGGAGGTTCAAATTAATGGCACCGTG
TCTGCCGTATTAATTGCGGAGGGAGATTTAAAACTAACGGGCAAAGGAAGGATTAGTGGCGCGGTGATCACCAACGGGAA
TTTAACTTTAGATGGGGTAACTTTAGCCTATGGAAAACCTGTCGTAACAACCTTAGTGCAACAATATAGCCAGTGGCAGT
TGGCAGAAAAAAGTTGGAGTGATTTTAATGTTCCAGATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Haemophilus influenzae Rd KW20

88.546

100

0.885


Multiple sequence alignment