Detailed information    

insolico Bioinformatically predicted

Overview


Name   comO   Type   Machinery gene
Locus tag   ACHWYH_RS09115 Genome accession   NZ_CP172084
Coordinates   1812149..1812865 (+) Length   238 a.a.
NCBI ID   WP_112103370.1    Uniprot ID   -
Organism   Haemophilus influenzae strain GA81666     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1807149..1817865
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACHWYH_RS09095 - 1808088..1808618 (-) 531 WP_005666027.1 DsbE family thiol:disulfide interchange protein -
  ACHWYH_RS09100 nrfE 1808618..1810525 (-) 1908 WP_112103369.1 heme lyase NrfEFG subunit NrfE -
  ACHWYH_RS09105 suhB 1810634..1811437 (-) 804 WP_105886188.1 inositol-1-monophosphatase -
  ACHWYH_RS09110 comN 1811637..1812149 (+) 513 WP_050948683.1 Tfp pilus assembly protein FimT/FimU Machinery gene
  ACHWYH_RS09115 comO 1812149..1812865 (+) 717 WP_112103370.1 type II secretion system protein J Machinery gene
  ACHWYH_RS09120 comP 1812880..1813563 (+) 684 WP_112103371.1 DUF2572 family protein Machinery gene
  ACHWYH_RS09125 comQ 1813556..1813843 (+) 288 WP_054249354.1 DUF5374 domain-containing protein Machinery gene
  ACHWYH_RS09130 recC 1813889..1817269 (+) 3381 WP_112103372.1 exodeoxyribonuclease V subunit gamma -
  ACHWYH_RS09135 nrdR 1817324..1817773 (+) 450 WP_005648026.1 transcriptional regulator NrdR -

Sequence


Protein


Download         Length: 238 a.a.        Molecular weight: 26680.05 Da        Isoelectric Point: 9.5422

>NTDB_id=1065069 ACHWYH_RS09115 WP_112103370.1 1812149..1812865(+) (comO) [Haemophilus influenzae strain GA81666]
MMKTLLKGQTLLALMISLALSSLLLLSISHFYVQIQTQNQQMLLHLKLQAELQRILQLIGKDLRRLGFRALNAKLTESNL
SLFELDEQGTAIFISQEDNAPPNSCVLFFYDLNKNGCIGKGSPKTCMKKGKNTSKSSTEELFGYKVSNKMIKTKLTYQSV
IPTNCTAETCKRAFQQTACNAGGGWADLLDNNEYEITKLQFNWLIEGKGLEIKLKGNLKQASNISYETSIVVALWNQK

Nucleotide


Download         Length: 717 bp        

>NTDB_id=1065069 ACHWYH_RS09115 WP_112103370.1 1812149..1812865(+) (comO) [Haemophilus influenzae strain GA81666]
ATGATGAAAACATTATTAAAAGGGCAAACCCTATTGGCACTGATGATTTCACTGGCTTTGTCTTCTTTATTGCTGCTAAG
CATTTCGCATTTTTATGTGCAAATACAAACACAAAACCAACAAATGTTATTACATTTAAAATTACAAGCTGAATTACAAC
GAATATTACAGTTAATAGGCAAAGATCTTCGCCGATTAGGATTTCGAGCATTAAATGCAAAACTGACGGAAAGCAATTTA
TCTTTATTTGAATTAGATGAACAAGGCACAGCCATTTTTATTAGCCAAGAAGATAATGCGCCACCTAATAGCTGCGTGTT
GTTCTTTTATGATTTAAATAAAAATGGATGTATTGGTAAAGGTTCGCCGAAAACCTGTATGAAAAAAGGTAAAAATACAT
CTAAAAGTAGTACTGAAGAATTATTCGGCTATAAAGTGAGTAACAAAATGATAAAAACCAAACTGACTTATCAAAGTGTT
ATTCCTACTAATTGCACGGCAGAAACGTGTAAACGTGCTTTTCAACAAACCGCTTGCAACGCTGGGGGAGGTTGGGCAGA
TTTATTAGATAACAACGAATATGAAATCACTAAGCTACAATTTAATTGGTTAATCGAAGGAAAAGGATTAGAAATCAAGC
TAAAAGGAAATTTAAAACAAGCCTCAAACATCAGTTATGAAACCTCCATTGTTGTTGCGCTATGGAATCAAAAATAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comO Haemophilus influenzae Rd KW20

96.218

100

0.962


Multiple sequence alignment