Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   HICON_RS00935 Genome accession   NC_014922
Coordinates   204014..204697 (-) Length   227 a.a.
NCBI ID   WP_013527275.1    Uniprot ID   A0AAV2U0L5
Organism   Haemophilus influenzae F3047     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 199014..209697
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HICON_RS00920 (HICON_04570) nrdR 199804..200253 (-) 450 WP_013527273.1 transcriptional regulator NrdR -
  HICON_RS00925 (HICON_04580) recC 200308..203688 (-) 3381 WP_041175139.1 exodeoxyribonuclease V subunit gamma -
  HICON_RS00930 (HICON_04590) comQ 203734..204021 (-) 288 WP_006995601.1 DUF5374 domain-containing protein Machinery gene
  HICON_RS00935 (HICON_04600) comP 204014..204697 (-) 684 WP_013527275.1 DUF2572 family protein Machinery gene
  HICON_RS00940 (HICON_04610) comO 204712..205428 (-) 717 WP_032826301.1 type II secretion system protein J Machinery gene
  HICON_RS00945 (HICON_04620) comN 205422..205940 (-) 519 WP_013527276.1 type II secretion system protein Machinery gene
  HICON_RS00950 (HICON_04630) suhB 206140..206943 (+) 804 WP_005664157.1 inositol-1-monophosphatase -
  HICON_RS00955 (HICON_04640) nrfE 207052..208959 (+) 1908 WP_006995596.1 heme lyase NrfEFG subunit NrfE -
  HICON_RS00960 (HICON_04650) - 208959..209489 (+) 531 WP_006995595.1 DsbE family thiol:disulfide interchange protein -

Sequence


Protein


Download         Length: 227 a.a.        Molecular weight: 25567.41 Da        Isoelectric Point: 8.8913

>NTDB_id=39439 HICON_RS00935 WP_013527275.1 204014..204697(-) (comP) [Haemophilus influenzae F3047]
MTIQKGIITLTILIFISGLLTVILLLDDSHLSFFRAQQNQRKHYVERTLQLQNMTEEKKQTACIDLPLNNNESVKQISIA
LEGAADAIQYFLWCERMSLFKKSPKKGDNQGALKDFVSGEKLAYFRPHFSSPRRILNANKMPKLYWFSDSQAEVEINGTV
YAVLIAEGDLKLTGKGRISGAVITSGNLTLDGVTLAYGKKTVVALVQQYSQWQLAEKSWSDFNVQDE

Nucleotide


Download         Length: 684 bp        

>NTDB_id=39439 HICON_RS00935 WP_013527275.1 204014..204697(-) (comP) [Haemophilus influenzae F3047]
ATGACAATACAAAAAGGCATTATCACGCTGACTATTCTGATTTTTATTTCAGGTTTATTAACCGTAATCTTATTGTTGGA
TGACAGCCATTTAAGTTTTTTTCGTGCGCAACAAAATCAACGAAAACACTATGTGGAAAGAACATTACAACTGCAAAACA
TGACAGAGGAGAAAAAACAAACTGCCTGTATTGATTTACCCTTAAATAATAATGAAAGTGTGAAGCAAATCAGCATCGCC
CTTGAGGGTGCCGCCGATGCAATTCAATATTTTCTTTGGTGTGAAAGAATGAGCCTATTTAAAAAATCGCCTAAAAAGGG
AGATAATCAAGGTGCATTGAAAGATTTTGTGAGTGGCGAAAAACTTGCCTATTTTCGACCGCACTTTTCTTCCCCGCGCA
GAATTTTAAACGCGAATAAAATGCCTAAACTTTATTGGTTTTCAGATTCACAAGCAGAGGTTGAAATTAATGGCACCGTA
TATGCCGTATTAATTGCAGAGGGCGATTTAAAATTGACTGGCAAAGGGAGGATTAGTGGTGCAGTGATTACCAGCGGGAA
TTTAACTTTAGATGGCGTAACTTTAGCTTATGGGAAAAAGACGGTGGTTGCTTTAGTGCAACAATATAGTCAGTGGCAGT
TAGCAGAAAAAAGTTGGAGTGATTTTAATGTTCAGGATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Haemophilus influenzae Rd KW20

94.714

100

0.947


Multiple sequence alignment