Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   INP89_RS05340 Genome accession   NZ_CP063127
Coordinates   1084808..1085491 (-) Length   227 a.a.
NCBI ID   WP_012055312.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M1C112_1     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1079808..1090491
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INP89_RS05325 (INP89_05320) nrdR 1080598..1081047 (-) 450 WP_005648026.1 transcriptional regulator NrdR -
  INP89_RS05330 (INP89_05325) recC 1081102..1084482 (-) 3381 WP_012055314.1 exodeoxyribonuclease V subunit gamma -
  INP89_RS05335 (INP89_05330) comQ 1084528..1084815 (-) 288 WP_012055313.1 DUF5374 domain-containing protein Machinery gene
  INP89_RS05340 (INP89_05335) comP 1084808..1085491 (-) 684 WP_012055312.1 DUF2572 family protein Machinery gene
  INP89_RS05345 (INP89_05340) comO 1085506..1086222 (-) 717 WP_012055311.1 type II secretion system protein J Machinery gene
  INP89_RS05350 (INP89_05345) comN 1086222..1086734 (-) 513 WP_005651545.1 Tfp pilus assembly protein FimT/FimU Machinery gene
  INP89_RS05355 (INP89_05350) suhB 1086934..1087737 (+) 804 WP_012055310.1 inositol-1-monophosphatase -
  INP89_RS05360 (INP89_05355) nrfE 1087845..1089752 (+) 1908 WP_012055309.1 heme lyase NrfEFG subunit NrfE -
  INP89_RS05365 (INP89_05360) - 1089749..1090279 (+) 531 WP_041174822.1 DsbE family thiol:disulfide interchange protein -

Sequence


Protein


Download         Length: 227 a.a.        Molecular weight: 25480.37 Da        Isoelectric Point: 8.9057

>NTDB_id=493225 INP89_RS05340 WP_012055312.1 1084808..1085491(-) (comP) [Haemophilus influenzae strain M1C112_1]
MTIQKGIITLTILIFISGLLTVILLLDDSHLSFFRAQQNQRKHYVERTLQLQKMTEEKKQTACIDLPLNNNESVKQISIA
LEGSTDAIQYFLWCERMSLFKKSPKKGDNQGALKDFVSGEKLAYFRLHFSSPPKILNANKMPKLYWFSDSQAEVEINGTV
SAVLIAEGDLKLTGKGRISGAVITSGNLTLDGVTLAYGKKTVVALVQQYSQWQLAEKSWSDFNVQDE

Nucleotide


Download         Length: 684 bp        

>NTDB_id=493225 INP89_RS05340 WP_012055312.1 1084808..1085491(-) (comP) [Haemophilus influenzae strain M1C112_1]
ATGACAATACAAAAAGGCATTATCACGCTGACTATTCTGATTTTTATTTCAGGTTTATTAACCGTAATCTTATTGTTGGA
TGACAGCCATTTAAGTTTTTTTCGTGCGCAACAAAATCAACGAAAACACTATGTGGAAAGAACATTACAACTGCAAAAAA
TGACAGAGGAGAAAAAACAAACTGCCTGTATTGATTTACCCTTAAATAATAATGAAAGTGTGAAGCAAATCAGCATCGCC
CTTGAGGGTTCCACCGATGCAATTCAATATTTTCTTTGGTGTGAAAGAATGAGCCTATTTAAAAAATCGCCTAAAAAGGG
AGATAATCAAGGCGCATTGAAAGATTTTGTGAGTGGCGAAAAACTTGCCTATTTTCGACTGCACTTTTCTTCCCCGCCCA
AAATTTTAAACGCGAATAAAATGCCTAAACTGTATTGGTTTTCAGATTCACAAGCAGAGGTTGAAATTAATGGAACCGTA
TCTGCCGTATTAATTGCAGAGGGCGATTTAAAATTGACTGGCAAAGGGAGAATTAGTGGCGCAGTGATTACCAGTGGGAA
TTTAACTTTAGATGGCGTAACTTTAGCTTATGGGAAAAAGACGGTGGTTGCTTTAGTGCAACAATATAGTCAGTGGCAGT
TAGCAGAAAAAAGTTGGAGTGATTTTAATGTTCAGGATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Haemophilus influenzae Rd KW20

96.035

100

0.96