Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   AT683_RS00270 Genome accession   NZ_LN831035
Coordinates   56000..56683 (-) Length   227 a.a.
NCBI ID   WP_011272301.1    Uniprot ID   Q4QLW8
Organism   Haemophilus influenzae strain NCTC8143     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 51000..61683
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AT683_RS00255 (ERS450003_00049) nrdR 51805..52254 (-) 450 WP_005648026.1 transcriptional regulator NrdR -
  AT683_RS00260 (ERS450003_00050) recC 52309..55674 (-) 3366 WP_050845847.1 exodeoxyribonuclease V subunit gamma -
  AT683_RS00265 (ERS450003_00051) comQ 55720..56007 (-) 288 WP_021034778.1 DUF5374 domain-containing protein Machinery gene
  AT683_RS00270 (ERS450003_00052) comP 56000..56683 (-) 684 WP_011272301.1 DUF2572 family protein Machinery gene
  AT683_RS00275 (ERS450003_00053) comO 56680..57399 (-) 720 WP_050845846.1 type II secretion system protein J Machinery gene
  AT683_RS00280 (ERS450003_00054) comN 57399..57911 (-) 513 WP_005651545.1 type II secretion system protein Machinery gene
  AT683_RS00285 (ERS450003_00055) suhB 58111..58914 (+) 804 WP_005664157.1 inositol-1-monophosphatase -
  AT683_RS00290 (ERS450003_00056) nrfE 59022..60929 (+) 1908 WP_050845845.1 heme lyase NrfEFG subunit NrfE -
  AT683_RS00295 (ERS450003_00057) - 60929..61459 (+) 531 WP_050845844.1 DsbE family thiol:disulfide interchange protein -

Sequence


Protein


Download         Length: 227 a.a.        Molecular weight: 25371.21 Da        Isoelectric Point: 9.5157

>NTDB_id=1114261 AT683_RS00270 WP_011272301.1 56000..56683(-) (comP) [Haemophilus influenzae strain NCTC8143]
MTIQKGIITLTILIFISGLLSVILLLDDSNLSFFRAQQNQRKLYVERTLQLQRITALKKQTACLDLSLNNDESVKQISIT
LDGATDSIQYFLWCERMSLFKKSPKKGDNQGALKDFVSGEKLAYFRPHFSSPPRILNANKTPKLYWFSDSQAEVEINGTV
SAVLIAEGDLKLTGKGRISGAVITNGNLTLDGVTLAYGKKTVVALVQQYSQWKLAEKSWSDFNVQDE

Nucleotide


Download         Length: 684 bp        

>NTDB_id=1114261 AT683_RS00270 WP_011272301.1 56000..56683(-) (comP) [Haemophilus influenzae strain NCTC8143]
ATGACAATACAAAAAGGTATTATTACGCTGACTATTCTGATTTTTATTTCAGGCTTATTAAGCGTAATCTTATTGTTAGA
TGATAGCAATTTAAGTTTTTTTCGGGCGCAACAAAATCAACGAAAACTGTATGTGGAAAGAACATTACAATTACAAAGAA
TAACAGCCTTGAAAAAACAAACTGCCTGCCTTGATTTATCATTAAATAATGATGAAAGTGTAAAGCAAATCAGCATTACG
CTTGATGGTGCCACAGATTCAATTCAATATTTTCTTTGGTGTGAAAGAATGAGCCTATTTAAAAAATCGCCTAAAAAGGG
AGATAATCAAGGCGCATTGAAAGATTTTGTGAGTGGCGAAAAACTTGCCTATTTTCGACCGCACTTTTCTTCTCCGCCCA
GAATTTTAAACGCGAATAAAACGCCTAAACTTTATTGGTTTTCAGATTCACAAGCAGAGGTTGAAATTAATGGCACGGTG
TCTGCCGTATTAATTGCAGAGGGCGATTTAAAATTGACTGGCAAAGGGAGGATTAGTGGCGCAGTGATTACCAACGGGAA
TTTAACTTTAGATGGCGTAACTTTAGCTTATGGGAAAAAGACGGTGGTCGCTTTAGTGCAACAATATAGTCAGTGGAAGT
TAGCAGAAAAAAGTTGGAGTGATTTTAATGTTCAGGATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q4QLW8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Haemophilus influenzae Rd KW20

91.63

100

0.916


Multiple sequence alignment