Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   ERO09_RS02990 Genome accession   NZ_CP035368
Coordinates   617775..618461 (+) Length   228 a.a.
NCBI ID   WP_172622002.1    Uniprot ID   -
Organism   Haemophilus parainfluenzae strain LC_1315_18     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 612775..623461
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ERO09_RS02965 (ERO09_02955) thiQ 613780..614409 (+) 630 WP_128787083.1 thiamine ABC transporter ATP-binding protein -
  ERO09_RS02970 (ERO09_02960) bioB 614484..615500 (+) 1017 WP_049355387.1 biotin synthase BioB -
  ERO09_RS02975 (ERO09_02965) suhB 615546..616346 (-) 801 WP_128787084.1 inositol-1-monophosphatase -
  ERO09_RS02980 (ERO09_02970) comN 616547..617065 (+) 519 WP_128787085.1 prepilin-type cleavage/methylation domain-containing protein Machinery gene
  ERO09_RS02985 (ERO09_02975) comO 617053..617778 (+) 726 WP_172622001.1 type II secretion system protein J Machinery gene
  ERO09_RS02990 (ERO09_02980) comP 617775..618461 (+) 687 WP_172622002.1 DUF2572 family protein Machinery gene
  ERO09_RS02995 (ERO09_02985) comQ 618475..618741 (+) 267 WP_049364346.1 DUF5374 domain-containing protein Machinery gene
  ERO09_RS03000 (ERO09_02990) recC 618779..622138 (+) 3360 WP_128787087.1 exodeoxyribonuclease V subunit gamma -
  ERO09_RS03005 (ERO09_02995) nrdR 622216..622665 (+) 450 WP_005695670.1 transcriptional regulator NrdR -

Sequence


Protein


Download         Length: 228 a.a.        Molecular weight: 25588.61 Da        Isoelectric Point: 9.2585

>NTDB_id=339350 ERO09_RS02990 WP_172622002.1 617775..618461(+) (comP) [Haemophilus parainfluenzae strain LC_1315_18]
MIMKKGMVTLTALILLSGLLALILLFDEQIFAFFRAQMSQRKYYVEQSLALQKISQQQQTHICQNLPLNGTEKVKQVFFE
SSGVEDKVAYSVWCKRAELFKKSPTKGINENMLRDFISSEKQADFQPHFVKVDTTLTAQKTPQVYWITQSQLEIKGNVSG
ILLAEGNLTLTGKGRISGAVIIGGSLKLEEGVTVAYGKAVVTKLVQEYSQWRLLDKSWSDLSAQDQSE

Nucleotide


Download         Length: 687 bp        

>NTDB_id=339350 ERO09_RS02990 WP_172622002.1 617775..618461(+) (comP) [Haemophilus parainfluenzae strain LC_1315_18]
ATGATCATGAAAAAAGGGATGGTGACACTGACGGCTCTGATTTTACTTTCGGGATTATTGGCATTGATCTTATTATTTGA
TGAACAGATCTTTGCGTTTTTTCGAGCTCAAATGAGTCAGCGAAAATATTATGTAGAACAAAGCCTGGCGTTACAGAAAA
TCAGTCAGCAACAACAAACGCACATTTGCCAAAACTTGCCTTTAAATGGTACTGAAAAAGTGAAACAAGTCTTTTTCGAG
TCTTCAGGGGTAGAGGATAAAGTCGCCTATTCTGTTTGGTGTAAACGTGCAGAGTTATTTAAGAAATCGCCCACAAAAGG
CATTAATGAAAATATGCTGAGAGATTTTATTTCCAGCGAAAAACAAGCTGATTTTCAACCGCACTTTGTGAAAGTAGATA
CCACTTTAACTGCTCAAAAAACACCACAAGTGTATTGGATAACGCAAAGCCAATTAGAAATTAAAGGGAATGTGAGTGGT
ATTCTGCTAGCAGAAGGAAATCTAACCTTAACCGGCAAAGGACGAATAAGCGGTGCAGTAATTATAGGTGGTTCGCTTAA
GTTAGAGGAGGGCGTGACGGTGGCTTACGGTAAAGCCGTGGTGACAAAACTCGTACAAGAATATAGCCAATGGCGTTTGT
TGGATAAAAGTTGGAGTGATTTAAGTGCGCAAGATCAAAGTGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Haemophilus influenzae Rd KW20

52.632

100

0.526


Multiple sequence alignment