Detailed information    

insolico Bioinformatically predicted

Overview


Name   comP   Type   Machinery gene
Locus tag   INP94_RS10270 Genome accession   NZ_CP063120
Coordinates   2077228..2077914 (-) Length   228 a.a.
NCBI ID   WP_197543566.1    Uniprot ID   A0A7M1NWA8
Organism   Haemophilus parainfluenzae strain M1C137_2     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2072228..2082914
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INP94_RS10255 (INP94_10250) nrdR 2073024..2073473 (-) 450 WP_005695670.1 transcriptional regulator NrdR -
  INP94_RS10260 (INP94_10255) recC 2073551..2076910 (-) 3360 WP_197543564.1 exodeoxyribonuclease V subunit gamma -
  INP94_RS10265 (INP94_10260) comQ 2076948..2077235 (-) 288 WP_197543565.1 DUF5374 domain-containing protein Machinery gene
  INP94_RS10270 (INP94_10265) comP 2077228..2077914 (-) 687 WP_197543566.1 DUF2572 family protein Machinery gene
  INP94_RS10275 (INP94_10270) comO 2077911..2078627 (-) 717 WP_197544304.1 PulJ/GspJ family protein Machinery gene
  INP94_RS10280 (INP94_10275) comN 2078624..2079142 (-) 519 WP_197543567.1 prepilin-type N-terminal cleavage/methylation domain-containing protein Machinery gene
  INP94_RS10285 (INP94_10280) suhB 2079342..2080142 (+) 801 WP_197543568.1 inositol-1-monophosphatase -
  INP94_RS10290 (INP94_10285) bioB 2080187..2081203 (-) 1017 WP_197543569.1 biotin synthase BioB -
  INP94_RS10295 (INP94_10290) thiQ 2081277..2081906 (-) 630 WP_197543570.1 thiamine ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 228 a.a.        Molecular weight: 25562.51 Da        Isoelectric Point: 8.9006

>NTDB_id=493087 INP94_RS10270 WP_197543566.1 2077228..2077914(-) (comP) [Haemophilus parainfluenzae strain M1C137_2]
MIMKKGIVTLTALILLSGLLALILLFDEQIFAFFRAQMSQRKYYVEQGLALQKISQQQQKHICQNLPLNGAEKVKQVFFE
SNGAEDKVAYSVWCKRAELFKKSPTKGINENMLQDFISSEKQADFQPHFVKVDTTLTAQKTLQVYWITQNQLEVKGNVSG
ILLAEGDLTLTGKGRISGAVITGGSLKLEEGVTVAYGKAVVTKLVQEYSQWRLVDKSWSDLSAQDQHE

Nucleotide


Download         Length: 687 bp        

>NTDB_id=493087 INP94_RS10270 WP_197543566.1 2077228..2077914(-) (comP) [Haemophilus parainfluenzae strain M1C137_2]
ATGATCATGAAAAAAGGGATTGTGACACTGACGGCACTGATTTTACTTTCAGGCTTATTGGCGCTGATCTTATTATTTGA
TGAACAGATCTTTGCATTTTTTCGAGCTCAAATGAGTCAGCGAAAATATTATGTAGAACAAGGCCTGGCGTTACAGAAAA
TCAGTCAGCAGCAACAAAAACATATTTGCCAAAACTTGCCTTTAAATGGTGCTGAAAAAGTGAAACAAGTCTTTTTCGAG
TCTAATGGGGCAGAGGATAAAGTAGCCTATTCTGTTTGGTGTAAACGAGCAGAGTTATTTAAGAAATCGCCCACAAAAGG
CATTAATGAAAATATGCTGCAAGATTTTATTTCCAGCGAAAAACAAGCTGATTTTCAACCGCACTTTGTGAAAGTAGATA
CTACTTTAACTGCTCAAAAAACACTACAAGTGTATTGGATAACGCAAAACCAATTAGAGGTTAAAGGGAATGTGAGTGGC
ATTTTGTTAGCAGAAGGGGATTTAACTTTAACCGGCAAAGGGCGAATAAGTGGTGCAGTGATTACAGGTGGTTCGCTTAA
GTTAGAGGAGGGCGTGACGGTGGCTTACGGTAAAGCCGTGGTAACAAAACTCGTACAAGAATATAGCCAATGGCGTTTGG
TGGATAAAAGTTGGAGTGACTTAAGTGCGCAAGATCAACATGAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7M1NWA8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comP Haemophilus influenzae Rd KW20

52.632

100

0.526