Detailed information    

insolico Bioinformatically predicted

Overview


Name   HI0659   Type   Machinery gene
Locus tag   SPP_RS02970 Genome accession   NC_012467
Coordinates   541978..542244 (+) Length   88 a.a.
NCBI ID   WP_001844589.1    Uniprot ID   -
Organism   Streptococcus pneumoniae P1031     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IS/Tn 541405..541707 541978..542244 flank 271


Gene organization within MGE regions


Location: 541405..542244
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SPP_RS02960 (SPP_0599) - 541405..541852 (+) 448 Protein_564 IS30 family transposase -
  SPP_RS02970 (SPP_0601) HI0659 541978..542244 (+) 267 WP_001844589.1 helix-turn-helix domain-containing protein Machinery gene

Sequence


Protein


Download         Length: 88 a.a.        Molecular weight: 9629.24 Da        Isoelectric Point: 5.1633

>NTDB_id=33219 SPP_RS02970 WP_001844589.1 541978..542244(+) (HI0659) [Streptococcus pneumoniae P1031]
MEGCPSELFSKEEILESDMRVAIMSELIEARNKQGISQKKLEELSGVSQPVIARMETGKTSPQLDTVLKVLASLGKILAV
VRLEQGKS

Nucleotide


Download         Length: 267 bp        

>NTDB_id=33219 SPP_RS02970 WP_001844589.1 541978..542244(+) (HI0659) [Streptococcus pneumoniae P1031]
TTGGAAGGATGTCCATCTGAGCTCTTTAGCAAGGAGGAAATCCTTGAAAGTGATATGCGAGTAGCTATCATGAGCGAGTT
GATTGAAGCCAGAAATAAGCAAGGAATCAGTCAGAAAAAGCTAGAGGAACTCAGTGGAGTGAGTCAGCCTGTTATAGCTA
GGATGGAGACAGGTAAGACTAGTCCACAGTTGGACACAGTCTTAAAAGTCCTAGCTAGTCTAGGAAAGATACTAGCAGTC
GTCCGACTTGAACAGGGGAAAAGTTGA

Domains


Predicted by InterproScan.

(28-75)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  HI0659 Haemophilus influenzae Rd KW20

59.74

87.5

0.523


Multiple sequence alignment