Detailed information    

insolico Bioinformatically predicted

Overview


Name   HI0659   Type   Machinery gene
Locus tag   INV104_RS02865 Genome accession   NC_017591
Coordinates   543433..543699 (+) Length   88 a.a.
NCBI ID   WP_001818764.1    Uniprot ID   -
Organism   Streptococcus pneumoniae INV104     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IS/Tn 542861..543355 543433..543699 flank 78


Gene organization within MGE regions


Location: 542861..543699
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  INV104_RS02860 (INV104_04860) - 542861..543349 (+) 489 Protein_555 transposase -
  INV104_RS02865 (INV104_04870) HI0659 543433..543699 (+) 267 WP_001818764.1 helix-turn-helix domain-containing protein Machinery gene

Sequence


Protein


Download         Length: 88 a.a.        Molecular weight: 9627.08 Da        Isoelectric Point: 4.2720

>NTDB_id=49151 INV104_RS02865 WP_001818764.1 543433..543699(+) (HI0659) [Streptococcus pneumoniae INV104]
MEGCPSELFSKEEILESDMRVAIMSELIEARYEQGISQKKLEEVSGVSQPVIARMETGESSLQLDTVLKVLTSLGKTLAV
VPLEQGKS

Nucleotide


Download         Length: 267 bp        

>NTDB_id=49151 INV104_RS02865 WP_001818764.1 543433..543699(+) (HI0659) [Streptococcus pneumoniae INV104]
TTGGAAGGATGTCCATCTGAGCTCTTTAGCAAGGAGGAAATCCTTGAAAGTGATATGCGAGTGGCTATCATGAGCGAGTT
GATTGAGGCTAGGTATGAACAAGGAATCAGTCAGAAAAAGCTGGAAGAAGTCAGTGGAGTGAGTCAACCTGTCATAGCTA
GGATGGAAACAGGAGAGAGCAGTCTTCAGTTGGATACGGTCTTAAAAGTTCTAACCAGTCTAGGAAAGACACTAGCAGTC
GTTCCACTTGAACAGGGGAAAAGTTGA

Domains


Predicted by InterproScan.

(29-79)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  HI0659 Haemophilus influenzae Rd KW20

57.143

87.5

0.5


Multiple sequence alignment