Detailed information    

insolico Bioinformatically predicted

Overview


Name   hexA   Type   Machinery gene
Locus tag   SIR_RS19005 Genome accession   NC_022246
Coordinates   1898097..1900655 (-) Length   852 a.a.
NCBI ID   WP_021003324.1    Uniprot ID   T1ZH40
Organism   Streptococcus intermedius B196     
Function   DNA mismatch repair (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1893097..1905655
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SIR_RS18980 (SIR_1809) - 1893728..1894234 (+) 507 WP_021003319.1 helix-turn-helix domain-containing protein -
  SIR_RS18985 (SIR_1810) - 1894218..1894727 (+) 510 WP_041787616.1 RDD family protein -
  SIR_RS18990 (SIR_1811) hexB 1894873..1896819 (-) 1947 WP_021003321.1 DNA mismatch repair endonuclease MutL Machinery gene
  SIR_RS18995 (SIR_1812) brsR 1897061..1897501 (+) 441 WP_041787618.1 LytTR family DNA-binding domain-containing protein Regulator
  SIR_RS19000 (SIR_1813) - 1897498..1897956 (+) 459 WP_021003323.1 DUF3021 domain-containing protein -
  SIR_RS19005 (SIR_1814) hexA 1898097..1900655 (-) 2559 WP_021003324.1 DNA mismatch repair protein MutS Machinery gene
  SIR_RS19010 (SIR_1815) argR 1900818..1901258 (-) 441 WP_003075336.1 arginine repressor -
  SIR_RS19015 (SIR_1816) argS 1901362..1903050 (+) 1689 WP_021003325.1 arginine--tRNA ligase -
  SIR_RS19020 (SIR_1817) nrdI 1903102..1903557 (-) 456 WP_009569662.1 class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI -
  SIR_RS19025 (SIR_1818) - 1903633..1904142 (-) 510 WP_021003326.1 hypothetical protein -

Sequence


Protein


Download         Length: 852 a.a.        Molecular weight: 96214.67 Da        Isoelectric Point: 4.9651

>NTDB_id=51168 SIR_RS19005 WP_021003324.1 1898097..1900655(-) (hexA) [Streptococcus intermedius B196]
MTKEKLSPGMQQYLDIKKDYPDAFLLFRMGDFYELFYEDAINAAQILEIALTSRNKNSEKPIPMAGVPYHSVQQYIDVLI
ESGYKVAIAEQVEDPKKAVGVVKREVVQVITPGTAVDSSKPDSQNNFLVALDKLEDFYGLAYMDVVTGEFQVTTLSDFNM
VCGEIRNLRAREVVLGYELPEAEHQVLANQMNLLLSQVETAFEDVQLLGDDLSRLEYQVAGKLLEYVHQTQLRELSHLKR
VHHYEIKDFLQMNYATMTSLDLTENARTGKKHGSLYWLMDETKTAMGMRLLRRWIQHPLLDKERILKRQDVVQVFLDHFF
ERSDLTDSLKGVYDIERLASRVSFGKTNPKDLLQLAATLSNVPQIKGILQGIDHPVLGQLIENLDDIPELANLIQSAISP
DAPNVITEGNIIQTGFDETLDKYRVVLRDGTSWIADIEAKERVASGINNLKIDYNKKDGYYFHVTNSQLEHVPSHFFRKA
TLKNSERFGTEELARIEGEMLEAREKSANLEYEIFIRIREEAGKYIKRLQSLAQTLATVDVLQSFAAVAEKQHFVRPEFI
ERPSIEIDKGRHAVVEKVMGAQTYIPNSISMDENVNLQLITGPNMSGKSTYMRQLAIIVIMAQMGSYVSAERAQLPIFDA
IFTRIGAADDLVSGQSTFMVEMMEANHAISQATEHSLILFDELGRGTATYDGMALAQAIIEYIHNRTGAKTLFATHYHEL
TDLSTSLTQLENVHVATLEKDGQVTFLHKIEAGPADKSYGIHVARIAGLPNDLLMRADQILARLEEQANEKPSLNPSNKG
ANDSKENQVSEQISLFTETTESPILDELRQLDIYNMTPMEVMLAIAEMKKHL

Nucleotide


Download         Length: 2559 bp        

>NTDB_id=51168 SIR_RS19005 WP_021003324.1 1898097..1900655(-) (hexA) [Streptococcus intermedius B196]
ATGACAAAAGAAAAACTATCTCCGGGGATGCAACAGTATTTAGATATAAAAAAAGATTATCCAGATGCTTTTTTGCTATT
TCGCATGGGAGATTTTTATGAATTATTCTATGAAGATGCGATCAATGCAGCACAGATTTTAGAAATTGCTCTGACTAGCC
GTAATAAAAATTCGGAAAAGCCAATCCCGATGGCAGGAGTTCCCTATCATTCGGTGCAACAATATATTGATGTTTTAATT
GAATCGGGCTATAAAGTAGCGATTGCAGAGCAGGTGGAAGATCCTAAAAAAGCAGTTGGTGTAGTCAAACGTGAGGTGGT
ACAAGTCATCACACCGGGGACAGCCGTTGATTCTTCAAAACCAGATAGTCAAAATAATTTCTTGGTAGCTTTAGATAAGC
TAGAAGATTTCTATGGGCTAGCTTATATGGATGTGGTAACTGGTGAATTTCAGGTGACAACTCTCAGTGACTTTAACATG
GTTTGTGGGGAAATTCGAAATCTACGAGCGCGTGAAGTGGTGTTGGGATATGAATTACCTGAAGCAGAGCACCAAGTGTT
GGCAAATCAAATGAATTTGTTATTATCACAGGTGGAAACAGCTTTTGAAGATGTTCAGCTATTAGGAGATGATTTGTCTC
GCCTAGAATATCAAGTAGCTGGGAAACTGTTAGAATATGTTCACCAGACGCAACTGCGTGAGCTTAGTCACTTAAAGCGA
GTTCATCATTATGAAATTAAAGATTTCTTACAGATGAACTATGCAACTATGACAAGTCTAGATTTAACGGAGAATGCACG
GACGGGGAAGAAGCATGGCAGTCTTTATTGGTTGATGGATGAGACGAAGACAGCTATGGGGATGCGACTTCTAAGAAGAT
GGATCCAGCATCCGTTGCTTGATAAGGAACGGATTCTTAAGCGACAGGATGTCGTACAAGTCTTTTTAGATCATTTTTTT
GAGCGTAGTGATTTGACAGATAGTCTCAAAGGGGTTTATGATATTGAGCGCTTGGCAAGCCGTGTTTCTTTTGGGAAGAC
AAACCCGAAGGATTTATTGCAGTTGGCGGCAACATTGAGCAATGTCCCTCAGATTAAGGGAATTTTACAAGGAATCGATC
ATCCTGTTTTGGGACAGTTGATTGAAAACTTGGATGATATTCCAGAATTGGCAAATTTGATTCAGTCGGCAATTTCTCCT
GATGCTCCAAATGTTATTACTGAAGGGAATATCATTCAAACTGGTTTTGATGAAACCTTAGATAAGTATCGAGTGGTTCT
GCGAGATGGTACAAGCTGGATTGCCGATATTGAAGCGAAAGAAAGGGTGGCTAGTGGGATTAATAATCTAAAAATTGATT
ACAATAAAAAAGACGGTTATTATTTCCATGTTACTAATTCACAGTTGGAGCACGTACCTAGCCACTTTTTCCGAAAAGCA
ACGTTGAAAAATTCAGAACGATTTGGTACAGAGGAATTGGCTCGTATTGAAGGCGAAATGTTGGAAGCGCGTGAAAAATC
GGCCAATTTAGAGTATGAGATTTTTATACGTATTCGGGAAGAAGCTGGCAAATATATCAAACGATTACAATCCTTGGCAC
AAACTCTAGCAACAGTGGATGTATTGCAGAGTTTTGCAGCAGTTGCTGAAAAGCAACACTTTGTACGCCCAGAATTTATT
GAACGTCCTTCCATTGAAATCGATAAAGGACGGCATGCAGTTGTAGAAAAGGTTATGGGCGCACAAACTTATATTCCGAA
TAGTATTTCAATGGACGAGAATGTCAATCTTCAGCTAATTACAGGTCCAAATATGAGTGGGAAGTCAACCTATATGCGTC
AATTAGCGATTATTGTTATCATGGCGCAAATGGGTTCCTATGTTTCAGCTGAACGTGCTCAATTACCAATTTTTGATGCC
ATCTTCACTCGAATTGGTGCAGCAGATGACTTGGTATCTGGGCAATCAACCTTTATGGTAGAAATGATGGAAGCAAATCA
TGCTATTTCTCAGGCGACTGAACACTCTCTTATTCTTTTTGATGAGTTGGGAAGAGGGACAGCAACATATGATGGGATGG
CTCTTGCTCAGGCGATTATTGAATATATCCACAATCGAACAGGAGCAAAGACCCTTTTTGCCACACACTATCATGAGTTG
ACAGACTTGTCAACTAGCTTGACACAGTTAGAAAATGTTCATGTAGCAACTTTAGAAAAAGATGGGCAAGTGACCTTCCT
TCATAAGATTGAAGCTGGTCCTGCAGATAAGTCTTATGGGATTCATGTCGCAAGAATTGCAGGTTTACCAAATGATTTGT
TGATGAGAGCCGACCAGATACTAGCAAGACTAGAAGAACAAGCAAATGAGAAACCATCTCTCAATCCTTCTAATAAAGGA
GCTAATGATAGCAAGGAAAATCAAGTATCTGAGCAGATATCTTTATTCACAGAAACAACTGAATCCCCTATTTTAGATGA
ACTACGTCAGTTGGATATTTATAATATGACCCCAATGGAAGTGATGCTAGCTATAGCTGAGATGAAAAAACATCTCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB T1ZH40

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  hexA Streptococcus pneumoniae R6

75.704

100

0.757

  mutS Pseudomonas stutzeri strain ATCC 17587

36.195

100

0.366


Multiple sequence alignment