Detailed information    

insolico Bioinformatically predicted

Overview


Name   amiA3   Type   Regulator
Locus tag   R8507_RS01770 Genome accession   NZ_AP026925
Coordinates   326558..328540 (+) Length   660 a.a.
NCBI ID   WP_000842571.1    Uniprot ID   A0A4P8GBQ2
Organism   Streptococcus pneumoniae strain PZ900700097     
Function   binding to XIP (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 316914..326380 326558..328540 flank 178


Gene organization within MGE regions


Location: 316914..328540
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R8507_RS01720 (PC0094_03310) - 316914..317393 (+) 480 WP_044812589.1 glycosyltransferase -
  R8507_RS01725 (PC0094_03320) - 317635..318732 (+) 1098 WP_044812587.1 glycosyltransferase family 1 protein -
  R8507_RS01730 (PC0094_03330) - 318760..319428 (+) 669 WP_044812604.1 DUF1919 domain-containing protein -
  R8507_RS01735 (PC0094_03340) - 319431..320153 (+) 723 WP_044812586.1 glycosyltransferase family 32 protein -
  R8507_RS01740 (PC0094_03350) - 320150..321322 (+) 1173 WP_044812585.1 O-antigen ligase -
  R8507_RS01745 (PC0094_03360) - 321312..321812 (+) 501 WP_044812583.1 acyltransferase -
  R8507_RS01750 (PC0094_03370) - 321822..323009 (+) 1188 WP_044812582.1 CDP-glycerol glycerophosphotransferase family protein -
  R8507_RS01755 (PC0094_03380) - 323026..324429 (+) 1404 WP_044812602.1 lipopolysaccharide biosynthesis protein -
  R8507_RS01760 (PC0094_03390) tagD 324542..324934 (+) 393 WP_044812580.1 glycerol-3-phosphate cytidylyltransferase -
  R8507_RS01765 (PC0094_03400) - 325352..326380 (+) 1029 WP_044812578.1 acyltransferase family protein -
  R8507_RS01770 (PC0094_03410) amiA3 326558..328540 (+) 1983 WP_000842571.1 peptide ABC transporter substrate-binding protein Regulator

Sequence


Protein


Download         Length: 660 a.a.        Molecular weight: 73038.64 Da        Isoelectric Point: 4.7528

>NTDB_id=98326 R8507_RS01770 WP_000842571.1 326558..328540(+) (amiA3) [Streptococcus pneumoniae strain PZ900700097]
MKSSKLFALAGVTLLAATTLAACSGSGSSTKGEKTFSYIYETDPDNLNYLTTAKAATANITSNVVDGLLENDRYGNFVPS
MAEDWSVSKDGLTYTYTIRKDAKWYTSEGEEYAAVKAQDFVTGLKYAADKKSDALYLVQESIKGLDAYVKGEIKDFSQVG
IKALDDQTVQYTLNKPESFWNSKTTMGVLAPVNEEFLNSKGDDFAKATDPSSLLYNGPYLLKSIVTKSSVEFAKNPNYWD
KDNVHIDKVKLSFWDGQDTSKPAENFKDGSLTAARLYPTSASFAELEKSMKDNIVYTQQDSITYLVGTNIDRQSYKYTSK
TSDEQKASTKKALLNKDFRQAIAFGFDRTAYASQLNGQTGASKILRNIFVPPTFVQADGKNFGDMVKEKLVTYGDEWKDV
NLADSQDGLYNPEKAKAEFAKAKSALQAEGVTFPIHLDMPVDQTATTKVQRVQSMKQSLEATLGADNVVIDIQQLQKDEV
NNITYFAENAAGEDWDLSDNVGWGPDFADPSTYLDIIKPSVGESTKTYLGFDSGEDNVAAKKVGLYDYEKLVTEAGDETT
DVAKRYDKYAAAQAWLTDSALIIPTTSRTGRPILSKMVPFTIPFALSGNKGTSEPVLYKYLELQDKAVTVDEYQKAQEKW
MKEKEESNKKAQEDLAKHVK

Nucleotide


Download         Length: 1983 bp        

>NTDB_id=98326 R8507_RS01770 WP_000842571.1 326558..328540(+) (amiA3) [Streptococcus pneumoniae strain PZ900700097]
ATGAAAAGTTCAAAACTATTTGCCCTTGCGGGCGTGACATTATTGGCGGCGACTACTTTAGCTGCATGCTCTGGATCAGG
TTCAAGCACTAAAGGTGAGAAGACATTCTCATACATTTATGAGACAGACCCTGATAACCTCAACTATTTGACAACTGCTA
AGGCTGCGACAGCAAATATTACCAGTAACGTGGTTGATGGTTTGCTAGAAAATGATCGCTACGGGAACTTTGTGCCGTCT
ATGGCTGAGGATTGGTCTGTATCCAAGGATGGATTGACTTACACTTATACTATCCGTAAGGATGCAAAATGGTATACTTC
TGAAGGTGAAGAATACGCGGCAGTCAAAGCTCAAGACTTTGTAACAGGACTAAAATATGCTGCTGATAAAAAATCAGATG
CTCTTTACCTTGTTCAAGAATCAATCAAAGGGTTGGATGCCTATGTAAAAGGGGAAATCAAAGATTTCTCACAAGTAGGA
ATTAAGGCTCTGGATGATCAGACAGTTCAGTACACTTTGAACAAACCTGAAAGTTTTTGGAACTCAAAAACAACCATGGG
TGTGCTTGCGCCAGTTAATGAAGAGTTTTTGAACTCAAAAGGGGATGATTTTGCCAAAGCTACGGATCCAAGTAGTCTCT
TGTATAATGGACCTTATTTGTTGAAATCCATTGTGACCAAATCTTCTGTTGAATTTGCGAAAAATCCGAACTACTGGGAT
AAGGACAATGTGCATATTGACAAAGTTAAATTGTCATTCTGGGATGGTCAAGATACCAGCAAACCTGCAGAAAACTTTAA
AGATGGTAGCCTTACAGCAGCTCGTCTCTATCCAACAAGTGCAAGTTTCGCAGAGCTTGAGAAGAGTATGAAGGACAATA
TTGTCTATACTCAACAAGACTCTATTACGTATCTAGTTGGTACAAATATTGACCGTCAGTCCTATAAATACACATCTAAG
ACCAGCGACGAACAAAAGGCATCGACTAAAAAGGCTCTCTTAAACAAGGATTTCCGTCAGGCTATTGCCTTTGGTTTTGA
TCGTACAGCCTATGCCTCTCAGTTGAATGGACAAACTGGAGCAAGCAAAATCTTACGTAATATCTTTGTTCCACCAACAT
TTGTTCAAGCAGATGGTAAAAACTTTGGCGATATGGTCAAAGAGAAATTGGTTACTTATGGGGATGAATGGAAGGATGTT
AATCTTGCAGATTCTCAGGATGGTCTTTACAATCCAGAAAAAGCCAAGGCTGAATTTGCTAAAGCTAAATCAGCCTTACA
AGCAGAAGGTGTGACATTCCCAATTCATTTGGATATGCCAGTTGACCAAACAGCAACTACAAAAGTTCAGCGCGTCCAAT
CTATGAAACAATCCTTGGAAGCAACTTTAGGAGCGGATAATGTAGTCATTGATATTCAACAACTACAAAAAGACGAAGTA
AACAATATTACATATTTTGCTGAAAATGCTGCTGGCGAAGACTGGGATTTATCAGATAATGTCGGTTGGGGTCCAGACTT
TGCCGATCCATCAACCTACCTTGATATCATCAAACCATCTGTAGGAGAAAGTACTAAAACATATTTAGGGTTTGACTCAG
GGGAAGATAATGTAGCTGCTAAAAAAGTAGGTCTATATGACTACGAAAAATTGGTTACTGAGGCTGGTGATGAGACTACA
GATGTTGCTAAACGCTATGATAAATACGCTGCAGCCCAAGCTTGGTTGACAGATAGTGCTTTGATTATTCCAACTACATC
TCGTACAGGGCGCCCAATCTTGTCTAAGATGGTACCATTTACAATACCATTTGCATTGTCAGGAAATAAAGGTACAAGTG
AACCAGTCTTGTATAAATACTTGGAACTTCAAGACAAGGCAGTCACTGTAGATGAATACCAAAAAGCTCAGGAAAAATGG
ATGAAAGAAAAAGAAGAGTCTAATAAAAAGGCTCAAGAAGATCTCGCAAAACATGTGAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4P8GBQ2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  amiA3 Streptococcus thermophilus LMD-9

58.699

100

0.588

  amiA Streptococcus salivarius strain HSISS4

58.245

100

0.583

  amiA3 Streptococcus thermophilus LMG 18311

58.094

100

0.582


Multiple sequence alignment