Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   STMM20_RS02590 Genome accession   NZ_CP065485
Coordinates   493132..493827 (+) Length   231 a.a.
NCBI ID   WP_014621862.1    Uniprot ID   -
Organism   Streptococcus thermophilus strain MM20     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 488132..498827
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  STMM20_RS02565 (STMM20_02565) - 488260..489153 (-) 894 WP_023909869.1 ABC transporter ATP-binding protein -
  STMM20_RS02570 (STMM20_02570) - 489146..489409 (-) 264 WP_014621867.1 hypothetical protein -
  STMM20_RS02575 (STMM20_02575) - 489396..490481 (-) 1086 WP_023909868.1 radical SAM/SPASM domain-containing protein -
  STMM20_RS02580 (STMM20_02580) - 490471..491604 (-) 1134 WP_231838380.1 radical SAM protein -
  STMM20_RS02585 (STMM20_02585) - 491890..492762 (-) 873 WP_014621863.1 Rgg/GadR/MutR family transcriptional regulator -
  STMM20_RS02590 (STMM20_02590) comEA 493132..493827 (+) 696 WP_014621862.1 helix-hairpin-helix domain-containing protein Machinery gene
  STMM20_RS02595 (STMM20_02595) - 493817..496057 (+) 2241 WP_347103612.1 DNA internalization-related competence protein ComEC/Rec2 -
  STMM20_RS02600 (STMM20_02600) - 496143..497405 (+) 1263 WP_347103613.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  STMM20_RS02605 (STMM20_02605) - 497471..498016 (+) 546 WP_084829230.1 GNAT family N-acetyltransferase -

Sequence


Protein


Download         Length: 231 a.a.        Molecular weight: 24487.43 Da        Isoelectric Point: 4.6300

>NTDB_id=510472 STMM20_RS02590 WP_014621862.1 493132..493827(+) (comEA) [Streptococcus thermophilus strain MM20]
MKEKILAYVKDNRLFVSVIAVLMVIFCFFLWMTCGAGSSMEAETSYTDVTALSTSSSKQSSQSLSEASSQSKTEGSEKVK
SKVTVDVKGAVVNPGVYTLKAGARVTDVIQEAGGMTEDADAKSVNLAASLSDEEVIYVANKDENVSVLDQTGTGQVSDKG
GQAVSKDGKINLNTATSEQLQTISGIGAKRTEDIIAYRESHGGFQSVDDLKNVSGIGDKTLDKIRESLYVA

Nucleotide


Download         Length: 696 bp        

>NTDB_id=510472 STMM20_RS02590 WP_014621862.1 493132..493827(+) (comEA) [Streptococcus thermophilus strain MM20]
GTGAAGGAAAAGATTCTAGCCTATGTCAAAGATAATCGTCTGTTTGTGAGTGTCATTGCTGTACTGATGGTGATTTTTTG
CTTCTTTTTATGGATGACTTGTGGTGCCGGCAGCAGCATGGAGGCGGAGACGTCTTATACAGATGTGACAGCTTTGTCAA
CGTCTTCATCCAAACAAAGTTCACAGTCTCTTTCTGAGGCGTCTTCCCAGTCAAAGACCGAAGGAAGTGAAAAAGTTAAG
TCAAAAGTAACGGTAGATGTTAAGGGGGCTGTGGTCAATCCAGGTGTCTATACGCTAAAGGCAGGCGCTAGGGTGACAGA
TGTCATTCAAGAAGCTGGAGGAATGACAGAAGATGCAGACGCTAAGAGTGTTAACTTAGCCGCAAGCTTGTCAGATGAAG
AGGTTATTTATGTAGCCAATAAAGATGAAAATGTTTCTGTCCTTGATCAAACAGGTACTGGTCAGGTCTCTGATAAAGGA
GGGCAGGCTGTATCTAAGGATGGCAAAATTAACTTAAATACAGCAACCTCAGAGCAGTTGCAAACCATTTCTGGAATTGG
AGCTAAGCGGACAGAGGATATCATTGCCTATCGTGAAAGTCATGGCGGCTTTCAGTCCGTAGATGACTTGAAAAATGTCT
CAGGAATTGGTGATAAAACATTAGATAAAATCAGAGAGTCCCTCTATGTGGCTTAA

Domains


Predicted by InterproScan.

(85-138)

(167-228)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Streptococcus thermophilus LMD-9

98.701

100

0.987

  comEA/celA/cilE Streptococcus mitis SK321

39.912

98.701

0.394

  comEA/celA/cilE Streptococcus pneumoniae D39

39.035

98.701

0.385

  comEA/celA/cilE Streptococcus pneumoniae R6

39.035

98.701

0.385

  comEA/celA/cilE Streptococcus pneumoniae Rx1

39.035

98.701

0.385

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

39.035

98.701

0.385

  comEA/celA/cilE Streptococcus mitis NCTC 12261

37.719

98.701

0.372