Detailed information    

insolico Bioinformatically predicted

Overview


Name   sepM   Type   Regulator
Locus tag   SCI_RS02345 Genome accession   NC_022238
Coordinates   456370..457425 (+) Length   351 a.a.
NCBI ID   WP_020997642.1    Uniprot ID   U2ZNX3
Organism   Streptococcus constellatus subsp. pharyngis C1050     
Function   processing of CSP (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 434024..467336 456370..457425 within 0


Gene organization within MGE regions


Location: 434024..467336
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SCI_RS02235 (SCI_0444) - 434462..435238 (-) 777 WP_006267458.1 aminoglycoside 3'-phosphotransferase -
  SCI_RS02240 (SCI_0445) rlmD 435295..436659 (+) 1365 WP_006267121.1 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD -
  SCI_RS02245 (SCI_0446) - 437517..438449 (-) 933 WP_006267664.1 nitronate monooxygenase -
  SCI_RS02250 (SCI_0447) - 438784..439017 (+) 234 WP_003070208.1 DUF1858 domain-containing protein -
  SCI_RS02255 (SCI_0448) - 439017..440378 (+) 1362 WP_006267202.1 DUF438 domain-containing protein -
  SCI_RS02260 (SCI_0449) - 440391..440645 (+) 255 WP_006267273.1 DUF1912 family protein -
  SCI_RS02265 (SCI_0451) - 441032..441610 (+) 579 WP_003070202.1 GNAT family N-acetyltransferase -
  SCI_RS02270 (SCI_0452) - 441607..442413 (+) 807 WP_020999313.1 hypothetical protein -
  SCI_RS02275 (SCI_0453) - 442567..445215 (+) 2649 WP_006267276.1 valine--tRNA ligase -
  SCI_RS02280 - 445223..445612 (+) 390 WP_006267502.1 hypothetical protein -
  SCI_RS11115 (SCI_0456) - 447106..448614 (-) 1509 WP_020999314.1 YSIRK signal domain/LPXTG anchor domain surface protein -
  SCI_RS10150 (SCI_0457) - 448753..449633 (-) 881 Protein_433 ISAs1 family transposase -
  SCI_RS10155 (SCI_0458) - 449810..450079 (-) 270 Protein_434 IS30 family transposase -
  SCI_RS10160 (SCI_0459) - 450184..450908 (-) 725 Protein_435 ISL3 family transposase -
  SCI_RS02310 (SCI_0460) cysK 450862..451791 (-) 930 WP_020997639.1 cysteine synthase A -
  SCI_RS02315 (SCI_0461) - 451890..452525 (-) 636 WP_003035211.1 YigZ family protein -
  SCI_RS02320 (SCI_0462) comFA/cflA 452582..453883 (+) 1302 WP_020997640.1 DEAD/DEAH box helicase Machinery gene
  SCI_RS02325 (SCI_0463) - 453880..454545 (+) 666 WP_006267632.1 ComF family protein -
  SCI_RS02330 (SCI_0464) hpf 454624..455166 (+) 543 WP_006267638.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  SCI_RS02335 (SCI_0465) rsmD 455363..455902 (+) 540 WP_020997641.1 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD -
  SCI_RS02340 (SCI_0466) coaD 455892..456389 (+) 498 WP_006267255.1 pantetheine-phosphate adenylyltransferase -
  SCI_RS02345 (SCI_0467) sepM 456370..457425 (+) 1056 WP_020997642.1 SepM family pheromone-processing serine protease Regulator
  SCI_RS02350 (SCI_0468) - 457712..458269 (+) 558 WP_006267391.1 YutD family protein -
  SCI_RS02355 (SCI_0469) rlmN 458303..459385 (+) 1083 WP_006269861.1 23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN -
  SCI_RS02360 (SCI_0470) - 459378..459896 (+) 519 WP_003070167.1 VanZ family protein -
  SCI_RS02365 (SCI_0471) cclA/cilC 459891..460565 (-) 675 WP_020997643.1 A24 family peptidase Machinery gene
  SCI_RS02370 (SCI_0472) - 460716..461243 (+) 528 WP_006267546.1 DNA starvation/stationary phase protection protein -
  SCI_RS02375 (SCI_0474) - 461460..462650 (-) 1191 WP_006267506.1 site-specific integrase -
  SCI_RS02380 (SCI_0475) - 462726..462929 (-) 204 WP_002983016.1 excisionase -
  SCI_RS02385 (SCI_0476) - 463359..463610 (-) 252 WP_006267285.1 helix-turn-helix domain-containing protein -
  SCI_RS02390 (SCI_0477) - 463591..464034 (-) 444 WP_006267609.1 RNA polymerase sigma factor -
  SCI_RS02395 (SCI_0478) - 464375..465709 (-) 1335 WP_006267371.1 MATE family efflux transporter -
  SCI_RS10170 (SCI_0479) - 466343..466667 (+) 325 Protein_454 plasmid mobilization protein -
  SCI_RS10875 - 466634..466864 (+) 231 Protein_455 relaxase/mobilization nuclease domain-containing protein -
  SCI_RS10180 - 466836..466988 (+) 153 WP_080654533.1 helix-turn-helix domain-containing protein -

Sequence


Protein


Download         Length: 351 a.a.        Molecular weight: 38352.39 Da        Isoelectric Point: 10.2387

>NTDB_id=61583 SCI_RS02345 WP_020997642.1 456370..457425(+) (sepM) [Streptococcus constellatus subsp. pharyngis C1050]
MEKSRTKFKKWPIVAVVGLLLLFMSFIVPLPYYIEVPGGAADVRQVLRVDNKVDKEKGSYNFVTVGIQHATFAHLVYAWL
TPFTDIYSAKDLTGGTSDKEYMRINQFYMETSQNLAKYQGLKKAGKDISLKYLGVYVLQVTKNSTFKGILNIADTVTGVN
DKTFNSSKDLIDYVGSQKIGSKVKVNYVEDGHKKSAIGKIIKLENGKNGIGISLIDRTEVTSNIPIEFSTAGIGGPSAGL
MFSLAIYTQIADPTLRDGRNIAGTGSIDREGKVGDIGGIDKKVVSAAKNGAEIFFAPNNPVTREVKKSNPKAKTNYETAL
AAAKKIKTKMKIVPVKTLQDAIDYLKSTKKS

Nucleotide


Download         Length: 1056 bp        

>NTDB_id=61583 SCI_RS02345 WP_020997642.1 456370..457425(+) (sepM) [Streptococcus constellatus subsp. pharyngis C1050]
ATGGAAAAATCACGAACTAAATTTAAAAAATGGCCGATAGTGGCTGTAGTAGGGCTGCTTCTACTGTTCATGTCTTTTAT
AGTTCCTCTGCCTTATTACATTGAGGTGCCAGGTGGTGCAGCAGATGTTCGGCAAGTATTACGTGTAGATAATAAAGTGG
ATAAGGAAAAAGGTTCTTATAACTTTGTCACTGTTGGGATTCAGCATGCAACCTTTGCTCATTTGGTTTATGCTTGGTTG
ACTCCTTTTACAGATATTTATTCGGCGAAAGATCTGACTGGCGGAACTTCTGATAAAGAATATATGCGAATCAATCAATT
TTATATGGAAACATCACAAAACTTAGCCAAATATCAAGGGTTAAAAAAAGCAGGCAAGGATATTAGCTTGAAATATTTGG
GTGTTTATGTGTTGCAAGTTACCAAGAATTCAACCTTTAAAGGGATTCTAAATATTGCAGATACTGTGACAGGCGTGAAT
GATAAGACTTTCAACAGTTCAAAAGATTTGATTGATTATGTAGGCTCACAGAAAATTGGCAGCAAGGTCAAGGTGAACTA
TGTAGAAGACGGTCATAAAAAATCTGCTATAGGTAAAATTATCAAACTTGAAAATGGGAAAAATGGAATTGGTATTAGCT
TAATTGACCGAACAGAAGTCACAAGTAATATACCAATTGAATTTTCAACAGCTGGGATTGGTGGACCAAGTGCGGGTTTG
ATGTTTAGCTTGGCGATTTATACACAGATTGCAGACCCGACATTGAGAGATGGCAGAAATATTGCTGGGACAGGATCGAT
TGACCGTGAAGGAAAAGTTGGTGATATTGGTGGGATTGATAAAAAAGTTGTTTCAGCAGCTAAAAATGGCGCAGAAATTT
TCTTTGCGCCAAATAATCCAGTTACCAGGGAAGTGAAAAAATCCAATCCTAAAGCGAAAACAAATTATGAGACAGCTTTG
GCAGCTGCTAAAAAAATTAAAACTAAAATGAAAATTGTACCTGTCAAGACCTTGCAAGACGCAATTGATTACCTCAAAAG
TACTAAAAAATCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB U2ZNX3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  sepM Streptococcus mutans UA159

63.372

98.006

0.621


Multiple sequence alignment