Detailed information    

insolico Bioinformatically predicted

Overview


Name   sepM   Type   Regulator
Locus tag   I6H72_RS04070 Genome accession   NZ_CP066055
Coordinates   826855..827910 (-) Length   351 a.a.
NCBI ID   WP_070669255.1    Uniprot ID   -
Organism   Streptococcus constellatus strain FDAARGOS_1015     
Function   processing of CSP (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 816934..850042 826855..827910 within 0


Gene organization within MGE regions


Location: 816934..850042
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6H72_RS04010 (I6H72_04010) - 817282..817434 (-) 153 WP_080654533.1 helix-turn-helix domain-containing protein -
  I6H72_RS10225 - 817406..817636 (-) 231 Protein_801 relaxase/mobilization nuclease domain-containing protein -
  I6H72_RS04015 (I6H72_04015) - 817603..817927 (-) 325 Protein_802 plasmid mobilization protein -
  I6H72_RS04020 (I6H72_04020) - 818561..819895 (+) 1335 WP_198458340.1 MATE family efflux transporter -
  I6H72_RS04025 (I6H72_04025) - 820236..820679 (+) 444 WP_198458341.1 RNA polymerase sigma factor -
  I6H72_RS04030 (I6H72_04030) - 820660..820911 (+) 252 WP_006267285.1 helix-turn-helix domain-containing protein -
  I6H72_RS04035 (I6H72_04035) - 821351..821554 (+) 204 WP_048801117.1 excisionase -
  I6H72_RS04040 (I6H72_04040) - 821630..822820 (+) 1191 WP_198458342.1 site-specific integrase -
  I6H72_RS04045 (I6H72_04045) - 823037..823564 (-) 528 WP_006267546.1 Dps family protein -
  I6H72_RS04050 (I6H72_04050) cclA/cilC 823715..824389 (+) 675 WP_020997643.1 prepilin peptidase Machinery gene
  I6H72_RS04055 (I6H72_04055) - 824384..824902 (-) 519 WP_003070167.1 VanZ family protein -
  I6H72_RS04060 (I6H72_04060) rlmN 824895..825977 (-) 1083 WP_049476508.1 23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN -
  I6H72_RS04065 (I6H72_04065) - 826011..826568 (-) 558 WP_003070172.1 YutD family protein -
  I6H72_RS04070 (I6H72_04070) sepM 826855..827910 (-) 1056 WP_070669255.1 SepM family pheromone-processing serine protease Regulator
  I6H72_RS04075 (I6H72_04075) coaD 827891..828388 (-) 498 WP_006267255.1 pantetheine-phosphate adenylyltransferase -
  I6H72_RS04080 (I6H72_04080) rsmD 828378..828917 (-) 540 WP_020997641.1 16S rRNA (guanine(966)-N(2))-methyltransferase RsmD -
  I6H72_RS04085 (I6H72_04085) hpf 829114..829656 (-) 543 WP_003032381.1 ribosome hibernation-promoting factor, HPF/YfiA family -
  I6H72_RS04090 (I6H72_04090) - 829735..830400 (-) 666 WP_006267632.1 ComF family protein -
  I6H72_RS04095 (I6H72_04095) comFA/cflA 830397..831698 (-) 1302 WP_198458343.1 DEAD/DEAH box helicase Machinery gene
  I6H72_RS04100 (I6H72_04100) - 831755..832390 (+) 636 WP_003035211.1 YigZ family protein -
  I6H72_RS04105 (I6H72_04105) cysK 832489..833418 (+) 930 WP_198458344.1 cysteine synthase A -
  I6H72_RS04110 (I6H72_04110) - 833483..834096 (+) 614 Protein_821 transposase -
  I6H72_RS04115 (I6H72_04115) - 834201..834470 (+) 270 Protein_822 IS30 family transposase -
  I6H72_RS04120 (I6H72_04120) - 834647..835527 (+) 881 Protein_823 ISAs1 family transposase -
  I6H72_RS10450 (I6H72_04125) - 835666..836973 (+) 1308 WP_198458345.1 albumin-binding GA domain-containing protein -
  I6H72_RS04130 (I6H72_04130) - 837151..838394 (+) 1244 WP_198458346.1 ISL3 family transposase -
  I6H72_RS04135 (I6H72_04135) - 838455..838844 (-) 390 WP_198458347.1 hypothetical protein -
  I6H72_RS04140 (I6H72_04140) - 838852..841500 (-) 2649 WP_006267276.1 valine--tRNA ligase -
  I6H72_RS04145 (I6H72_04145) - 841522..842460 (-) 939 WP_003070201.1 hypothetical protein -
  I6H72_RS04150 (I6H72_04150) - 842457..843035 (-) 579 WP_003070202.1 GNAT family N-acetyltransferase -
  I6H72_RS04155 (I6H72_04155) - 843422..843676 (-) 255 WP_006267273.1 DUF1912 family protein -
  I6H72_RS04160 (I6H72_04160) - 843689..845050 (-) 1362 WP_198458348.1 DUF438 domain-containing protein -
  I6H72_RS04165 (I6H72_04165) - 845050..845283 (-) 234 WP_003070208.1 DUF1858 domain-containing protein -
  I6H72_RS04170 (I6H72_04170) - 845617..846549 (+) 933 WP_198458349.1 nitronate monooxygenase -
  I6H72_RS04175 (I6H72_04175) rlmD 847407..848771 (-) 1365 WP_006267121.1 23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD -
  I6H72_RS04180 (I6H72_04180) - 848828..849604 (+) 777 Protein_835 aminoglycoside 3'-phosphotransferase -

Sequence


Protein


Download         Length: 351 a.a.        Molecular weight: 38315.36 Da        Isoelectric Point: 10.2305

>NTDB_id=516846 I6H72_RS04070 WP_070669255.1 826855..827910(-) (sepM) [Streptococcus constellatus strain FDAARGOS_1015]
MEKSRTKFKKWPIVAVVGLLLLFMSFIVPLPYYIEVPGGAADVRQVLRVDNKVDKEKGSYNFVTVGIQHATFAHLVYAWL
TPFTDIYSAKDLTGGTSDKEYMRINQFYMETSQNLAKYQGLKKAGKDISLKYLGVYVLQVTKNSTFKGILNIADTVTGVN
DKTFNSSKDLIDYVGSQKIGSKVKVNYVEDGQKKSAIGKIIKLENGKNGIGISLIDRTEVTSNIPIEFSTAGIGGPSAGL
MFSLAIYTQIADPTLRDGRNIAGTGSIDREGKVGDIGGIDKKVVSAAKNGAEIFFAPNNPVTKEVKKSNPKAKTNYETAL
AAAKKIKTKMKIVPVKTLQDAIDYLKSTKKS

Nucleotide


Download         Length: 1056 bp        

>NTDB_id=516846 I6H72_RS04070 WP_070669255.1 826855..827910(-) (sepM) [Streptococcus constellatus strain FDAARGOS_1015]
ATGGAAAAATCACGAACTAAATTTAAAAAATGGCCGATAGTGGCTGTAGTAGGGCTGCTTCTACTGTTCATGTCTTTTAT
AGTTCCTCTGCCTTATTACATTGAGGTGCCAGGTGGTGCAGCAGATGTTCGGCAAGTATTACGTGTAGATAATAAAGTGG
ATAAGGAAAAAGGTTCTTATAACTTTGTCACTGTTGGGATTCAGCATGCAACCTTTGCTCATTTGGTTTATGCTTGGTTG
ACTCCTTTTACAGATATTTATTCGGCGAAAGATCTGACTGGCGGAACTTCTGATAAAGAATATATGCGAATCAATCAATT
TTATATGGAAACATCACAAAACTTAGCCAAATATCAAGGGTTAAAAAAAGCAGGCAAGGATATTAGCTTGAAATATTTGG
GTGTTTATGTGTTGCAAGTTACCAAGAATTCAACCTTTAAAGGGATTCTAAATATTGCAGATACTGTGACAGGCGTGAAT
GATAAGACTTTCAACAGTTCAAAAGATTTGATTGATTATGTAGGCTCACAGAAAATTGGCAGCAAGGTCAAGGTGAACTA
TGTAGAAGACGGTCAGAAAAAATCTGCTATAGGTAAAATTATCAAACTTGAAAATGGGAAAAATGGAATTGGTATTAGCT
TAATTGACCGAACAGAAGTCACAAGTAATATACCAATTGAATTTTCAACAGCTGGGATTGGTGGACCAAGTGCGGGTTTA
ATGTTTAGCTTGGCGATTTATACACAGATTGCAGACCCGACATTGAGAGATGGCAGAAATATTGCTGGGACAGGATCGAT
TGACCGTGAAGGAAAAGTTGGTGATATTGGTGGGATTGATAAAAAAGTTGTTTCAGCAGCTAAAAATGGCGCAGAAATTT
TCTTTGCGCCAAATAATCCAGTCACCAAGGAAGTGAAAAAATCCAATCCTAAAGCGAAAACAAACTATGAGACAGCTTTG
GCAGCTGCTAAAAAAATTAAAACTAAAATGAAAATTGTACCTGTCAAGACCTTGCAAGACGCAATTGATTACCTCAAAAG
TACTAAAAAATCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  sepM Streptococcus mutans UA159

63.663

98.006

0.624