Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA/nlmT   Type   Regulator
Locus tag   SDEG_RS02575 Genome accession   NC_012891
Coordinates   490276..492429 (-) Length   717 a.a.
NCBI ID   WP_003062135.1    Uniprot ID   -
Organism   Streptococcus dysgalactiae subsp. equisimilis GGS_124     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 424793..498126 490276..492429 within 0
IS/Tn 492824..493852 490276..492429 flank 395


Gene organization within MGE regions


Location: 424793..498126
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SDEG_RS02255 (SDEG_0456) serC 425404..426495 (+) 1092 WP_012766619.1 3-phosphoserine/phosphohydroxythreonine transaminase -
  SDEG_RS02260 (SDEG_0457) - 426521..427072 (+) 552 WP_003061315.1 GNAT family N-acetyltransferase -
  SDEG_RS02265 (SDEG_0458) - 427133..427630 (+) 498 WP_003062357.1 methylated-DNA--[protein]-cysteine S-methyltransferase -
  SDEG_RS02270 (SDEG_0459) - 427627..427983 (+) 357 WP_012766620.1 arsenate reductase family protein -
  SDEG_RS02275 (SDEG_0460) - 428034..428342 (-) 309 WP_003060273.1 VOC family protein -
  SDEG_RS02280 (SDEG_0461) - 428344..428733 (-) 390 WP_012766621.1 glyoxalase/bleomycin resistance/extradiol dioxygenase family protein -
  SDEG_RS02285 (SDEG_0462) - 428831..429766 (-) 936 WP_012766622.1 LacI family DNA-binding transcriptional regulator -
  SDEG_RS02290 (SDEG_0463) - 429763..430575 (-) 813 WP_003062594.1 HAD family hydrolase -
  SDEG_RS02295 (SDEG_0464) - 430704..431531 (-) 828 WP_012766623.1 exodeoxyribonuclease III -
  SDEG_RS02300 (SDEG_0465) - 431820..432452 (+) 633 WP_012766624.1 SdpI family protein -
  SDEG_RS02305 (SDEG_0466) - 432760..434280 (+) 1521 WP_003061592.1 L-lactate permease -
  SDEG_RS02310 (SDEG_0467) lctO 434422..435603 (+) 1182 WP_003055631.1 L-lactate oxidase -
  SDEG_RS11015 - 435855..440605 (+) 4751 Protein_413 S8 family serine peptidase -
  SDEG_RS02320 (SDEG_0471) - 440818..442482 (+) 1665 WP_012766629.1 hypothetical protein -
  SDEG_RS02325 (SDEG_0472) - 442767..443474 (+) 708 WP_003061462.1 YoaK family protein -
  SDEG_RS02330 (SDEG_0473) metG 443742..445739 (+) 1998 WP_041788833.1 methionine--tRNA ligase -
  SDEG_RS02335 (SDEG_0474) nrdF 446272..447285 (+) 1014 WP_003055629.1 class 1b ribonucleoside-diphosphate reductase subunit beta -
  SDEG_RS02340 (SDEG_0475) nrdI 447289..447762 (+) 474 WP_003055616.1 class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI -
  SDEG_RS02345 (SDEG_0476) nrdE 447746..449932 (+) 2187 WP_012766631.1 class 1b ribonucleoside-diphosphate reductase subunit alpha -
  SDEG_RS02350 (SDEG_0477) brnQ 450230..451564 (+) 1335 WP_003055639.1 branched-chain amino acid transport system II carrier protein -
  SDEG_RS02355 (SDEG_0478) - 451902..452765 (-) 864 WP_003060834.1 IS982 family transposase -
  SDEG_RS02360 - 452927..453280 (+) 354 WP_041788838.1 hypothetical protein -
  SDEG_RS02365 (SDEG_0479) - 453629..453865 (+) 237 WP_003054401.1 DUF2829 domain-containing protein -
  SDEG_RS02370 (SDEG_0480) - 453858..454556 (+) 699 WP_003054338.1 3-oxoacyl-ACP reductase -
  SDEG_RS02375 - 454553..454795 (+) 243 WP_003049859.1 DUF3977 family protein -
  SDEG_RS02380 (SDEG_0481) - 454813..455262 (+) 450 WP_012766632.1 ASCH domain-containing protein -
  SDEG_RS02385 (SDEG_0482) - 455264..456223 (+) 960 WP_003054341.1 Gfo/Idh/MocA family protein -
  SDEG_RS02390 (SDEG_0483) - 456557..457894 (+) 1338 WP_003054334.1 MFS transporter -
  SDEG_RS02395 (SDEG_0484) glmU 458066..459448 (+) 1383 WP_003061627.1 bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU -
  SDEG_RS02400 (SDEG_0485) - 459479..460033 (+) 555 WP_003059987.1 NUDIX domain-containing protein -
  SDEG_RS02405 macP 460034..460285 (+) 252 WP_003054365.1 cell wall synthase accessory phosphoprotein MacP -
  SDEG_RS02410 (SDEG_0486) - 460303..460998 (+) 696 WP_003054376.1 5'-methylthioadenosine/adenosylhomocysteine nucleosidase -
  SDEG_RS02415 (SDEG_0487) - 461071..461718 (-) 648 WP_003059888.1 metal-dependent transcriptional regulator -
  SDEG_RS02420 (SDEG_0488) - 461864..462796 (+) 933 WP_003054333.1 metal ABC transporter substrate-binding protein -
  SDEG_RS02425 (SDEG_0489) - 462861..463586 (+) 726 WP_003054363.1 metal ABC transporter ATP-binding protein -
  SDEG_RS02430 (SDEG_0490) - 463587..464435 (+) 849 WP_003054335.1 metal ABC transporter permease -
  SDEG_RS02435 (SDEG_0491) - 464566..465372 (-) 807 WP_012766633.1 peptidylprolyl isomerase -
  SDEG_RS02440 (SDEG_0492) - 465590..467998 (+) 2409 WP_041789178.1 DNA translocase FtsK -
  SDEG_RS02445 (SDEG_0493) - 468067..468420 (-) 354 WP_012766635.1 DUF3397 domain-containing protein -
  SDEG_RS02450 (SDEG_0494) - 468576..469598 (-) 1023 WP_171841606.1 IS30 family transposase -
  SDEG_RS02455 (SDEG_0495) rplK 469840..470265 (+) 426 WP_003049875.1 50S ribosomal protein L11 -
  SDEG_RS02460 (SDEG_0496) rplA 470371..471060 (+) 690 WP_003049876.1 50S ribosomal protein L1 -
  SDEG_RS11415 - 471335..471469 (-) 135 WP_269146492.1 hypothetical protein -
  SDEG_RS11155 - 471518..471685 (-) 168 WP_003054374.1 hypothetical protein -
  SDEG_RS02470 (SDEG_0497) pyrH 471967..472695 (+) 729 WP_003049878.1 UMP kinase -
  SDEG_RS02475 (SDEG_0498) frr 472724..473281 (+) 558 WP_003054381.1 ribosome recycling factor -
  SDEG_RS02480 (SDEG_0499) - 473390..474247 (+) 858 WP_012766638.1 CvfB family protein -
  SDEG_RS02485 (SDEG_0500) msrA 474320..474829 (+) 510 WP_003054395.1 peptide-methionine (S)-S-oxide reductase MsrA -
  SDEG_RS02490 (SDEG_0501) - 474826..475041 (+) 216 WP_003054340.1 YozE family protein -
  SDEG_RS02495 (SDEG_0502) - 475200..476396 (+) 1197 WP_012766639.1 LysM peptidoglycan-binding domain-containing protein -
  SDEG_RS02500 (SDEG_0503) - 476707..478479 (+) 1773 WP_041788841.1 oleate hydratase -
  SDEG_RS02505 (SDEG_0504) - 478640..479701 (+) 1062 WP_012766641.1 PhoH family protein -
  SDEG_RS02510 (SDEG_0505) - 479748..480323 (+) 576 WP_003054351.1 uracil-DNA glycosylase family protein -
  SDEG_RS02515 (SDEG_0506) ybeY 480482..480979 (+) 498 WP_003054373.1 rRNA maturation RNase YbeY -
  SDEG_RS02520 (SDEG_0507) - 480960..481367 (+) 408 WP_003054389.1 diacylglycerol kinase family protein -
  SDEG_RS02525 (SDEG_0508) era 481488..482384 (+) 897 WP_012766643.1 GTPase Era -
  SDEG_RS11275 (SDEG_0509) - 482571..482876 (+) 306 Protein_457 NUDIX domain-containing protein -
  SDEG_RS02530 (SDEG_0510) - 482854..484113 (-) 1260 Protein_458 ABC transporter transmembrane domain-containing protein -
  SDEG_RS02535 (SDEG_0511) - 484417..484644 (+) 228 WP_003062179.1 Blp family class II bacteriocin -
  SDEG_RS02540 (SDEG_0512) - 484693..485010 (+) 318 WP_002994587.1 hypothetical protein -
  SDEG_RS02545 (SDEG_0513) - 485418..485657 (+) 240 WP_012766647.1 Blp family class II bacteriocin -
  SDEG_RS02550 - 485671..485856 (+) 186 WP_003054392.1 hypothetical protein -
  SDEG_RS02560 (SDEG_0515) comE/blpR 486540..487289 (+) 750 WP_012766649.1 response regulator transcription factor Regulator
  SDEG_RS02565 (SDEG_0516) - 487290..488624 (+) 1335 WP_012766650.1 sensor histidine kinase -
  SDEG_RS10710 (SDEG_0517) - 488685..488810 (-) 126 WP_011017462.1 ComC/BlpC family leader-containing pheromone/bacteriocin -
  SDEG_RS02570 (SDEG_0518) - 488901..490265 (-) 1365 WP_012766651.1 bacteriocin secretion accessory protein -
  SDEG_RS02575 (SDEG_0519) comA/nlmT 490276..492429 (-) 2154 WP_003062135.1 peptide cleavage/export ABC transporter Regulator
  SDEG_RS02580 (SDEG_0520) - 492830..493852 (+) 1023 WP_171841606.1 IS30 family transposase -
  SDEG_RS02585 - 493902..494120 (+) 219 WP_269146493.1 Blp family class II bacteriocin -
  SDEG_RS02590 (SDEG_0521) - 494133..494333 (+) 201 WP_003060803.1 class IIb bacteriocin, lactobin A/cerein 7B family -
  SDEG_RS02600 (SDEG_0522) - 494628..494822 (+) 195 WP_012766652.1 hypothetical protein -
  SDEG_RS02610 (SDEG_0524) - 495348..495758 (+) 411 WP_012766654.1 hypothetical protein -
  SDEG_RS11250 - 496005..496252 (+) 248 Protein_473 hypothetical protein -
  SDEG_RS10720 (SDEG_0526) - 496274..496504 (+) 231 WP_012766656.1 CPBP family intramembrane glutamic endopeptidase -
  SDEG_RS02620 (SDEG_0527) - 496930..497667 (+) 738 WP_231844023.1 hypothetical protein -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80558.51 Da        Isoelectric Point: 7.0109

>NTDB_id=34379 SDEG_RS02575 WP_003062135.1 490276..492429(-) (comA/nlmT) [Streptococcus dysgalactiae subsp. equisimilis GGS_124]
MISYRKTFVAQIDARDCGVAALASIAKYYGSDYSLAHLRELAKTNKEGTTALGIVKAAKLMGFETRAIQADMTLFDIEDV
PYPFIVHVNKEGKFQHYYVVYQNKKNYLIIGDPDPTVKVTKMTKERFTSEWTGVAIFLAPEPSYKPHKDKKNGLISFLPL
IFKQRSLIFYIILASLLVTLINIVGSYYLQGILDDYIPNQLKSTLGIISIGLIITYILQQMMSFSRDYLLTVLSQRLSID
VILSYIRHIFELPMSFFATRRTGEIISRFTDANAIIDALASTILSLFLDVSILTIVGTVLLVQNTNLFLLSLVSVPLYIV
IIFIFMHPFEKMNNDVMQSNSMVSSAIIEDINGIETIKSLTSEENRYQKIDSEFVDYLDKSFKLSKYSILQSSLKQGAQL
ILNVIVLWFGAKLVMGGKISVGQLITFNTLLSYFTNPLENIINLQTKLQSAKVANNRLNEVYLVDSEFQEAGTLVNQELL
HGDIQFEELSYKYGFGRDTLSNINLTIKQGDKVSLVGISGSGKTTLAKMIVNFFEPYNGRITINHNDLKMIDKKSLRQHI
NYLPQQAYIFNGSILENLTLGANDCTSHEDILRACEVAEIRQDIEQMPMGYQTELSDGAGLSGGQKQRIALARALLTKAP
VLILDEATSGLDVLTEKRVIDNLLAMTDKTIIFVAHRLSISARTNQVIVLDEGKIIEIGSHQELMTKQGFYHHLFSS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=34379 SDEG_RS02575 WP_003062135.1 490276..492429(-) (comA/nlmT) [Streptococcus dysgalactiae subsp. equisimilis GGS_124]
ATGATATCCTATCGAAAAACATTTGTTGCTCAGATTGATGCTAGAGACTGTGGTGTGGCTGCTCTTGCTTCCATCGCTAA
ATATTACGGTTCAGACTACTCTTTAGCTCATTTAAGAGAGTTAGCTAAAACAAATAAAGAGGGAACAACAGCGCTTGGCA
TCGTCAAAGCTGCCAAATTAATGGGATTTGAAACAAGAGCTATTCAAGCAGACATGACCCTCTTTGATATAGAAGATGTC
CCTTATCCTTTTATTGTTCATGTTAATAAAGAAGGTAAATTCCAACACTACTATGTCGTCTATCAAAACAAAAAAAATTA
TCTGATAATCGGCGACCCTGACCCGACAGTAAAAGTCACTAAGATGACAAAGGAGCGATTTACTTCTGAATGGACGGGTG
TCGCTATTTTTCTAGCACCCGAGCCCAGCTACAAGCCTCATAAGGACAAAAAGAACGGTCTAATAAGCTTTTTACCTCTC
ATTTTTAAACAGCGATCTCTGATTTTCTATATCATATTAGCTAGCCTACTCGTTACTCTGATTAATATTGTAGGTTCTTA
TTACCTACAAGGTATATTAGATGACTATATTCCAAATCAACTGAAATCAACACTCGGTATTATTTCAATTGGTCTTATCA
TTACCTATATCCTCCAACAGATGATGAGTTTTTCGCGAGATTATCTCCTGACTGTACTGAGTCAGCGATTAAGCATTGAT
GTTATTTTATCCTATATTCGTCATATTTTTGAGCTTCCCATGTCTTTTTTTGCAACTCGCAGAACAGGAGAAATCATCTC
ACGTTTTACAGATGCTAATGCTATTATTGATGCACTCGCTTCTACTATTTTGTCTCTTTTTCTTGATGTCAGTATTTTAA
CTATCGTAGGGACAGTTCTGTTAGTGCAAAATACCAATCTTTTTCTTCTATCTCTGGTTTCAGTCCCTCTATATATTGTT
ATTATCTTTATATTTATGCACCCATTTGAAAAGATGAATAATGATGTCATGCAGAGTAACTCTATGGTTAGCTCAGCCAT
TATCGAAGATATCAATGGCATTGAAACTATTAAATCACTGACCAGTGAAGAAAATCGTTATCAGAAAATTGATAGTGAAT
TTGTGGACTATCTGGATAAATCTTTCAAGTTAAGTAAGTACTCTATTTTACAAAGCAGTCTCAAACAAGGGGCGCAACTA
ATCCTTAACGTGATAGTCTTATGGTTTGGGGCCAAATTAGTTATGGGAGGAAAAATTTCTGTTGGTCAGTTGATTACGTT
TAATACTCTCTTATCTTATTTTACCAATCCTTTAGAAAATATCATTAATCTTCAAACTAAACTGCAATCAGCCAAGGTAG
CTAATAATCGTCTCAATGAAGTTTACCTAGTCGATTCTGAGTTTCAAGAGGCTGGGACATTAGTCAATCAAGAATTATTA
CACGGAGACATTCAATTTGAAGAGTTATCCTATAAATATGGCTTTGGACGCGATACTCTATCAAATATTAATCTCACTAT
TAAGCAAGGGGACAAAGTTAGCCTTGTTGGTATTAGTGGATCTGGTAAAACGACTCTAGCTAAAATGATTGTTAATTTTT
TTGAGCCTTATAACGGCCGAATTACCATTAATCATAATGACTTAAAAATGATTGATAAAAAAAGTCTTCGCCAACATATT
AATTATCTCCCCCAACAAGCCTATATTTTTAATGGTTCTATTTTAGAAAATTTAACCTTAGGTGCTAATGACTGTACTAG
TCATGAAGATATTCTAAGAGCTTGTGAGGTTGCAGAAATTCGACAAGATATTGAACAAATGCCTATGGGCTATCAAACAG
AGCTTTCTGATGGTGCCGGACTCTCAGGAGGCCAAAAGCAACGAATTGCACTTGCCAGAGCTCTCCTAACCAAGGCGCCT
GTGCTTATTCTAGATGAGGCTACAAGCGGTCTTGACGTACTAACTGAAAAGAGGGTTATTGATAATCTTTTGGCTATGAC
GGATAAAACTATTATTTTCGTAGCTCATCGTCTCAGCATTTCAGCGCGGACCAATCAAGTCATTGTGCTTGATGAGGGTA
AAATTATTGAAATAGGATCTCATCAAGAATTAATGACTAAACAAGGTTTCTATCATCATTTATTTAGTAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA/nlmT Streptococcus mutans UA159

72.867

99.721

0.727

  comA Streptococcus mitis NCTC 12261

67.647

99.582

0.674

  comA Streptococcus mitis SK321

67.227

99.582

0.669

  comA Streptococcus pneumoniae Rx1

67.227

99.582

0.669

  comA Streptococcus pneumoniae D39

67.227

99.582

0.669

  comA Streptococcus pneumoniae R6

67.227

99.582

0.669

  comA Streptococcus pneumoniae TIGR4

66.947

99.582

0.667

  comA Streptococcus gordonii str. Challis substr. CH1

66.017

100

0.661


Multiple sequence alignment