Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA/nlmT   Type   Regulator
Locus tag   MGCS35823_RS02825 Genome accession   NZ_CP117289
Coordinates   525552..527705 (-) Length   717 a.a.
NCBI ID   WP_084916306.1    Uniprot ID   -
Organism   Streptococcus dysgalactiae subsp. equisimilis strain MGCS35823     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 495711..546531 525552..527705 within 0


Gene organization within MGE regions


Location: 495711..546531
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MGCS35823_RS02660 (MGCS35823_01084) - 495810..498218 (+) 2409 WP_037590009.1 DNA translocase FtsK -
  MGCS35823_RS02665 (MGCS35823_01086) - 498287..498640 (-) 354 WP_037587264.1 DUF3397 domain-containing protein -
  MGCS35823_RS02670 (MGCS35823_01088) rplK 498883..499308 (+) 426 WP_003049875.1 50S ribosomal protein L11 -
  MGCS35823_RS02675 (MGCS35823_01090) rplA 499414..500103 (+) 690 WP_003049876.1 50S ribosomal protein L1 -
  MGCS35823_RS02680 - 500378..500512 (-) 135 WP_003054384.1 hypothetical protein -
  MGCS35823_RS02685 (MGCS35823_01092) - 500606..500773 (-) 168 WP_003054374.1 hypothetical protein -
  MGCS35823_RS02690 (MGCS35823_01094) pyrH 501055..501783 (+) 729 WP_003049878.1 UMP kinase -
  MGCS35823_RS02695 (MGCS35823_01096) frr 501812..502369 (+) 558 WP_003054381.1 ribosome recycling factor -
  MGCS35823_RS02700 (MGCS35823_01098) - 502478..503335 (+) 858 WP_003054345.1 S1 RNA-binding domain-containing protein -
  MGCS35823_RS02705 (MGCS35823_01100) msrA 503408..503917 (+) 510 WP_003054395.1 peptide-methionine (S)-S-oxide reductase MsrA -
  MGCS35823_RS02710 (MGCS35823_01102) - 503914..504129 (+) 216 WP_003054340.1 YozE family protein -
  MGCS35823_RS02715 (MGCS35823_01104) - 504289..505524 (+) 1236 WP_003054361.1 LysM domain-containing protein -
  MGCS35823_RS02720 (MGCS35823_01106) - 505835..507607 (+) 1773 WP_003054339.1 oleate hydratase -
  MGCS35823_RS02725 (MGCS35823_01108) - 507768..508829 (+) 1062 WP_003054347.1 PhoH family protein -
  MGCS35823_RS02730 (MGCS35823_01110) - 508876..509451 (+) 576 WP_003054351.1 uracil-DNA glycosylase family protein -
  MGCS35823_RS02735 (MGCS35823_01112) ybeY 509610..510107 (+) 498 WP_003054373.1 rRNA maturation RNase YbeY -
  MGCS35823_RS02740 (MGCS35823_01114) - 510088..510495 (+) 408 WP_003054389.1 diacylglycerol kinase -
  MGCS35823_RS02745 (MGCS35823_01116) era 510616..511512 (+) 897 WP_002985743.1 GTPase Era -
  MGCS35823_RS02750 (MGCS35823_01118) - 511533..512009 (+) 477 WP_002993018.1 NUDIX domain-containing protein -
  MGCS35823_RS02755 (MGCS35823_01120) - 512318..512569 (-) 252 WP_080589767.1 hypothetical protein -
  MGCS35823_RS02760 - 512870..513088 (+) 219 WP_072135532.1 LytTR family transcriptional regulator DNA-binding domain-containing protein -
  MGCS35823_RS02765 (MGCS35823_01124) - 513369..515082 (-) 1714 Protein_488 peptide cleavage/export ABC transporter -
  MGCS35823_RS02770 (MGCS35823_01126) - 515380..515607 (+) 228 WP_022554332.1 Blp family class II bacteriocin -
  MGCS35823_RS02775 (MGCS35823_01128) - 515656..515973 (+) 318 WP_002994587.1 hypothetical protein -
  MGCS35823_RS02780 (MGCS35823_01130) - 516156..517177 (+) 1022 Protein_491 IS3 family transposase -
  MGCS35823_RS02785 (MGCS35823_01132) - 517391..518809 (+) 1419 WP_231873805.1 IS1182 family transposase -
  MGCS35823_RS02795 (MGCS35823_01136) - 520415..521165 (+) 751 Protein_494 response regulator transcription factor -
  MGCS35823_RS02805 (MGCS35823_01140) - 521507..522640 (-) 1134 WP_275902597.1 ISAs1-like element IS1548 family transposase -
  MGCS35823_RS02810 (MGCS35823_01142) - 523003..523827 (+) 825 WP_275910586.1 GHKL domain-containing protein -
  MGCS35823_RS02815 (MGCS35823_01144) - 523975..524100 (-) 126 WP_011017462.1 ComC/BlpC family leader-containing pheromone/bacteriocin -
  MGCS35823_RS02820 (MGCS35823_01148) - 524177..525541 (-) 1365 WP_084916305.1 bacteriocin secretion accessory protein -
  MGCS35823_RS02825 (MGCS35823_01150) comA/nlmT 525552..527705 (-) 2154 WP_084916306.1 peptide cleavage/export ABC transporter Regulator
  MGCS35823_RS02830 (MGCS35823_01152) - 527992..528219 (+) 228 WP_002992018.1 Blp family class II bacteriocin -
  MGCS35823_RS02835 (MGCS35823_01154) - 528232..528432 (+) 201 WP_003060803.1 class IIb bacteriocin, lactobin A/cerein 7B family -
  MGCS35823_RS02840 (MGCS35823_01156) - 528726..528920 (+) 195 WP_002990768.1 hypothetical protein -
  MGCS35823_RS02845 (MGCS35823_01158) - 528968..529249 (+) 282 WP_011054252.1 hypothetical protein -
  MGCS35823_RS02850 (MGCS35823_01160) - 529443..529853 (+) 411 WP_275910587.1 hypothetical protein -
  MGCS35823_RS02855 - 530100..530347 (+) 248 Protein_505 hypothetical protein -
  MGCS35823_RS02860 (MGCS35823_01162) - 530369..530623 (+) 255 WP_014612089.1 CPBP family intramembrane glutamic endopeptidase -
  MGCS35823_RS02865 (MGCS35823_01166) - 531035..531772 (+) 738 WP_228113938.1 hypothetical protein -
  MGCS35823_RS02870 (MGCS35823_01168) - 531922..532864 (-) 943 Protein_508 hypothetical protein -
  MGCS35823_RS02875 shp2 532933..533001 (-) 69 WP_134771781.1 peptide pheromone SHP2 -
  MGCS35823_RS02880 (MGCS35823_01170) rgg2 533091..533945 (+) 855 Protein_510 quorum-sensing system transcriptional regulator Rgg2 -
  MGCS35823_RS02885 (MGCS35823_01172) mutM 534127..534954 (+) 828 WP_084916308.1 DNA-formamidopyrimidine glycosylase -
  MGCS35823_RS02890 (MGCS35823_01174) coaE 534933..535544 (+) 612 WP_172453883.1 dephospho-CoA kinase -
  MGCS35823_RS02895 - 535559..535792 (-) 234 WP_159335598.1 hypothetical protein -
  MGCS35823_RS02900 (MGCS35823_01176) - 535912..536613 (+) 702 WP_003056868.1 ABC transporter ATP-binding protein -
  MGCS35823_RS02905 (MGCS35823_01178) - 536603..538240 (+) 1638 WP_022554345.1 hypothetical protein -
  MGCS35823_RS02910 (MGCS35823_01180) - 538350..539852 (+) 1503 WP_084916309.1 helicase HerA-like domain-containing protein -
  MGCS35823_RS02915 (MGCS35823_01182) - 540016..541233 (+) 1218 WP_223846048.1 multidrug efflux MFS transporter -
  MGCS35823_RS02920 rpmG 541230..541376 (+) 147 WP_003047126.1 50S ribosomal protein L33 -
  MGCS35823_RS02925 (MGCS35823_01184) secG 541422..541658 (+) 237 WP_003047129.1 preprotein translocase subunit SecG -
  MGCS35823_RS02930 (MGCS35823_01186) rnr 541755..544085 (+) 2331 WP_084916311.1 ribonuclease R -
  MGCS35823_RS02935 (MGCS35823_01188) smpB 544088..544555 (+) 468 WP_022554349.1 SsrA-binding protein SmpB -
  MGCS35823_RS02940 (MGCS35823_01190) - 544734..545090 (+) 357 WP_003053910.1 hypothetical protein -
  MGCS35823_RS02945 (MGCS35823_01192) - 545183..546531 (+) 1349 WP_111678974.1 IS3 family transposase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80504.46 Da        Isoelectric Point: 7.1728

>NTDB_id=783436 MGCS35823_RS02825 WP_084916306.1 525552..527705(-) (comA/nlmT) [Streptococcus dysgalactiae subsp. equisimilis strain MGCS35823]
MISYRKTFVAQIDARDCGVAALASIAKYYGSDYSLAHLRELAKTNKEGTTALGIVKAAKLMGFETRAIQADMTLFDIEDV
PYPFIVHVNKEGKFQHYYVIYQNKKNYLIIGDPDSTVKVTKMTKERFTSEWTGVAIFLAPEPSYKPHKDKKNGLISFLPL
IFKQRSLIFYIILASLLVTLINIVGSYYLQGILDDYIPNQLKSTLGIISIGLIITYILQQMMSFSRDYLLTVLSQRLSID
VILSYIRHIFELPMSFFATRRTGEIISRFTDANAIIDALASTILSLFLDVSILTIVGTVLLVQNTNLFLLSLVSVPLYIV
IIFIFMHPFEKMNNDVMQSNSMVSSAIIEDINGIETIKSLTSEENRYQKIDSEFVGYLDKSFKLSKYSILQSSLKQGAQL
ILNVIVLWFGAKLVMGGKISVGQLITFNTLLSYFTNPLENIINLQTKLQSAKVANNRLNEVYLVDSEFQEAGTLVNQELL
HGDIQFEELSYKYGFGRDTLSNINLTIKQGDKVSLVGISGSGKTTLAKMIVNFFEPYNGRITINHNDLKMIDKKSLRQHI
NYLPQQAYIFNGSILENLTLGANDCTSHEDILRACEVAEIRQDIEQMPMGYQTELSDGAGLSGGQKQRIALARALLTKAP
VLILDEATSGLDVLTEKRVIDNLLAMTDKTIIFVAHRLSISARTNQVIVLDEGKIIEIGSHQELMTKQGFYHHLFSS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=783436 MGCS35823_RS02825 WP_084916306.1 525552..527705(-) (comA/nlmT) [Streptococcus dysgalactiae subsp. equisimilis strain MGCS35823]
ATGATATCCTATCGAAAAACATTTGTTGCTCAGATTGATGCTAGAGACTGTGGTGTGGCTGCTCTTGCTTCCATCGCTAA
ATATTACGGTTCAGACTACTCTTTAGCTCATTTAAGAGAGTTAGCTAAAACAAATAAAGAGGGAACAACAGCGCTTGGCA
TCGTCAAAGCTGCCAAATTAATGGGATTTGAAACAAGAGCTATTCAAGCAGACATGACCCTCTTTGATATAGAAGATGTC
CCTTATCCTTTTATTGTTCATGTTAATAAAGAAGGTAAATTCCAACACTACTATGTCATCTATCAAAACAAAAAAAATTA
TCTGATAATCGGCGACCCTGACTCGACAGTAAAAGTCACTAAGATGACAAAGGAGCGATTTACTTCTGAATGGACGGGTG
TCGCTATTTTTCTAGCACCCGAGCCCAGCTACAAGCCTCATAAGGACAAAAAGAACGGTCTAATAAGCTTTTTACCTCTC
ATTTTTAAACAGCGATCTCTGATTTTCTATATCATATTAGCTAGCCTACTCGTTACTCTGATTAATATTGTAGGTTCTTA
TTACCTACAAGGTATATTAGATGACTATATTCCAAATCAACTGAAATCAACACTCGGCATTATTTCAATTGGTCTTATCA
TTACCTATATCCTCCAACAGATGATGAGTTTTTCGCGAGATTATCTCCTGACTGTACTGAGTCAGCGATTAAGCATTGAT
GTTATTTTATCCTATATTCGTCATATTTTTGAGCTTCCCATGTCTTTCTTTGCAACACGTCGGACAGGAGAAATCATCTC
ACGTTTTACAGATGCTAATGCTATTATTGATGCACTCGCTTCTACTATTTTGTCTCTTTTTCTTGATGTCAGTATTTTAA
CTATCGTAGGGACAGTCCTGTTAGTGCAAAATACCAATCTTTTTCTTCTATCTCTGGTTTCAGTCCCTCTATATATTGTT
ATTATCTTTATATTTATGCACCCATTTGAAAAGATGAATAATGATGTCATGCAGAGTAACTCTATGGTTAGCTCAGCCAT
TATCGAAGATATCAATGGCATTGAAACTATTAAATCACTGACCAGTGAAGAAAATCGTTATCAGAAAATTGATAGTGAAT
TTGTGGGCTATCTGGATAAATCCTTCAAGTTAAGTAAGTACTCTATTTTACAAAGCAGTCTCAAACAAGGGGCGCAACTA
ATCCTTAACGTGATAGTCTTATGGTTTGGGGCCAAATTAGTTATGGGAGGAAAAATTTCTGTTGGTCAGTTGATTACGTT
TAATACTCTCTTATCTTATTTTACCAATCCTTTAGAAAATATCATTAATCTTCAAACTAAACTGCAATCAGCCAAGGTAG
CTAATAATCGTCTCAATGAAGTTTACCTAGTCGATTCTGAGTTTCAAGAGGCTGGGACATTAGTCAATCAAGAATTATTA
CACGGAGACATTCAATTTGAAGAGTTATCCTATAAATATGGCTTTGGACGCGATACTCTATCAAATATTAATCTCACTAT
TAAGCAAGGGGACAAAGTTAGCCTTGTTGGTATTAGTGGATCTGGTAAAACGACTCTAGCTAAAATGATTGTTAATTTTT
TTGAGCCTTATAACGGCCGAATTACCATTAATCATAATGACTTAAAAATGATTGATAAAAAAAGTCTTCGCCAACATATT
AATTATCTCCCCCAACAAGCCTATATTTTTAATGGTTCTATTTTAGAAAATTTAACCTTAGGTGCTAATGACTGTACTAG
TCATGAAGATATTCTAAGAGCTTGTGAGGTTGCAGAAATTCGACAAGATATTGAACAAATGCCTATGGGCTATCAAACAG
AGCTTTCTGATGGTGCCGGACTCTCAGGAGGCCAAAAGCAACGAATTGCACTTGCCAGAGCTCTCCTAACCAAGGCGCCT
GTGCTTATTCTAGATGAGGCTACAAGCGGTCTTGACGTACTAACTGAAAAGAGGGTTATTGATAATCTTTTGGCTATGAC
GGATAAAACTATTATTTTCGTAGCTCATCGTCTCAGCATTTCAGCGCGGACCAATCAAGTCATTGTGCTTGATGAGGGCA
AAATTATTGAAATAGGATCTCATCAAGAATTAATGACTAAACAAGGTTTCTATCATCATTTATTTAGTAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA/nlmT Streptococcus mutans UA159

72.448

99.721

0.722

  comA Streptococcus mitis NCTC 12261

67.227

99.582

0.669

  comA Streptococcus mitis SK321

66.807

99.582

0.665

  comA Streptococcus pneumoniae Rx1

66.807

99.582

0.665

  comA Streptococcus pneumoniae D39

66.807

99.582

0.665

  comA Streptococcus pneumoniae R6

66.807

99.582

0.665

  comA Streptococcus pneumoniae TIGR4

66.527

99.582

0.662

  comA Streptococcus gordonii str. Challis substr. CH1

65.738

100

0.658