Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpX   Type   Regulator
Locus tag   GE021_RS04300 Genome accession   NZ_CP053792
Coordinates   869104..870333 (+) Length   409 a.a.
NCBI ID   WP_125073848.1    Uniprot ID   A0A3P5Y7D1
Organism   Streptococcus canis strain HL_77_1     
Function   require for competence development (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 862714..915603 869104..870333 within 0


Gene organization within MGE regions


Location: 862714..915603
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GE021_RS04265 (GE021_004265) fni 862714..863706 (+) 993 WP_125073854.1 type 2 isopentenyl-diphosphate Delta-isomerase -
  GE021_RS11530 (GE021_004270) - 864015..864161 (+) 147 WP_367302866.1 transposase -
  GE021_RS11535 (GE021_004275) - 864349..864603 (+) 255 WP_125073853.1 IS3 family transposase -
  GE021_RS04280 (GE021_004280) - 864701..865975 (-) 1275 WP_125073852.1 hydroxymethylglutaryl-CoA reductase, degradative -
  GE021_RS04285 (GE021_004285) - 865956..867137 (-) 1182 WP_125073851.1 hydroxymethylglutaryl-CoA synthase -
  GE021_RS04290 (GE021_004290) - 867345..868184 (+) 840 WP_125073850.1 thymidylate synthase -
  GE021_RS04295 (GE021_004295) - 868264..868761 (+) 498 WP_125073849.1 dihydrofolate reductase -
  GE021_RS04300 (GE021_004300) clpX 869104..870333 (+) 1230 WP_125073848.1 ATP-dependent Clp protease ATP-binding subunit ClpX Regulator
  GE021_RS04305 (GE021_004305) yihA 870343..870942 (+) 600 WP_125073847.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  GE021_RS04310 (GE021_004310) - 871092..871832 (+) 741 WP_125073846.1 hypothetical protein -
  GE021_RS04315 (GE021_004315) - 871961..872927 (-) 967 Protein_819 IS30 family transposase -
  GE021_RS04320 (GE021_004320) - 873073..873870 (-) 798 Protein_820 IS982 family transposase -
  GE021_RS04325 (GE021_004325) - 873966..874346 (-) 381 WP_125073844.1 hypothetical protein -
  GE021_RS04330 (GE021_004330) - 874399..874557 (-) 159 WP_142998413.1 helix-turn-helix domain-containing protein -
  GE021_RS04335 (GE021_004335) - 875410..876606 (-) 1197 WP_164407705.1 site-specific integrase -
  GE021_RS04340 (GE021_004340) - 876959..878161 (-) 1203 WP_302051309.1 IS110 family transposase -
  GE021_RS04345 (GE021_004345) - 878410..879003 (-) 594 WP_164407456.1 HIRAN domain-containing protein -
  GE021_RS04350 (GE021_004350) - 879015..879398 (-) 384 WP_164407458.1 ImmA/IrrE family metallo-endopeptidase -
  GE021_RS04355 (GE021_004355) - 879402..879752 (-) 351 WP_063629762.1 helix-turn-helix domain-containing protein -
  GE021_RS04360 (GE021_004360) - 880042..880206 (+) 165 WP_164407460.1 hypothetical protein -
  GE021_RS04365 (GE021_004365) - 880248..880451 (+) 204 WP_015984784.1 hypothetical protein -
  GE021_RS11430 - 880453..880584 (+) 132 WP_261006046.1 hypothetical protein -
  GE021_RS04370 (GE021_004370) - 880696..881025 (-) 330 WP_164407462.1 hypothetical protein -
  GE021_RS04375 (GE021_004375) - 881025..881456 (-) 432 WP_164407464.1 hypothetical protein -
  GE021_RS04380 (GE021_004380) - 881516..882235 (+) 720 WP_164407466.1 ORF6C domain-containing protein -
  GE021_RS04385 (GE021_004385) - 882309..882566 (+) 258 WP_164407468.1 DNA-binding protein -
  GE021_RS04390 (GE021_004390) - 882704..882892 (+) 189 WP_164407470.1 hypothetical protein -
  GE021_RS04395 (GE021_004395) - 882905..883489 (+) 585 WP_164407472.1 hypothetical protein -
  GE021_RS04400 (GE021_004400) - 883538..884449 (+) 912 WP_217267494.1 DnaD domain protein -
  GE021_RS04405 (GE021_004405) - 884485..884712 (+) 228 WP_164407476.1 hypothetical protein -
  GE021_RS04410 (GE021_004410) - 884726..884932 (+) 207 WP_155782846.1 hypothetical protein -
  GE021_RS04415 (GE021_004415) - 884942..885127 (+) 186 WP_085577387.1 hypothetical protein -
  GE021_RS04420 (GE021_004420) - 885124..885288 (+) 165 WP_164407478.1 hypothetical protein -
  GE021_RS04425 (GE021_004425) - 885285..885569 (+) 285 WP_164407480.1 hypothetical protein -
  GE021_RS04430 (GE021_004430) - 885573..886055 (+) 483 WP_164407482.1 class I SAM-dependent methyltransferase -
  GE021_RS04435 (GE021_004435) - 886057..886791 (+) 735 WP_164407484.1 DNA cytosine methyltransferase -
  GE021_RS04440 (GE021_004440) - 886772..887275 (+) 504 WP_254388134.1 DNA cytosine methyltransferase -
  GE021_RS11125 - 887265..887831 (+) 567 WP_164407486.1 DUF3310 domain-containing protein -
  GE021_RS04450 (GE021_004450) - 887824..888156 (+) 333 WP_002986215.1 hypothetical protein -
  GE021_RS04455 (GE021_004455) - 888149..888376 (+) 228 WP_164407488.1 hypothetical protein -
  GE021_RS04460 (GE021_004460) ssb 888366..888758 (+) 393 WP_155782953.1 single-stranded DNA-binding protein Machinery gene
  GE021_RS04465 (GE021_004465) - 888772..889050 (+) 279 WP_029713970.1 hypothetical protein -
  GE021_RS04470 (GE021_004470) - 889047..889388 (+) 342 WP_164407490.1 helix-turn-helix domain-containing protein -
  GE021_RS04475 (GE021_004475) - 889511..890338 (+) 828 WP_164407492.1 prohibitin family protein -
  GE021_RS04480 (GE021_004480) - 890353..890754 (+) 402 WP_164407495.1 transcriptional regulator -
  GE021_RS04485 (GE021_004485) - 890914..891489 (+) 576 WP_011054435.1 site-specific integrase -
  GE021_RS04490 (GE021_004490) - 891643..891939 (+) 297 WP_164407497.1 hypothetical protein -
  GE021_RS04495 (GE021_004495) - 891978..892364 (+) 387 WP_164407499.1 hypothetical protein -
  GE021_RS04500 (GE021_004500) - 892357..892662 (+) 306 WP_164407501.1 HNH endonuclease -
  GE021_RS04505 (GE021_004505) - 892805..893122 (+) 318 WP_164407503.1 P27 family phage terminase small subunit -
  GE021_RS04510 (GE021_004510) - 893135..894865 (+) 1731 WP_164407506.1 terminase large subunit -
  GE021_RS04515 (GE021_004515) - 895233..896420 (+) 1188 WP_164407508.1 phage portal protein -
  GE021_RS04520 (GE021_004520) - 896401..897207 (+) 807 WP_164407510.1 head maturation protease, ClpP-related -
  GE021_RS04525 (GE021_004525) - 897224..898357 (+) 1134 WP_164407512.1 phage major capsid protein -
  GE021_RS04530 (GE021_004530) - 898378..898551 (+) 174 WP_164407515.1 hypothetical protein -
  GE021_RS04535 (GE021_004535) - 898551..898859 (+) 309 WP_027970290.1 hypothetical protein -
  GE021_RS04540 (GE021_004540) - 898852..899214 (+) 363 WP_037562728.1 hypothetical protein -
  GE021_RS04545 (GE021_004545) - 899216..899614 (+) 399 WP_143978750.1 HK97 gp10 family phage protein -
  GE021_RS04550 (GE021_004550) - 899607..899987 (+) 381 WP_164407517.1 hypothetical protein -
  GE021_RS04555 (GE021_004555) - 899999..900583 (+) 585 WP_164407519.1 major tail protein -
  GE021_RS04560 (GE021_004560) - 900679..900981 (+) 303 WP_164407521.1 hypothetical protein -
  GE021_RS04565 (GE021_004565) - 900996..901163 (+) 168 WP_164407454.1 hypothetical protein -
  GE021_RS04570 (GE021_004570) - 901207..905307 (+) 4101 WP_164407523.1 phage tail tape measure protein -
  GE021_RS04575 (GE021_004575) - 905320..906090 (+) 771 WP_164407525.1 distal tail protein Dit -
  GE021_RS04580 (GE021_004580) - 906087..908135 (+) 2049 WP_164407527.1 phage tail spike protein -
  GE021_RS11435 - 908135..909319 (+) 1185 WP_302051313.1 hypothetical protein -
  GE021_RS04590 (GE021_004590) - 909316..909690 (+) 375 WP_164407529.1 hypothetical protein -
  GE021_RS04595 (GE021_004595) - 909702..911405 (+) 1704 WP_172774133.1 gp58-like family protein -
  GE021_RS04600 (GE021_004600) - 911417..911845 (+) 429 WP_164407648.1 DUF1617 family protein -
  GE021_RS04605 (GE021_004605) - 911848..912273 (+) 426 WP_143928019.1 DUF1366 domain-containing protein -
  GE021_RS04610 (GE021_004610) - 912299..912460 (+) 162 WP_164406953.1 hypothetical protein -
  GE021_RS04615 (GE021_004615) - 912469..912843 (+) 375 WP_125073767.1 phage holin family protein -
  GE021_RS04620 (GE021_004620) - 912966..914183 (+) 1218 WP_164406948.1 peptidoglycan amidohydrolase family protein -
  GE021_RS04625 (GE021_004625) - 914327..914578 (-) 252 WP_164406943.1 hypothetical protein -
  GE021_RS04630 (GE021_004630) - 914719..914865 (-) 147 WP_164406938.1 hypothetical protein -
  GE021_RS04635 (GE021_004635) - 915018..915227 (+) 210 WP_011054450.1 helix-turn-helix domain-containing protein -
  GE021_RS04640 (GE021_004640) prx 915421..915603 (+) 183 WP_164406925.1 Paratox Regulator

Sequence


Protein


Download         Length: 409 a.a.        Molecular weight: 44972.48 Da        Isoelectric Point: 4.6207

>NTDB_id=447274 GE021_RS04300 WP_125073848.1 869104..870333(+) (clpX) [Streptococcus canis strain HL_77_1]
MAGNRTNDIKVYCSFCGKSQDEVKKIIAGNNVFICNECVALSQEIIKEELAEEVLADLTEVPKPKELLNILNQYVVGQDR
AKRALAVAVYNHYKRVSFAESRDDEDVDLQKSNILMIGPTGSGKTFLAQTLAKSLNVPFAIADATSLTEAGYVGEDVENI
LLKLIQAADYNVERAERGIIYVDEIDKIAKKGENVSITRDVSGEGVQQALLKIIEGTVASVPPQGGRKHPNQEMIQIDTK
NILFIVGGAFDGIEEIVKQRLGEKIIGFGQNSRKIDDNASYMQEIISEDIQKFGLIPEFIGRLPVVAALEQLNTSDLIQI
LTEPRNALVKQYQALLSYDGVELEFEKGALEAIASKAIERKTGARGLRSIIEETMLDIMFEVPSQEDVTKVRVTKAAVEG
KSKPVLETA

Nucleotide


Download         Length: 1230 bp        

>NTDB_id=447274 GE021_RS04300 WP_125073848.1 869104..870333(+) (clpX) [Streptococcus canis strain HL_77_1]
ATGGCAGGAAATCGTACTAACGATATTAAGGTATATTGTTCATTTTGTGGCAAAAGCCAAGATGAAGTCAAAAAAATCAT
TGCAGGGAACAACGTTTTTATTTGTAATGAATGTGTAGCCCTATCACAGGAAATTATCAAAGAAGAATTGGCAGAAGAAG
TACTGGCTGATTTAACAGAGGTTCCAAAACCAAAAGAACTTCTTAATATTTTAAACCAATACGTGGTTGGTCAAGACCGT
GCCAAGCGGGCCTTAGCAGTTGCAGTCTATAACCATTACAAGCGTGTTTCATTTGCTGAAAGCCGTGATGACGAAGATGT
TGATTTGCAAAAATCAAATATTTTAATGATTGGCCCAACAGGCTCTGGAAAGACGTTTTTGGCACAGACATTGGCTAAGA
GTTTAAATGTGCCTTTTGCTATTGCAGATGCTACCTCCTTAACAGAAGCTGGATATGTGGGAGAGGACGTTGAAAATATC
CTTCTCAAACTGATTCAAGCAGCGGATTATAATGTGGAGCGAGCAGAGCGTGGCATCATCTATGTTGATGAAATTGATAA
GATTGCTAAAAAAGGTGAGAATGTGTCCATCACACGAGATGTGTCAGGCGAGGGAGTACAACAGGCTCTGTTAAAAATTA
TTGAAGGTACTGTTGCGAGTGTTCCTCCTCAAGGTGGGCGGAAGCATCCTAATCAGGAAATGATTCAAATTGATACAAAA
AATATCTTATTCATTGTTGGTGGGGCATTTGATGGCATTGAGGAGATTGTCAAACAGCGTCTAGGGGAAAAAATCATTGG
TTTTGGTCAAAACAGCCGTAAAATTGATGATAATGCTTCTTACATGCAAGAAATTATATCAGAAGATATCCAAAAATTTG
GTCTTATTCCTGAATTTATTGGCCGTTTACCAGTTGTGGCAGCCCTTGAACAGCTTAACACCTCAGACTTGATCCAAATA
TTAACAGAGCCAAGAAATGCTCTTGTTAAACAATACCAAGCCTTGTTATCTTATGATGGTGTTGAATTAGAATTTGAGAA
GGGTGCCCTTGAAGCCATTGCAAGTAAAGCGATTGAACGCAAAACAGGAGCACGTGGTCTCCGTTCTATCATTGAAGAAA
CCATGTTGGATATTATGTTTGAAGTGCCAAGTCAAGAAGATGTGACTAAAGTTCGTGTTACCAAAGCTGCAGTTGAAGGC
AAGTCAAAACCCGTTTTAGAAACAGCATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3P5Y7D1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpX Streptococcus mutans UA159

89.242

100

0.892

  clpX Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

58.354

98.044

0.572