Detailed information    

insolico Bioinformatically predicted

Overview


Name   clpX   Type   Regulator
Locus tag   AAA985_RS07865 Genome accession   NZ_AP028603
Coordinates   1510801..1512033 (-) Length   410 a.a.
NCBI ID   WP_000106346.1    Uniprot ID   A0A064C077
Organism   Streptococcus pneumoniae strain Pne2     
Function   require for competence development (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1465734..1520031 1510801..1512033 within 0


Gene organization within MGE regions


Location: 1465734..1520031
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAA985_RS07545 (TKY181970_14990) - 1465734..1466702 (-) 969 WP_338168205.1 N-acetylmuramoyl-L-alanine amidase family protein -
  AAA985_RS07550 (TKY181970_15000) - 1466706..1467038 (-) 333 WP_001186242.1 phage holin -
  AAA985_RS07555 (TKY181970_15010) - 1467042..1467458 (-) 417 WP_050211460.1 phage holin family protein -
  AAA985_RS07560 (TKY181970_15020) - 1467468..1467818 (-) 351 WP_000852249.1 hypothetical protein -
  AAA985_RS07565 (TKY181970_15030) - 1467821..1468024 (-) 204 WP_001091109.1 hypothetical protein -
  AAA985_RS07570 - 1468005..1468121 (-) 117 WP_001063632.1 hypothetical protein -
  AAA985_RS07575 (TKY181970_15040) - 1468118..1474561 (-) 6444 WP_338168206.1 tail fiber domain-containing protein -
  AAA985_RS07580 (TKY181970_15050) - 1474566..1474916 (-) 351 WP_000068032.1 DUF6711 family protein -
  AAA985_RS07585 (TKY181970_15060) - 1474925..1478578 (-) 3654 WP_000212167.1 hypothetical protein -
  AAA985_RS07590 (TKY181970_15070) - 1478565..1478915 (-) 351 WP_000478011.1 hypothetical protein -
  AAA985_RS07595 (TKY181970_15080) - 1478954..1479334 (-) 381 WP_001185636.1 DUF6096 family protein -
  AAA985_RS07600 (TKY181970_15090) - 1479339..1479752 (-) 414 WP_000880674.1 phage tail tube protein -
  AAA985_RS07605 (TKY181970_15100) - 1479755..1480123 (-) 369 WP_000608235.1 hypothetical protein -
  AAA985_RS07610 (TKY181970_15110) - 1480120..1480635 (-) 516 WP_001292567.1 HK97-gp10 family putative phage morphogenesis protein -
  AAA985_RS07615 (TKY181970_15120) - 1480610..1480948 (-) 339 WP_000478942.1 hypothetical protein -
  AAA985_RS07620 (TKY181970_15130) - 1480929..1481240 (-) 312 WP_000021221.1 phage head-tail connector protein -
  AAA985_RS07625 (TKY181970_15140) - 1481242..1481430 (-) 189 WP_000669352.1 hypothetical protein -
  AAA985_RS07630 (TKY181970_15150) - 1481420..1481602 (-) 183 WP_000054936.1 Rho termination factor N-terminal domain-containing protein -
  AAA985_RS07635 (TKY181970_15160) - 1481607..1482452 (-) 846 WP_000123893.1 N4-gp56 family major capsid protein -
  AAA985_RS07640 (TKY181970_15170) - 1482458..1483027 (-) 570 WP_001001557.1 DUF4355 domain-containing protein -
  AAA985_RS07645 (TKY181970_15180) - 1483230..1483481 (-) 252 WP_000913247.1 DUF6275 family protein -
  AAA985_RS07650 (TKY181970_15190) - 1483483..1483728 (-) 246 WP_000877357.1 hypothetical protein -
  AAA985_RS07655 (TKY181970_15200) - 1483771..1484184 (-) 414 WP_000565276.1 HD domain-containing protein -
  AAA985_RS07660 (TKY181970_15210) - 1484181..1484390 (-) 210 WP_000651747.1 hypothetical protein -
  AAA985_RS07665 (TKY181970_15220) - 1484392..1486029 (-) 1638 WP_179855908.1 minor capsid protein -
  AAA985_RS07670 (TKY181970_15230) - 1485938..1487407 (-) 1470 WP_000285396.1 phage portal protein -
  AAA985_RS07675 (TKY181970_15240) - 1487419..1488717 (-) 1299 WP_000084430.1 PBSX family phage terminase large subunit -
  AAA985_RS07680 (TKY181970_15250) - 1488695..1489195 (-) 501 WP_012677066.1 terminase small subunit -
  AAA985_RS07690 (TKY181970_15260) - 1489587..1489991 (-) 405 WP_001030243.1 DUF1492 domain-containing protein -
  AAA985_RS07695 (TKY181970_15270) - 1490062..1490433 (-) 372 WP_001247152.1 hypothetical protein -
  AAA985_RS07700 (TKY181970_15280) - 1490430..1490642 (-) 213 WP_000160160.1 hypothetical protein -
  AAA985_RS07705 (TKY181970_15290) - 1490643..1491143 (-) 501 WP_001021770.1 DUF1642 domain-containing protein -
  AAA985_RS07710 (TKY181970_15300) - 1491145..1491459 (-) 315 WP_000391841.1 hypothetical protein -
  AAA985_RS07715 (TKY181970_15310) - 1491610..1491948 (-) 339 WP_000119452.1 hypothetical protein -
  AAA985_RS07720 (TKY181970_15320) - 1491948..1492100 (-) 153 WP_000122746.1 hypothetical protein -
  AAA985_RS07725 (TKY181970_15330) - 1492394..1493737 (-) 1344 WP_000580027.1 virulence-associated E family protein -
  AAA985_RS07730 (TKY181970_15340) - 1493740..1494561 (-) 822 WP_001838318.1 bifunctional DNA primase/polymerase -
  AAA985_RS07735 (TKY181970_15350) - 1494734..1495237 (-) 504 WP_000054338.1 hypothetical protein -
  AAA985_RS07740 (TKY181970_15360) - 1495257..1496033 (-) 777 WP_000885998.1 AAA family ATPase -
  AAA985_RS07745 (TKY181970_15370) - 1496033..1496350 (-) 318 WP_001044592.1 hypothetical protein -
  AAA985_RS07750 (TKY181970_15380) - 1496369..1497562 (-) 1194 WP_000028860.1 DEAD/DEAH box helicase family protein -
  AAA985_RS07755 (TKY181970_15390) - 1497525..1498010 (-) 486 WP_000696056.1 hypothetical protein -
  AAA985_RS07760 (TKY181970_15400) - 1498010..1498369 (-) 360 WP_000422309.1 hypothetical protein -
  AAA985_RS07765 (TKY181970_15410) - 1498369..1498851 (-) 483 WP_000157807.1 siphovirus Gp157 family protein -
  AAA985_RS07770 (TKY181970_15420) - 1498844..1499566 (-) 723 WP_000544343.1 HNH endonuclease signature motif containing protein -
  AAA985_RS07775 (TKY181970_15430) - 1499580..1499810 (-) 231 WP_000252080.1 hypothetical protein -
  AAA985_RS07780 (TKY181970_15440) - 1499811..1500152 (-) 342 WP_000189405.1 hypothetical protein -
  AAA985_RS07785 (TKY181970_15450) - 1500164..1500355 (-) 192 WP_000970850.1 hypothetical protein -
  AAA985_RS07790 (TKY181970_15460) - 1500352..1500546 (-) 195 WP_001050924.1 hypothetical protein -
  AAA985_RS07795 (TKY181970_15470) - 1500543..1500815 (-) 273 WP_000450967.1 hypothetical protein -
  AAA985_RS07800 (TKY181970_15480) - 1500958..1501161 (+) 204 WP_128969726.1 hypothetical protein -
  AAA985_RS07805 (TKY181970_15500) - 1501626..1501820 (-) 195 WP_029680340.1 helix-turn-helix transcriptional regulator -
  AAA985_RS07810 (TKY181970_15510) - 1501854..1502039 (-) 186 WP_023396929.1 helix-turn-helix transcriptional regulator -
  AAA985_RS07815 (TKY181970_15520) - 1502187..1502378 (+) 192 WP_000834623.1 hypothetical protein -
  AAA985_RS07820 (TKY181970_15540) - 1502595..1503284 (+) 690 WP_000577241.1 DUF4145 domain-containing protein -
  AAA985_RS07825 (TKY181970_15550) - 1503546..1504313 (+) 768 WP_001859593.1 S24 family peptidase -
  AAA985_RS07830 (TKY181970_15560) - 1504335..1505405 (+) 1071 WP_000420652.1 hypothetical protein -
  AAA985_RS07835 (TKY181970_15570) - 1505775..1506902 (+) 1128 WP_000266845.1 site-specific integrase -
  AAA985_RS07840 (TKY181970_15580) whiA 1506990..1507901 (-) 912 WP_000011306.1 DNA-binding protein WhiA -
  AAA985_RS07845 (TKY181970_15590) - 1507898..1508875 (-) 978 WP_001231086.1 YvcK family protein -
  AAA985_RS07850 (TKY181970_15600) rapZ 1508872..1509762 (-) 891 WP_000163033.1 RNase adapter RapZ -
  AAA985_RS07855 (TKY181970_15610) - 1509814..1510194 (-) 381 WP_001140412.1 RidA family protein -
  AAA985_RS07860 (TKY181970_15620) yihA 1510205..1510792 (-) 588 WP_000422599.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  AAA985_RS07865 (TKY181970_15630) clpX 1510801..1512033 (-) 1233 WP_000106346.1 ATP-dependent Clp protease ATP-binding subunit ClpX Regulator
  AAA985_RS07870 - 1512065..1512235 (-) 171 WP_000442275.1 hypothetical protein -
  AAA985_RS07875 (TKY181970_15640) - 1512235..1512741 (-) 507 WP_000162484.1 dihydrofolate reductase -
  AAA985_RS07880 (TKY181970_15650) - 1512871..1513389 (-) 519 WP_000229874.1 Dps family protein -
  AAA985_RS07885 (TKY181970_15660) - 1513963..1515219 (+) 1257 WP_000436627.1 ISL3 family transposase -
  AAA985_RS07890 (TKY181970_15670) lytC 1515308..1516813 (-) 1506 WP_075575672.1 choline binding-anchored murein hydrolase LytC -
  AAA985_RS07895 (TKY181970_15680) tpiA 1516851..1517609 (-) 759 WP_000087897.1 triose-phosphate isomerase -
  AAA985_RS07900 (TKY181970_15690) - 1517707..1518384 (-) 678 WP_000221611.1 DnaD domain-containing protein -
  AAA985_RS07905 (TKY181970_15700) metA 1518393..1519337 (-) 945 WP_001122712.1 homoserine O-succinyltransferase -
  AAA985_RS07910 (TKY181970_15710) - 1519519..1520031 (-) 513 WP_001049323.1 adenine phosphoribosyltransferase -

Sequence


Protein


Download         Length: 410 a.a.        Molecular weight: 45798.35 Da        Isoelectric Point: 4.4550

>NTDB_id=105971 AAA985_RS07865 WP_000106346.1 1510801..1512033(-) (clpX) [Streptococcus pneumoniae strain Pne2]
MSTNRKNDMMVYCSFCGKNQEEVQKIIAGNNAFICNECVELAQEIIREELVEEVLADLSEVPKPIELLHILNHYVIGQDR
AKRALAVAVYNHYKRINFHDTREESEDVDLQKSNILMIGPTGSGKTFLAQTLAKSLNVPFAIADATALTEAGYVGEDVEN
ILLKLLQVADFNIERAERGIIYVDEIDKIAKKSENVSITRDVSGEGVQQALLKIIEGTVASVPPQGGRKHPQQEMIQVDT
KNILFIVGGAFDGIEEIVKQRLGEKVIGFGQNNKAIDENSSYMQEIIAEDIQKFGIIPELIGRLPVFAALEQLTVDDLVR
ILKEPRNALVKQYQTLLSYDDVELEFDDEALQEIANKAIERKTGARGLRSIIEETMLDVMFEVPSQENVKLVRITKETVD
GTDKPILETA

Nucleotide


Download         Length: 1233 bp        

>NTDB_id=105971 AAA985_RS07865 WP_000106346.1 1510801..1512033(-) (clpX) [Streptococcus pneumoniae strain Pne2]
ATGTCTACAAATAGAAAAAATGATATGATGGTTTATTGCTCATTTTGTGGCAAAAACCAAGAAGAAGTACAAAAAATAAT
TGCTGGCAACAATGCTTTTATTTGTAATGAATGCGTGGAGTTAGCTCAGGAAATCATTCGAGAAGAATTGGTTGAGGAAG
TCTTGGCAGACTTGTCTGAGGTGCCAAAACCAATTGAACTCCTCCATATCTTGAACCACTATGTAATTGGTCAAGATCGT
GCCAAGCGTGCCTTGGCAGTGGCGGTTTATAACCACTACAAACGCATCAATTTCCACGATACACGCGAAGAGTCAGAAGA
TGTGGATTTGCAGAAGTCAAACATTTTGATGATTGGCCCAACTGGTTCAGGGAAAACTTTCCTTGCCCAGACCTTGGCTA
AGAGCTTGAATGTACCTTTTGCTATTGCGGATGCGACAGCTCTGACGGAGGCTGGTTATGTGGGTGAGGATGTGGAAAAT
ATCCTCCTCAAACTCTTGCAGGTTGCTGACTTTAACATCGAACGTGCAGAGCGTGGCATTATCTATGTGGATGAAATTGA
CAAGATTGCCAAGAAGAGTGAGAATGTGTCTATCACACGTGATGTTTCTGGTGAAGGGGTGCAACAAGCCCTTCTCAAGA
TTATCGAGGGAACTGTTGCTAGCGTACCGCCTCAAGGTGGACGCAAACATCCACAACAAGAGATGATTCAAGTGGATACA
AAAAATATCCTCTTCATCGTGGGTGGTGCTTTTGATGGTATTGAAGAAATTGTCAAACAACGTCTGGGTGAAAAAGTCAT
CGGATTTGGTCAAAACAATAAGGCGATTGACGAAAACAGCTCATACATGCAAGAAATCATCGCTGAAGACATTCAAAAAT
TTGGTATTATCCCTGAGTTGATTGGACGCTTGCCTGTTTTTGCGGCTCTTGAGCAATTGACCGTTGATGACTTGGTTCGC
ATCTTGAAAGAGCCAAGAAATGCCTTGGTGAAACAATACCAAACCTTGCTTTCTTATGATGATGTTGAGTTGGAATTTGA
CGACGAAGCCCTTCAAGAGATTGCTAATAAAGCAATCGAACGGAAGACAGGGGCGCGTGGACTTCGCTCCATCATCGAAG
AAACCATGCTAGATGTTATGTTTGAGGTGCCGAGTCAGGAAAATGTGAAATTGGTTCGCATCACTAAAGAAACTGTCGAT
GGAACGGATAAACCGATCCTAGAAACAGCCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A064C077

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  clpX Streptococcus mutans UA159

86.585

100

0.866

  clpX Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

57.214

98.049

0.561


Multiple sequence alignment