Detailed information    

insolico Bioinformatically predicted

Overview


Name   codY   Type   Regulator
Locus tag   QZH34_RS18970 Genome accession   NZ_CP135587
Coordinates   3642966..3643745 (-) Length   259 a.a.
NCBI ID   WP_000421288.1    Uniprot ID   Q81WK7
Organism   Bacillus anthracis strain BA20200413YY     
Function   repression of comK (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 3608644..3682567 3642966..3643745 within 0


Gene organization within MGE regions


Location: 3608644..3682567
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QZH34_RS18820 (QZH34_18780) - 3608988..3610658 (-) 1671 WP_000823085.1 ribonuclease J -
  QZH34_RS18825 (QZH34_18785) dapA 3611424..3612302 (-) 879 WP_000564761.1 4-hydroxy-tetrahydrodipicolinate synthase -
  QZH34_RS18830 (QZH34_18790) dapG 3612314..3613546 (-) 1233 WP_000692470.1 aspartate kinase -
  QZH34_RS18835 (QZH34_18795) asd 3613570..3614616 (-) 1047 WP_000414849.1 aspartate-semialdehyde dehydrogenase -
  QZH34_RS18840 (QZH34_18800) dpaB 3614767..3615366 (-) 600 WP_001049484.1 dipicolinate synthase subunit B -
  QZH34_RS18845 (QZH34_18805) dpaA 3615363..3616265 (-) 903 WP_000954726.1 dipicolinic acid synthetase subunit A -
  QZH34_RS18850 (QZH34_18810) - 3616540..3616791 (-) 252 WP_001239752.1 YlmC/YmxH family sporulation protein -
  QZH34_RS18855 (QZH34_18815) - 3616918..3618159 (-) 1242 WP_000592993.1 pitrilysin family protein -
  QZH34_RS18860 (QZH34_18820) - 3618246..3619145 (-) 900 WP_000647529.1 polysaccharide deacetylase family protein -
  QZH34_RS18865 (QZH34_18825) pnp 3619297..3621435 (-) 2139 WP_000076733.1 polyribonucleotide nucleotidyltransferase -
  QZH34_RS18870 (QZH34_18830) rpsO 3621596..3621865 (-) 270 WP_001229392.1 30S ribosomal protein S15 -
  QZH34_RS18875 (QZH34_18835) ribF 3621966..3622937 (-) 972 WP_000766711.1 bifunctional riboflavin kinase/FAD synthetase -
  QZH34_RS18880 (QZH34_18840) truB 3622981..3623904 (-) 924 WP_404978092.1 tRNA pseudouridine(55) synthase TruB -
  QZH34_RS18885 (QZH34_18845) rbfA 3623991..3624347 (-) 357 WP_000776446.1 30S ribosome-binding factor RbfA -
  QZH34_RS18890 (QZH34_18850) - 3624363..3624644 (-) 282 WP_000582364.1 DUF503 domain-containing protein -
  QZH34_RS18895 (QZH34_18855) infB 3624641..3626701 (-) 2061 WP_000036339.1 translation initiation factor IF-2 -
  QZH34_RS18900 (QZH34_18860) - 3626706..3627017 (-) 312 WP_001286523.1 YlxQ family RNA-binding protein -
  QZH34_RS18905 (QZH34_18865) rnpM 3627018..3627290 (-) 273 WP_000071128.1 RNase P modulator RnpM -
  QZH34_RS18910 (QZH34_18870) nusA 3627302..3628408 (-) 1107 WP_000102609.1 transcription termination factor NusA -
  QZH34_RS18915 (QZH34_18875) rimP 3628426..3628896 (-) 471 WP_000359097.1 ribosome maturation factor RimP -
  QZH34_RS18920 (QZH34_18880) - 3629229..3633530 (-) 4302 WP_000059985.1 PolC-type DNA polymerase III -
  QZH34_RS18925 (QZH34_18885) - 3633655..3635355 (-) 1701 WP_000814312.1 proline--tRNA ligase -
  QZH34_RS18930 (QZH34_18890) rseP 3635465..3636721 (-) 1257 WP_001090228.1 RIP metalloprotease RseP -
  QZH34_RS18935 (QZH34_18895) dxr 3636738..3637880 (-) 1143 WP_000790359.1 1-deoxy-D-xylulose-5-phosphate reductoisomerase -
  QZH34_RS18940 (QZH34_18900) cdsA 3637904..3638695 (-) 792 WP_000813581.1 phosphatidate cytidylyltransferase -
  QZH34_RS18945 (QZH34_18905) uppS 3638713..3639489 (-) 777 WP_000971303.1 isoprenyl transferase -
  QZH34_RS18950 (QZH34_18910) frr 3639575..3640132 (-) 558 WP_000531498.1 ribosome recycling factor -
  QZH34_RS18955 (QZH34_18915) pyrH 3640135..3640857 (-) 723 WP_000042663.1 UMP kinase -
  QZH34_RS18960 (QZH34_18920) tsf 3640924..3641811 (-) 888 WP_001018581.1 translation elongation factor Ts -
  QZH34_RS18965 (QZH34_18925) rpsB 3641915..3642616 (-) 702 WP_000111483.1 30S ribosomal protein S2 -
  QZH34_RS18970 (QZH34_18930) codY 3642966..3643745 (-) 780 WP_000421288.1 GTP-sensing pleiotropic transcriptional regulator CodY Regulator
  QZH34_RS18975 (QZH34_18935) hslU 3643823..3645214 (-) 1392 WP_000550088.1 ATP-dependent protease ATPase subunit HslU -
  QZH34_RS18980 (QZH34_18940) hslV 3645237..3645779 (-) 543 WP_000526272.1 ATP-dependent protease proteolytic subunit HslV -
  QZH34_RS18985 (QZH34_18945) xerC 3645822..3646721 (-) 900 WP_001101226.1 tyrosine recombinase XerC -
  QZH34_RS18990 (QZH34_18950) trmFO 3646787..3648091 (-) 1305 WP_003161605.1 FADH(2)-oxidizing methylenetetrahydrofolate--tRNA-(uracil(54)-C(5))- methyltransferase TrmFO -
  QZH34_RS18995 (QZH34_18955) topA 3648142..3650220 (-) 2079 WP_001286969.1 type I DNA topoisomerase -
  QZH34_RS19000 (QZH34_18960) dprA 3650365..3651113 (-) 749 Protein_3705 DNA-processing protein DprA -
  QZH34_RS19005 (QZH34_18965) sucD 3651321..3652223 (-) 903 WP_000115178.1 succinate--CoA ligase subunit alpha -
  QZH34_RS19010 (QZH34_18970) sucC 3652244..3653404 (-) 1161 WP_001020786.1 ADP-forming succinate--CoA ligase subunit beta -
  QZH34_RS19015 (QZH34_18975) rnhB 3653598..3654371 (-) 774 WP_001174712.1 ribonuclease HII -
  QZH34_RS19020 (QZH34_18980) ylqF 3654423..3655313 (-) 891 WP_000236707.1 ribosome biogenesis GTPase YlqF -
  QZH34_RS19025 (QZH34_18985) lepB 3655334..3655885 (-) 552 WP_000711857.1 signal peptidase I -
  QZH34_RS19030 (QZH34_18990) rplS 3655987..3656331 (-) 345 WP_001186516.1 50S ribosomal protein L19 -
  QZH34_RS19035 (QZH34_18995) trmD 3656478..3657212 (-) 735 WP_000686892.1 tRNA (guanosine(37)-N1)-methyltransferase TrmD -
  QZH34_RS19040 (QZH34_19000) rimM 3657212..3657727 (-) 516 WP_000170269.1 ribosome maturation factor RimM -
  QZH34_RS19045 (QZH34_19005) - 3657848..3658075 (-) 228 WP_000737398.1 KH domain-containing protein -
  QZH34_RS19050 (QZH34_19010) rpsP 3658090..3658362 (-) 273 WP_000268750.1 30S ribosomal protein S16 -
  QZH34_RS19055 (QZH34_19015) ffh 3658463..3659812 (-) 1350 WP_000863456.1 signal recognition particle protein -
  QZH34_RS19060 (QZH34_19020) - 3659825..3660157 (-) 333 WP_000891062.1 putative DNA-binding protein -
  QZH34_RS19065 (QZH34_19025) ftsY 3660290..3661279 (-) 990 WP_000007650.1 signal recognition particle-docking protein FtsY -
  QZH34_RS19070 (QZH34_19030) smc 3661295..3664864 (-) 3570 WP_000478985.1 chromosome segregation protein SMC -
  QZH34_RS19075 (QZH34_19035) rncS 3665011..3665748 (-) 738 WP_001146873.1 ribonuclease III -
  QZH34_RS19080 (QZH34_19040) acpP 3665807..3666040 (-) 234 WP_000786062.1 acyl carrier protein -
  QZH34_RS19085 (QZH34_19045) fabG 3666110..3666850 (-) 741 WP_000911782.1 3-oxoacyl-[acyl-carrier-protein] reductase -
  QZH34_RS19090 (QZH34_19050) fabD 3666850..3667794 (-) 945 WP_000516958.1 ACP S-malonyltransferase -
  QZH34_RS19095 (QZH34_19055) plsX 3667809..3668801 (-) 993 WP_000684111.1 phosphate acyltransferase PlsX -
  QZH34_RS19100 (QZH34_19060) fapR 3668798..3669391 (-) 594 WP_000747352.1 transcription factor FapR -
  QZH34_RS19105 (QZH34_19065) recG 3669480..3671528 (-) 2049 WP_001006884.1 ATP-dependent DNA helicase RecG -
  QZH34_RS19110 (QZH34_19070) - 3671818..3673494 (-) 1677 WP_000027136.1 DAK2 domain-containing protein -
  QZH34_RS19115 (QZH34_19075) - 3673517..3673879 (-) 363 WP_000021109.1 Asp23/Gls24 family envelope stress response protein -
  QZH34_RS19120 (QZH34_19080) rpmB 3674258..3674446 (+) 189 WP_000124778.1 50S ribosomal protein L28 -
  QZH34_RS19125 (QZH34_19085) spoVM 3674519..3674599 (-) 81 WP_001213599.1 stage V sporulation protein SpoVM -
  QZH34_RS19130 (QZH34_19090) - 3674666..3675346 (-) 681 WP_025388434.1 thiamine diphosphokinase -
  QZH34_RS19135 (QZH34_19095) rpe 3675446..3676090 (-) 645 WP_000589966.1 ribulose-phosphate 3-epimerase -
  QZH34_RS19140 (QZH34_19100) rsgA 3676093..3676974 (-) 882 WP_001113935.1 ribosome small subunit-dependent GTPase A -
  QZH34_RS19145 (QZH34_19105) prkC 3677243..3679216 (-) 1974 WP_000904759.1 serine/threonine protein kinase PrkC -
  QZH34_RS19150 (QZH34_19110) - 3679225..3679977 (-) 753 WP_000648699.1 Stp1/IreP family PP2C-type Ser/Thr phosphatase -
  QZH34_RS19155 (QZH34_19115) rlmN 3679982..3681070 (-) 1089 WP_000450543.1 23S rRNA (adenine(2503)-C(2))-methyltransferase RlmN -
  QZH34_RS19160 (QZH34_19120) rsmB 3681075..3682409 (-) 1335 WP_001249680.1 16S rRNA (cytosine(967)-C(5))-methyltransferase RsmB -

Sequence


Protein


Download         Length: 259 a.a.        Molecular weight: 28774.05 Da        Isoelectric Point: 4.7947

>NTDB_id=886983 QZH34_RS18970 WP_000421288.1 3642966..3643745(-) (codY) [Bacillus anthracis strain BA20200413YY]
MELLAKTRKLNALLQSAAGKPVNFREMSDTMCEVIEANVFVVSRRGKLLGYAIHQQIENERMKQMLAERQFPEEYTQSLF
NITETSSNLDVNSAYTAFPVENKELFGQGLTTIVPIVGGGERLGTLVLARLGQEFLDDDLILAEYSSTVVGMEILREKAE
EIEEEARSKAVVQMAISSLSYSELEAIEHIFEELNGTEGLLVASKIADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
YIKVLNDKFLHELAKLKTN

Nucleotide


Download         Length: 780 bp        

>NTDB_id=886983 QZH34_RS18970 WP_000421288.1 3642966..3643745(-) (codY) [Bacillus anthracis strain BA20200413YY]
ATGGAATTATTAGCAAAAACAAGAAAATTAAATGCGTTATTACAGAGCGCAGCAGGAAAGCCTGTAAACTTTAGAGAAAT
GTCTGACACAATGTGTGAAGTAATCGAAGCGAACGTATTCGTAGTAAGTCGTCGTGGTAAATTACTAGGTTATGCAATTC
ACCAACAAATCGAGAATGAGCGTATGAAACAAATGCTTGCAGAACGTCAATTCCCAGAAGAGTATACACAAAGCTTATTC
AACATTACAGAAACATCTTCAAACTTAGATGTAAACAGTGCTTACACAGCATTCCCAGTAGAAAATAAAGAATTATTTGG
TCAAGGTTTAACTACAATCGTACCGATCGTTGGTGGCGGTGAGCGTCTAGGTACATTAGTTTTAGCTCGTCTTGGTCAAG
AGTTCTTAGATGATGATTTAATTCTTGCTGAGTACAGCTCAACTGTTGTAGGTATGGAAATTTTACGTGAAAAAGCAGAA
GAAATCGAAGAAGAAGCACGTAGCAAAGCTGTTGTTCAAATGGCGATCAGCTCATTATCTTACAGTGAATTAGAAGCAAT
CGAGCACATCTTCGAAGAATTAAACGGAACAGAAGGTTTACTTGTTGCAAGTAAAATTGCTGACCGCGTAGGAATCACTC
GTTCGGTAATCGTAAATGCACTTCGTAAATTAGAAAGTGCTGGTGTAATTGAGTCGCGTTCTTTAGGTATGAAAGGAACA
TACATTAAAGTATTAAACGACAAATTCTTACATGAACTTGCTAAATTAAAAACAAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q81WK7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  codY Bacillus subtilis subsp. subtilis str. 168

81.081

100

0.811

  codY Lactococcus lactis subsp. lactis strain DGCC12653

46.667

98.456

0.459