Detailed information
Overview
| Name | comX/comX2 | Type | Regulator |
| Locus tag | R8623_RS00070 | Genome accession | NZ_AP026928 |
| Coordinates | 14255..14734 (+) | Length | 159 a.a. |
| NCBI ID | WP_000588925.1 | Uniprot ID | A0AAX3HEV8 |
| Organism | Streptococcus pneumoniae strain PZ900700406 | ||
| Function | activate transcription of late competence genes (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 12175..44018 | 14255..14734 | within | 0 |
Gene organization within MGE regions
Location: 12175..44018
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R8623_RS00065 (PC0401_00130) | ftsH | 12175..14133 (+) | 1959 | WP_000744557.1 | ATP-dependent zinc metalloprotease FtsH | - |
| R8623_RS00070 (PC0401_00140) | comX/comX2 | 14255..14734 (+) | 480 | WP_000588925.1 | sigma-70 family RNA polymerase sigma factor | Regulator |
| R8623_RS00105 | - | 20225..21071 (+) | 847 | Protein_14 | IS630 family transposase | - |
| R8623_RS00110 | - | 21106..21903 (-) | 798 | Protein_15 | transposase | - |
| R8623_RS00115 (PC0401_00180) | comW | 22169..22405 (+) | 237 | WP_000939544.1 | sigma(X)-activator ComW | Regulator |
| R8623_RS00120 (PC0401_00190) | - | 22636..23922 (+) | 1287 | WP_000205044.1 | adenylosuccinate synthase | - |
| R8623_RS00125 (PC0401_00200) | tadA | 24123..24590 (+) | 468 | WP_000291870.1 | tRNA adenosine(34) deaminase TadA | - |
| R8623_RS00135 (PC0401_00210) | - | 24799..25497 (-) | 699 | WP_001106362.1 | site-specific integrase | - |
| R8623_RS00140 (PC0401_00220) | - | 25587..25940 (-) | 354 | WP_001814135.1 | hypothetical protein | - |
| R8623_RS00145 (PC0401_00230) | - | 25995..27059 (-) | 1065 | WP_061374999.1 | type I restriction endonuclease | - |
| R8623_RS00150 (PC0401_00240) | - | 27076..27456 (-) | 381 | WP_000170931.1 | ImmA/IrrE family metallo-endopeptidase | - |
| R8623_RS00155 (PC0401_00250) | - | 27469..27732 (-) | 264 | WP_000285962.1 | type II toxin-antitoxin system RelE/ParE family toxin | - |
| R8623_RS00160 (PC0401_00260) | - | 27732..27965 (-) | 234 | WP_000156419.1 | hypothetical protein | - |
| R8623_RS00165 (PC0401_00270) | - | 27965..28333 (-) | 369 | WP_000464160.1 | helix-turn-helix domain-containing protein | - |
| R8623_RS00170 (PC0401_00290) | - | 28905..29096 (+) | 192 | WP_001112859.1 | DNA-binding protein | - |
| R8623_RS00175 (PC0401_00300) | - | 29119..29322 (+) | 204 | WP_001247549.1 | hypothetical protein | - |
| R8623_RS00180 (PC0401_00320) | - | 29477..29644 (-) | 168 | WP_000024181.1 | YjzC family protein | - |
| R8623_RS00185 (PC0401_00330) | - | 29649..30029 (+) | 381 | Protein_29 | autolysin | - |
| R8623_RS00190 (PC0401_00340) | - | 30249..30428 (-) | 180 | WP_001209433.1 | hypothetical protein | - |
| R8623_RS00195 (PC0401_00350) | - | 30570..30719 (-) | 150 | WP_001030863.1 | hypothetical protein | - |
| R8623_RS00200 (PC0401_00360) | - | 31024..31467 (+) | 444 | WP_000701992.1 | dUTP diphosphatase | - |
| R8623_RS00205 (PC0401_00370) | - | 31469..31984 (+) | 516 | WP_000691236.1 | histidine phosphatase family protein | - |
| R8623_RS00210 (PC0401_00380) | radA | 31998..33359 (+) | 1362 | WP_075213698.1 | DNA repair protein RadA | Machinery gene |
| R8623_RS00215 (PC0401_00390) | - | 33432..33929 (+) | 498 | WP_001809263.1 | carbonic anhydrase | - |
| R8623_RS00220 (PC0401_00400) | - | 33954..34737 (+) | 784 | Protein_36 | PrsW family glutamic-type intramembrane protease | - |
| R8623_RS00225 (PC0401_00410) | - | 34882..35850 (+) | 969 | WP_000010157.1 | ribose-phosphate diphosphokinase | - |
| R8623_RS00230 | - | 35984..36265 (-) | 282 | Protein_38 | ISL3 family transposase | - |
| R8623_RS00235 | - | 36392..37274 (-) | 883 | Protein_39 | Rpn family recombination-promoting nuclease/putative transposase | - |
| R8623_RS00240 (PC0401_00440) | polA | 37530..40163 (+) | 2634 | WP_317649668.1 | DNA polymerase I | - |
| R8623_RS00245 (PC0401_00450) | - | 40248..40685 (+) | 438 | WP_000076483.1 | CoA-binding protein | - |
| R8623_RS00250 (PC0401_00460) | - | 40923..41933 (-) | 1011 | WP_000009141.1 | YeiH family protein | - |
| R8623_RS00255 (PC0401_00480) | - | 42082..43251 (+) | 1170 | WP_000366376.1 | pyridoxal phosphate-dependent aminotransferase | - |
| R8623_RS00260 (PC0401_00490) | recO | 43248..44018 (+) | 771 | WP_000616164.1 | DNA repair protein RecO | - |
Sequence
Protein
Download Length: 159 a.a. Molecular weight: 19887.54 Da Isoelectric Point: 7.3798
>NTDB_id=98557 R8623_RS00070 WP_000588925.1 14255..14734(+) (comX/comX2) [Streptococcus pneumoniae strain PZ900700406]
MIKELYEEVQGTVYKCRNEYYLHLWELSDWEQEGMLCLHELISREEGLVDDIPRLRKYFKTKFRNRILDYIRKQESQKRR
YDKEPYEEVGEISHRISEGGLWLDDYYLFHETLRDYRNKQSKEKQEELERVLSNERFRGRQRVLRDLRIVFKEFTIRTH
MIKELYEEVQGTVYKCRNEYYLHLWELSDWEQEGMLCLHELISREEGLVDDIPRLRKYFKTKFRNRILDYIRKQESQKRR
YDKEPYEEVGEISHRISEGGLWLDDYYLFHETLRDYRNKQSKEKQEELERVLSNERFRGRQRVLRDLRIVFKEFTIRTH
Nucleotide
Download Length: 480 bp
>NTDB_id=98557 R8623_RS00070 WP_000588925.1 14255..14734(+) (comX/comX2) [Streptococcus pneumoniae strain PZ900700406]
ATGATTAAAGAATTGTATGAAGAAGTCCAAGGGACTGTGTATAAGTGTAGAAATGAATATTACCTTCATTTATGGGAATT
GTCGGATTGGGAGCAAGAAGGCATGCTCTGCTTACATGAATTGATTAGTAGAGAAGAAGGACTGGTAGACGATATTCCAC
GTTTAAGGAAATATTTCAAGACCAAGTTTCGAAATCGAATTTTAGACTATATCCGTAAACAGGAAAGTCAGAAGCGTAGA
TACGATAAAGAACCCTATGAAGAAGTGGGTGAGATCAGTCATCGTATAAGTGAGGGGGGTCTCTGGCTAGATGATTATTA
TCTCTTTCATGAAACACTAAGAGATTATAGAAACAAACAAAGTAAAGAGAAACAAGAAGAACTAGAACGCGTCTTAAGCA
ATGAACGATTTCGAGGGCGTCAAAGAGTATTAAGAGACTTACGCATTGTGTTTAAGGAGTTTACTATCCGTACCCATTAG
ATGATTAAAGAATTGTATGAAGAAGTCCAAGGGACTGTGTATAAGTGTAGAAATGAATATTACCTTCATTTATGGGAATT
GTCGGATTGGGAGCAAGAAGGCATGCTCTGCTTACATGAATTGATTAGTAGAGAAGAAGGACTGGTAGACGATATTCCAC
GTTTAAGGAAATATTTCAAGACCAAGTTTCGAAATCGAATTTTAGACTATATCCGTAAACAGGAAAGTCAGAAGCGTAGA
TACGATAAAGAACCCTATGAAGAAGTGGGTGAGATCAGTCATCGTATAAGTGAGGGGGGTCTCTGGCTAGATGATTATTA
TCTCTTTCATGAAACACTAAGAGATTATAGAAACAAACAAAGTAAAGAGAAACAAGAAGAACTAGAACGCGTCTTAAGCA
ATGAACGATTTCGAGGGCGTCAAAGAGTATTAAGAGACTTACGCATTGTGTTTAAGGAGTTTACTATCCGTACCCATTAG
Domains
No domain identified.
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.