Detailed information
Overview
| Name | comB | Type | Regulator |
| Locus tag | R8559_RS00465 | Genome accession | NZ_AP026926 |
| Coordinates | 79798..81147 (+) | Length | 449 a.a. |
| NCBI ID | WP_219599548.1 | Uniprot ID | - |
| Organism | Streptococcus pneumoniae strain PZ900700119 | ||
| Function | transport of ComC (predicted from homology) Competence regulation |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 12174..91107 | 79798..81147 | within | 0 |
Gene organization within MGE regions
Location: 12174..91107
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| R8559_RS00065 (PC0116_00130) | ftsH | 12174..14132 (+) | 1959 | WP_000744557.1 | ATP-dependent zinc metalloprotease FtsH | - |
| R8559_RS00070 (PC0116_00140) | comX/comX2 | 14254..14733 (+) | 480 | WP_000588897.1 | sigma-70 family RNA polymerase sigma factor | Regulator |
| R8559_RS00105 | - | 20235..21016 (-) | 782 | Protein_14 | transposase | - |
| R8559_RS00110 (PC0116_00160) | comW | 21282..21518 (+) | 237 | WP_000939545.1 | sigma(X)-activator ComW | Regulator |
| R8559_RS00115 (PC0116_00170) | - | 21749..23035 (+) | 1287 | WP_000205044.1 | adenylosuccinate synthase | - |
| R8559_RS00120 (PC0116_00180) | - | 23299..24447 (-) | 1149 | WP_000876727.1 | site-specific integrase | - |
| R8559_RS00125 (PC0116_00190) | - | 24618..25220 (-) | 603 | WP_370597176.1 | HIRAN domain-containing protein | - |
| R8559_RS11540 | - | 25335..25532 (-) | 198 | Protein_19 | hypothetical protein | - |
| R8559_RS00130 (PC0116_00200) | - | 25545..26300 (-) | 756 | WP_000069476.1 | XRE family transcriptional regulator | - |
| R8559_RS00135 (PC0116_00210) | - | 26474..26680 (+) | 207 | WP_001171134.1 | helix-turn-helix transcriptional regulator | - |
| R8559_RS00140 (PC0116_00240) | - | 26988..27665 (-) | 678 | WP_000289447.1 | DUF4145 domain-containing protein | - |
| R8559_RS00145 (PC0116_00250) | - | 27720..28433 (+) | 714 | WP_001002349.1 | ORF6C domain-containing protein | - |
| R8559_RS00150 (PC0116_00260) | - | 28446..28703 (+) | 258 | WP_000370959.1 | hypothetical protein | - |
| R8559_RS00155 (PC0116_00270) | - | 28789..29109 (+) | 321 | WP_000462823.1 | hypothetical protein | - |
| R8559_RS00160 (PC0116_00280) | - | 29125..29421 (+) | 297 | WP_050241622.1 | hypothetical protein | - |
| R8559_RS00165 (PC0116_00290) | - | 29412..30284 (+) | 873 | WP_050241623.1 | phage replisome organizer N-terminal domain-containing protein | - |
| R8559_RS00170 (PC0116_00300) | - | 30357..30524 (+) | 168 | WP_000233203.1 | hypothetical protein | - |
| R8559_RS00175 (PC0116_00310) | - | 30511..30720 (+) | 210 | WP_050241624.1 | hypothetical protein | - |
| R8559_RS00180 (PC0116_00320) | - | 30695..31009 (+) | 315 | WP_050241625.1 | hypothetical protein | - |
| R8559_RS00185 (PC0116_00330) | - | 31011..31211 (+) | 201 | WP_050241626.1 | hypothetical protein | - |
| R8559_RS00190 (PC0116_00340) | - | 31202..31540 (+) | 339 | WP_050241627.1 | helix-turn-helix domain-containing protein | - |
| R8559_RS00195 (PC0116_00350) | - | 31537..32001 (+) | 465 | WP_050244776.1 | hypothetical protein | - |
| R8559_RS00200 (PC0116_00360) | - | 32110..32652 (+) | 543 | WP_001028147.1 | site-specific integrase | - |
| R8559_RS00205 (PC0116_00380) | - | 33193..33399 (+) | 207 | WP_223842409.1 | HNH endonuclease | - |
| R8559_RS00210 (PC0116_00390) | - | 33536..34021 (+) | 486 | WP_000601030.1 | hypothetical protein | - |
| R8559_RS00215 (PC0116_00400) | - | 34014..35726 (+) | 1713 | WP_000230006.1 | terminase TerL endonuclease subunit | - |
| R8559_RS00220 (PC0116_00410) | - | 35735..36877 (+) | 1143 | WP_001812652.1 | phage portal protein | - |
| R8559_RS00225 (PC0116_00420) | - | 36924..37466 (+) | 543 | WP_000413203.1 | HK97 family phage prohead protease | - |
| R8559_RS00230 (PC0116_00430) | - | 37481..38734 (+) | 1254 | WP_000855224.1 | phage major capsid protein | - |
| R8559_RS00235 (PC0116_00440) | - | 38760..39095 (+) | 336 | WP_000154006.1 | hypothetical protein | - |
| R8559_RS00240 (PC0116_00450) | - | 39092..39397 (+) | 306 | WP_000842790.1 | head-tail adaptor protein | - |
| R8559_RS00245 (PC0116_00460) | - | 39397..39744 (+) | 348 | WP_001074487.1 | hypothetical protein | - |
| R8559_RS00250 (PC0116_00470) | - | 39731..40075 (+) | 345 | WP_000534621.1 | hypothetical protein | - |
| R8559_RS00255 (PC0116_00480) | - | 40089..40757 (+) | 669 | WP_000221469.1 | hypothetical protein | - |
| R8559_RS00260 (PC0116_00490) | - | 40759..41235 (+) | 477 | WP_000591561.1 | hypothetical protein | - |
| R8559_RS00265 (PC0116_00500) | - | 41422..44160 (+) | 2739 | WP_050241635.1 | phage tail tape measure protein | - |
| R8559_RS00270 (PC0116_00510) | - | 44157..44879 (+) | 723 | WP_000161559.1 | phage tail protein | - |
| R8559_RS00275 (PC0116_00520) | - | 44880..53006 (+) | 8127 | WP_317638279.1 | phage tail spike protein | - |
| R8559_RS00280 | - | 53003..53119 (+) | 117 | WP_001063633.1 | hypothetical protein | - |
| R8559_RS00285 (PC0116_00530) | - | 53100..53303 (+) | 204 | WP_001091118.1 | hypothetical protein | - |
| R8559_RS00290 (PC0116_00540) | - | 53306..53656 (+) | 351 | WP_050218371.1 | hypothetical protein | - |
| R8559_RS00295 (PC0116_00550) | - | 53666..54082 (+) | 417 | WP_050211460.1 | phage holin family protein | - |
| R8559_RS00300 (PC0116_00560) | - | 54086..54418 (+) | 333 | WP_001186242.1 | phage holin | - |
| R8559_RS00305 (PC0116_00570) | lytA | 54422..55378 (+) | 957 | WP_050218372.1 | N-acetylmuramoyl-L-alanine amidase LytA | - |
| R8559_RS00310 (PC0116_00580) | - | 55598..55777 (-) | 180 | WP_001209433.1 | hypothetical protein | - |
| R8559_RS00315 (PC0116_00590) | - | 55919..56068 (-) | 150 | WP_001030863.1 | hypothetical protein | - |
| R8559_RS00320 (PC0116_00600) | tadA | 56349..56816 (+) | 468 | WP_000291870.1 | tRNA adenosine(34) deaminase TadA | - |
| R8559_RS00330 (PC0116_00610) | - | 57002..57445 (+) | 444 | WP_000701992.1 | dUTP diphosphatase | - |
| R8559_RS00335 (PC0116_00620) | - | 57447..57962 (+) | 516 | WP_001842035.1 | histidine phosphatase family protein | - |
| R8559_RS00340 (PC0116_00630) | radA | 57976..59337 (+) | 1362 | WP_074017595.1 | DNA repair protein RadA | Machinery gene |
| R8559_RS00345 (PC0116_00640) | - | 59410..59907 (+) | 498 | WP_001809263.1 | carbonic anhydrase | - |
| R8559_RS00350 (PC0116_00650) | - | 59932..60746 (+) | 815 | Protein_63 | PrsW family intramembrane metalloprotease | - |
| R8559_RS00355 (PC0116_00660) | - | 60891..61859 (+) | 969 | WP_000010163.1 | ribose-phosphate diphosphokinase | - |
| R8559_RS00360 | - | 61993..62274 (-) | 282 | Protein_65 | ISL3 family transposase | - |
| R8559_RS00365 | - | 62401..63308 (-) | 908 | Protein_66 | Rpn family recombination-promoting nuclease/putative transposase | - |
| R8559_RS00370 (PC0116_00700) | polA | 63559..66192 (+) | 2634 | WP_001812647.1 | DNA polymerase I | - |
| R8559_RS00375 (PC0116_00710) | - | 66277..66714 (+) | 438 | WP_000076479.1 | CoA-binding protein | - |
| R8559_RS00380 (PC0116_00720) | - | 66755..66958 (+) | 204 | WP_025171757.1 | hypothetical protein | - |
| R8559_RS00385 (PC0116_00730) | - | 66987..67997 (-) | 1011 | WP_000009158.1 | YeiH family protein | - |
| R8559_RS00390 (PC0116_00750) | - | 68146..69315 (+) | 1170 | WP_000366342.1 | pyridoxal phosphate-dependent aminotransferase | - |
| R8559_RS00395 (PC0116_00760) | recO | 69312..70082 (+) | 771 | WP_000616164.1 | DNA repair protein RecO | - |
| R8559_RS00400 (PC0116_00770) | plsX | 70079..71071 (+) | 993 | WP_000717458.1 | phosphate acyltransferase PlsX | - |
| R8559_RS00405 (PC0116_00780) | - | 71077..71310 (+) | 234 | WP_000136447.1 | acyl carrier protein | - |
| R8559_RS00410 (PC0116_00790) | - | 71347..71646 (+) | 300 | Protein_75 | transposase family protein | - |
| R8559_RS00415 (PC0116_00800) | blpU | 71849..72079 (+) | 231 | WP_001093075.1 | bacteriocin-like peptide BlpU | - |
| R8559_RS00420 | - | 72082..72207 (+) | 126 | WP_000346297.1 | PncF family bacteriocin immunity protein | - |
| R8559_RS00425 | - | 72618..72812 (+) | 195 | WP_000756776.1 | hypothetical protein | - |
| R8559_RS00430 (PC0116_00810) | - | 72833..73726 (+) | 894 | WP_000343888.1 | XRE family transcriptional regulator | - |
| R8559_RS00435 (PC0116_00820) | - | 74269..74565 (+) | 297 | WP_000837726.1 | uberolysin/carnocyclin family circular bacteriocin | - |
| R8559_RS00440 | - | 74572..75708 (+) | 1137 | WP_000698828.1 | hypothetical protein | - |
| R8559_RS00445 (PC0116_00830) | - | 75705..76187 (+) | 483 | WP_001232094.1 | stage II sporulation protein M | - |
| R8559_RS00450 (PC0116_00840) | - | 76184..76780 (+) | 597 | WP_000687787.1 | ATP-binding cassette domain-containing protein | - |
| R8559_RS00455 (PC0116_00850) | - | 76761..77252 (+) | 492 | WP_000671990.1 | hypothetical protein | - |
| R8559_RS00460 (PC0116_00860) | comA | 77632..79785 (+) | 2154 | WP_317638280.1 | peptide cleavage/export ABC transporter ComA | Regulator |
| R8559_RS00465 (PC0116_00870) | comB | 79798..81147 (+) | 1350 | WP_219599548.1 | competence pheromone export protein ComB | Regulator |
| R8559_RS00470 (PC0116_00880) | purC | 81317..82024 (+) | 708 | WP_219599549.1 | phosphoribosylaminoimidazolesuccinocarboxamide synthase | - |
| R8559_RS00475 (PC0116_00890) | - | 82081..85806 (+) | 3726 | WP_000361203.1 | phosphoribosylformylglycinamidine synthase | - |
| R8559_RS00480 (PC0116_00900) | purF | 85899..87341 (+) | 1443 | WP_219599550.1 | amidophosphoribosyltransferase | - |
| R8559_RS00485 (PC0116_00910) | purM | 87378..88400 (+) | 1023 | WP_317638281.1 | phosphoribosylformylglycinamidine cyclo-ligase | - |
| R8559_RS00490 (PC0116_00920) | purN | 88397..88942 (+) | 546 | WP_000717506.1 | phosphoribosylglycinamide formyltransferase | - |
| R8559_RS00495 (PC0116_00930) | - | 89026..89535 (+) | 510 | WP_000894018.1 | VanZ family protein | - |
| R8559_RS00500 (PC0116_00940) | purH | 89560..91107 (+) | 1548 | WP_219599551.1 | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase | - |
Sequence
Protein
Download Length: 449 a.a. Molecular weight: 49609.51 Da Isoelectric Point: 5.7846
>NTDB_id=98403 R8559_RS00465 WP_219599548.1 79798..81147(+) (comB) [Streptococcus pneumoniae strain PZ900700119]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
KTIASQNAAASQTQSEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESNLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
KTIASQNAAASQTQSEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESNLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE
Nucleotide
Download Length: 1350 bp
>NTDB_id=98403 R8559_RS00465 WP_219599548.1 79798..81147(+) (comB) [Streptococcus pneumoniae strain PZ900700119]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCCAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
AAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAATCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAATCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACACTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCTTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTATGGGGTGGAAGGCCG
TTTACAGATGATTACGGGCAAGAAAAGTTATCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCCAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
AAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAATCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAATCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACACTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCTTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTATGGGGTGGAAGGCCG
TTTACAGATGATTACGGGCAAGAAAAGTTATCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA
Domains
No domain identified.
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comB | Streptococcus pneumoniae TIGR4 |
98.886 |
100 |
0.989 |
| comB | Streptococcus pneumoniae D39 |
98.664 |
100 |
0.987 |
| comB | Streptococcus pneumoniae R6 |
98.664 |
100 |
0.987 |
| comB | Streptococcus pneumoniae Rx1 |
98.664 |
100 |
0.987 |
| comB | Streptococcus mitis SK321 |
94.432 |
100 |
0.944 |
| comB | Streptococcus mitis NCTC 12261 |
93.987 |
100 |
0.94 |
| comB | Streptococcus gordonii str. Challis substr. CH1 |
55.333 |
100 |
0.555 |