Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   R8559_RS00465 Genome accession   NZ_AP026926
Coordinates   79798..81147 (+) Length   449 a.a.
NCBI ID   WP_219599548.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain PZ900700119     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 12174..91107 79798..81147 within 0


Gene organization within MGE regions


Location: 12174..91107
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R8559_RS00065 (PC0116_00130) ftsH 12174..14132 (+) 1959 WP_000744557.1 ATP-dependent zinc metalloprotease FtsH -
  R8559_RS00070 (PC0116_00140) comX/comX2 14254..14733 (+) 480 WP_000588897.1 sigma-70 family RNA polymerase sigma factor Regulator
  R8559_RS00105 - 20235..21016 (-) 782 Protein_14 transposase -
  R8559_RS00110 (PC0116_00160) comW 21282..21518 (+) 237 WP_000939545.1 sigma(X)-activator ComW Regulator
  R8559_RS00115 (PC0116_00170) - 21749..23035 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  R8559_RS00120 (PC0116_00180) - 23299..24447 (-) 1149 WP_000876727.1 site-specific integrase -
  R8559_RS00125 (PC0116_00190) - 24618..25220 (-) 603 WP_370597176.1 HIRAN domain-containing protein -
  R8559_RS11540 - 25335..25532 (-) 198 Protein_19 hypothetical protein -
  R8559_RS00130 (PC0116_00200) - 25545..26300 (-) 756 WP_000069476.1 XRE family transcriptional regulator -
  R8559_RS00135 (PC0116_00210) - 26474..26680 (+) 207 WP_001171134.1 helix-turn-helix transcriptional regulator -
  R8559_RS00140 (PC0116_00240) - 26988..27665 (-) 678 WP_000289447.1 DUF4145 domain-containing protein -
  R8559_RS00145 (PC0116_00250) - 27720..28433 (+) 714 WP_001002349.1 ORF6C domain-containing protein -
  R8559_RS00150 (PC0116_00260) - 28446..28703 (+) 258 WP_000370959.1 hypothetical protein -
  R8559_RS00155 (PC0116_00270) - 28789..29109 (+) 321 WP_000462823.1 hypothetical protein -
  R8559_RS00160 (PC0116_00280) - 29125..29421 (+) 297 WP_050241622.1 hypothetical protein -
  R8559_RS00165 (PC0116_00290) - 29412..30284 (+) 873 WP_050241623.1 phage replisome organizer N-terminal domain-containing protein -
  R8559_RS00170 (PC0116_00300) - 30357..30524 (+) 168 WP_000233203.1 hypothetical protein -
  R8559_RS00175 (PC0116_00310) - 30511..30720 (+) 210 WP_050241624.1 hypothetical protein -
  R8559_RS00180 (PC0116_00320) - 30695..31009 (+) 315 WP_050241625.1 hypothetical protein -
  R8559_RS00185 (PC0116_00330) - 31011..31211 (+) 201 WP_050241626.1 hypothetical protein -
  R8559_RS00190 (PC0116_00340) - 31202..31540 (+) 339 WP_050241627.1 helix-turn-helix domain-containing protein -
  R8559_RS00195 (PC0116_00350) - 31537..32001 (+) 465 WP_050244776.1 hypothetical protein -
  R8559_RS00200 (PC0116_00360) - 32110..32652 (+) 543 WP_001028147.1 site-specific integrase -
  R8559_RS00205 (PC0116_00380) - 33193..33399 (+) 207 WP_223842409.1 HNH endonuclease -
  R8559_RS00210 (PC0116_00390) - 33536..34021 (+) 486 WP_000601030.1 hypothetical protein -
  R8559_RS00215 (PC0116_00400) - 34014..35726 (+) 1713 WP_000230006.1 terminase TerL endonuclease subunit -
  R8559_RS00220 (PC0116_00410) - 35735..36877 (+) 1143 WP_001812652.1 phage portal protein -
  R8559_RS00225 (PC0116_00420) - 36924..37466 (+) 543 WP_000413203.1 HK97 family phage prohead protease -
  R8559_RS00230 (PC0116_00430) - 37481..38734 (+) 1254 WP_000855224.1 phage major capsid protein -
  R8559_RS00235 (PC0116_00440) - 38760..39095 (+) 336 WP_000154006.1 hypothetical protein -
  R8559_RS00240 (PC0116_00450) - 39092..39397 (+) 306 WP_000842790.1 head-tail adaptor protein -
  R8559_RS00245 (PC0116_00460) - 39397..39744 (+) 348 WP_001074487.1 hypothetical protein -
  R8559_RS00250 (PC0116_00470) - 39731..40075 (+) 345 WP_000534621.1 hypothetical protein -
  R8559_RS00255 (PC0116_00480) - 40089..40757 (+) 669 WP_000221469.1 hypothetical protein -
  R8559_RS00260 (PC0116_00490) - 40759..41235 (+) 477 WP_000591561.1 hypothetical protein -
  R8559_RS00265 (PC0116_00500) - 41422..44160 (+) 2739 WP_050241635.1 phage tail tape measure protein -
  R8559_RS00270 (PC0116_00510) - 44157..44879 (+) 723 WP_000161559.1 phage tail protein -
  R8559_RS00275 (PC0116_00520) - 44880..53006 (+) 8127 WP_317638279.1 phage tail spike protein -
  R8559_RS00280 - 53003..53119 (+) 117 WP_001063633.1 hypothetical protein -
  R8559_RS00285 (PC0116_00530) - 53100..53303 (+) 204 WP_001091118.1 hypothetical protein -
  R8559_RS00290 (PC0116_00540) - 53306..53656 (+) 351 WP_050218371.1 hypothetical protein -
  R8559_RS00295 (PC0116_00550) - 53666..54082 (+) 417 WP_050211460.1 phage holin family protein -
  R8559_RS00300 (PC0116_00560) - 54086..54418 (+) 333 WP_001186242.1 phage holin -
  R8559_RS00305 (PC0116_00570) lytA 54422..55378 (+) 957 WP_050218372.1 N-acetylmuramoyl-L-alanine amidase LytA -
  R8559_RS00310 (PC0116_00580) - 55598..55777 (-) 180 WP_001209433.1 hypothetical protein -
  R8559_RS00315 (PC0116_00590) - 55919..56068 (-) 150 WP_001030863.1 hypothetical protein -
  R8559_RS00320 (PC0116_00600) tadA 56349..56816 (+) 468 WP_000291870.1 tRNA adenosine(34) deaminase TadA -
  R8559_RS00330 (PC0116_00610) - 57002..57445 (+) 444 WP_000701992.1 dUTP diphosphatase -
  R8559_RS00335 (PC0116_00620) - 57447..57962 (+) 516 WP_001842035.1 histidine phosphatase family protein -
  R8559_RS00340 (PC0116_00630) radA 57976..59337 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  R8559_RS00345 (PC0116_00640) - 59410..59907 (+) 498 WP_001809263.1 carbonic anhydrase -
  R8559_RS00350 (PC0116_00650) - 59932..60746 (+) 815 Protein_63 PrsW family intramembrane metalloprotease -
  R8559_RS00355 (PC0116_00660) - 60891..61859 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  R8559_RS00360 - 61993..62274 (-) 282 Protein_65 ISL3 family transposase -
  R8559_RS00365 - 62401..63308 (-) 908 Protein_66 Rpn family recombination-promoting nuclease/putative transposase -
  R8559_RS00370 (PC0116_00700) polA 63559..66192 (+) 2634 WP_001812647.1 DNA polymerase I -
  R8559_RS00375 (PC0116_00710) - 66277..66714 (+) 438 WP_000076479.1 CoA-binding protein -
  R8559_RS00380 (PC0116_00720) - 66755..66958 (+) 204 WP_025171757.1 hypothetical protein -
  R8559_RS00385 (PC0116_00730) - 66987..67997 (-) 1011 WP_000009158.1 YeiH family protein -
  R8559_RS00390 (PC0116_00750) - 68146..69315 (+) 1170 WP_000366342.1 pyridoxal phosphate-dependent aminotransferase -
  R8559_RS00395 (PC0116_00760) recO 69312..70082 (+) 771 WP_000616164.1 DNA repair protein RecO -
  R8559_RS00400 (PC0116_00770) plsX 70079..71071 (+) 993 WP_000717458.1 phosphate acyltransferase PlsX -
  R8559_RS00405 (PC0116_00780) - 71077..71310 (+) 234 WP_000136447.1 acyl carrier protein -
  R8559_RS00410 (PC0116_00790) - 71347..71646 (+) 300 Protein_75 transposase family protein -
  R8559_RS00415 (PC0116_00800) blpU 71849..72079 (+) 231 WP_001093075.1 bacteriocin-like peptide BlpU -
  R8559_RS00420 - 72082..72207 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  R8559_RS00425 - 72618..72812 (+) 195 WP_000756776.1 hypothetical protein -
  R8559_RS00430 (PC0116_00810) - 72833..73726 (+) 894 WP_000343888.1 XRE family transcriptional regulator -
  R8559_RS00435 (PC0116_00820) - 74269..74565 (+) 297 WP_000837726.1 uberolysin/carnocyclin family circular bacteriocin -
  R8559_RS00440 - 74572..75708 (+) 1137 WP_000698828.1 hypothetical protein -
  R8559_RS00445 (PC0116_00830) - 75705..76187 (+) 483 WP_001232094.1 stage II sporulation protein M -
  R8559_RS00450 (PC0116_00840) - 76184..76780 (+) 597 WP_000687787.1 ATP-binding cassette domain-containing protein -
  R8559_RS00455 (PC0116_00850) - 76761..77252 (+) 492 WP_000671990.1 hypothetical protein -
  R8559_RS00460 (PC0116_00860) comA 77632..79785 (+) 2154 WP_317638280.1 peptide cleavage/export ABC transporter ComA Regulator
  R8559_RS00465 (PC0116_00870) comB 79798..81147 (+) 1350 WP_219599548.1 competence pheromone export protein ComB Regulator
  R8559_RS00470 (PC0116_00880) purC 81317..82024 (+) 708 WP_219599549.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  R8559_RS00475 (PC0116_00890) - 82081..85806 (+) 3726 WP_000361203.1 phosphoribosylformylglycinamidine synthase -
  R8559_RS00480 (PC0116_00900) purF 85899..87341 (+) 1443 WP_219599550.1 amidophosphoribosyltransferase -
  R8559_RS00485 (PC0116_00910) purM 87378..88400 (+) 1023 WP_317638281.1 phosphoribosylformylglycinamidine cyclo-ligase -
  R8559_RS00490 (PC0116_00920) purN 88397..88942 (+) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  R8559_RS00495 (PC0116_00930) - 89026..89535 (+) 510 WP_000894018.1 VanZ family protein -
  R8559_RS00500 (PC0116_00940) purH 89560..91107 (+) 1548 WP_219599551.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49609.51 Da        Isoelectric Point: 5.7846

>NTDB_id=98403 R8559_RS00465 WP_219599548.1 79798..81147(+) (comB) [Streptococcus pneumoniae strain PZ900700119]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
KTIASQNAAASQTQSEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESNLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=98403 R8559_RS00465 WP_219599548.1 79798..81147(+) (comB) [Streptococcus pneumoniae strain PZ900700119]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCCAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
AAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAATCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAATCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACACTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCTTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTATGGGGTGGAAGGCCG
TTTACAGATGATTACGGGCAAGAAAAGTTATCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

98.886

100

0.989

  comB Streptococcus pneumoniae D39

98.664

100

0.987

  comB Streptococcus pneumoniae R6

98.664

100

0.987

  comB Streptococcus pneumoniae Rx1

98.664

100

0.987

  comB Streptococcus mitis SK321

94.432

100

0.944

  comB Streptococcus mitis NCTC 12261

93.987

100

0.94

  comB Streptococcus gordonii str. Challis substr. CH1

55.333

100

0.555


Multiple sequence alignment