Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   EQH35_RS00460 Genome accession   NZ_CP035245
Coordinates   76438..77787 (+) Length   449 a.a.
NCBI ID   WP_000801618.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain TVO_1901943     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 12174..87892 76438..77787 within 0


Gene organization within MGE regions


Location: 12174..87892
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH35_RS00060 (EQH35_00060) ftsH 12174..14132 (+) 1959 WP_000744554.1 ATP-dependent zinc metalloprotease FtsH -
  EQH35_RS00065 (EQH35_00065) comX/comX2 14254..14733 (+) 480 WP_000588866.1 sigma-70 family RNA polymerase sigma factor Regulator
  EQH35_RS00100 (EQH35_00105) - 20283..20492 (+) 210 Protein_14 transposase -
  EQH35_RS00105 (EQH35_00110) - 20527..21324 (-) 798 Protein_15 transposase -
  EQH35_RS00110 (EQH35_00115) comW 21590..21826 (+) 237 WP_000939546.1 sigma(X)-activator ComW Regulator
  EQH35_RS00115 (EQH35_00120) - 22057..23343 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  EQH35_RS00120 (EQH35_00125) - 23585..24733 (-) 1149 WP_000876732.1 site-specific integrase -
  EQH35_RS00125 (EQH35_00135) - 24919..25842 (-) 924 WP_000122591.1 exonuclease domain-containing protein -
  EQH35_RS00130 (EQH35_00140) - 25855..26238 (-) 384 WP_000136459.1 ImmA/IrrE family metallo-endopeptidase -
  EQH35_RS00135 (EQH35_00145) - 26251..26616 (-) 366 WP_000492031.1 helix-turn-helix domain-containing protein -
  EQH35_RS00140 (EQH35_00150) - 26993..27214 (-) 222 WP_000041097.1 hypothetical protein -
  EQH35_RS00145 - 27333..27479 (+) 147 WP_000389576.1 hypothetical protein -
  EQH35_RS00150 (EQH35_00155) - 27491..27694 (+) 204 WP_000032097.1 helix-turn-helix transcriptional regulator -
  EQH35_RS00155 (EQH35_00160) - 27711..27908 (+) 198 WP_001057654.1 hypothetical protein -
  EQH35_RS00160 - 27919..28080 (+) 162 WP_001002946.1 hypothetical protein -
  EQH35_RS00165 (EQH35_00165) - 28075..28500 (-) 426 WP_000386249.1 hypothetical protein -
  EQH35_RS00170 (EQH35_00170) - 28554..29264 (+) 711 WP_001002359.1 ORF6C domain-containing protein -
  EQH35_RS00175 (EQH35_00175) - 29278..29535 (+) 258 WP_000370959.1 hypothetical protein -
  EQH35_RS00180 (EQH35_00180) - 29621..29941 (+) 321 WP_000462824.1 hypothetical protein -
  EQH35_RS00185 (EQH35_00185) - 29957..30253 (+) 297 WP_000391805.1 hypothetical protein -
  EQH35_RS00190 (EQH35_00190) - 30246..31052 (+) 807 WP_001289771.1 phage replisome organizer N-terminal domain-containing protein -
  EQH35_RS00195 (EQH35_00195) - 31192..31962 (+) 771 WP_000228219.1 ATP-binding protein -
  EQH35_RS10680 (EQH35_00200) - 31977..32171 (+) 195 WP_000470307.1 hypothetical protein -
  EQH35_RS00200 (EQH35_00205) - 32171..32389 (+) 219 WP_000891962.1 hypothetical protein -
  EQH35_RS00205 (EQH35_00210) - 32771..32869 (+) 99 Protein_36 single-stranded DNA-binding protein -
  EQH35_RS00210 (EQH35_00215) - 32883..33050 (+) 168 WP_000233203.1 hypothetical protein -
  EQH35_RS00215 (EQH35_00220) - 33037..33246 (+) 210 WP_000872740.1 hypothetical protein -
  EQH35_RS00220 (EQH35_00225) - 33218..33535 (+) 318 WP_000969665.1 hypothetical protein -
  EQH35_RS00225 - 33537..33611 (+) 75 Protein_40 DUF1642 domain-containing protein -
  EQH35_RS00230 (EQH35_00230) - 33788..34189 (+) 402 WP_000736390.1 transcriptional activator -
  EQH35_RS00235 (EQH35_00235) - 34377..34919 (+) 543 WP_000397549.1 site-specific integrase -
  EQH35_RS00240 (EQH35_00240) - 35296..35616 (+) 321 WP_000282427.1 HNH endonuclease -
  EQH35_RS00245 (EQH35_00245) - 35753..36145 (+) 393 WP_001118283.1 P27 family phage terminase small subunit -
  EQH35_RS00250 (EQH35_00250) - 36138..37868 (+) 1731 WP_000527299.1 terminase large subunit -
  EQH35_RS00255 (EQH35_00255) - 37876..38094 (+) 219 WP_001002923.1 hypothetical protein -
  EQH35_RS00260 (EQH35_00260) - 38112..39314 (+) 1203 WP_000510803.1 phage portal protein -
  EQH35_RS00265 (EQH35_00265) - 39298..39873 (+) 576 WP_001172115.1 HK97 family phage prohead protease -
  EQH35_RS00270 (EQH35_00270) - 39870..41036 (+) 1167 WP_001030357.1 phage major capsid protein -
  EQH35_RS00275 (EQH35_00275) - 41048..41317 (+) 270 WP_000262606.1 hypothetical protein -
  EQH35_RS00280 (EQH35_00280) - 41320..41601 (+) 282 WP_000370976.1 hypothetical protein -
  EQH35_RS00285 (EQH35_00285) - 41588..41887 (+) 300 WP_000267055.1 phage head closure protein -
  EQH35_RS00290 (EQH35_00290) - 41884..42231 (+) 348 WP_000063886.1 HK97 gp10 family phage protein -
  EQH35_RS00295 (EQH35_00295) - 42228..42551 (+) 324 WP_000777003.1 hypothetical protein -
  EQH35_RS00300 (EQH35_00300) - 42563..43141 (+) 579 WP_000191279.1 major tail protein -
  EQH35_RS00305 (EQH35_00305) - 43153..43572 (+) 420 WP_001227146.1 hypothetical protein -
  EQH35_RS00310 (EQH35_00310) - 43850..46945 (+) 3096 WP_000918318.1 hypothetical protein -
  EQH35_RS00315 (EQH35_00315) - 46942..47664 (+) 723 WP_000589856.1 hypothetical protein -
  EQH35_RS00320 - 47665..54297 (+) 6633 WP_000966215.1 phage tail spike protein -
  EQH35_RS10685 - 54294..54410 (+) 117 WP_001063632.1 hypothetical protein -
  EQH35_RS00325 (EQH35_00335) - 54391..54594 (+) 204 WP_001091113.1 hypothetical protein -
  EQH35_RS00330 (EQH35_00340) - 54597..54947 (+) 351 WP_000852241.1 hypothetical protein -
  EQH35_RS00335 (EQH35_00345) - 54956..55372 (+) 417 WP_001165344.1 phage holin family protein -
  EQH35_RS00340 (EQH35_00350) - 55376..55708 (+) 333 WP_001186219.1 phage holin -
  EQH35_RS00345 (EQH35_00355) - 55712..56668 (+) 957 WP_000350505.1 N-acetylmuramoyl-L-alanine amidase family protein -
  EQH35_RS00350 (EQH35_00360) - 56889..57077 (-) 189 WP_000109850.1 hypothetical protein -
  EQH35_RS00355 (EQH35_00365) tadA 57645..58112 (+) 468 WP_000291870.1 tRNA adenosine(34) deaminase TadA -
  EQH35_RS00365 (EQH35_00375) - 58298..58741 (+) 444 WP_000701992.1 dUTP diphosphatase -
  EQH35_RS00370 (EQH35_00380) - 58743..59258 (+) 516 WP_001838385.1 histidine phosphatase family protein -
  EQH35_RS00375 (EQH35_00385) radA 59272..60633 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  EQH35_RS00380 (EQH35_00390) - 60706..61203 (+) 498 WP_001809263.1 carbonic anhydrase -
  EQH35_RS00385 (EQH35_00400) - 61228..62058 (+) 831 Protein_72 PrsW family glutamic-type intramembrane protease -
  EQH35_RS00390 (EQH35_00405) - 62203..63171 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  EQH35_RS00395 (EQH35_00410) - 63308..63586 (-) 279 Protein_74 transposase family protein -
  EQH35_RS00400 - 63630..64555 (-) 926 Protein_75 Rpn family recombination-promoting nuclease/putative transposase -
  EQH35_RS00405 (EQH35_00430) polA 64811..67444 (+) 2634 WP_001809267.1 DNA polymerase I -
  EQH35_RS00410 (EQH35_00435) - 67529..67966 (+) 438 WP_000076479.1 CoA-binding protein -
  EQH35_RS00415 - 68007..68435 (+) 429 WP_000693134.1 hypothetical protein -
  EQH35_RS00420 (EQH35_00440) - 68464..69474 (-) 1011 WP_000009171.1 YeiH family protein -
  EQH35_RS00425 (EQH35_00445) - 69623..70792 (+) 1170 WP_000366345.1 pyridoxal phosphate-dependent aminotransferase -
  EQH35_RS00430 (EQH35_00450) recO 70789..71559 (+) 771 WP_000616162.1 DNA repair protein RecO -
  EQH35_RS00435 (EQH35_00455) plsX 71556..72548 (+) 993 WP_000717457.1 phosphate acyltransferase PlsX -
  EQH35_RS00440 (EQH35_00460) - 72554..72787 (+) 234 WP_000659556.1 acyl carrier protein -
  EQH35_RS00445 (EQH35_00465) - 72824..73124 (+) 301 Protein_84 transposase family protein -
  EQH35_RS00450 (EQH35_00470) blpU 73327..73557 (+) 231 Protein_85 bacteriocin-like peptide BlpU -
  EQH35_RS10585 - 73560..73685 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  EQH35_RS00455 (EQH35_00475) comA 74272..76425 (+) 2154 WP_000668304.1 peptide cleavage/export ABC transporter ComA Regulator
  EQH35_RS00460 (EQH35_00480) comB 76438..77787 (+) 1350 WP_000801618.1 competence pheromone export protein ComB Regulator
  EQH35_RS00465 (EQH35_00485) purC 77957..78664 (+) 708 WP_000043304.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  EQH35_RS10690 (EQH35_00490) - 78666..78809 (+) 144 WP_050167432.1 hypothetical protein -
  EQH35_RS00470 (EQH35_00495) - 78866..82591 (+) 3726 WP_000361178.1 phosphoribosylformylglycinamidine synthase -
  EQH35_RS00475 (EQH35_00500) purF 82684..84126 (+) 1443 WP_000220633.1 amidophosphoribosyltransferase -
  EQH35_RS00480 (EQH35_00505) purM 84163..85185 (+) 1023 WP_000182558.1 phosphoribosylformylglycinamidine cyclo-ligase -
  EQH35_RS00485 (EQH35_00510) purN 85182..85727 (+) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  EQH35_RS00490 (EQH35_00515) - 85811..86320 (+) 510 WP_000894018.1 VanZ family protein -
  EQH35_RS00495 (EQH35_00520) purH 86345..87892 (+) 1548 WP_000167082.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49615.49 Da        Isoelectric Point: 5.3895

>NTDB_id=337339 EQH35_RS00460 WP_000801618.1 76438..77787(+) (comB) [Streptococcus pneumoniae strain TVO_1901943]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
ETIASQNAAASQTQAEIGNLISQTEAKIRDYQTAKSAMETGASLASQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESSLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=337339 EQH35_RS00460 WP_000801618.1 76438..77787(+) (comB) [Streptococcus pneumoniae strain TVO_1901943]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAAAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCTAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
GAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAAGCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATGGAAACAGGTGCTTCCTTGGCCAGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAGTCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACATTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCCTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCAACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTACGGGGTGGAAGGCCG
CTTGCAGATGATTACGGGCAAGAAAAGTTATCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

99.109

100

0.991

  comB Streptococcus pneumoniae Rx1

98.886

100

0.989

  comB Streptococcus pneumoniae D39

98.886

100

0.989

  comB Streptococcus pneumoniae R6

98.886

100

0.989

  comB Streptococcus mitis SK321

95.1

100

0.951

  comB Streptococcus mitis NCTC 12261

94.655

100

0.947

  comB Streptococcus gordonii str. Challis substr. CH1

55.333

100

0.555


Multiple sequence alignment