Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   E0F31_RS00200 Genome accession   NZ_LR216064
Coordinates   27592..28941 (+) Length   449 a.a.
NCBI ID   WP_000801591.1    Uniprot ID   A0A4V0IRK4
Organism   Streptococcus pneumoniae strain GPSC13 substr. ST473 isolate 569492b0-41bd-11e5-998e-3c4a9275d6c6     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1720..39046 27592..28941 within 0


Gene organization within MGE regions


Location: 1720..39046
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E0F31_RS00025 (SAMEA3714520_00005) - 1762..3048 (+) 1287 WP_000205044.1 adenylosuccinate synthase -
  E0F31_RS00030 (SAMEA3714520_00006) tadA 3249..3716 (+) 468 WP_000291874.1 tRNA adenosine(34) deaminase TadA -
  E0F31_RS00040 (SAMEA3714520_00007) - 3925..5067 (-) 1143 WP_000266841.1 tyrosine-type recombinase/integrase -
  E0F31_RS00045 (SAMEA3714520_00008) - 5122..6192 (-) 1071 WP_000401841.1 type I restriction endonuclease -
  E0F31_RS00050 (SAMEA3714520_00009) - 6209..6589 (-) 381 WP_000170931.1 ImmA/IrrE family metallo-endopeptidase -
  E0F31_RS00055 (SAMEA3714520_00010) - 6602..6865 (-) 264 WP_000285962.1 type II toxin-antitoxin system RelE family toxin -
  E0F31_RS00060 (SAMEA3714520_00011) - 6865..7098 (-) 234 WP_000156419.1 hypothetical protein -
  E0F31_RS00065 (SAMEA3714520_00012) - 7098..7466 (-) 369 WP_000464160.1 helix-turn-helix domain-containing protein -
  E0F31_RS11985 (SAMEA3714520_00013) - 7763..7894 (+) 132 WP_000253628.1 hypothetical protein -
  E0F31_RS00075 (SAMEA3714520_00014) - 8184..8375 (+) 192 WP_001112859.1 DNA-binding protein -
  E0F31_RS00080 (SAMEA3714520_00015) - 8398..8601 (+) 204 WP_001247549.1 hypothetical protein -
  E0F31_RS00085 (SAMEA3714520_00017) - 8756..8923 (-) 168 WP_000024181.1 YjzC family protein -
  E0F31_RS00090 (SAMEA3714520_00018) - 8928..9308 (+) 381 Protein_14 autolysin -
  E0F31_RS00095 (SAMEA3714520_00019) - 9582..10025 (+) 444 WP_000701990.1 dUTP diphosphatase -
  E0F31_RS00100 (SAMEA3714520_00020) - 10027..10542 (+) 516 WP_000691237.1 histidine phosphatase family protein -
  E0F31_RS00105 (SAMEA3714520_00021) radA 10556..11917 (+) 1362 WP_074017595.1 DNA repair protein RadA Machinery gene
  E0F31_RS00110 (SAMEA3714520_00022) - 11990..12487 (+) 498 WP_001809263.1 beta-class carbonic anhydrase -
  E0F31_RS00115 (SAMEA3714520_00023) - 12512..13327 (+) 816 WP_000749763.1 PrsW family intramembrane metalloprotease -
  E0F31_RS00120 (SAMEA3714520_00024) - 13472..14440 (+) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  E0F31_RS00125 - 14574..14855 (-) 282 Protein_21 transposase family protein -
  E0F31_RS11580 - 14982..15889 (-) 908 Protein_22 Rpn family recombination-promoting nuclease/putative transposase -
  E0F31_RS00145 (SAMEA3714520_00028) polA 16145..18778 (+) 2634 WP_130889213.1 DNA polymerase I -
  E0F31_RS00150 (SAMEA3714520_00029) - 18863..19300 (+) 438 WP_000076479.1 CoA-binding protein -
  E0F31_RS11585 (SAMEA3714520_00030) - 19341..19589 (+) 249 WP_000692961.1 hypothetical protein -
  E0F31_RS00160 (SAMEA3714520_00031) - 19618..20628 (-) 1011 WP_000009170.1 YeiH family protein -
  E0F31_RS00165 (SAMEA3714520_00032) - 20777..21946 (+) 1170 WP_000366342.1 pyridoxal phosphate-dependent aminotransferase -
  E0F31_RS00170 (SAMEA3714520_00033) recO 21943..22713 (+) 771 WP_000616122.1 DNA repair protein RecO -
  E0F31_RS00175 (SAMEA3714520_00034) plsX 22710..23702 (+) 993 WP_000717458.1 phosphate acyltransferase PlsX -
  E0F31_RS00180 (SAMEA3714520_00035) - 23708..23941 (+) 234 WP_000136449.1 acyl carrier protein -
  E0F31_RS00185 - 23978..24278 (+) 301 Protein_31 transposase family protein -
  E0F31_RS00190 (SAMEA3714520_00036) blpU 24481..24711 (+) 231 WP_001093075.1 bacteriocin-like peptide BlpU -
  E0F31_RS11990 (SAMEA3714520_00037) - 24714..24839 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  E0F31_RS00195 (SAMEA3714520_00038) comA 25426..27579 (+) 2154 WP_000668294.1 peptide cleavage/export ABC transporter ComA Regulator
  E0F31_RS00200 (SAMEA3714520_00039) comB 27592..28941 (+) 1350 WP_000801591.1 competence pheromone export protein ComB Regulator
  E0F31_RS00205 (SAMEA3714520_00040) purC 29111..29818 (+) 708 WP_000043304.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  E0F31_RS12145 - 29829..29963 (+) 135 WP_000429436.1 hypothetical protein -
  E0F31_RS00215 (SAMEA3714520_00041) - 30020..33745 (+) 3726 WP_000361217.1 phosphoribosylformylglycinamidine synthase -
  E0F31_RS00220 (SAMEA3714520_00042) purF 33838..35280 (+) 1443 WP_050206437.1 amidophosphoribosyltransferase -
  E0F31_RS00225 (SAMEA3714520_00043) purM 35317..36339 (+) 1023 WP_000182575.1 phosphoribosylformylglycinamidine cyclo-ligase -
  E0F31_RS00230 (SAMEA3714520_00044) purN 36336..36881 (+) 546 WP_050289164.1 phosphoribosylglycinamide formyltransferase -
  E0F31_RS00235 (SAMEA3714520_00045) - 36965..37474 (+) 510 WP_000894018.1 VanZ family protein -
  E0F31_RS00240 (SAMEA3714520_00046) purH 37499..39046 (+) 1548 WP_000167084.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49694.60 Da        Isoelectric Point: 5.5684

>NTDB_id=1126243 E0F31_RS00200 WP_000801591.1 27592..28941(+) (comB) [Streptococcus pneumoniae strain GPSC13 substr. ST473 isolate 569492b0-41bd-11e5-998e-3c4a9275d6c6]
MKPEFLESAEFYNRRYHNFSSRVIVPMSLLLVFLLGFATFAEKEISLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
ETIASQNAAASQTQAEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQIKVQAVAQVEAQIS
QLESSLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=1126243 E0F31_RS00200 WP_000801591.1 27592..28941(+) (comB) [Streptococcus pneumoniae strain GPSC13 substr. ST473 isolate 569492b0-41bd-11e5-998e-3c4a9275d6c6]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTCGGGTGATTGTACCCAT
GTCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACATTTGCGGAGAAGGAGATAAGTTTATCAACTAGAGCTACTGTCG
AGCCTAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
GAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAAGCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAAGAAAATCCCCAAATTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAGTCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACATTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCCTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTACGGGGTGGAAGGCCG
CTTGCAGATGATTACGGGCAAGAAAAGTTACCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4V0IRK4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

98.441

100

0.984

  comB Streptococcus pneumoniae D39

98.218

100

0.982

  comB Streptococcus pneumoniae R6

98.218

100

0.982

  comB Streptococcus pneumoniae Rx1

98.218

100

0.982

  comB Streptococcus mitis SK321

95.546

100

0.955

  comB Streptococcus mitis NCTC 12261

94.655

100

0.947

  comB Streptococcus gordonii str. Challis substr. CH1

55.778

100

0.559


Multiple sequence alignment