Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   E0F40_RS00145 Genome accession   NZ_LR216065
Coordinates   24140..25489 (+) Length   449 a.a.
NCBI ID   WP_000801611.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6     
Function   transport of ComC (predicted from homology)   
Competence regulation

Genomic Context


Location: 19140..30489
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E0F40_RS00120 (SAMEA3714487_00023) plsX 19259..20251 (+) 993 WP_000717456.1 phosphate acyltransferase PlsX -
  E0F40_RS00125 (SAMEA3714487_00024) - 20257..20490 (+) 234 WP_000136449.1 acyl carrier protein -
  E0F40_RS00130 - 20527..20827 (+) 301 Protein_22 transposase family protein -
  E0F40_RS00135 (SAMEA3714487_00025) blpU 21029..21259 (+) 231 WP_001093075.1 bacteriocin-like peptide BlpU -
  E0F40_RS12275 (SAMEA3714487_00026) - 21262..21387 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  E0F40_RS00140 (SAMEA3714487_00027) comA 21974..24127 (+) 2154 WP_000668272.1 peptide cleavage/export ABC transporter ComA Regulator
  E0F40_RS00145 (SAMEA3714487_00028) comB 24140..25489 (+) 1350 WP_000801611.1 competence pheromone export protein ComB Regulator
  E0F40_RS00150 (SAMEA3714487_00029) purC 25659..26366 (+) 708 WP_000043309.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  E0F40_RS12380 - 26368..26511 (+) 144 WP_050167432.1 hypothetical protein -
  E0F40_RS00160 (SAMEA3714487_00030) - 26568..30293 (+) 3726 WP_000361173.1 phosphoribosylformylglycinamidine synthase -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49601.49 Da        Isoelectric Point: 5.5642

>NTDB_id=1126318 E0F40_RS00145 WP_000801611.1 24140..25489(+) (comB) [Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
ETIASQNAAASQTQAEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESSLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKVTASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKYVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=1126318 E0F40_RS00145 WP_000801611.1 24140..25489(+) (comB) [Streptococcus pneumoniae strain GPSC18 substr. ST13 isolate 55896440-41bd-11e5-998e-3c4a9275d6c6]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTTGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCTAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
GAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAAGCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAGTCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACACTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATTTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAGTTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCTTATCTAAGTTCAAAATATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTACGGGGTGGAAGGCCG
CTTGCAGATGATTACGGGCAAGAAAAGTTACCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

100

100

1

  comB Streptococcus pneumoniae Rx1

98.886

100

0.989

  comB Streptococcus pneumoniae D39

98.886

100

0.989

  comB Streptococcus pneumoniae R6

98.886

100

0.989

  comB Streptococcus mitis SK321

94.655

100

0.947

  comB Streptococcus mitis NCTC 12261

94.209

100

0.942

  comB Streptococcus gordonii str. Challis substr. CH1

55.111

100

0.552


Multiple sequence alignment