Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   EQH36_RS00220 Genome accession   NZ_CP035244
Coordinates   41946..43295 (+) Length   449 a.a.
NCBI ID   WP_000801623.1    Uniprot ID   A0A064C431
Organism   Streptococcus pneumoniae strain TVO_1901945     
Function   transport of ComC (predicted from homology)   
Competence regulation

Genomic Context


Location: 36946..48295
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH36_RS00195 (EQH36_00210) plsX 37065..38057 (+) 993 WP_000717460.1 phosphate acyltransferase PlsX -
  EQH36_RS00200 (EQH36_00215) - 38063..38296 (+) 234 WP_000136449.1 acyl carrier protein -
  EQH36_RS00205 (EQH36_00220) - 38333..38632 (+) 300 Protein_35 transposase family protein -
  EQH36_RS00210 (EQH36_00225) blpU 38835..39065 (+) 231 WP_001093071.1 bacteriocin-like peptide BlpU -
  EQH36_RS10785 - 39068..39193 (+) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  EQH36_RS00215 (EQH36_00230) comA 39780..41933 (+) 2154 WP_000668302.1 peptide cleavage/export ABC transporter ComA Regulator
  EQH36_RS00220 (EQH36_00235) comB 41946..43295 (+) 1350 WP_000801623.1 competence pheromone export protein ComB Regulator
  EQH36_RS00225 (EQH36_00240) purC 43465..44172 (+) 708 WP_000043312.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  EQH36_RS10880 (EQH36_00245) - 44174..44317 (+) 144 WP_050167432.1 hypothetical protein -
  EQH36_RS00230 (EQH36_00250) - 44374..48099 (+) 3726 WP_000361175.1 phosphoribosylformylglycinamidine synthase -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49617.44 Da        Isoelectric Point: 5.3895

>NTDB_id=337262 EQH36_RS00220 WP_000801623.1 41946..43295(+) (comB) [Streptococcus pneumoniae strain TVO_1901945]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
ETIASQNAAASQTQSEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESSLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQFFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=337262 EQH36_RS00220 WP_000801623.1 41946..43295(+) (comB) [Streptococcus pneumoniae strain TVO_1901945]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCCAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
GAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAATCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAGTCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACACTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCCTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAATTTTTTCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTACGGGGTGGAAGGCCG
TCTACAGATGATTACGGGCAAGAAAAGTTATCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A064C431

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

99.109

100

0.991

  comB Streptococcus pneumoniae Rx1

98.886

100

0.989

  comB Streptococcus pneumoniae D39

98.886

100

0.989

  comB Streptococcus pneumoniae R6

98.886

100

0.989

  comB Streptococcus mitis SK321

94.878

100

0.949

  comB Streptococcus mitis NCTC 12261

94.209

100

0.942

  comB Streptococcus gordonii str. Challis substr. CH1

55.333

100

0.555


Multiple sequence alignment