Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   EQH37_RS00520 Genome accession   NZ_CP035243
Coordinates   87874..89223 (+) Length   449 a.a.
NCBI ID   WP_000801610.1    Uniprot ID   A0A0X9S2V2
Organism   Streptococcus pneumoniae strain TVO_1901946     
Function   transport of ComC (predicted from homology)   
Competence regulation

Genomic Context


Location: 82874..94223
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH37_RS00500 (EQH37_00510) - 83781..84263 (+) 483 WP_001232094.1 stage II sporulation protein M -
  EQH37_RS00505 (EQH37_00515) - 84260..84856 (+) 597 WP_000687787.1 ATP-binding cassette domain-containing protein -
  EQH37_RS00510 (EQH37_00520) - 84837..85328 (+) 492 WP_000671990.1 hypothetical protein -
  EQH37_RS00515 (EQH37_00525) comA 85708..87861 (+) 2154 WP_000668290.1 peptide cleavage/export ABC transporter ComA Regulator
  EQH37_RS00520 (EQH37_00530) comB 87874..89223 (+) 1350 WP_000801610.1 competence pheromone export protein ComB Regulator
  EQH37_RS00525 (EQH37_00535) purC 89393..90100 (+) 708 WP_000043300.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  EQH37_RS10895 (EQH37_00540) - 90111..90245 (+) 135 WP_000429436.1 hypothetical protein -
  EQH37_RS00530 (EQH37_00545) - 90302..94027 (+) 3726 WP_000361213.1 phosphoribosylformylglycinamidine synthase -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49567.43 Da        Isoelectric Point: 5.3895

>NTDB_id=337185 EQH37_RS00520 WP_000801610.1 87874..89223(+) (comB) [Streptococcus pneumoniae strain TVO_1901946]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
ETIASQNAAASQTQAEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESSLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKITASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKDVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=337185 EQH37_RS00520 WP_000801610.1 87874..89223(+) (comB) [Streptococcus pneumoniae strain TVO_1901946]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTCGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCCAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
GAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAAGCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAAGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAGTCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACATTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATCTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAATTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCCTATCTAAGTTCAAAAGATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTACGGGGTGGAAGGCCG
CTTGCAGATGATTACGGGCAAGAAAAGTTATCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0X9S2V2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

99.555

100

0.996

  comB Streptococcus pneumoniae D39

99.332

100

0.993

  comB Streptococcus pneumoniae R6

99.332

100

0.993

  comB Streptococcus pneumoniae Rx1

99.332

100

0.993

  comB Streptococcus mitis SK321

95.1

100

0.951

  comB Streptococcus mitis NCTC 12261

94.655

100

0.947

  comB Streptococcus gordonii str. Challis substr. CH1

55.556

100

0.557


Multiple sequence alignment