Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   ABC810_RS10820 Genome accession   NZ_CP155532
Coordinates   2137144..2138493 (-) Length   449 a.a.
NCBI ID   WP_000801611.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain SP264     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2127184..2176655 2137144..2138493 within 0


Gene organization within MGE regions


Location: 2127184..2176655
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ABC810_RS10785 (ABC810_10790) purH 2127184..2128731 (-) 1548 WP_000167083.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -
  ABC810_RS10790 (ABC810_10795) - 2128756..2129265 (-) 510 WP_000894018.1 VanZ family protein -
  ABC810_RS10795 (ABC810_10800) purN 2129349..2129894 (-) 546 WP_000717506.1 phosphoribosylglycinamide formyltransferase -
  ABC810_RS10800 (ABC810_10805) purM 2129891..2130913 (-) 1023 WP_000182575.1 phosphoribosylformylglycinamidine cyclo-ligase -
  ABC810_RS10805 (ABC810_10810) purF 2130950..2132392 (-) 1443 WP_000220632.1 amidophosphoribosyltransferase -
  ABC810_RS10810 (ABC810_10815) - 2132485..2136210 (-) 3726 WP_000361217.1 phosphoribosylformylglycinamidine synthase -
  ABC810_RS10815 (ABC810_10820) purC 2136267..2136974 (-) 708 WP_000043310.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  ABC810_RS10820 (ABC810_10825) comB 2137144..2138493 (-) 1350 WP_000801611.1 competence pheromone export protein ComB Regulator
  ABC810_RS10825 (ABC810_10830) comA 2138506..2140659 (-) 2154 WP_000668272.1 peptide cleavage/export ABC transporter ComA Regulator
  ABC810_RS10830 (ABC810_10835) - 2141246..2141371 (-) 126 WP_000346297.1 PncF family bacteriocin immunity protein -
  ABC810_RS10835 (ABC810_10840) blpU 2141374..2141604 (-) 231 WP_001093075.1 bacteriocin-like peptide BlpU -
  ABC810_RS10840 (ABC810_10845) - 2141806..2142097 (-) 292 Protein_2101 IS5/IS1182 family transposase -
  ABC810_RS10845 (ABC810_10850) - 2142143..2142376 (-) 234 WP_000136449.1 acyl carrier protein -
  ABC810_RS10850 (ABC810_10855) plsX 2142382..2143374 (-) 993 WP_000717451.1 phosphate acyltransferase PlsX -
  ABC810_RS10855 (ABC810_10860) recO 2143371..2144141 (-) 771 WP_000616164.1 DNA repair protein RecO -
  ABC810_RS10860 (ABC810_10865) - 2144138..2145307 (-) 1170 WP_000366348.1 pyridoxal phosphate-dependent aminotransferase -
  ABC810_RS10865 (ABC810_10870) - 2145456..2146466 (+) 1011 WP_000009180.1 YeiH family protein -
  ABC810_RS10870 (ABC810_10875) - 2146485..2146700 (-) 216 WP_001814139.1 hypothetical protein -
  ABC810_RS10875 (ABC810_10880) - 2146741..2147178 (-) 438 WP_000076479.1 CoA-binding protein -
  ABC810_RS10880 (ABC810_10885) polA 2147263..2149896 (-) 2634 WP_001812055.1 DNA polymerase I -
  ABC810_RS10885 (ABC810_10890) - 2150505..2151432 (-) 928 Protein_2110 Rpn family recombination-promoting nuclease/putative transposase -
  ABC810_RS10890 (ABC810_10895) - 2151550..2152518 (-) 969 WP_000010163.1 ribose-phosphate diphosphokinase -
  ABC810_RS10895 (ABC810_10900) - 2152663..2153446 (-) 784 Protein_2112 PrsW family glutamic-type intramembrane protease -
  ABC810_RS10900 (ABC810_10905) - 2153471..2153968 (-) 498 WP_001809263.1 carbonic anhydrase -
  ABC810_RS10905 (ABC810_10910) radA 2154041..2155402 (-) 1362 WP_075213698.1 DNA repair protein RadA Machinery gene
  ABC810_RS10910 (ABC810_10915) - 2155416..2155931 (-) 516 WP_000691236.1 histidine phosphatase family protein -
  ABC810_RS10915 (ABC810_10920) - 2155933..2156376 (-) 444 WP_000701992.1 dUTP diphosphatase -
  ABC810_RS10920 (ABC810_10925) - 2156681..2156830 (+) 150 WP_001030863.1 hypothetical protein -
  ABC810_RS10925 (ABC810_10930) - 2156972..2157151 (+) 180 WP_001209433.1 hypothetical protein -
  ABC810_RS10930 (ABC810_10935) - 2157371..2157736 (-) 366 Protein_2119 autolysin -
  ABC810_RS10935 (ABC810_10940) - 2157756..2157923 (+) 168 WP_000024181.1 YjzC family protein -
  ABC810_RS10940 (ABC810_10945) - 2158078..2158281 (-) 204 WP_001247549.1 hypothetical protein -
  ABC810_RS10945 (ABC810_10950) - 2158304..2158495 (-) 192 WP_001112859.1 DNA-binding protein -
  ABC810_RS10950 (ABC810_10955) - 2159067..2159435 (+) 369 WP_000464160.1 helix-turn-helix transcriptional regulator -
  ABC810_RS10955 (ABC810_10960) - 2159435..2159668 (+) 234 WP_000156419.1 hypothetical protein -
  ABC810_RS10960 (ABC810_10965) - 2159668..2159931 (+) 264 WP_000285962.1 type II toxin-antitoxin system RelE/ParE family toxin -
  ABC810_RS10965 (ABC810_10970) - 2159944..2160324 (+) 381 WP_000170931.1 ImmA/IrrE family metallo-endopeptidase -
  ABC810_RS10970 (ABC810_10975) - 2160341..2161411 (+) 1071 WP_000401841.1 type I restriction endonuclease -
  ABC810_RS10975 (ABC810_10980) - 2161472..2161819 (+) 348 WP_001839379.1 hypothetical protein -
  ABC810_RS10980 (ABC810_10985) - 2161909..2162607 (+) 699 WP_001106362.1 site-specific integrase -
  ABC810_RS10990 (ABC810_10995) tadA 2162816..2163283 (-) 468 WP_000291870.1 tRNA adenosine(34) deaminase TadA -
  ABC810_RS10995 (ABC810_11000) - 2163484..2164770 (-) 1287 WP_000205044.1 adenylosuccinate synthase -
  ABC810_RS11000 (ABC810_11005) comW 2165001..2165237 (-) 237 WP_000939545.1 sigma(X)-activator ComW Regulator
  ABC810_RS11005 (ABC810_11010) - 2165503..2166300 (+) 798 Protein_2133 transposase -
  ABC810_RS11010 (ABC810_11015) - 2166335..2167181 (-) 847 Protein_2134 IS630 family transposase -
  ABC810_RS11045 (ABC810_11050) comX/comX2 2172673..2173152 (-) 480 WP_000588897.1 sigma-70 family RNA polymerase sigma factor Regulator
  ABC810_RS11050 (ABC810_11055) - 2173375..2174631 (+) 1257 WP_000436644.1 ISL3 family transposase -
  ABC810_RS11055 (ABC810_11060) ftsH 2174697..2176655 (-) 1959 WP_000744545.1 ATP-dependent zinc metalloprotease FtsH -

Sequence


Protein


Download         Length: 449 a.a.        Molecular weight: 49601.49 Da        Isoelectric Point: 5.5642

>NTDB_id=997138 ABC810_RS10820 WP_000801611.1 2137144..2138493(-) (comB) [Streptococcus pneumoniae strain SP264]
MKPEFLESAEFYNRRYHNFSSSVIVPMALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSNNRILVNHLEENKL
VKKGDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFRDYISQAGSLRASTSQQN
ETIASQNAAASQTQAEIGNLISQTEAKIRDYQTAKSAIETGASLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQIS
QLESSLATYRVQYAGSGTQQAYASGLSSQLESLKSQHLAKVGQELTLLAQKILEAESGKKVQGNLLDKGKVTASEDGVLH
LNPETSDSSMVAEGALLAQLYPSLEREGKAKLTAYLSSKYVARIKVGDSVRYTTTHDAGNQLFLDSTITSIDATATKTEK
GNFFKIEAETNLTSEQAEKLRYGVEGRLQMITGKKSYLRYYLDQFLNKE

Nucleotide


Download         Length: 1350 bp        

>NTDB_id=997138 ABC810_RS10820 WP_000801611.1 2137144..2138493(-) (comB) [Streptococcus pneumoniae strain SP264]
ATGAAACCAGAATTTTTAGAAAGTGCGGAGTTTTATAATCGTCGTTACCATAATTTTTCCAGTAGTGTGATTGTACCCAT
GGCCCTTCTGCTTGTGTTTTTACTTGGCTTTGCAACTGTTGCAGAGAAGGAGATGAGTTTGTCCACTAGAGCTACTGTCG
AACCTAGTCGTATCCTTGCAAATATCCAGTCAACTAGCAACAATCGTATTCTTGTCAATCATTTGGAAGAAAATAAGCTG
GTTAAGAAGGGGGATCTTTTGGTTCAATACCAAGAAGGGGCAGAGGGTGTCCAAGCGGAGTCCTATGCCAGTCAGTTGGA
CATGCTAAAGGATCAAAAAAAGCAATTGGAGTATCTGCAAAAGAGCCTGCAAGAAGGGGAGAACCACTTTCCAGAGGAGG
ATAAGTTTGGCTACCAAGCCACCTTTCGCGACTACATCAGTCAAGCAGGCAGTCTTAGGGCTAGTACATCGCAACAAAAT
GAGACCATCGCGTCCCAGAATGCAGCAGCTAGCCAAACCCAAGCCGAAATCGGCAACCTCATCAGTCAAACAGAGGCTAA
AATTCGCGATTACCAGACAGCTAAGTCAGCTATTGAAACAGGTGCTTCCTTGGCCGGTCAGAATCTAGCCTACTCTCTTT
ACCAGTCCTACAAGTCTCAGGGCGAGGAAAATCCCCAAACTAAGGTTCAGGCAGTTGCACAGGTTGAAGCACAGATTTCT
CAGTTAGAATCTAGTCTTGCTACTTACCGTGTCCAGTATGCAGGTTCAGGTACCCAGCAAGCCTATGCGTCAGGGTTAAG
CAGTCAATTGGAATCCCTTAAATCCCAACACTTGGCAAAGGTTGGTCAGGAATTGACCCTTCTAGCCCAGAAAATTTTGG
AGGCAGAGTCAGGTAAGAAGGTACAGGGAAATCTTTTAGACAAGGGGAAAGTTACGGCGAGTGAGGATGGGGTGCTTCAT
CTTAATCCTGAGACCAGTGATTCTAGCATGGTTGCAGAAGGTGCCCTACTAGCCCAACTTTATCCATCTTTGGAAAGAGA
AGGGAAAGCCAAACTCACAGCTTATCTAAGTTCAAAATATGTAGCAAGAATCAAGGTCGGTGATTCTGTTCGCTATACTA
CGACTCATGATGCCGGGAATCAACTTTTCCTAGATTCTACTATTACAAGTATTGATGCGACAGCTACTAAGACTGAGAAA
GGGAATTTCTTTAAAATCGAGGCGGAGACTAATCTAACTTCGGAGCAGGCTGAAAAACTTAGGTACGGGGTGGAAGGCCG
CTTGCAGATGATTACGGGCAAGAAAAGTTACCTACGTTATTATTTGGATCAATTTTTGAACAAAGAGTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus pneumoniae TIGR4

100

100

1

  comB Streptococcus pneumoniae Rx1

98.886

100

0.989

  comB Streptococcus pneumoniae D39

98.886

100

0.989

  comB Streptococcus pneumoniae R6

98.886

100

0.989

  comB Streptococcus mitis SK321

94.655

100

0.947

  comB Streptococcus mitis NCTC 12261

94.209

100

0.942

  comB Streptococcus gordonii str. Challis substr. CH1

55.111

100

0.552


Multiple sequence alignment