Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Regulator
Locus tag   DQM54_RS10445 Genome accession   NZ_LS483375
Coordinates   2104703..2106061 (-) Length   452 a.a.
NCBI ID   WP_111724244.1    Uniprot ID   -
Organism   Streptococcus gordonii strain NCTC3165     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2106071..2139803 2104703..2106061 flank 10


Gene organization within MGE regions


Location: 2104703..2139803
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM54_RS10445 (NCTC3165_02079) comB 2104703..2106061 (-) 1359 WP_111724244.1 competence pheromone export protein ComB Regulator
  DQM54_RS10450 (NCTC3165_02080) comA 2106071..2108224 (-) 2154 WP_111724245.1 peptide cleavage/export ABC transporter ComA Regulator
  DQM54_RS10455 (NCTC3165_02081) - 2108926..2109135 (-) 210 WP_111724246.1 hypothetical protein -
  DQM54_RS10465 (NCTC3165_02083) - 2109997..2110500 (-) 504 WP_111724247.1 hypothetical protein -
  DQM54_RS10470 (NCTC3165_02084) - 2110573..2111097 (-) 525 WP_111724248.1 hypothetical protein -
  DQM54_RS10475 (NCTC3165_02085) - 2111462..2112901 (-) 1440 WP_111724323.1 phage/plasmid primase, P4 family -
  DQM54_RS10480 (NCTC3165_02086) - 2112981..2113841 (-) 861 WP_111724249.1 primase alpha helix C-terminal domain-containing protein -
  DQM54_RS10485 (NCTC3165_02087) - 2113982..2114258 (-) 277 Protein_2012 XRE family transcriptional regulator -
  DQM54_RS10490 (NCTC3165_02088) - 2114251..2114592 (-) 342 WP_111724250.1 DNA-binding protein -
  DQM54_RS10495 (NCTC3165_02089) - 2114589..2114804 (-) 216 WP_111724251.1 hypothetical protein -
  DQM54_RS10500 (NCTC3165_02090) - 2114806..2115009 (-) 204 WP_111724252.1 hypothetical protein -
  DQM54_RS10505 (NCTC3165_02091) - 2115291..2115917 (-) 627 WP_111724253.1 Rha family transcriptional regulator -
  DQM54_RS10510 (NCTC3165_02092) - 2115932..2116129 (-) 198 WP_111724254.1 helix-turn-helix domain-containing protein -
  DQM54_RS10515 (NCTC3165_02093) - 2116293..2117066 (+) 774 WP_111724255.1 helix-turn-helix domain-containing protein -
  DQM54_RS10520 (NCTC3165_02094) - 2117110..2118099 (+) 990 WP_084849558.1 DUF3644 domain-containing protein -
  DQM54_RS10525 (NCTC3165_02095) - 2118445..2119611 (+) 1167 WP_111724256.1 tyrosine-type recombinase/integrase -
  DQM54_RS10530 (NCTC3165_02096) rpsD 2119700..2120311 (-) 612 WP_008810013.1 30S ribosomal protein S4 -
  DQM54_RS10535 (NCTC3165_02097) - 2120577..2121299 (-) 723 WP_111724257.1 ABC transporter ATP-binding protein -
  DQM54_RS10540 (NCTC3165_02098) - 2121299..2122300 (-) 1002 WP_111724258.1 ABC transporter substrate-binding protein -
  DQM54_RS10545 (NCTC3165_02099) - 2122341..2123093 (-) 753 WP_060554273.1 ABC transporter permease -
  DQM54_RS10550 (NCTC3165_02100) - 2123056..2123346 (-) 291 WP_045504760.1 MTH1187 family thiamine-binding protein -
  DQM54_RS10555 (NCTC3165_02102) - 2123686..2124345 (-) 660 WP_061597425.1 HAD family hydrolase -
  DQM54_RS10560 (NCTC3165_02103) srtB 2124424..2125281 (-) 858 WP_045504765.1 class B sortase, LPKTxAVK-specific -
  DQM54_RS10565 (NCTC3165_02104) abpA 2125369..2125956 (-) 588 WP_045504766.1 amylase-binding adhesin AbpA -
  DQM54_RS10570 (NCTC3165_02105) - 2126173..2127138 (-) 966 WP_111724259.1 ribose-phosphate diphosphokinase -
  DQM54_RS10575 (NCTC3165_02106) pcsB 2127298..2128488 (-) 1191 WP_111724260.1 peptidoglycan hydrolase PcsB -
  DQM54_RS10580 (NCTC3165_02107) mreD 2128588..2129088 (-) 501 WP_111724261.1 rod shape-determining protein MreD -
  DQM54_RS10585 (NCTC3165_02108) mreC 2129090..2129905 (-) 816 WP_111724262.1 rod shape-determining protein MreC -
  DQM54_RS10695 (NCTC3165_02130) comR/comR2 2136810..2137292 (-) 483 WP_045772576.1 sigma-70 family RNA polymerase sigma factor Regulator
  DQM54_RS10710 (NCTC3165_02133) ftsH 2137821..2139803 (-) 1983 WP_046164766.1 ATP-dependent zinc metalloprotease FtsH -

Sequence


Protein


Download         Length: 452 a.a.        Molecular weight: 49835.12 Da        Isoelectric Point: 6.1758

>NTDB_id=1139160 DQM54_RS10445 WP_111724244.1 2104703..2106061(-) (comB) [Streptococcus gordonii strain NCTC3165]
MNEQFLESAEFYQKRYHNFASCLIVPSLILLVFLVGFSMLAKKEITISSRASVEASRVLAQIQSTSNQAIIANHLAENKE
VKKGDLLIQYAVEGEGAQEQKFSSQLDLLKDQKGKLETLRSSLESGRNQFTEPDSYGYEQSFKDYQNQVASMTSSVNQQN
ATIASQNAAASQSQAELGGVISDVDSKLNDYRNLKNAIQSGVGIDASHPLYSLYQSYRDQLSLAEDKTTAQSQIVAQLDG
QISQLEATAATYRVQYAGAGAQQAYASNLSSQLASLKAQYLVKVGQELTTLTQQILEAESNLKLQETVSKRGQILAEMDG
LLHLNPEVQGSTLVAEGTALAQIYPKITSERKIKIVTYVSSKDVSTIKNGDKVRFITADDANKQMILTSQISSIDANATQ
TKQGNFFKVECEMAVSKDQAKKLRYGLEGKFVMVTGQKTYFSYYMEKFFNLG

Nucleotide


Download         Length: 1359 bp        

>NTDB_id=1139160 DQM54_RS10445 WP_111724244.1 2104703..2106061(-) (comB) [Streptococcus gordonii strain NCTC3165]
ATGAATGAACAATTTTTAGAAAGTGCAGAGTTTTATCAGAAACGTTATCATAATTTTGCTAGTTGTTTAATTGTGCCAAG
TCTTATTTTACTAGTATTTCTAGTTGGTTTTTCCATGCTGGCTAAAAAGGAAATTACGATTTCTAGCCGTGCTTCTGTAG
AAGCTAGCCGAGTGCTAGCTCAGATTCAGTCGACTAGTAACCAGGCTATCATTGCGAATCATTTGGCAGAAAATAAGGAG
GTCAAAAAAGGAGACCTGTTAATTCAGTATGCAGTAGAAGGAGAGGGAGCTCAGGAACAGAAGTTTTCCAGCCAGCTTGA
CTTACTCAAAGATCAAAAGGGCAAACTAGAAACTTTGCGTTCCAGTTTAGAAAGCGGGCGTAATCAATTCACTGAGCCAG
ATAGTTATGGCTACGAACAAAGCTTTAAAGACTACCAGAATCAAGTAGCGAGTATGACTAGTTCTGTGAATCAGCAAAAT
GCTACGATTGCTTCACAAAATGCAGCGGCCAGTCAGTCTCAGGCGGAACTGGGCGGTGTTATCAGTGATGTGGATAGCAA
ACTAAATGACTATCGAAACTTGAAAAATGCCATTCAAAGTGGGGTAGGTATAGACGCATCGCACCCTCTTTACTCCTTGT
ATCAGTCCTATCGTGATCAGTTGAGTTTGGCAGAGGATAAGACTACGGCTCAAAGTCAGATTGTAGCTCAATTAGATGGG
CAAATTTCTCAATTAGAAGCAACAGCTGCGACTTATCGTGTCCAATATGCTGGTGCAGGCGCTCAACAAGCCTATGCTAG
CAATTTATCTAGTCAGTTGGCTTCACTGAAAGCTCAGTATCTGGTCAAGGTAGGACAAGAATTGACCACTCTGACTCAGC
AAATCTTGGAAGCCGAGAGCAATCTTAAACTTCAAGAAACAGTCTCTAAGCGAGGTCAGATTTTGGCTGAAATGGATGGC
TTGCTTCACTTAAATCCAGAAGTACAAGGATCTACTCTAGTGGCAGAAGGAACAGCTTTGGCTCAGATTTATCCTAAAAT
CACCAGTGAACGAAAAATCAAAATTGTCACATATGTCTCTTCCAAGGATGTATCTACTATTAAAAATGGAGATAAGGTGC
GCTTTATCACAGCTGATGATGCTAACAAACAAATGATTTTGACATCCCAAATTTCCAGTATTGACGCCAATGCTACTCAA
ACCAAACAAGGAAATTTCTTTAAAGTAGAATGCGAAATGGCTGTTAGTAAAGATCAAGCTAAGAAGCTCCGCTATGGCTT
GGAAGGTAAATTCGTCATGGTAACTGGGCAAAAGACTTATTTCAGCTATTATATGGAGAAGTTTTTTAATCTTGGTTGA

Domains


Predicted by InterproScan.

(314-426)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Streptococcus gordonii str. Challis substr. CH1

98.23

100

0.982

  comB Streptococcus pneumoniae Rx1

56.222

99.558

0.56

  comB Streptococcus pneumoniae D39

56.222

99.558

0.56

  comB Streptococcus pneumoniae R6

56.222

99.558

0.56

  comB Streptococcus mitis NCTC 12261

56

99.558

0.558

  comB Streptococcus pneumoniae TIGR4

55.778

99.558

0.555

  comB Streptococcus mitis SK321

55.333

99.558

0.551


Multiple sequence alignment