Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   BB164_RS01045 Genome accession   NZ_CP021867
Coordinates   173002..174045 (+) Length   347 a.a.
NCBI ID   WP_223295889.1    Uniprot ID   -
Organism   Streptococcus agalactiae strain SG-M25     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 174119..214121 173002..174045 flank 74


Gene organization within MGE regions


Location: 173002..214121
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BB164_RS01045 (BB164_01045) comYB 173002..174045 (+) 1044 WP_223295889.1 competence type IV pilus assembly protein ComGB Machinery gene
  BB164_RS01050 (BB164_01050) - 174042..174110 (+) 69 Protein_150 competence protein ComGF -
  BB164_RS01055 (BB164_01055) - 174119..175573 (-) 1455 WP_001044789.1 recombinase family protein -
  BB164_RS01060 (BB164_01060) - 175731..176069 (-) 339 WP_000390807.1 hypothetical protein -
  BB164_RS01065 (BB164_01065) - 176073..176501 (-) 429 WP_000200806.1 hypothetical protein -
  BB164_RS01070 (BB164_01070) - 176513..177097 (-) 585 WP_000823130.1 hypothetical protein -
  BB164_RS01075 (BB164_01075) - 177100..177297 (-) 198 WP_000880827.1 hypothetical protein -
  BB164_RS01080 (BB164_01080) - 177316..178056 (-) 741 WP_000580170.1 XRE family transcriptional regulator -
  BB164_RS01085 (BB164_01085) - 178210..178431 (+) 222 WP_000215930.1 helix-turn-helix transcriptional regulator -
  BB164_RS11815 - 178686..178835 (+) 150 WP_001074679.1 BOW99_gp33 family protein -
  BB164_RS01090 (BB164_01090) - 178875..179207 (+) 333 WP_000072970.1 hypothetical protein -
  BB164_RS01095 (BB164_01095) - 179428..180099 (+) 672 WP_000131170.1 ERF family protein -
  BB164_RS01100 (BB164_01100) - 180103..181098 (+) 996 WP_000159058.1 DUF1351 domain-containing protein -
  BB164_RS01105 (BB164_01105) - 181100..181651 (+) 552 WP_001010334.1 MazG-like family protein -
  BB164_RS01110 (BB164_01110) ssbA 181742..182215 (+) 474 WP_000609569.1 single-stranded DNA-binding protein Machinery gene
  BB164_RS01115 (BB164_01115) - 182212..182490 (+) 279 WP_001082484.1 hypothetical protein -
  BB164_RS01120 (BB164_01120) - 182504..183295 (+) 792 WP_029667750.1 DNA methyltransferase -
  BB164_RS01125 (BB164_01125) - 183282..183485 (+) 204 WP_000163592.1 hypothetical protein -
  BB164_RS01130 (BB164_01130) - 183482..183895 (+) 414 WP_000862186.1 helix-turn-helix domain-containing protein -
  BB164_RS01135 (BB164_01135) - 183908..184339 (+) 432 WP_000412398.1 hypothetical protein -
  BB164_RS01140 (BB164_01140) - 184341..185075 (+) 735 WP_001293548.1 hypothetical protein -
  BB164_RS01150 (BB164_01150) - 185718..186044 (+) 327 WP_000836878.1 DUF1372 family protein -
  BB164_RS01155 (BB164_01155) - 186177..186785 (+) 609 WP_001019158.1 DUF1642 domain-containing protein -
  BB164_RS01160 (BB164_01160) - 186772..187020 (+) 249 WP_000958011.1 hypothetical protein -
  BB164_RS01165 (BB164_01165) - 187031..187426 (+) 396 WP_000667207.1 hypothetical protein -
  BB164_RS01170 (BB164_01170) - 187423..187812 (+) 390 WP_000614972.1 YopX family protein -
  BB164_RS01180 (BB164_01180) - 187982..188362 (+) 381 WP_000164677.1 hypothetical protein -
  BB164_RS01185 (BB164_01185) - 188405..188605 (+) 201 WP_001103600.1 hypothetical protein -
  BB164_RS01190 (BB164_01190) - 188628..188843 (+) 216 WP_223875938.1 hypothetical protein -
  BB164_RS01195 (BB164_01195) - 188929..190008 (+) 1080 WP_001038279.1 DUF4417 domain-containing protein -
  BB164_RS01200 (BB164_01200) - 189998..190408 (+) 411 WP_000509432.1 hypothetical protein -
  BB164_RS01205 (BB164_01205) - 190601..191044 (+) 444 WP_001136854.1 hypothetical protein -
  BB164_RS01210 (BB164_01210) - 191031..192344 (+) 1314 WP_001167688.1 PBSX family phage terminase large subunit -
  BB164_RS01215 (BB164_01215) - 192356..193939 (+) 1584 WP_000052338.1 phage portal protein -
  BB164_RS01220 (BB164_01220) - 194038..195207 (+) 1170 WP_000140092.1 phage minor capsid protein -
  BB164_RS01225 (BB164_01225) - 195343..195921 (+) 579 WP_000011256.1 phage scaffolding protein -
  BB164_RS01230 (BB164_01230) - 195944..196828 (+) 885 WP_000224752.1 hypothetical protein -
  BB164_RS01235 (BB164_01235) - 196839..197096 (+) 258 WP_001269647.1 hypothetical protein -
  BB164_RS01240 (BB164_01240) - 197138..197527 (+) 390 WP_000221832.1 hypothetical protein -
  BB164_RS01245 (BB164_01245) - 197517..197843 (+) 327 WP_000565942.1 putative minor capsid protein -
  BB164_RS01250 (BB164_01250) - 197843..198205 (+) 363 WP_000077535.1 minor capsid protein -
  BB164_RS01255 (BB164_01255) - 198206..198613 (+) 408 WP_001180646.1 minor capsid protein -
  BB164_RS01260 (BB164_01260) - 198616..199071 (+) 456 WP_000251918.1 phage tail tube protein -
  BB164_RS01265 (BB164_01265) - 199103..199501 (+) 399 WP_000069050.1 hypothetical protein -
  BB164_RS01270 (BB164_01270) - 199504..200088 (+) 585 WP_000459714.1 Gp15 family bacteriophage protein -
  BB164_RS01275 (BB164_01275) - 200107..203883 (+) 3777 WP_000763172.1 tape measure protein -
  BB164_RS01280 (BB164_01280) - 203876..205324 (+) 1449 WP_001293185.1 distal tail protein Dit -
  BB164_RS01285 (BB164_01285) - 205324..208125 (+) 2802 WP_001882115.1 phage tail protein -
  BB164_RS01290 (BB164_01290) - 208136..210148 (+) 2013 WP_000536261.1 DUF859 family phage minor structural protein -
  BB164_RS01295 (BB164_01295) - 210162..210488 (+) 327 WP_000404431.1 DUF1366 domain-containing protein -
  BB164_RS01300 (BB164_01300) - 210517..210675 (+) 159 WP_418287329.1 CD1375 family protein -
  BB164_RS01305 (BB164_01305) - 210688..210990 (+) 303 WP_000215499.1 hypothetical protein -
  BB164_RS01310 (BB164_01310) - 210992..211327 (+) 336 WP_001001960.1 phage holin -
  BB164_RS01315 (BB164_01315) - 211329..212048 (+) 720 WP_000236381.1 peptidoglycan amidohydrolase family protein -
  BB164_RS01320 (BB164_01320) comGF 212261..212683 (+) 423 WP_001264244.1 competence type IV pilus minor pilin ComGF -
  BB164_RS01325 (BB164_01325) comGG 212661..213032 (+) 372 WP_000601104.1 competence type IV pilus minor pilin ComGG -
  BB164_RS01330 (BB164_01330) comYH 213147..214121 (+) 975 WP_001008574.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 347 a.a.        Molecular weight: 39936.65 Da        Isoelectric Point: 9.9002

>NTDB_id=234109 BB164_RS01045 WP_223295889.1 173002..174045(+) (comYB) [Streptococcus agalactiae strain SG-M25]
MDKWISWLKKDISVRNRHKSKKLSLKKQRKVVQLFNNLFASGFSLTDMVTFLKRSKLLSDCYTDSMNKALLEGKDLSKML
GELGFSDTVITQVALADLHGNISRSLLKIESYLANLLLVRKKVIEVATYPLILLSFLVLIMIGLRNYLMPQLGENNFATR
LITNVPNIFLLLLAVVLIFSLIFYIIQKRLSRIKVACFLTTIPLVGSYVKLYLTAYYAREWGNLLSQGIELDQIVKVMQN
QKSKLFREIGYDMEEGFLSGKAFHQKVLDYPFFLTELSLMIEYGQVKAKLGTELDIYADEKWEDFFTKLARATQLIQPVI
FIFVALIIVMIYAAMLLPMYQNMEILS

Nucleotide


Download         Length: 1044 bp        

>NTDB_id=234109 BB164_RS01045 WP_223295889.1 173002..174045(+) (comYB) [Streptococcus agalactiae strain SG-M25]
ATAGACAAGTGGATATCTTGGCTGAAGAAGGACATATCAGTAAGAAACAGGCACAAGTCGAAAAAATTATCCCTCAAGAA
ACAACGGAAAGTAGTCCAACTTTTTAATAATCTTTTTGCTAGTGGTTTTTCTTTAACTGATATGGTCACCTTCTTAAAGA
GGAGTAAGTTATTGTCTGATTGTTATACAGATAGTATGAATAAGGCATTATTAGAGGGAAAAGATTTATCAAAAATGTTA
GGAGAGTTAGGTTTTTCTGACACTGTTATCACACAGGTTGCATTAGCTGATTTGCATGGTAACATTTCAAGGAGCCTACT
AAAGATTGAGTCTTATTTAGCTAATCTTTTGTTAGTTAGAAAAAAAGTAATAGAAGTAGCTACTTACCCATTGATATTAT
TGTCTTTTCTGGTGCTAATTATGATTGGCCTTAGGAATTATTTAATGCCCCAATTAGGAGAAAATAATTTTGCAACTAGA
CTGATTACAAATGTGCCGAATATTTTCTTATTACTTTTAGCAGTTGTACTTATTTTTAGTTTAATATTTTATATTATTCA
AAAGCGATTGTCGCGCATTAAAGTAGCTTGTTTTTTAACAACAATTCCCTTAGTTGGATCATATGTTAAGCTTTATTTAA
CTGCTTACTATGCCCGTGAGTGGGGAAATTTATTAAGTCAAGGTATTGAATTGGACCAAATTGTTAAAGTAATGCAAAAT
CAGAAATCCAAACTTTTTAGGGAAATAGGATATGACATGGAAGAAGGTTTTCTATCAGGTAAAGCATTTCACCAAAAAGT
ATTAGACTATCCGTTTTTCTTAACTGAGCTTAGTTTAATGATTGAATATGGCCAAGTTAAGGCGAAATTAGGAACAGAGT
TAGATATATATGCTGATGAGAAGTGGGAGGATTTTTTTACAAAATTAGCTAGAGCGACCCAGTTAATCCAACCCGTTATT
TTTATTTTTGTAGCTCTTATCATTGTTATGATTTATGCAGCAATGCTGTTACCAATGTATCAAAATATGGAGATATTATC
ATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus mutans UA140

62.17

98.271

0.611

  comYB Streptococcus mutans UA159

61.877

98.271

0.608

  comYB Streptococcus gordonii str. Challis substr. CH1

56.305

98.271

0.553

  comGB/cglB Streptococcus mitis NCTC 12261

55.224

96.542

0.533

  comGB/cglB Streptococcus mitis SK321

54.627

96.542

0.527

  comGB/cglB Streptococcus pneumoniae Rx1

53.731

96.542

0.519

  comGB/cglB Streptococcus pneumoniae D39

53.731

96.542

0.519

  comGB/cglB Streptococcus pneumoniae R6

53.731

96.542

0.519

  comGB/cglB Streptococcus pneumoniae TIGR4

53.731

96.542

0.519

  comGB Lactococcus lactis subsp. cremoris KW2

50.148

97.118

0.487


Multiple sequence alignment