Detailed information
Overview
| Name | comYB | Type | Machinery gene |
| Locus tag | I6H73_RS00050 | Genome accession | NZ_CP066073 |
| Coordinates | 14743..15774 (+) | Length | 343 a.a. |
| NCBI ID | WP_015057201.1 | Uniprot ID | - |
| Organism | Streptococcus dysgalactiae strain FDAARGOS_1016 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 14788..65553 | 14743..15774 | flank | -986 |
Gene organization within MGE regions
Location: 14743..65553
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| I6H73_RS00050 (I6H73_00050) | comYB | 14743..15774 (+) | 1032 | WP_015057201.1 | competence type IV pilus assembly protein ComGB | Machinery gene |
| I6H73_RS00055 (I6H73_00055) | comYC | 15776..16102 (+) | 327 | WP_003055259.1 | competence type IV pilus major pilin ComGC | Machinery gene |
| I6H73_RS00060 (I6H73_00060) | - | 16139..17593 (-) | 1455 | WP_003055278.1 | recombinase family protein | - |
| I6H73_RS00065 (I6H73_00065) | - | 17730..18221 (-) | 492 | WP_003055290.1 | hypothetical protein | - |
| I6H73_RS00070 (I6H73_00070) | - | 18232..19002 (-) | 771 | WP_003055282.1 | XRE family transcriptional regulator | - |
| I6H73_RS00075 (I6H73_00075) | - | 19203..19406 (+) | 204 | WP_003058516.1 | helix-turn-helix transcriptional regulator | - |
| I6H73_RS00080 (I6H73_00080) | - | 19695..19844 (+) | 150 | WP_003058552.1 | hypothetical protein | - |
| I6H73_RS00085 (I6H73_00085) | - | 19884..20231 (+) | 348 | WP_003058582.1 | hypothetical protein | - |
| I6H73_RS00090 (I6H73_00090) | - | 20492..20740 (+) | 249 | WP_003058590.1 | hypothetical protein | - |
| I6H73_RS00095 (I6H73_00095) | - | 20740..21432 (+) | 693 | WP_003058502.1 | ERF family protein | - |
| I6H73_RS00100 (I6H73_00100) | - | 21436..21762 (+) | 327 | WP_003058496.1 | hypothetical protein | - |
| I6H73_RS00105 (I6H73_00105) | - | 21776..22771 (+) | 996 | WP_003058518.1 | DUF1351 domain-containing protein | - |
| I6H73_RS00110 (I6H73_00110) | - | 22771..23325 (+) | 555 | WP_003058546.1 | MazG-like family protein | - |
| I6H73_RS00115 (I6H73_00115) | - | 23312..23506 (+) | 195 | WP_115261906.1 | hypothetical protein | - |
| I6H73_RS00120 (I6H73_00120) | - | 23493..24014 (+) | 522 | WP_003058526.1 | hypothetical protein | - |
| I6H73_RS00125 (I6H73_00125) | - | 24023..24721 (+) | 699 | WP_003058504.1 | site-specific DNA-methyltransferase | - |
| I6H73_RS00130 (I6H73_00130) | - | 24744..25022 (+) | 279 | WP_037588195.1 | hypothetical protein | - |
| I6H73_RS00135 (I6H73_00135) | - | 25099..25395 (+) | 297 | Protein_26 | methylase | - |
| I6H73_RS00140 (I6H73_00140) | - | 25382..25585 (+) | 204 | WP_003058550.1 | hypothetical protein | - |
| I6H73_RS00145 (I6H73_00145) | - | 25582..26019 (+) | 438 | WP_003058575.1 | helix-turn-helix domain-containing protein | - |
| I6H73_RS00150 (I6H73_00150) | - | 26032..26463 (+) | 432 | WP_003058500.1 | hypothetical protein | - |
| I6H73_RS00155 (I6H73_00155) | - | 26465..27202 (+) | 738 | WP_003058554.1 | hypothetical protein | - |
| I6H73_RS00160 (I6H73_00160) | - | 27222..27593 (+) | 372 | WP_003058541.1 | hypothetical protein | - |
| I6H73_RS00165 (I6H73_00165) | - | 27749..27925 (+) | 177 | WP_165737297.1 | hypothetical protein | - |
| I6H73_RS00170 (I6H73_00170) | - | 27927..28268 (+) | 342 | WP_003058522.1 | DUF1372 family protein | - |
| I6H73_RS00175 (I6H73_00175) | - | 28268..28408 (+) | 141 | WP_003058562.1 | hypothetical protein | - |
| I6H73_RS00180 (I6H73_00180) | - | 28410..28601 (+) | 192 | WP_003058511.1 | hypothetical protein | - |
| I6H73_RS00185 (I6H73_00185) | - | 28594..28824 (+) | 231 | WP_003058513.1 | hypothetical protein | - |
| I6H73_RS00190 (I6H73_00190) | - | 28821..29231 (+) | 411 | WP_037588173.1 | endodeoxyribonuclease RusA | - |
| I6H73_RS00195 (I6H73_00195) | - | 29224..29604 (+) | 381 | WP_003058484.1 | hypothetical protein | - |
| I6H73_RS00200 (I6H73_00200) | - | 29671..30348 (+) | 678 | WP_003058558.1 | DUF4417 domain-containing protein | - |
| I6H73_RS00205 (I6H73_00205) | - | 30345..30728 (+) | 384 | WP_003058537.1 | hypothetical protein | - |
| I6H73_RS00210 (I6H73_00210) | - | 30921..31400 (+) | 480 | WP_003058573.1 | terminase small subunit | - |
| I6H73_RS00215 (I6H73_00215) | - | 31387..32700 (+) | 1314 | WP_003058577.1 | PBSX family phage terminase large subunit | - |
| I6H73_RS00220 (I6H73_00220) | - | 32712..34289 (+) | 1578 | WP_003058556.1 | phage portal protein | - |
| I6H73_RS00225 (I6H73_00225) | - | 34292..35440 (+) | 1149 | WP_003058578.1 | phage minor capsid protein | - |
| I6H73_RS00230 (I6H73_00230) | - | 35587..36150 (+) | 564 | WP_003058544.1 | phage scaffolding protein | - |
| I6H73_RS00235 (I6H73_00235) | - | 36169..37047 (+) | 879 | WP_003058535.1 | hypothetical protein | - |
| I6H73_RS00240 (I6H73_00240) | - | 37058..37294 (+) | 237 | WP_003058596.1 | hypothetical protein | - |
| I6H73_RS00245 (I6H73_00245) | - | 37338..37730 (+) | 393 | WP_003058548.1 | hypothetical protein | - |
| I6H73_RS00250 (I6H73_00250) | - | 37720..38046 (+) | 327 | WP_198465271.1 | putative minor capsid protein | - |
| I6H73_RS00255 (I6H73_00255) | - | 38046..38396 (+) | 351 | WP_003058592.1 | minor capsid protein | - |
| I6H73_RS00260 (I6H73_00260) | - | 38396..38803 (+) | 408 | WP_003058570.1 | minor capsid protein | - |
| I6H73_RS00265 (I6H73_00265) | - | 38808..39263 (+) | 456 | WP_003058598.1 | hypothetical protein | - |
| I6H73_RS00270 (I6H73_00270) | - | 39306..39686 (+) | 381 | WP_003058580.1 | hypothetical protein | - |
| I6H73_RS00275 (I6H73_00275) | - | 39686..40273 (+) | 588 | WP_003058588.1 | Gp15 family bacteriophage protein | - |
| I6H73_RS00280 (I6H73_00280) | - | 40293..43829 (+) | 3537 | WP_003058479.1 | tape measure protein | - |
| I6H73_RS00285 (I6H73_00285) | - | 43826..45328 (+) | 1503 | WP_003058481.1 | distal tail protein Dit | - |
| I6H73_RS00290 (I6H73_00290) | - | 45332..48256 (+) | 2925 | WP_198465272.1 | phage tail spike protein | - |
| I6H73_RS00295 (I6H73_00295) | - | 48268..50133 (+) | 1866 | WP_003058489.1 | DUF859 family phage minor structural protein | - |
| I6H73_RS00300 (I6H73_00300) | - | 50184..50612 (+) | 429 | WP_003058584.1 | DUF1366 domain-containing protein | - |
| I6H73_RS00305 (I6H73_00305) | - | 50578..50805 (+) | 228 | WP_037588167.1 | hypothetical protein | - |
| I6H73_RS00310 (I6H73_00310) | - | 50813..51280 (+) | 468 | WP_003058491.1 | phage holin family protein | - |
| I6H73_RS00315 (I6H73_00315) | - | 51273..51608 (+) | 336 | WP_003058565.1 | phage holin | - |
| I6H73_RS00320 (I6H73_00320) | - | 51610..53055 (+) | 1446 | WP_003058531.1 | peptidoglycan amidohydrolase family protein | - |
| I6H73_RS00325 (I6H73_00325) | - | 53193..53384 (+) | 192 | WP_003058494.1 | hypothetical protein | - |
| I6H73_RS00330 (I6H73_00330) | comGD | 53399..53791 (+) | 393 | WP_003058533.1 | competence type IV pilus minor pilin ComGD | - |
| I6H73_RS00335 (I6H73_00335) | comGE | 53799..54044 (+) | 246 | WP_003058569.1 | competence type IV pilus minor pilin ComGE | - |
| I6H73_RS00340 (I6H73_00340) | comGF | 54025..54465 (+) | 441 | WP_003058520.1 | competence type IV pilus minor pilin ComGF | - |
| I6H73_RS00345 (I6H73_00345) | comGG | 54488..54817 (+) | 330 | WP_003058539.1 | competence type IV pilus minor pilin ComGG | - |
| I6H73_RS00350 (I6H73_00350) | comYH | 54880..55833 (+) | 954 | WP_003058509.1 | class I SAM-dependent methyltransferase | Machinery gene |
| I6H73_RS00355 (I6H73_00355) | - | 55892..57088 (+) | 1197 | WP_003058485.1 | acetate kinase | - |
| I6H73_RS00360 (I6H73_00360) | - | 57394..57600 (+) | 207 | WP_003058508.1 | helix-turn-helix transcriptional regulator | - |
| I6H73_RS00365 (I6H73_00365) | - | 57703..58383 (+) | 681 | WP_003058487.1 | type II CAAX endopeptidase family protein | - |
| I6H73_RS00370 (I6H73_00370) | - | 58401..58838 (+) | 438 | WP_003058498.1 | hypothetical protein | - |
| I6H73_RS00375 (I6H73_00375) | proC | 58978..59754 (-) | 777 | WP_003058540.1 | pyrroline-5-carboxylate reductase | - |
| I6H73_RS00380 (I6H73_00380) | pepA | 59801..60868 (-) | 1068 | WP_003058528.1 | glutamyl aminopeptidase | - |
| I6H73_RS00385 (I6H73_00385) | - | 61426..61710 (+) | 285 | WP_003055815.1 | DUF4651 domain-containing protein | - |
| I6H73_RS00390 (I6H73_00390) | - | 61707..62024 (+) | 318 | WP_003055817.1 | thioredoxin family protein | - |
| I6H73_RS00395 (I6H73_00395) | ytpR | 62042..62668 (+) | 627 | WP_003055812.1 | YtpR family tRNA-binding protein | - |
| I6H73_RS00400 (I6H73_00400) | - | 62783..64131 (+) | 1349 | WP_100206400.1 | IS3 family transposase | - |
| I6H73_RS00405 (I6H73_00405) | ssbA | 64257..64652 (+) | 396 | WP_003053955.1 | single-stranded DNA-binding protein | Machinery gene |
| I6H73_RS00410 (I6H73_00410) | - | 64912..65553 (-) | 642 | WP_003053950.1 | deoxynucleoside kinase | - |
Sequence
Protein
Download Length: 343 a.a. Molecular weight: 38953.34 Da Isoelectric Point: 9.2055
>NTDB_id=517153 I6H73_RS00050 WP_015057201.1 14743..15774(+) (comYB) [Streptococcus dysgalactiae strain FDAARGOS_1016]
MTFLQRDISVRSKQGLKKLSSKEQYKLIQLLANLLLTGFNLAEIIAFLTKSHLIPEPYVLRMEEALLKGRGLADMLAGIG
FSDAILTQISLADGHGNIETTLVNIQGYLNQMAKLKRKTIEVVTYPIILLVFLFVMILGLRHYVMPQLEAQNQLTIFLTH
FPAIFFGIILFIGVLLGGIWLHWRSQSRLKLFTHFSAYPVLGKLLQQYLTAYYAREWGTLIGQGLELPVILDIMAMEKSL
FMQELAEDIRVSLLAGQAFHDKVATYPFFKKELSLMITYGEIKSKLGSELEIYAQESWTQFFSQLHQATQFIQPIIFLVI
ALTIVMIYAAILLPIYQNMGGVL
MTFLQRDISVRSKQGLKKLSSKEQYKLIQLLANLLLTGFNLAEIIAFLTKSHLIPEPYVLRMEEALLKGRGLADMLAGIG
FSDAILTQISLADGHGNIETTLVNIQGYLNQMAKLKRKTIEVVTYPIILLVFLFVMILGLRHYVMPQLEAQNQLTIFLTH
FPAIFFGIILFIGVLLGGIWLHWRSQSRLKLFTHFSAYPVLGKLLQQYLTAYYAREWGTLIGQGLELPVILDIMAMEKSL
FMQELAEDIRVSLLAGQAFHDKVATYPFFKKELSLMITYGEIKSKLGSELEIYAQESWTQFFSQLHQATQFIQPIIFLVI
ALTIVMIYAAILLPIYQNMGGVL
Nucleotide
Download Length: 1032 bp
>NTDB_id=517153 I6H73_RS00050 WP_015057201.1 14743..15774(+) (comYB) [Streptococcus dysgalactiae strain FDAARGOS_1016]
ATGACCTTCTTGCAGCGGGACATCTCAGTCCGGAGCAAGCAAGGTTTGAAAAAATTATCCAGTAAGGAACAATACAAACT
AATTCAGCTATTAGCTAACCTTTTGTTAACAGGATTTAACTTGGCTGAGATAATAGCTTTCTTAACAAAAAGTCATTTGA
TTCCCGAGCCCTATGTTCTCCGCATGGAAGAAGCACTCTTAAAAGGTCGAGGGCTGGCAGATATGTTAGCTGGGATAGGT
TTTTCGGATGCTATTTTAACTCAAATTAGTTTAGCAGATGGTCACGGAAACATTGAGACGACGCTTGTTAATATTCAGGG
CTATCTCAACCAAATGGCTAAACTTAAACGAAAAACCATTGAAGTCGTCACCTATCCGATTATTCTCCTGGTATTTCTTT
TTGTGATGATATTAGGGTTAAGGCATTATGTTATGCCTCAATTAGAAGCTCAAAATCAGCTTACTATTTTTCTGACTCAT
TTTCCAGCTATTTTCTTTGGTATTATCCTTTTTATAGGTGTGCTGTTAGGGGGAATTTGGCTGCATTGGCGATCGCAAAG
TCGCCTCAAGCTTTTTACTCACTTCAGTGCTTATCCTGTTTTAGGGAAACTCTTACAACAGTATCTGACGGCTTATTATG
CTAGAGAATGGGGAACATTGATTGGACAGGGACTTGAATTACCAGTCATTCTGGATATTATGGCCATGGAGAAATCTTTA
TTCATGCAAGAATTGGCTGAAGATATTCGAGTTAGTTTATTGGCAGGACAAGCTTTTCATGATAAGGTAGCTACCTATCC
TTTCTTTAAAAAAGAGTTAAGTTTGATGATTACTTATGGAGAGATTAAATCCAAGTTAGGAAGTGAACTAGAGATTTATG
CTCAAGAGAGTTGGACTCAGTTTTTCAGCCAACTTCATCAAGCAACTCAGTTTATTCAACCGATTATCTTTTTAGTGATT
GCTCTGACGATTGTCATGATTTATGCAGCAATTTTATTACCAATTTATCAAAATATGGGAGGCGTTTTATAA
ATGACCTTCTTGCAGCGGGACATCTCAGTCCGGAGCAAGCAAGGTTTGAAAAAATTATCCAGTAAGGAACAATACAAACT
AATTCAGCTATTAGCTAACCTTTTGTTAACAGGATTTAACTTGGCTGAGATAATAGCTTTCTTAACAAAAAGTCATTTGA
TTCCCGAGCCCTATGTTCTCCGCATGGAAGAAGCACTCTTAAAAGGTCGAGGGCTGGCAGATATGTTAGCTGGGATAGGT
TTTTCGGATGCTATTTTAACTCAAATTAGTTTAGCAGATGGTCACGGAAACATTGAGACGACGCTTGTTAATATTCAGGG
CTATCTCAACCAAATGGCTAAACTTAAACGAAAAACCATTGAAGTCGTCACCTATCCGATTATTCTCCTGGTATTTCTTT
TTGTGATGATATTAGGGTTAAGGCATTATGTTATGCCTCAATTAGAAGCTCAAAATCAGCTTACTATTTTTCTGACTCAT
TTTCCAGCTATTTTCTTTGGTATTATCCTTTTTATAGGTGTGCTGTTAGGGGGAATTTGGCTGCATTGGCGATCGCAAAG
TCGCCTCAAGCTTTTTACTCACTTCAGTGCTTATCCTGTTTTAGGGAAACTCTTACAACAGTATCTGACGGCTTATTATG
CTAGAGAATGGGGAACATTGATTGGACAGGGACTTGAATTACCAGTCATTCTGGATATTATGGCCATGGAGAAATCTTTA
TTCATGCAAGAATTGGCTGAAGATATTCGAGTTAGTTTATTGGCAGGACAAGCTTTTCATGATAAGGTAGCTACCTATCC
TTTCTTTAAAAAAGAGTTAAGTTTGATGATTACTTATGGAGAGATTAAATCCAAGTTAGGAAGTGAACTAGAGATTTATG
CTCAAGAGAGTTGGACTCAGTTTTTCAGCCAACTTCATCAAGCAACTCAGTTTATTCAACCGATTATCTTTTTAGTGATT
GCTCTGACGATTGTCATGATTTATGCAGCAATTTTATTACCAATTTATCAAAATATGGGAGGCGTTTTATAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comYB | Streptococcus mutans UA140 |
53.392 |
98.834 |
0.528 |
| comYB | Streptococcus mutans UA159 |
53.392 |
98.834 |
0.528 |
| comYB | Streptococcus gordonii str. Challis substr. CH1 |
48.673 |
98.834 |
0.481 |
| comGB/cglB | Streptococcus mitis NCTC 12261 |
45.97 |
97.668 |
0.449 |
| comGB/cglB | Streptococcus mitis SK321 |
45.345 |
97.085 |
0.44 |
| comGB/cglB | Streptococcus pneumoniae Rx1 |
43.844 |
97.085 |
0.426 |
| comGB/cglB | Streptococcus pneumoniae D39 |
43.844 |
97.085 |
0.426 |
| comGB/cglB | Streptococcus pneumoniae R6 |
43.844 |
97.085 |
0.426 |
| comGB/cglB | Streptococcus pneumoniae TIGR4 |
43.844 |
97.085 |
0.426 |
| comGB | Lactococcus lactis subsp. cremoris KW2 |
42.773 |
98.834 |
0.423 |