Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   I6H73_RS00050 Genome accession   NZ_CP066073
Coordinates   14743..15774 (+) Length   343 a.a.
NCBI ID   WP_015057201.1    Uniprot ID   -
Organism   Streptococcus dysgalactiae strain FDAARGOS_1016     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 14788..65553 14743..15774 flank -986


Gene organization within MGE regions


Location: 14743..65553
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6H73_RS00050 (I6H73_00050) comYB 14743..15774 (+) 1032 WP_015057201.1 competence type IV pilus assembly protein ComGB Machinery gene
  I6H73_RS00055 (I6H73_00055) comYC 15776..16102 (+) 327 WP_003055259.1 competence type IV pilus major pilin ComGC Machinery gene
  I6H73_RS00060 (I6H73_00060) - 16139..17593 (-) 1455 WP_003055278.1 recombinase family protein -
  I6H73_RS00065 (I6H73_00065) - 17730..18221 (-) 492 WP_003055290.1 hypothetical protein -
  I6H73_RS00070 (I6H73_00070) - 18232..19002 (-) 771 WP_003055282.1 XRE family transcriptional regulator -
  I6H73_RS00075 (I6H73_00075) - 19203..19406 (+) 204 WP_003058516.1 helix-turn-helix transcriptional regulator -
  I6H73_RS00080 (I6H73_00080) - 19695..19844 (+) 150 WP_003058552.1 hypothetical protein -
  I6H73_RS00085 (I6H73_00085) - 19884..20231 (+) 348 WP_003058582.1 hypothetical protein -
  I6H73_RS00090 (I6H73_00090) - 20492..20740 (+) 249 WP_003058590.1 hypothetical protein -
  I6H73_RS00095 (I6H73_00095) - 20740..21432 (+) 693 WP_003058502.1 ERF family protein -
  I6H73_RS00100 (I6H73_00100) - 21436..21762 (+) 327 WP_003058496.1 hypothetical protein -
  I6H73_RS00105 (I6H73_00105) - 21776..22771 (+) 996 WP_003058518.1 DUF1351 domain-containing protein -
  I6H73_RS00110 (I6H73_00110) - 22771..23325 (+) 555 WP_003058546.1 MazG-like family protein -
  I6H73_RS00115 (I6H73_00115) - 23312..23506 (+) 195 WP_115261906.1 hypothetical protein -
  I6H73_RS00120 (I6H73_00120) - 23493..24014 (+) 522 WP_003058526.1 hypothetical protein -
  I6H73_RS00125 (I6H73_00125) - 24023..24721 (+) 699 WP_003058504.1 site-specific DNA-methyltransferase -
  I6H73_RS00130 (I6H73_00130) - 24744..25022 (+) 279 WP_037588195.1 hypothetical protein -
  I6H73_RS00135 (I6H73_00135) - 25099..25395 (+) 297 Protein_26 methylase -
  I6H73_RS00140 (I6H73_00140) - 25382..25585 (+) 204 WP_003058550.1 hypothetical protein -
  I6H73_RS00145 (I6H73_00145) - 25582..26019 (+) 438 WP_003058575.1 helix-turn-helix domain-containing protein -
  I6H73_RS00150 (I6H73_00150) - 26032..26463 (+) 432 WP_003058500.1 hypothetical protein -
  I6H73_RS00155 (I6H73_00155) - 26465..27202 (+) 738 WP_003058554.1 hypothetical protein -
  I6H73_RS00160 (I6H73_00160) - 27222..27593 (+) 372 WP_003058541.1 hypothetical protein -
  I6H73_RS00165 (I6H73_00165) - 27749..27925 (+) 177 WP_165737297.1 hypothetical protein -
  I6H73_RS00170 (I6H73_00170) - 27927..28268 (+) 342 WP_003058522.1 DUF1372 family protein -
  I6H73_RS00175 (I6H73_00175) - 28268..28408 (+) 141 WP_003058562.1 hypothetical protein -
  I6H73_RS00180 (I6H73_00180) - 28410..28601 (+) 192 WP_003058511.1 hypothetical protein -
  I6H73_RS00185 (I6H73_00185) - 28594..28824 (+) 231 WP_003058513.1 hypothetical protein -
  I6H73_RS00190 (I6H73_00190) - 28821..29231 (+) 411 WP_037588173.1 endodeoxyribonuclease RusA -
  I6H73_RS00195 (I6H73_00195) - 29224..29604 (+) 381 WP_003058484.1 hypothetical protein -
  I6H73_RS00200 (I6H73_00200) - 29671..30348 (+) 678 WP_003058558.1 DUF4417 domain-containing protein -
  I6H73_RS00205 (I6H73_00205) - 30345..30728 (+) 384 WP_003058537.1 hypothetical protein -
  I6H73_RS00210 (I6H73_00210) - 30921..31400 (+) 480 WP_003058573.1 terminase small subunit -
  I6H73_RS00215 (I6H73_00215) - 31387..32700 (+) 1314 WP_003058577.1 PBSX family phage terminase large subunit -
  I6H73_RS00220 (I6H73_00220) - 32712..34289 (+) 1578 WP_003058556.1 phage portal protein -
  I6H73_RS00225 (I6H73_00225) - 34292..35440 (+) 1149 WP_003058578.1 phage minor capsid protein -
  I6H73_RS00230 (I6H73_00230) - 35587..36150 (+) 564 WP_003058544.1 phage scaffolding protein -
  I6H73_RS00235 (I6H73_00235) - 36169..37047 (+) 879 WP_003058535.1 hypothetical protein -
  I6H73_RS00240 (I6H73_00240) - 37058..37294 (+) 237 WP_003058596.1 hypothetical protein -
  I6H73_RS00245 (I6H73_00245) - 37338..37730 (+) 393 WP_003058548.1 hypothetical protein -
  I6H73_RS00250 (I6H73_00250) - 37720..38046 (+) 327 WP_198465271.1 putative minor capsid protein -
  I6H73_RS00255 (I6H73_00255) - 38046..38396 (+) 351 WP_003058592.1 minor capsid protein -
  I6H73_RS00260 (I6H73_00260) - 38396..38803 (+) 408 WP_003058570.1 minor capsid protein -
  I6H73_RS00265 (I6H73_00265) - 38808..39263 (+) 456 WP_003058598.1 hypothetical protein -
  I6H73_RS00270 (I6H73_00270) - 39306..39686 (+) 381 WP_003058580.1 hypothetical protein -
  I6H73_RS00275 (I6H73_00275) - 39686..40273 (+) 588 WP_003058588.1 Gp15 family bacteriophage protein -
  I6H73_RS00280 (I6H73_00280) - 40293..43829 (+) 3537 WP_003058479.1 tape measure protein -
  I6H73_RS00285 (I6H73_00285) - 43826..45328 (+) 1503 WP_003058481.1 distal tail protein Dit -
  I6H73_RS00290 (I6H73_00290) - 45332..48256 (+) 2925 WP_198465272.1 phage tail spike protein -
  I6H73_RS00295 (I6H73_00295) - 48268..50133 (+) 1866 WP_003058489.1 DUF859 family phage minor structural protein -
  I6H73_RS00300 (I6H73_00300) - 50184..50612 (+) 429 WP_003058584.1 DUF1366 domain-containing protein -
  I6H73_RS00305 (I6H73_00305) - 50578..50805 (+) 228 WP_037588167.1 hypothetical protein -
  I6H73_RS00310 (I6H73_00310) - 50813..51280 (+) 468 WP_003058491.1 phage holin family protein -
  I6H73_RS00315 (I6H73_00315) - 51273..51608 (+) 336 WP_003058565.1 phage holin -
  I6H73_RS00320 (I6H73_00320) - 51610..53055 (+) 1446 WP_003058531.1 peptidoglycan amidohydrolase family protein -
  I6H73_RS00325 (I6H73_00325) - 53193..53384 (+) 192 WP_003058494.1 hypothetical protein -
  I6H73_RS00330 (I6H73_00330) comGD 53399..53791 (+) 393 WP_003058533.1 competence type IV pilus minor pilin ComGD -
  I6H73_RS00335 (I6H73_00335) comGE 53799..54044 (+) 246 WP_003058569.1 competence type IV pilus minor pilin ComGE -
  I6H73_RS00340 (I6H73_00340) comGF 54025..54465 (+) 441 WP_003058520.1 competence type IV pilus minor pilin ComGF -
  I6H73_RS00345 (I6H73_00345) comGG 54488..54817 (+) 330 WP_003058539.1 competence type IV pilus minor pilin ComGG -
  I6H73_RS00350 (I6H73_00350) comYH 54880..55833 (+) 954 WP_003058509.1 class I SAM-dependent methyltransferase Machinery gene
  I6H73_RS00355 (I6H73_00355) - 55892..57088 (+) 1197 WP_003058485.1 acetate kinase -
  I6H73_RS00360 (I6H73_00360) - 57394..57600 (+) 207 WP_003058508.1 helix-turn-helix transcriptional regulator -
  I6H73_RS00365 (I6H73_00365) - 57703..58383 (+) 681 WP_003058487.1 type II CAAX endopeptidase family protein -
  I6H73_RS00370 (I6H73_00370) - 58401..58838 (+) 438 WP_003058498.1 hypothetical protein -
  I6H73_RS00375 (I6H73_00375) proC 58978..59754 (-) 777 WP_003058540.1 pyrroline-5-carboxylate reductase -
  I6H73_RS00380 (I6H73_00380) pepA 59801..60868 (-) 1068 WP_003058528.1 glutamyl aminopeptidase -
  I6H73_RS00385 (I6H73_00385) - 61426..61710 (+) 285 WP_003055815.1 DUF4651 domain-containing protein -
  I6H73_RS00390 (I6H73_00390) - 61707..62024 (+) 318 WP_003055817.1 thioredoxin family protein -
  I6H73_RS00395 (I6H73_00395) ytpR 62042..62668 (+) 627 WP_003055812.1 YtpR family tRNA-binding protein -
  I6H73_RS00400 (I6H73_00400) - 62783..64131 (+) 1349 WP_100206400.1 IS3 family transposase -
  I6H73_RS00405 (I6H73_00405) ssbA 64257..64652 (+) 396 WP_003053955.1 single-stranded DNA-binding protein Machinery gene
  I6H73_RS00410 (I6H73_00410) - 64912..65553 (-) 642 WP_003053950.1 deoxynucleoside kinase -

Sequence


Protein


Download         Length: 343 a.a.        Molecular weight: 38953.34 Da        Isoelectric Point: 9.2055

>NTDB_id=517153 I6H73_RS00050 WP_015057201.1 14743..15774(+) (comYB) [Streptococcus dysgalactiae strain FDAARGOS_1016]
MTFLQRDISVRSKQGLKKLSSKEQYKLIQLLANLLLTGFNLAEIIAFLTKSHLIPEPYVLRMEEALLKGRGLADMLAGIG
FSDAILTQISLADGHGNIETTLVNIQGYLNQMAKLKRKTIEVVTYPIILLVFLFVMILGLRHYVMPQLEAQNQLTIFLTH
FPAIFFGIILFIGVLLGGIWLHWRSQSRLKLFTHFSAYPVLGKLLQQYLTAYYAREWGTLIGQGLELPVILDIMAMEKSL
FMQELAEDIRVSLLAGQAFHDKVATYPFFKKELSLMITYGEIKSKLGSELEIYAQESWTQFFSQLHQATQFIQPIIFLVI
ALTIVMIYAAILLPIYQNMGGVL

Nucleotide


Download         Length: 1032 bp        

>NTDB_id=517153 I6H73_RS00050 WP_015057201.1 14743..15774(+) (comYB) [Streptococcus dysgalactiae strain FDAARGOS_1016]
ATGACCTTCTTGCAGCGGGACATCTCAGTCCGGAGCAAGCAAGGTTTGAAAAAATTATCCAGTAAGGAACAATACAAACT
AATTCAGCTATTAGCTAACCTTTTGTTAACAGGATTTAACTTGGCTGAGATAATAGCTTTCTTAACAAAAAGTCATTTGA
TTCCCGAGCCCTATGTTCTCCGCATGGAAGAAGCACTCTTAAAAGGTCGAGGGCTGGCAGATATGTTAGCTGGGATAGGT
TTTTCGGATGCTATTTTAACTCAAATTAGTTTAGCAGATGGTCACGGAAACATTGAGACGACGCTTGTTAATATTCAGGG
CTATCTCAACCAAATGGCTAAACTTAAACGAAAAACCATTGAAGTCGTCACCTATCCGATTATTCTCCTGGTATTTCTTT
TTGTGATGATATTAGGGTTAAGGCATTATGTTATGCCTCAATTAGAAGCTCAAAATCAGCTTACTATTTTTCTGACTCAT
TTTCCAGCTATTTTCTTTGGTATTATCCTTTTTATAGGTGTGCTGTTAGGGGGAATTTGGCTGCATTGGCGATCGCAAAG
TCGCCTCAAGCTTTTTACTCACTTCAGTGCTTATCCTGTTTTAGGGAAACTCTTACAACAGTATCTGACGGCTTATTATG
CTAGAGAATGGGGAACATTGATTGGACAGGGACTTGAATTACCAGTCATTCTGGATATTATGGCCATGGAGAAATCTTTA
TTCATGCAAGAATTGGCTGAAGATATTCGAGTTAGTTTATTGGCAGGACAAGCTTTTCATGATAAGGTAGCTACCTATCC
TTTCTTTAAAAAAGAGTTAAGTTTGATGATTACTTATGGAGAGATTAAATCCAAGTTAGGAAGTGAACTAGAGATTTATG
CTCAAGAGAGTTGGACTCAGTTTTTCAGCCAACTTCATCAAGCAACTCAGTTTATTCAACCGATTATCTTTTTAGTGATT
GCTCTGACGATTGTCATGATTTATGCAGCAATTTTATTACCAATTTATCAAAATATGGGAGGCGTTTTATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus mutans UA140

53.392

98.834

0.528

  comYB Streptococcus mutans UA159

53.392

98.834

0.528

  comYB Streptococcus gordonii str. Challis substr. CH1

48.673

98.834

0.481

  comGB/cglB Streptococcus mitis NCTC 12261

45.97

97.668

0.449

  comGB/cglB Streptococcus mitis SK321

45.345

97.085

0.44

  comGB/cglB Streptococcus pneumoniae Rx1

43.844

97.085

0.426

  comGB/cglB Streptococcus pneumoniae D39

43.844

97.085

0.426

  comGB/cglB Streptococcus pneumoniae R6

43.844

97.085

0.426

  comGB/cglB Streptococcus pneumoniae TIGR4

43.844

97.085

0.426

  comGB Lactococcus lactis subsp. cremoris KW2

42.773

98.834

0.423