Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYD   Type   Machinery gene
Locus tag   P6F23_RS01090 Genome accession   NZ_CP120953
Coordinates   188966..189394 (+) Length   142 a.a.
NCBI ID   WP_000793381.1    Uniprot ID   A0A8B4RCN7
Organism   Streptococcus agalactiae strain GBSIR0001     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 190255..235935 188966..189394 flank 861


Gene organization within MGE regions


Location: 188966..235935
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P6F23_RS01090 (P6F23_01090) comYD 188966..189394 (+) 429 WP_000793381.1 competence type IV pilus minor pilin ComGD Machinery gene
  P6F23_RS01095 (P6F23_01095) comGE 189366..189665 (+) 300 WP_001867089.1 competence type IV pilus minor pilin ComGE -
  P6F23_RS01100 (P6F23_01100) comGF 189619..190080 (+) 462 WP_001874060.1 competence type IV pilus minor pilin ComGF -
  P6F23_RS01105 (P6F23_01105) comGG 190058..190429 (+) 372 WP_000601104.1 competence type IV pilus minor pilin ComGG -
  P6F23_RS01110 (P6F23_01110) comYH 190544..191518 (+) 975 WP_001008570.1 class I SAM-dependent methyltransferase Machinery gene
  P6F23_RS01115 (P6F23_01115) - 191550..192743 (+) 1194 WP_000047535.1 acetate kinase -
  P6F23_RS01120 (P6F23_01120) - 192894..193100 (+) 207 WP_000798242.1 helix-turn-helix transcriptional regulator -
  P6F23_RS01125 (P6F23_01125) - 193159..193296 (+) 138 WP_001867090.1 hypothetical protein -
  P6F23_RS01130 (P6F23_01130) - 193337..193792 (+) 456 WP_000905674.1 hypothetical protein -
  P6F23_RS01135 (P6F23_01135) - 193861..194526 (+) 666 WP_000008111.1 type II CAAX endopeptidase family protein -
  P6F23_RS01140 (P6F23_01140) proC 194547..195317 (-) 771 WP_001867096.1 pyrroline-5-carboxylate reductase -
  P6F23_RS01145 (P6F23_01145) pepA 195387..196454 (-) 1068 WP_001281321.1 glutamyl aminopeptidase -
  P6F23_RS01150 (P6F23_01150) - 196639..196878 (-) 240 WP_000660181.1 hypothetical protein -
  P6F23_RS01155 (P6F23_01155) - 197039..197323 (+) 285 WP_000791272.1 DUF4651 domain-containing protein -
  P6F23_RS01160 (P6F23_01160) - 197320..197643 (+) 324 WP_000601792.1 thioredoxin family protein -
  P6F23_RS01165 (P6F23_01165) ytpR 197676..198302 (+) 627 WP_000578331.1 YtpR family tRNA-binding protein -
  P6F23_RS01170 (P6F23_01170) - 198356..199072 (-) 717 WP_000186183.1 class I SAM-dependent methyltransferase -
  P6F23_RS01175 (P6F23_01175) ssbA 199153..199548 (+) 396 WP_000282450.1 single-stranded DNA-binding protein Machinery gene
  P6F23_RS01180 (P6F23_01180) - 199671..200315 (+) 645 WP_000416612.1 HAD family hydrolase -
  P6F23_RS01185 (P6F23_01185) - 200342..202087 (+) 1746 WP_000930334.1 LytS/YhcK type 5TM receptor domain-containing protein -
  P6F23_RS01190 (P6F23_01190) - 202068..202808 (+) 741 WP_000697630.1 LytTR family transcriptional regulator DNA-binding domain-containing protein -
  P6F23_RS01195 (P6F23_01195) - 202978..203433 (+) 456 WP_000683316.1 CidA/LrgA family protein -
  P6F23_RS01200 (P6F23_01200) lrgB 203435..204163 (+) 729 WP_000421727.1 antiholin-like protein LrgB -
  P6F23_RS01205 (P6F23_01205) - 204406..206034 (+) 1629 WP_000170504.1 ABC transporter substrate-binding protein -
  P6F23_RS01210 (P6F23_01210) - 206147..207124 (+) 978 WP_000680644.1 ABC transporter permease -
  P6F23_RS01215 (P6F23_01215) - 207121..207942 (+) 822 WP_000603397.1 ABC transporter permease -
  P6F23_RS01220 (P6F23_01220) - 207954..208757 (+) 804 WP_000140979.1 ABC transporter ATP-binding protein -
  P6F23_RS01225 (P6F23_01225) - 208741..209367 (+) 627 WP_000171304.1 ABC transporter ATP-binding protein -
  P6F23_RS01230 (P6F23_01230) treP 209649..211679 (+) 2031 WP_000434610.1 PTS system trehalose-specific EIIBC component -
  P6F23_RS01235 (P6F23_01235) treC 211901..213526 (+) 1626 WP_000151014.1 alpha,alpha-phosphotrehalase -
  P6F23_RS01240 (P6F23_01240) - 213746..215782 (+) 2037 WP_000228178.1 BglG family transcription antiterminator -
  P6F23_RS01245 (P6F23_01245) - 215785..216069 (+) 285 WP_000944235.1 PTS sugar transporter subunit IIB -
  P6F23_RS01250 (P6F23_01250) - 216082..217437 (+) 1356 WP_000677351.1 PTS ascorbate transporter subunit IIC -
  P6F23_RS01255 (P6F23_01255) - 217440..218297 (+) 858 WP_000203492.1 transketolase -
  P6F23_RS01260 (P6F23_01260) - 218294..219223 (+) 930 WP_001203828.1 transketolase C-terminal domain-containing protein -
  P6F23_RS01265 (P6F23_01265) - 219332..220591 (+) 1260 WP_001203074.1 ferric reductase-like transmembrane domain-containing protein -
  P6F23_RS01270 (P6F23_01270) rpsO 220679..220948 (+) 270 WP_001018249.1 30S ribosomal protein S15 -
  P6F23_RS01275 (P6F23_01275) pnp 221329..223458 (+) 2130 WP_000043857.1 polyribonucleotide nucleotidyltransferase -
  P6F23_RS01280 (P6F23_01280) - 223460..224212 (+) 753 WP_000204780.1 SseB family protein -
  P6F23_RS01285 (P6F23_01285) cysE 224221..224805 (+) 585 WP_000539954.1 serine O-acetyltransferase -
  P6F23_RS01290 (P6F23_01290) - 224815..224997 (+) 183 WP_000656477.1 hypothetical protein -
  P6F23_RS01295 (P6F23_01295) cysS 224994..226337 (+) 1344 WP_000591129.1 cysteine--tRNA ligase -
  P6F23_RS01300 (P6F23_01300) - 226330..226716 (+) 387 WP_000568029.1 Mini-ribonuclease 3 -
  P6F23_RS01305 (P6F23_01305) rlmB 226819..227574 (+) 756 WP_000178023.1 23S rRNA (guanosine(2251)-2'-O)-methyltransferase RlmB -
  P6F23_RS01310 (P6F23_01310) - 227571..228089 (+) 519 WP_000716636.1 NYN domain-containing protein -
  P6F23_RS01315 (P6F23_01315) - 228182..229042 (+) 861 WP_000143135.1 DegV family protein -
  P6F23_RS01320 (P6F23_01320) - 229580..229699 (+) 120 Protein_207 helix-turn-helix domain-containing protein -
  P6F23_RS01325 (P6F23_01325) - 230017..231183 (-) 1167 WP_000160598.1 IS30-like element ISSag9 family transposase -
  P6F23_RS01330 (P6F23_01330) rplM 231484..231930 (+) 447 WP_001867156.1 50S ribosomal protein L13 -
  P6F23_RS01335 (P6F23_01335) rpsI 231951..232343 (+) 393 WP_000035940.1 30S ribosomal protein S9 -
  P6F23_RS01340 (P6F23_01340) - 232489..233643 (-) 1155 WP_000110711.1 site-specific integrase -
  P6F23_RS01345 (P6F23_01345) - 233698..234216 (-) 519 WP_000181342.1 helix-turn-helix transcriptional regulator -
  P6F23_RS01350 (P6F23_01350) - 234363..234647 (+) 285 WP_001287945.1 hypothetical protein -
  P6F23_RS01355 (P6F23_01355) - 234659..235513 (+) 855 WP_000005759.1 phage replisome organizer N-terminal domain-containing protein -
  P6F23_RS01360 (P6F23_01360) - 235524..235814 (+) 291 WP_000158581.1 DUF5962 family protein -

Sequence


Protein


Download         Length: 142 a.a.        Molecular weight: 16493.15 Da        Isoelectric Point: 10.0345

>NTDB_id=809538 P6F23_RS01090 WP_000793381.1 188966..189394(+) (comYD) [Streptococcus agalactiae strain GBSIR0001]
MKNLLLKCKDKKVKAFTLLESLIVLSVVAFMTLVFSTSFNNIFRQVEETIFFISFEHLYRDTQKLSAFGQKKQTLTISHN
YLENTYERLYLPKTVKVVKSDTLAFDANGGNSSLAKIQFECYRKTVTYQLYIGSGNYRKKEN

Nucleotide


Download         Length: 429 bp        

>NTDB_id=809538 P6F23_RS01090 WP_000793381.1 188966..189394(+) (comYD) [Streptococcus agalactiae strain GBSIR0001]
ATGAAAAATTTATTGTTAAAATGTAAGGATAAGAAGGTTAAAGCATTTACACTTTTAGAGAGCCTTATTGTATTATCAGT
AGTGGCATTTATGACGTTAGTATTTTCAACATCATTTAATAATATTTTTAGGCAGGTTGAAGAAACAATTTTCTTCATAT
CCTTTGAACATCTTTATAGAGATACTCAGAAATTGAGTGCATTTGGTCAGAAGAAACAAACCCTTACAATCTCTCATAAT
TATCTCGAAAATACTTATGAGAGACTTTATTTACCTAAAACTGTAAAAGTAGTCAAAAGTGACACACTTGCATTTGACGC
TAATGGAGGGAATTCAAGCTTGGCAAAAATTCAATTTGAATGTTATAGAAAAACTGTTACGTATCAATTATATATAGGAA
GTGGTAATTATCGTAAGAAAGAAAATTAG

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A8B4RCN7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYD Streptococcus mutans UA140

52.273

92.958

0.486

  comYD Streptococcus mutans UA159

52.273

92.958

0.486

  comYD Streptococcus gordonii str. Challis substr. CH1

43.662

100

0.437

  comGD/cglD Streptococcus mitis NCTC 12261

41.045

94.366

0.387

  comGD/cglD Streptococcus pneumoniae TIGR4

43.307

89.437

0.387

  comGD/cglD Streptococcus mitis SK321

43.307

89.437

0.387

  comGD/cglD Streptococcus pneumoniae Rx1

42.52

89.437

0.38

  comGD/cglD Streptococcus pneumoniae D39

42.52

89.437

0.38

  comGD/cglD Streptococcus pneumoniae R6

42.52

89.437

0.38