Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   ACG3JJ_RS00965 Genome accession   NZ_CP171733
Coordinates   190905..191861 (-) Length   318 a.a.
NCBI ID   WP_003103712.1    Uniprot ID   A0A0E2UAX9
Organism   Streptococcus parauberis strain DB-M5     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 190905..232117 190905..191861 within 0


Gene organization within MGE regions


Location: 190905..232117
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACG3JJ_RS00965 (ACG3JJ_00965) comYH 190905..191861 (-) 957 WP_003103712.1 class I SAM-dependent methyltransferase Machinery gene
  ACG3JJ_RS00970 (ACG3JJ_00970) comGG 191968..192465 (-) 498 WP_003108640.1 competence type IV pilus minor pilin ComGG -
  ACG3JJ_RS00975 (ACG3JJ_00975) comGF 192443..192880 (-) 438 WP_003102915.1 competence type IV pilus minor pilin ComGF -
  ACG3JJ_RS00980 (ACG3JJ_00980) comGE 192858..193157 (-) 300 WP_259966510.1 competence type IV pilus minor pilin ComGE -
  ACG3JJ_RS00985 (ACG3JJ_00985) comGD 193129..193572 (-) 444 WP_307877782.1 competence type IV pilus minor pilin ComGD -
  ACG3JJ_RS00990 (ACG3JJ_00990) comYC 193526..193846 (-) 321 WP_013794397.1 competence type IV pilus major pilin ComGC Machinery gene
  ACG3JJ_RS00995 (ACG3JJ_00995) - 194499..195224 (-) 726 WP_013794398.1 peptidoglycan amidohydrolase family protein -
  ACG3JJ_RS01000 (ACG3JJ_01000) - 195217..195681 (-) 465 WP_013794399.1 phage holin family protein -
  ACG3JJ_RS01005 (ACG3JJ_01005) - 195685..195834 (-) 150 WP_259966511.1 hypothetical protein -
  ACG3JJ_RS01010 (ACG3JJ_01010) - 195897..196235 (-) 339 WP_013794400.1 DUF1366 domain-containing protein -
  ACG3JJ_RS01015 (ACG3JJ_01015) - 196284..198140 (-) 1857 WP_013794401.1 DUF859 family phage minor structural protein -
  ACG3JJ_RS01020 (ACG3JJ_01020) - 198150..201695 (-) 3546 WP_259966512.1 phage tail protein -
  ACG3JJ_RS01025 (ACG3JJ_01025) - 201695..203140 (-) 1446 WP_013794404.1 distal tail protein Dit -
  ACG3JJ_RS01030 (ACG3JJ_01030) - 203137..206943 (-) 3807 WP_013794405.1 tape measure protein -
  ACG3JJ_RS01035 (ACG3JJ_01035) - 206963..207550 (-) 588 WP_041828613.1 Gp15 family bacteriophage protein -
  ACG3JJ_RS01040 (ACG3JJ_01040) - 207553..207951 (-) 399 WP_070030361.1 hypothetical protein -
  ACG3JJ_RS01045 (ACG3JJ_01045) - 207976..208431 (-) 456 WP_041828614.1 phage tail tube protein -
  ACG3JJ_RS01050 (ACG3JJ_01050) - 208434..208841 (-) 408 WP_003108657.1 minor capsid protein -
  ACG3JJ_RS01055 (ACG3JJ_01055) - 208842..209198 (-) 357 WP_003108659.1 minor capsid protein -
  ACG3JJ_RS01060 (ACG3JJ_01060) - 209198..209524 (-) 327 WP_041828616.1 putative minor capsid protein -
  ACG3JJ_RS01065 (ACG3JJ_01065) - 209514..209903 (-) 390 WP_013794407.1 hypothetical protein -
  ACG3JJ_RS01070 (ACG3JJ_01070) - 209944..210195 (-) 252 WP_003108665.1 hypothetical protein -
  ACG3JJ_RS01075 (ACG3JJ_01075) - 210207..211091 (-) 885 WP_003108667.1 hypothetical protein -
  ACG3JJ_RS01080 (ACG3JJ_01080) - 211114..211692 (-) 579 WP_013794408.1 phage scaffolding protein -
  ACG3JJ_RS01085 (ACG3JJ_01085) - 211828..212997 (-) 1170 WP_259966513.1 phage minor capsid protein -
  ACG3JJ_RS01090 (ACG3JJ_01090) - 212997..214565 (-) 1569 WP_259966514.1 phage portal protein -
  ACG3JJ_RS01095 (ACG3JJ_01095) - 214577..215887 (-) 1311 WP_013794411.1 PBSX family phage terminase large subunit -
  ACG3JJ_RS01100 (ACG3JJ_01100) - 215884..216330 (-) 447 WP_259966515.1 stress-induced protein -
  ACG3JJ_RS01105 (ACG3JJ_01105) - 216462..217277 (-) 816 WP_013794412.1 hypothetical protein -
  ACG3JJ_RS01110 (ACG3JJ_01110) - 217281..217607 (-) 327 WP_041828618.1 hypothetical protein -
  ACG3JJ_RS01115 (ACG3JJ_01115) - 217651..218751 (-) 1101 WP_013794413.1 DUF4417 domain-containing protein -
  ACG3JJ_RS01120 (ACG3JJ_01120) - 218871..219257 (-) 387 WP_041828619.1 hypothetical protein -
  ACG3JJ_RS01125 (ACG3JJ_01125) - 219244..219423 (-) 180 WP_041828620.1 hypothetical protein -
  ACG3JJ_RS01130 (ACG3JJ_01130) - 219501..219758 (-) 258 WP_259966516.1 hypothetical protein -
  ACG3JJ_RS01135 (ACG3JJ_01135) - 219755..219973 (-) 219 WP_041828623.1 hypothetical protein -
  ACG3JJ_RS01140 (ACG3JJ_01140) - 219966..220367 (-) 402 WP_041828624.1 endodeoxyribonuclease RusA -
  ACG3JJ_RS01145 (ACG3JJ_01145) - 220357..220902 (-) 546 WP_013794414.1 DUF1642 domain-containing protein -
  ACG3JJ_RS01150 (ACG3JJ_01150) - 220892..221134 (-) 243 WP_041828626.1 hypothetical protein -
  ACG3JJ_RS01155 (ACG3JJ_01155) - 221186..221359 (-) 174 WP_259966517.1 hypothetical protein -
  ACG3JJ_RS01160 (ACG3JJ_01160) - 221362..221547 (-) 186 WP_041828628.1 hypothetical protein -
  ACG3JJ_RS01165 (ACG3JJ_01165) - 221547..221873 (-) 327 WP_003108696.1 DUF1372 family protein -
  ACG3JJ_RS01170 (ACG3JJ_01170) - 222216..222464 (-) 249 WP_041828629.1 hypothetical protein -
  ACG3JJ_RS01175 (ACG3JJ_01175) - 222457..222945 (-) 489 WP_003108699.1 helix-turn-helix domain-containing protein -
  ACG3JJ_RS01180 (ACG3JJ_01180) ssbA 222960..223487 (-) 528 WP_013794415.1 single-stranded DNA-binding protein Machinery gene
  ACG3JJ_RS01185 (ACG3JJ_01185) - 223480..223842 (-) 363 WP_080560929.1 helix-turn-helix transcriptional regulator -
  ACG3JJ_RS01190 (ACG3JJ_01190) - 223839..224372 (-) 534 WP_041828630.1 MazG-like family protein -
  ACG3JJ_RS01195 (ACG3JJ_01195) - 224374..225405 (-) 1032 WP_013794416.1 DUF1351 domain-containing protein -
  ACG3JJ_RS01200 (ACG3JJ_01200) - 225409..226080 (-) 672 WP_080560930.1 ERF family protein -
  ACG3JJ_RS01205 (ACG3JJ_01205) - 226150..226428 (-) 279 WP_041828632.1 hypothetical protein -
  ACG3JJ_RS01210 (ACG3JJ_01210) - 226430..227383 (-) 954 WP_151191588.1 phage replisome organizer N-terminal domain-containing protein -
  ACG3JJ_RS01215 (ACG3JJ_01215) - 227380..227571 (-) 192 WP_003108723.1 hypothetical protein -
  ACG3JJ_RS01220 (ACG3JJ_01220) - 227646..227828 (-) 183 WP_003108726.1 hypothetical protein -
  ACG3JJ_RS01225 (ACG3JJ_01225) - 227865..228068 (-) 204 WP_003108729.1 helix-turn-helix domain-containing protein -
  ACG3JJ_RS01230 (ACG3JJ_01230) - 228337..229026 (+) 690 WP_013794418.1 LexA family transcriptional regulator -
  ACG3JJ_RS01235 (ACG3JJ_01235) - 229040..229213 (+) 174 WP_192813189.1 hypothetical protein -
  ACG3JJ_RS01240 (ACG3JJ_01240) - 229215..229805 (+) 591 WP_013794419.1 hypothetical protein -
  ACG3JJ_RS01245 (ACG3JJ_01245) - 229831..230238 (+) 408 WP_041828634.1 hypothetical protein -
  ACG3JJ_RS01250 (ACG3JJ_01250) - 230243..230560 (+) 318 WP_041828635.1 hypothetical protein -
  ACG3JJ_RS01255 (ACG3JJ_01255) - 230681..232117 (+) 1437 WP_003108747.1 recombinase family protein -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36032.20 Da        Isoelectric Point: 4.9473

>NTDB_id=1063808 ACG3JJ_RS00965 WP_003103712.1 190905..191861(-) (comYH) [Streptococcus parauberis strain DB-M5]
MNFEKIEKAFELILENSQLIETDLHTHIYDAIIEQNSYLLGAKGANKQVEGNIQALKELHLTKEEWRRAFQFVFIKASQT
ERLQANHQFTPDSLGFILLYMMETLVSNHSFDLLEIGSGTGNLAQTILNNTSKSIDYLGIELDDLLIDLSASISEIMDSS
AKFLQEDAVRPQILKVSDVIISDLPVGYYPNDDIASRYKVASKEEHTYAHHLLMEQSLKYLKEGGYAIFLAPTNILTSPQ
SDLLKQWLKNYAQVMAVVTLPETMFGNPNNAKSIFVLGKTKNQAVETFVFPITNIQSTELIQDFMNKFKNWKLANVIS

Nucleotide


Download         Length: 957 bp        

>NTDB_id=1063808 ACG3JJ_RS00965 WP_003103712.1 190905..191861(-) (comYH) [Streptococcus parauberis strain DB-M5]
ATGAATTTTGAAAAAATAGAAAAAGCCTTTGAGCTTATTTTAGAGAATAGTCAGCTTATTGAAACGGACTTACATACACA
TATTTATGATGCAATTATTGAACAAAATTCCTACTTATTGGGTGCAAAAGGGGCAAACAAACAAGTTGAAGGAAATATTC
AAGCATTAAAAGAATTACACTTAACAAAAGAAGAATGGCGACGTGCCTTTCAATTTGTATTTATTAAGGCCTCACAGACC
GAGAGATTGCAAGCCAATCATCAATTTACTCCAGACAGTCTTGGTTTTATCTTGCTCTATATGATGGAGACTTTAGTATC
TAATCATTCATTTGATTTATTAGAAATTGGTAGTGGAACTGGTAATTTGGCTCAAACTATCTTAAACAATACGAGTAAGT
CTATTGACTATCTTGGTATTGAGCTGGATGACTTGCTTATCGATTTATCAGCAAGTATTTCTGAAATAATGGATTCTTCT
GCAAAATTTTTACAGGAAGATGCTGTGAGACCACAAATTCTTAAAGTAAGTGATGTTATCATTAGTGACTTACCAGTTGG
TTATTACCCAAATGACGATATAGCAAGTCGTTATAAAGTAGCTAGTAAGGAAGAGCATACTTATGCTCACCATTTATTAA
TGGAACAGTCATTGAAGTATCTAAAAGAAGGTGGTTATGCTATATTTTTAGCTCCAACTAATATTTTAACTAGTCCACAA
AGCGATCTGTTGAAGCAATGGTTAAAAAACTATGCCCAGGTTATGGCAGTTGTTACTTTACCTGAAACGATGTTTGGCAA
TCCAAATAATGCCAAGTCGATTTTTGTCTTAGGAAAAACGAAAAATCAAGCGGTTGAAACATTTGTCTTTCCAATTACTA
ACATTCAATCCACAGAGCTTATTCAAGATTTCATGAACAAGTTTAAAAATTGGAAACTTGCTAATGTCATTTCATAG

Domains


Predicted by InterproScan.

(78-310)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0E2UAX9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

63.608

99.371

0.632

  comYH Streptococcus mutans UA159

63.291

99.371

0.629


Multiple sequence alignment