Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   H7679_RS01070 Genome accession   NZ_AP023392
Coordinates   163938..164891 (+) Length   317 a.a.
NCBI ID   WP_184495879.1    Uniprot ID   -
Organism   Streptococcus suis strain DAT300     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 123402..175862 163938..164891 within 0


Gene organization within MGE regions


Location: 123402..175862
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H7679_RS00760 (DAT300_01170) - 123402..124841 (-) 1440 WP_172077518.1 recombinase family protein -
  H7679_RS00765 (DAT300_01180) - 124976..125545 (-) 570 WP_197970253.1 hypothetical protein -
  H7679_RS00770 (DAT300_01190) - 125559..126317 (-) 759 WP_105131553.1 helix-turn-helix domain-containing protein -
  H7679_RS00775 (DAT300_01200) - 126517..126720 (+) 204 WP_105131554.1 helix-turn-helix domain-containing protein -
  H7679_RS00780 (DAT300_01210) - 126775..127032 (+) 258 WP_105131555.1 hypothetical protein -
  H7679_RS00785 (DAT300_01220) - 127008..127154 (+) 147 WP_184495818.1 BOW99_gp33 family protein -
  H7679_RS00790 (DAT300_01230) - 127197..127544 (+) 348 WP_105131556.1 transcriptional regulator -
  H7679_RS00795 (DAT300_01240) - 127755..128387 (+) 633 WP_184495821.1 polymer-forming cytoskeletal protein -
  H7679_RS00800 (DAT300_01250) - 128391..129083 (+) 693 WP_074412752.1 ERF family protein -
  H7679_RS00805 (DAT300_01260) - 129087..130109 (+) 1023 WP_184495823.1 DUF1351 domain-containing protein -
  H7679_RS00810 (DAT300_01270) - 130109..130756 (+) 648 WP_184495824.1 hypothetical protein -
  H7679_RS00815 (DAT300_01280) - 130758..131291 (+) 534 WP_184495827.1 MazG-like family protein -
  H7679_RS00820 (DAT300_01290) - 131291..131467 (+) 177 WP_171841111.1 hypothetical protein -
  H7679_RS00825 (DAT300_01300) ssbA 131457..131987 (+) 531 WP_184495829.1 single-stranded DNA-binding protein Machinery gene
  H7679_RS00830 (DAT300_01310) - 132003..132281 (+) 279 WP_184495831.1 hypothetical protein -
  H7679_RS00835 (DAT300_01320) - 132282..133070 (+) 789 WP_184495833.1 DNA methyltransferase -
  H7679_RS00840 (DAT300_01330) - 133067..133270 (+) 204 WP_074392405.1 hypothetical protein -
  H7679_RS00845 (DAT300_01340) - 133267..133746 (+) 480 WP_184495836.1 helix-turn-helix domain-containing protein -
  H7679_RS00850 (DAT300_01350) - 133758..134507 (+) 750 WP_029752016.1 DNA-methyltransferase -
  H7679_RS00855 (DAT300_01360) - 134524..134961 (+) 438 WP_024383995.1 hypothetical protein -
  H7679_RS00860 (DAT300_01370) - 134963..135697 (+) 735 WP_184495839.1 hypothetical protein -
  H7679_RS00865 (DAT300_01380) - 135718..136134 (+) 417 WP_105119255.1 hypothetical protein -
  H7679_RS11505 (DAT300_01390) - 136191..136316 (+) 126 WP_269088801.1 hypothetical protein -
  H7679_RS00870 (DAT300_01400) - 136329..137180 (+) 852 WP_024410597.1 prohibitin family protein -
  H7679_RS00875 (DAT300_01410) - 137181..137483 (+) 303 WP_014736101.1 DUF1372 family protein -
  H7679_RS00880 (DAT300_01420) - 137484..137714 (+) 231 WP_014736100.1 hypothetical protein -
  H7679_RS00885 (DAT300_01430) - 137707..137937 (+) 231 WP_074412312.1 hypothetical protein -
  H7679_RS00890 (DAT300_01440) - 137934..138344 (+) 411 WP_074415751.1 endodeoxyribonuclease RusA -
  H7679_RS00895 (DAT300_01450) - 138337..138675 (+) 339 WP_044767980.1 hypothetical protein -
  H7679_RS00900 (DAT300_01460) - 138668..138904 (+) 237 WP_044767978.1 hypothetical protein -
  H7679_RS00905 (DAT300_01470) - 138905..139285 (+) 381 WP_014736097.1 hypothetical protein -
  H7679_RS00910 (DAT300_01480) - 139354..140025 (+) 672 WP_184495841.1 DUF4417 domain-containing protein -
  H7679_RS00915 (DAT300_01490) - 140022..140405 (+) 384 WP_014736095.1 hypothetical protein -
  H7679_RS00920 (DAT300_01500) - 140595..141074 (+) 480 WP_044767977.1 terminase small subunit -
  H7679_RS00925 (DAT300_01510) - 141061..142374 (+) 1314 WP_184495844.1 PBSX family phage terminase large subunit -
  H7679_RS00930 (DAT300_01520) - 142386..143975 (+) 1590 WP_184495846.1 phage portal protein -
  H7679_RS00935 (DAT300_01530) - 143959..144213 (+) 255 WP_415669128.1 hypothetical protein -
  H7679_RS00940 (DAT300_01540) - 144216..145364 (+) 1149 WP_184495850.1 phage minor capsid protein -
  H7679_RS00945 (DAT300_01550) - 145511..146128 (+) 618 WP_184495852.1 phage scaffolding protein -
  H7679_RS00950 (DAT300_01560) - 146132..146971 (+) 840 WP_016056180.1 N4-gp56 family major capsid protein -
  H7679_RS00955 (DAT300_01570) - 146971..147156 (+) 186 WP_074412321.1 Rho termination factor N-terminal domain-containing protein -
  H7679_RS00960 (DAT300_01580) - 147134..147361 (+) 228 WP_184495855.1 hypothetical protein -
  H7679_RS00965 (DAT300_01590) - 147403..147798 (+) 396 WP_184495858.1 hypothetical protein -
  H7679_RS00970 (DAT300_01600) - 147788..148114 (+) 327 WP_184495860.1 putative minor capsid protein -
  H7679_RS00975 (DAT300_01610) - 148114..148464 (+) 351 WP_029751987.1 minor capsid protein -
  H7679_RS00980 (DAT300_01620) - 148464..148871 (+) 408 WP_014736083.1 minor capsid protein -
  H7679_RS00985 (DAT300_01630) - 148875..149333 (+) 459 WP_014736082.1 phage tail tube protein -
  H7679_RS00990 (DAT300_01640) - 149356..149724 (+) 369 WP_029751986.1 hypothetical protein -
  H7679_RS00995 (DAT300_01650) - 149724..150311 (+) 588 WP_184495862.1 Gp15 family bacteriophage protein -
  H7679_RS01000 (DAT300_01660) - 150331..153744 (+) 3414 WP_184495864.1 hypothetical protein -
  H7679_RS01005 (DAT300_01670) - 153741..155243 (+) 1503 WP_174854417.1 distal tail protein Dit -
  H7679_RS01010 (DAT300_01680) - 155243..157927 (+) 2685 WP_184495866.1 phage tail spike protein -
  H7679_RS01015 (DAT300_01690) - 157941..160013 (+) 2073 WP_184495869.1 DUF859 family phage minor structural protein -
  H7679_RS01020 (DAT300_01700) - 160075..160341 (+) 267 WP_184495871.1 hypothetical protein -
  H7679_RS01025 (DAT300_01710) - 160354..160809 (+) 456 WP_184495873.1 hypothetical protein -
  H7679_RS01030 (DAT300_01720) - 160784..160987 (+) 204 WP_029751977.1 phage holin -
  H7679_RS01035 (DAT300_01730) - 161117..161827 (+) 711 WP_184495875.1 CHAP domain-containing protein -
  H7679_RS01040 (DAT300_01740) prx 161973..162158 (+) 186 WP_016056161.1 hypothetical protein Regulator
  H7679_RS01045 (DAT300_01750) comYC 162159..162419 (+) 261 WP_184495877.1 competence type IV pilus major pilin ComGC Machinery gene
  H7679_RS01050 (DAT300_01760) comGD 162400..162807 (+) 408 WP_024395446.1 competence type IV pilus minor pilin ComGD -
  H7679_RS01055 (DAT300_01770) comYE 162779..163072 (+) 294 WP_024394703.1 competence type IV pilus minor pilin ComGE Machinery gene
  H7679_RS01060 (DAT300_01780) comGF/cglF 163059..163493 (+) 435 WP_014637316.1 competence type IV pilus minor pilin ComGF Machinery gene
  H7679_RS01065 (DAT300_01790) comGG 163471..163881 (+) 411 WP_044676305.1 competence type IV pilus minor pilin ComGG -
  H7679_RS01070 (DAT300_01800) comYH 163938..164891 (+) 954 WP_184495879.1 class I SAM-dependent methyltransferase Machinery gene
  H7679_RS01075 (DAT300_01810) - 164941..166128 (+) 1188 WP_024401510.1 acetate kinase -
  H7679_RS01080 (DAT300_01820) - 166443..166994 (+) 552 WP_024395442.1 folate family ECF transporter S component -
  H7679_RS01085 (DAT300_01830) - 167050..168306 (+) 1257 WP_024395441.1 bifunctional folylpolyglutamate synthase/dihydrofolate synthase -
  H7679_RS01090 (DAT300_01840) pepA 168351..169412 (-) 1062 WP_024395440.1 glutamyl aminopeptidase -
  H7679_RS01095 (DAT300_01850) - 169559..169843 (+) 285 WP_024395439.1 DUF4651 domain-containing protein -
  H7679_RS01100 (DAT300_01860) - 169840..170160 (+) 321 WP_105143421.1 thioredoxin family protein -
  H7679_RS01105 (DAT300_01870) - 170205..171161 (+) 957 WP_024395438.1 DUF1002 domain-containing protein -
  H7679_RS01110 (DAT300_01880) ytpR 171180..171803 (+) 624 WP_105104958.1 YtpR family tRNA-binding protein -
  H7679_RS01115 (DAT300_01890) - 171836..172615 (-) 780 WP_105104959.1 DUF2785 domain-containing protein -
  H7679_RS01120 (DAT300_01900) ssbA 172670..173065 (+) 396 WP_011921737.1 single-stranded DNA-binding protein Machinery gene
  H7679_RS01125 (DAT300_01910) - 173539..173751 (-) 213 WP_162495426.1 hypothetical protein -
  H7679_RS01130 (DAT300_01920) groES 173948..174229 (+) 282 WP_014637330.1 co-chaperone GroES -
  H7679_RS01135 (DAT300_01930) groL 174240..175862 (+) 1623 WP_044692542.1 chaperonin GroEL -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35774.88 Da        Isoelectric Point: 4.4442

>NTDB_id=82934 H7679_RS01070 WP_184495879.1 163938..164891(+) (comYH) [Streptococcus suis strain DAT300]
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVANQHETDLVVNNNQTLKQLDLTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVTVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKNGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPNLFGKAAMAKSIFVLQKKAARSLMPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=82934 H7679_RS01070 WP_184495879.1 163938..164891(+) (comYH) [Streptococcus suis strain DAT300]
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTAATCAGCATGAGACGGACCTTGTTGTCAATAACAATC
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTACCAATTCCTGCTCATCAAGGCCAATCAGACT
GAACCCATGCAGTATAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGACGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCTCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATCGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCGGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
CTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTACTCA
TGGAACAATCCCTCAAATATCTGGAAAAAAATGGTTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGTCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAATCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAAGAAAGCTGCAAGATCGTTGATGCCGTTTGTTTATCCCTTACAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA

Domains


Predicted by InterproScan.

(68-282)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

60.443

99.685

0.603

  comYH Streptococcus mutans UA159

60.127

99.685

0.599


Multiple sequence alignment