Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   PFZ59_RS01895 Genome accession   NZ_CP116393
Coordinates   391224..392177 (-) Length   317 a.a.
NCBI ID   WP_277697384.1    Uniprot ID   -
Organism   Streptococcus suis strain SS/UPM/MY/F001     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 319717..402487 391224..392177 within 0


Gene organization within MGE regions


Location: 319717..402487
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PFZ59_RS01560 (PFZ59_01560) rnhC 320137..321027 (+) 891 WP_136671550.1 ribonuclease HIII -
  PFZ59_RS01565 (PFZ59_01565) lepB 321037..321666 (+) 630 WP_063077047.1 signal peptidase I -
  PFZ59_RS01570 (PFZ59_01570) - 321731..324223 (+) 2493 WP_208581203.1 ATP-dependent RecD-like DNA helicase -
  PFZ59_RS01575 (PFZ59_01575) - 324341..325045 (+) 705 WP_277697357.1 hypothetical protein -
  PFZ59_RS01580 (PFZ59_01580) - 325215..326135 (-) 921 WP_277697358.1 PfkB family carbohydrate kinase -
  PFZ59_RS01585 (PFZ59_01585) - 326240..326866 (-) 627 WP_208582643.1 NAD(P)-dependent oxidoreductase -
  PFZ59_RS01590 (PFZ59_01590) - 327002..327421 (-) 420 WP_208582645.1 Rrf2 family transcriptional regulator -
  PFZ59_RS01595 (PFZ59_01595) dinB 327464..328531 (-) 1068 WP_208582647.1 DNA polymerase IV -
  PFZ59_RS01600 (PFZ59_01600) pflB 328847..331192 (+) 2346 WP_024376804.1 formate C-acetyltransferase -
  PFZ59_RS01605 (PFZ59_01605) - 331376..332627 (+) 1252 Protein_296 ISL3 family transposase -
  PFZ59_RS01610 (PFZ59_01610) - 333064..334011 (-) 948 WP_277697359.1 serine hydrolase domain-containing protein -
  PFZ59_RS01615 (PFZ59_01615) - 334008..334739 (-) 732 WP_244229275.1 CppA family protein -
  PFZ59_RS01620 (PFZ59_01620) - 334924..337251 (+) 2328 WP_277697360.1 Xaa-Pro dipeptidyl-peptidase -
  PFZ59_RS01625 (PFZ59_01625) - 337236..338082 (+) 847 WP_277697006.1 IS630 family transposase -
  PFZ59_RS01630 (PFZ59_01630) - 338125..339291 (-) 1167 WP_277697361.1 SIS domain-containing protein -
  PFZ59_RS01635 (PFZ59_01635) - 339557..339856 (+) 300 WP_002941492.1 YbaB/EbfC family nucleoid-associated protein -
  PFZ59_RS01640 (PFZ59_01640) - 339897..341663 (-) 1767 WP_277697362.1 glycerophosphodiester phosphodiesterase -
  PFZ59_RS01645 (PFZ59_01645) - 341848..342396 (+) 549 Protein_304 GNAT family protein -
  PFZ59_RS01650 (PFZ59_01650) - 342417..342911 (-) 495 WP_015646325.1 DUF536 domain-containing protein -
  PFZ59_RS01655 (PFZ59_01655) - 343081..344481 (-) 1401 WP_277697363.1 glycoside hydrolase family 1 protein -
  PFZ59_RS01660 (PFZ59_01660) - 344631..345478 (+) 848 WP_208580884.1 IS630 family transposase -
  PFZ59_RS01665 (PFZ59_01665) - 345617..346789 (+) 1173 WP_277697364.1 NAD(P)/FAD-dependent oxidoreductase -
  PFZ59_RS01670 (PFZ59_01670) - 346799..347491 (+) 693 WP_277697365.1 GNAT family protein -
  PFZ59_RS01675 (PFZ59_01675) - 347572..348270 (+) 699 WP_277697366.1 ABC transporter ATP-binding protein -
  PFZ59_RS01680 (PFZ59_01680) - 348280..349905 (+) 1626 WP_277697367.1 hypothetical protein -
  PFZ59_RS01685 (PFZ59_01685) - 350048..351070 (-) 1023 WP_277697368.1 YeiH family protein -
  PFZ59_RS01690 (PFZ59_01690) - 351080..351403 (-) 324 WP_208581183.1 AzlD domain-containing protein -
  PFZ59_RS01695 (PFZ59_01695) - 351390..352091 (-) 702 WP_208581186.1 AzlC family ABC transporter permease -
  PFZ59_RS01700 (PFZ59_01700) - 352266..354467 (-) 2202 WP_277697369.1 alpha-galactosidase -
  PFZ59_RS01705 (PFZ59_01705) - 354477..355307 (-) 831 WP_044756438.1 carbohydrate ABC transporter permease -
  PFZ59_RS01710 (PFZ59_01710) - 355318..356211 (-) 894 WP_277697761.1 sugar ABC transporter permease -
  PFZ59_RS01715 (PFZ59_01715) - 356279..357553 (-) 1275 WP_277697370.1 sugar ABC transporter substrate-binding protein -
  PFZ59_RS01720 (PFZ59_01720) - 357768..358604 (+) 837 WP_014637344.1 AraC family transcriptional regulator -
  PFZ59_RS01725 (PFZ59_01725) tsaD 358782..359789 (-) 1008 WP_002938526.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD -
  PFZ59_RS01730 (PFZ59_01730) rimI 359779..360219 (-) 441 WP_277697371.1 ribosomal protein S18-alanine N-acetyltransferase -
  PFZ59_RS01735 (PFZ59_01735) tsaB 360216..360899 (-) 684 WP_277697372.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex dimerization subunit type 1 TsaB -
  PFZ59_RS01740 (PFZ59_01740) - 361308..361592 (-) 285 WP_277697373.1 hypothetical protein -
  PFZ59_RS01745 (PFZ59_01745) - 361645..362492 (+) 848 WP_208580884.1 IS630 family transposase -
  PFZ59_RS01750 (PFZ59_01750) - 362738..363742 (-) 1005 WP_277696681.1 IS5 family transposase -
  PFZ59_RS01755 (PFZ59_01755) - 363793..364056 (-) 264 WP_277697374.1 hypothetical protein -
  PFZ59_RS01760 (PFZ59_01760) - 364059..364271 (-) 213 WP_208581196.1 hypothetical protein -
  PFZ59_RS01765 (PFZ59_01765) - 364646..365515 (-) 870 WP_208582853.1 Rgg/GadR/MutR family transcriptional regulator -
  PFZ59_RS01770 (PFZ59_01770) - 365814..366044 (+) 231 WP_002938523.1 DNA-dependent RNA polymerase subunit epsilon -
  PFZ59_RS01775 (PFZ59_01775) - 366048..367727 (+) 1680 WP_002938522.1 ribonuclease J -
  PFZ59_RS01780 (PFZ59_01780) glnA 368104..369450 (-) 1347 WP_011921751.1 type I glutamate--ammonia ligase -
  PFZ59_RS01785 (PFZ59_01785) - 369479..369850 (-) 372 WP_002940041.1 MerR family transcriptional regulator -
  PFZ59_RS01790 (PFZ59_01790) - 369926..370441 (-) 516 WP_277697375.1 aromatic acid exporter family protein -
  PFZ59_RS01795 (PFZ59_01795) - 371198..372454 (+) 1257 WP_277697762.1 ISL3 family transposase -
  PFZ59_RS01800 (PFZ59_01800) - 372549..373748 (-) 1200 WP_014637337.1 phosphoglycerate kinase -
  PFZ59_RS01805 (PFZ59_01805) gap 374008..375018 (-) 1011 WP_002938507.1 type I glyceraldehyde-3-phosphate dehydrogenase -
  PFZ59_RS01810 (PFZ59_01810) fusA 375225..377306 (-) 2082 WP_011921743.1 elongation factor G -
  PFZ59_RS01815 (PFZ59_01815) rpsG 377736..378206 (-) 471 WP_044775067.1 30S ribosomal protein S7 -
  PFZ59_RS01820 (PFZ59_01820) rpsL 378223..378636 (-) 414 WP_002940030.1 30S ribosomal protein S12 -
  PFZ59_RS01825 (PFZ59_01825) - 378942..380105 (+) 1164 WP_277697376.1 IS30 family transposase -
  PFZ59_RS01830 (PFZ59_01830) groL 380421..382043 (-) 1623 WP_277697377.1 chaperonin GroEL -
  PFZ59_RS01835 (PFZ59_01835) groES 382055..382336 (-) 282 WP_014637330.1 co-chaperone GroES -
  PFZ59_RS01840 (PFZ59_01840) - 382535..382780 (+) 246 WP_277697378.1 hypothetical protein -
  PFZ59_RS01845 (PFZ59_01845) ssbA 382899..383294 (-) 396 WP_277697379.1 single-stranded DNA-binding protein Machinery gene
  PFZ59_RS01850 (PFZ59_01850) - 383349..384128 (+) 780 WP_277697380.1 DUF2785 domain-containing protein -
  PFZ59_RS01855 (PFZ59_01855) ytpR 384161..384784 (-) 624 WP_277697381.1 YtpR family tRNA-binding protein -
  PFZ59_RS01860 (PFZ59_01860) - 384803..385759 (-) 957 WP_277697382.1 DUF1002 domain-containing protein -
  PFZ59_RS01865 (PFZ59_01865) - 385957..386277 (-) 321 WP_044669967.1 thioredoxin family protein -
  PFZ59_RS01870 (PFZ59_01870) - 386274..386558 (-) 285 WP_015646299.1 DUF4651 domain-containing protein -
  PFZ59_RS01875 (PFZ59_01875) pepA 386705..387766 (+) 1062 WP_208580731.1 glutamyl aminopeptidase -
  PFZ59_RS01880 (PFZ59_01880) - 387811..389067 (-) 1257 WP_277697383.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -
  PFZ59_RS01885 (PFZ59_01885) - 389123..389674 (-) 552 WP_208580735.1 folate family ECF transporter S component -
  PFZ59_RS01890 (PFZ59_01890) - 389987..391174 (-) 1188 WP_208580737.1 acetate kinase -
  PFZ59_RS01895 (PFZ59_01895) comYH 391224..392177 (-) 954 WP_277697384.1 class I SAM-dependent methyltransferase Machinery gene
  PFZ59_RS01900 (PFZ59_01900) comGG 392227..392745 (-) 519 WP_277697385.1 competence type IV pilus minor pilin ComGG -
  PFZ59_RS01905 (PFZ59_01905) comGF/cglF 392723..393157 (-) 435 WP_277697386.1 competence type IV pilus minor pilin ComGF Machinery gene
  PFZ59_RS01910 (PFZ59_01910) comYE 393144..393437 (-) 294 WP_024405248.1 competence type IV pilus minor pilin ComGE Machinery gene
  PFZ59_RS01915 (PFZ59_01915) comGD 393409..393816 (-) 408 WP_277697387.1 competence type IV pilus minor pilin ComGD -
  PFZ59_RS01920 (PFZ59_01920) comYC 393797..394078 (-) 282 WP_024387069.1 competence type IV pilus major pilin ComGC Machinery gene
  PFZ59_RS01925 (PFZ59_01925) comYB 394080..395117 (-) 1038 WP_277697388.1 competence type IV pilus assembly protein ComGB Machinery gene
  PFZ59_RS01930 (PFZ59_01930) comYA 395029..395979 (-) 951 WP_105156412.1 competence type IV pilus ATPase ComGA Machinery gene
  PFZ59_RS01935 (PFZ59_01935) - 396065..397015 (+) 951 WP_277697389.1 S66 peptidase family protein -
  PFZ59_RS01940 (PFZ59_01940) - 397050..397415 (-) 366 WP_277697390.1 DUF1033 family protein -
  PFZ59_RS01945 (PFZ59_01945) - 397488..397943 (-) 456 WP_347176380.1 transposase -
  PFZ59_RS01950 (PFZ59_01950) - 398008..398337 (-) 330 WP_277696633.1 IS630 transposase-related protein -
  PFZ59_RS01955 (PFZ59_01955) - 398360..398815 (-) 456 WP_347176380.1 transposase -
  PFZ59_RS01960 (PFZ59_01960) - 398880..399209 (-) 330 WP_277696633.1 IS630 transposase-related protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35754.88 Da        Isoelectric Point: 4.4571

>NTDB_id=777073 PFZ59_RS01895 WP_277697384.1 391224..392177(-) (comYH) [Streptococcus suis strain SS/UPM/MY/F001]
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVANQHETDLIINNNKTLKQLDLTKEEWRRAYQFLLIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVTVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKDGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPSLFGKAAMAKSIFVLQRKAARPLAPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=777073 PFZ59_RS01895 WP_277697384.1 391224..392177(-) (comYH) [Streptococcus suis strain SS/UPM/MY/F001]
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTAATCAGCATGAGACGGACCTTATTATCAATAATAACA
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTACCAATTCCTGCTCATCAAGGCCAATCAGACT
GAACCCATGCAGTACAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGACGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCGCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATTGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCTGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
TTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTACTCA
TGGAACAATCCCTGAAATATCTGGAAAAAGATGGCTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGCCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAGTCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAGGAAAGCAGCTAGACCTCTAGCGCCGTTTGTTTATCCCTTGCAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA

Domains


Predicted by InterproScan.

(68-282)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

60.443

99.685

0.603

  comYH Streptococcus mutans UA159

60.127

99.685

0.599