Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   NQZ88_RS00880 Genome accession   NZ_CP102152
Coordinates   135704..136657 (+) Length   317 a.a.
NCBI ID   WP_257116215.1    Uniprot ID   -
Organism   Streptococcus suis strain DNS11     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 107215..171199 135704..136657 within 0


Gene organization within MGE regions


Location: 107215..171199
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NQZ88_RS00735 (NQZ88_00735) - 107425..107634 (+) 210 WP_023369068.1 heavy-metal-associated domain-containing protein -
  NQZ88_RS00740 (NQZ88_00740) - 107879..108253 (-) 375 WP_023369069.1 HIT family protein -
  NQZ88_RS00745 (NQZ88_00745) - 108307..109089 (-) 783 WP_023369071.1 TMEM175 family protein -
  NQZ88_RS00750 (NQZ88_00750) tyrS 109093..110349 (-) 1257 WP_023369073.1 tyrosine--tRNA ligase -
  NQZ88_RS00755 (NQZ88_00755) pbp1b 110503..112926 (+) 2424 WP_023369075.1 penicillin-binding protein PBP1B -
  NQZ88_RS00760 (NQZ88_00760) rpoB 113275..116847 (+) 3573 WP_002936570.1 DNA-directed RNA polymerase subunit beta -
  NQZ88_RS00765 (NQZ88_00765) rpoC 117025..120672 (+) 3648 WP_012774906.1 DNA-directed RNA polymerase subunit beta' -
  NQZ88_RS00770 (NQZ88_00770) - 120823..121188 (+) 366 WP_023369079.1 DUF1033 family protein -
  NQZ88_RS00775 (NQZ88_00775) - 121223..122173 (-) 951 Protein_119 S66 peptidase family protein -
  NQZ88_RS00780 (NQZ88_00780) comYA 122259..123209 (+) 951 WP_044760311.1 competence type IV pilus ATPase ComGA Machinery gene
  NQZ88_RS00785 (NQZ88_00785) comYB 123121..124158 (+) 1038 WP_257116212.1 competence type IV pilus assembly protein ComGB Machinery gene
  NQZ88_RS00790 (NQZ88_00790) comYC 124160..124441 (+) 282 WP_074389246.1 competence type IV pilus major pilin ComGC Machinery gene
  NQZ88_RS00795 (NQZ88_00795) comGD 124422..124829 (+) 408 WP_074389247.1 competence type IV pilus minor pilin ComGD -
  NQZ88_RS00800 (NQZ88_00800) comYE 124801..125094 (+) 294 WP_024413203.1 competence type IV pilus minor pilin ComGE Machinery gene
  NQZ88_RS00805 (NQZ88_00805) comGF/cglF 125081..125515 (+) 435 WP_074389248.1 competence type IV pilus minor pilin ComGF Machinery gene
  NQZ88_RS00810 (NQZ88_00810) comGG 125493..125903 (+) 411 WP_074389249.1 competence type IV pilus minor pilin ComGG -
  NQZ88_RS00815 (NQZ88_00815) - 126384..126809 (-) 426 WP_226319165.1 ArpU family phage packaging/lysis transcriptional regulator -
  NQZ88_RS00820 (NQZ88_00820) - 127133..127708 (-) 576 WP_074392204.1 hypothetical protein -
  NQZ88_RS00825 (NQZ88_00825) - 127729..127902 (-) 174 WP_257116213.1 hypothetical protein -
  NQZ88_RS00830 (NQZ88_00830) - 128207..129643 (-) 1437 WP_257116214.1 phage/plasmid primase, P4 family -
  NQZ88_RS00835 (NQZ88_00835) - 129706..130584 (-) 879 WP_074389575.1 primase alpha helix C-terminal domain-containing protein -
  NQZ88_RS00840 (NQZ88_00840) - 130719..131003 (-) 285 WP_044766582.1 hypothetical protein -
  NQZ88_RS00845 (NQZ88_00845) - 130993..131331 (-) 339 WP_074390405.1 hypothetical protein -
  NQZ88_RS00850 (NQZ88_00850) - 131343..131537 (-) 195 WP_225620783.1 hypothetical protein -
  NQZ88_RS00855 (NQZ88_00855) - 131548..131991 (-) 444 WP_074390406.1 hypothetical protein -
  NQZ88_RS00860 (NQZ88_00860) - 132463..133089 (-) 627 WP_074390407.1 Rha family transcriptional regulator -
  NQZ88_RS00865 (NQZ88_00865) - 133106..133306 (-) 201 WP_015647585.1 helix-turn-helix transcriptional regulator -
  NQZ88_RS00870 (NQZ88_00870) - 133492..134001 (+) 510 WP_370695883.1 helix-turn-helix domain-containing protein -
  NQZ88_RS00875 (NQZ88_00875) - 134462..135607 (+) 1146 WP_074390408.1 site-specific integrase -
  NQZ88_RS00880 (NQZ88_00880) comYH 135704..136657 (+) 954 WP_257116215.1 class I SAM-dependent methyltransferase Machinery gene
  NQZ88_RS00885 (NQZ88_00885) - 136707..137894 (+) 1188 WP_074389251.1 acetate kinase -
  NQZ88_RS00890 (NQZ88_00890) - 138209..138760 (+) 552 WP_024389479.1 folate family ECF transporter S component -
  NQZ88_RS00895 (NQZ88_00895) - 138816..140072 (+) 1257 WP_074389253.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -
  NQZ88_RS00900 (NQZ88_00900) pepA 140117..141178 (-) 1062 WP_074389254.1 glutamyl aminopeptidase -
  NQZ88_RS00905 (NQZ88_00905) - 141325..141609 (+) 285 WP_044674517.1 DUF4651 domain-containing protein -
  NQZ88_RS00910 (NQZ88_00910) - 141606..141926 (+) 321 WP_024386832.1 thioredoxin family protein -
  NQZ88_RS00915 (NQZ88_00915) - 141971..142927 (+) 957 WP_044754802.1 DUF1002 domain-containing protein -
  NQZ88_RS00920 (NQZ88_00920) ytpR 142946..143569 (+) 624 WP_074389255.1 YtpR family tRNA-binding protein -
  NQZ88_RS00925 (NQZ88_00925) - 143602..144381 (-) 780 WP_074389256.1 DUF2785 domain-containing protein -
  NQZ88_RS00930 (NQZ88_00930) ssbA 144436..144831 (+) 396 WP_011921737.1 single-stranded DNA-binding protein Machinery gene
  NQZ88_RS00935 (NQZ88_00935) - 144950..145195 (-) 246 WP_074389257.1 hypothetical protein -
  NQZ88_RS00940 (NQZ88_00940) groES 145392..145673 (+) 282 WP_074389258.1 co-chaperone GroES -
  NQZ88_RS00945 (NQZ88_00945) groL 145685..147307 (+) 1623 WP_074389259.1 chaperonin GroEL -
  NQZ88_RS00950 (NQZ88_00950) rpsL 147539..147952 (+) 414 WP_002940030.1 30S ribosomal protein S12 -
  NQZ88_RS00955 (NQZ88_00955) rpsG 147969..148439 (+) 471 WP_002940031.1 30S ribosomal protein S7 -
  NQZ88_RS00960 (NQZ88_00960) fusA 148767..150848 (+) 2082 WP_002938501.1 elongation factor G -
  NQZ88_RS00965 (NQZ88_00965) - 151837..153021 (+) 1185 Protein_157 M13 family peptidase -
  NQZ88_RS00970 (NQZ88_00970) - 153229..153828 (+) 600 WP_228476909.1 M13-type metalloendopeptidase -
  NQZ88_RS00975 (NQZ88_00975) gap 154195..155205 (+) 1011 WP_012775479.1 type I glyceraldehyde-3-phosphate dehydrogenase -
  NQZ88_RS00980 (NQZ88_00980) pgk 155461..156660 (+) 1200 WP_014637337.1 phosphoglycerate kinase -
  NQZ88_RS00985 (NQZ88_00985) - 157440..157955 (+) 516 WP_024403309.1 aromatic acid exporter family protein -
  NQZ88_RS00990 (NQZ88_00990) - 158031..158402 (+) 372 WP_002940041.1 MerR family transcriptional regulator -
  NQZ88_RS00995 (NQZ88_00995) glnA 158431..159777 (+) 1347 WP_074389261.1 type I glutamate--ammonia ligase -
  NQZ88_RS01000 (NQZ88_01000) rnjA 160104..161783 (-) 1680 WP_002938522.1 ribonuclease J1 -
  NQZ88_RS01005 (NQZ88_01005) - 161787..162017 (-) 231 WP_002938523.1 DNA-dependent RNA polymerase subunit epsilon -
  NQZ88_RS01010 (NQZ88_01010) tsaB 162469..163152 (+) 684 WP_074389262.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex dimerization subunit type 1 TsaB -
  NQZ88_RS01015 (NQZ88_01015) rimI 163149..163589 (+) 441 WP_074389263.1 ribosomal protein S18-alanine N-acetyltransferase -
  NQZ88_RS01020 (NQZ88_01020) tsaD 163579..164586 (+) 1008 WP_074389264.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex transferase subunit TsaD -
  NQZ88_RS01025 (NQZ88_01025) - 164763..165599 (-) 837 WP_024382108.1 AraC family transcriptional regulator -
  NQZ88_RS01030 (NQZ88_01030) - 165814..167088 (+) 1275 WP_074389265.1 sugar ABC transporter substrate-binding protein -
  NQZ88_RS01035 (NQZ88_01035) - 167157..168050 (+) 894 WP_074389266.1 carbohydrate ABC transporter permease -
  NQZ88_RS01040 (NQZ88_01040) - 168061..168891 (+) 831 WP_074389267.1 carbohydrate ABC transporter permease -
  NQZ88_RS01045 (NQZ88_01045) - 168901..171102 (+) 2202 WP_074389268.1 alpha-galactosidase -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35794.98 Da        Isoelectric Point: 4.4133

>NTDB_id=715044 NQZ88_RS00880 WP_257116215.1 135704..136657(+) (comYH) [Streptococcus suis strain DNS11]
MNFEKIEQAYDLLLENVQTIQNQLGTNIYDAMIEQNAAYVADQHEMDLVVNNNKTLKQLDLTKEEWRRAFQFLFIKANQT
EPMQYNHQFTPDSIGFILSFLVDQLVPTQKVMVLEIGSGTGNLAQTILNASQKELDYLGIEVDDLLIDLSASIADVMQAD
ISFAQGDAVRPQILKESQVILGDLPIGYYPDDQIASRYQVASPNEHTYAHHLLMEQSLKYLEKDGFAILLAPNDLLTSPQ
SDLLKGWLQEQANIVAMIALPPNLFGKAAMAKSIFVLQKKAARSLAPFVYPLQSLQEPEAIQKFMLNFKNWKQENAI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=715044 NQZ88_RS00880 WP_257116215.1 135704..136657(+) (comYH) [Streptococcus suis strain DNS11]
ATGAATTTTGAAAAGATCGAACAGGCTTACGACCTGCTATTAGAAAACGTACAGACTATCCAAAACCAGCTAGGTACCAA
TATCTATGATGCCATGATTGAGCAAAATGCTGCTTACGTAGCTGATCAGCATGAGATGGACCTTGTTGTCAATAATAACA
AGACCTTGAAACAACTAGATTTAACCAAGGAAGAATGGCGTCGTGCCTTCCAATTCCTGTTCATCAAGGCCAATCAGACT
GAACCCATGCAGTACAATCACCAGTTCACACCAGACTCTATCGGATTTATCCTATCTTTTCTAGTAGACCAATTGGTGCC
GACTCAAAAGGTGATGGTTCTGGAAATTGGTTCGGGGACAGGCAATCTAGCTCAGACCATTCTCAACGCCAGCCAGAAAG
AATTGGATTACTTGGGGATCGAAGTGGACGACCTCTTGATTGATTTGTCGGCAAGTATTGCGGATGTCATGCAGGCAGAT
ATTTCTTTTGCTCAGGGAGATGCGGTACGTCCGCAGATTTTGAAGGAAAGTCAAGTAATTTTGGGAGATTTGCCTATTGG
CTACTATCCAGATGACCAGATTGCTAGCCGCTATCAGGTCGCCAGCCCAAATGAACATACCTACGCCCATCATTTGCTCA
TGGAACAATCCCTCAAATATCTGGAAAAAGATGGCTTTGCGATTTTGTTGGCTCCAAATGATTTATTGACTAGTCCGCAA
AGCGATTTGCTGAAAGGTTGGTTACAGGAGCAAGCCAATATTGTTGCCATGATTGCCCTGCCACCAAATCTCTTTGGGAA
GGCTGCTATGGCCAAGTCTATTTTTGTCTTGCAAAAGAAAGCTGCAAGATCGTTGGCGCCGTTTGTTTATCCCTTACAAA
GTCTTCAAGAACCAGAAGCTATTCAGAAGTTCATGCTCAATTTCAAAAATTGGAAGCAAGAGAATGCAATTTAA

Domains


Predicted by InterproScan.

(78-283)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

61.392

99.685

0.612

  comYH Streptococcus mutans UA159

61.076

99.685

0.609