Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   I2438_RS01030 Genome accession   NZ_CP065190
Coordinates   162727..163680 (+) Length   317 a.a.
NCBI ID   WP_231146412.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus strain SEZ36     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 118146..163680 162727..163680 within 0


Gene organization within MGE regions


Location: 118146..163680
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I2438_RS00710 (HCFMJIKG_00139) - 118146..118511 (+) 366 WP_012514777.1 DUF1033 family protein -
  I2438_RS00715 - 118793..118984 (+) 192 Protein_107 lactococcin 972 family bacteriocin -
  I2438_RS00720 (HCFMJIKG_00140) - 119049..121052 (+) 2004 WP_021320258.1 DUF1430 domain-containing protein -
  I2438_RS00725 (HCFMJIKG_00141) - 121054..121722 (+) 669 WP_012514779.1 ATP-binding cassette domain-containing protein -
  I2438_RS00730 (HCFMJIKG_00142) comGA/cglA 121878..122816 (+) 939 WP_021320259.1 competence type IV pilus ATPase ComGA Machinery gene
  I2438_RS00735 (HCFMJIKG_00143) comYB 122749..123783 (+) 1035 WP_229297307.1 competence type IV pilus assembly protein ComGB Machinery gene
  I2438_RS00740 (HCFMJIKG_00144) - 123864..125309 (-) 1446 WP_231146423.1 recombinase family protein -
  I2438_RS00745 (HCFMJIKG_00145) - 125433..126164 (-) 732 WP_231146422.1 XRE family transcriptional regulator -
  I2438_RS00750 (HCFMJIKG_00146) - 126326..126538 (+) 213 WP_214504394.1 helix-turn-helix domain-containing protein -
  I2438_RS00755 (HCFMJIKG_00147) - 126778..126954 (+) 177 WP_231146421.1 BOW99_gp33 family protein -
  I2438_RS00760 - 126973..127143 (+) 171 Protein_116 excisionase -
  I2438_RS00765 (HCFMJIKG_00148) - 127146..127286 (+) 141 WP_173476844.1 hypothetical protein -
  I2438_RS00770 (HCFMJIKG_00149) - 127286..127432 (+) 147 WP_154244050.1 hypothetical protein -
  I2438_RS00775 (HCFMJIKG_00150) - 127435..127776 (+) 342 WP_012679297.1 hypothetical protein -
  I2438_RS00780 (HCFMJIKG_00151) - 127776..128258 (+) 483 WP_012679298.1 siphovirus Gp157 family protein -
  I2438_RS00785 (HCFMJIKG_00152) - 128255..128728 (+) 474 WP_012679299.1 hypothetical protein -
  I2438_RS00790 (HCFMJIKG_00153) - 128688..129863 (+) 1176 WP_050316389.1 DEAD/DEAH box helicase family protein -
  I2438_RS00795 (HCFMJIKG_00154) - 129873..130583 (+) 711 WP_012679301.1 ERF family protein -
  I2438_RS00800 (HCFMJIKG_00155) ssb 130584..130976 (+) 393 WP_050316388.1 single-stranded DNA-binding protein Machinery gene
  I2438_RS00805 (HCFMJIKG_00156) - 130990..131817 (+) 828 WP_050316387.1 bifunctional DNA primase/polymerase -
  I2438_RS00810 (HCFMJIKG_00157) - 131801..133201 (+) 1401 WP_050316386.1 virulence-associated E family protein -
  I2438_RS00815 (HCFMJIKG_00158) - 133547..133741 (+) 195 WP_050316385.1 DUF7204 family protein -
  I2438_RS00820 (HCFMJIKG_00159) - 133734..133961 (+) 228 WP_050316384.1 hypothetical protein -
  I2438_RS00825 (HCFMJIKG_00160) - 133951..134661 (+) 711 WP_231147179.1 DNA-methyltransferase -
  I2438_RS00830 (HCFMJIKG_00161) - 134712..134942 (+) 231 WP_050316383.1 DUF1642 domain-containing protein -
  I2438_RS00835 (HCFMJIKG_00162) - 134939..135226 (+) 288 WP_050316382.1 hypothetical protein -
  I2438_RS00840 (HCFMJIKG_00163) - 135226..135441 (+) 216 WP_050316381.1 crAss001_48 related protein -
  I2438_RS00845 (HCFMJIKG_00164) - 135515..135952 (+) 438 WP_050316380.1 DUF1492 domain-containing protein -
  I2438_RS00850 (HCFMJIKG_00165) - 136198..136641 (+) 444 WP_230373400.1 terminase small subunit -
  I2438_RS00855 (HCFMJIKG_00166) - 136631..137929 (+) 1299 WP_050315874.1 PBSX family phage terminase large subunit -
  I2438_RS00860 (HCFMJIKG_00167) - 137939..139438 (+) 1500 WP_050316378.1 phage portal protein -
  I2438_RS00865 (HCFMJIKG_00168) - 139438..141048 (+) 1611 WP_050315872.1 phage minor capsid protein -
  I2438_RS00870 (HCFMJIKG_00169) - 141014..141235 (+) 222 WP_050315871.1 CPCC family cysteine-rich protein -
  I2438_RS00875 (HCFMJIKG_00170) - 141287..141571 (+) 285 WP_050315870.1 hypothetical protein -
  I2438_RS00880 (HCFMJIKG_00171) - 141657..141896 (+) 240 WP_050318689.1 hypothetical protein -
  I2438_RS00885 (HCFMJIKG_00172) - 141881..142471 (+) 591 WP_050318688.1 hypothetical protein -
  I2438_RS00890 (HCFMJIKG_00173) - 142485..143357 (+) 873 WP_050318686.1 hypothetical protein -
  I2438_RS00895 (HCFMJIKG_00174) - 143371..143610 (+) 240 WP_050318684.1 hypothetical protein -
  I2438_RS00900 (HCFMJIKG_00175) - 143613..143999 (+) 387 WP_050318682.1 hypothetical protein -
  I2438_RS00905 (HCFMJIKG_00176) - 143993..144337 (+) 345 WP_050318680.1 putative minor capsid protein -
  I2438_RS00910 (HCFMJIKG_00177) - 144334..144687 (+) 354 WP_050318679.1 minor capsid protein -
  I2438_RS00915 (HCFMJIKG_00178) - 144690..145058 (+) 369 WP_050318677.1 hypothetical protein -
  I2438_RS00920 (HCFMJIKG_00179) - 145062..145559 (+) 498 WP_050316279.1 phage tail tube protein -
  I2438_RS00925 (HCFMJIKG_00180) - 145579..145986 (+) 408 WP_050315860.1 hypothetical protein -
  I2438_RS00930 (HCFMJIKG_00181) - 145994..146611 (+) 618 WP_050316278.1 Gp15 family bacteriophage protein -
  I2438_RS00935 (HCFMJIKG_00182) - 146612..149059 (+) 2448 WP_050316374.1 phage tail tape measure protein -
  I2438_RS00940 (HCFMJIKG_00183) - 149063..149824 (+) 762 WP_050315857.1 phage tail domain-containing protein -
  I2438_RS00945 (HCFMJIKG_00184) - 149835..151832 (+) 1998 WP_231146417.1 phage tail protein -
  I2438_RS00950 (HCFMJIKG_00185) - 151832..152506 (+) 675 WP_231146416.1 collagen-like protein -
  I2438_RS00955 (HCFMJIKG_00186) - 152508..153122 (+) 615 WP_231146415.1 hypothetical protein -
  I2438_RS00960 (HCFMJIKG_00187) - 153133..155016 (+) 1884 WP_231146414.1 gp58-like family protein -
  I2438_RS00965 (HCFMJIKG_00188) - 155025..155456 (+) 432 WP_050322829.1 DUF1617 family protein -
  I2438_RS00970 (HCFMJIKG_00189) - 155460..156071 (+) 612 WP_050317086.1 DUF1366 domain-containing protein -
  I2438_RS00975 (HCFMJIKG_00190) - 156082..156453 (+) 372 WP_012678881.1 phage holin family protein -
  I2438_RS00980 (HCFMJIKG_00192) - 156569..157765 (+) 1197 WP_050317189.1 glucosaminidase domain-containing protein -
  I2438_RS00985 (HCFMJIKG_00193) - 157904..158428 (+) 525 WP_037578457.1 Panacea domain-containing protein -
  I2438_RS00990 (HCFMJIKG_00194) - 158416..159279 (+) 864 WP_050321679.1 DUF334 domain-containing protein -
  I2438_RS00995 (HCFMJIKG_00195) - 159393..160544 (-) 1152 WP_037578452.1 DNA/RNA non-specific endonuclease -
  I2438_RS01000 (HCFMJIKG_00196) prx 160785..160976 (+) 192 WP_050316475.1 hypothetical protein Regulator
  I2438_RS01005 (HCFMJIKG_00197) comYC 160991..161254 (+) 264 WP_050316474.1 competence type IV pilus major pilin ComGC Machinery gene
  I2438_RS01010 (HCFMJIKG_00198) comGD 161232..161657 (+) 426 WP_231146413.1 competence type IV pilus minor pilin ComGD -
  I2438_RS01015 (HCFMJIKG_00199) comGE 161662..161904 (+) 243 WP_012677236.1 competence type IV pilus minor pilin ComGE -
  I2438_RS01020 (HCFMJIKG_00200) comYF 161891..162325 (+) 435 WP_014622144.1 competence type IV pilus minor pilin ComGF Machinery gene
  I2438_RS01025 (HCFMJIKG_00201) comGG 162303..162665 (+) 363 WP_012514785.1 competence type IV pilus minor pilin ComGG -
  I2438_RS01030 (HCFMJIKG_00202) comYH 162727..163680 (+) 954 WP_231146412.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 35708.69 Da        Isoelectric Point: 4.6973

>NTDB_id=507745 I2438_RS01030 WP_231146412.1 162727..163680(+) (comYH) [Streptococcus equi subsp. zooepidemicus strain SEZ36]
MNFEKIEQAYELILENSQLIENDLKTHIYDAIVEQNSFYLGAQGASPQVAKNIETLKALQLTKEEWRQAYQFVLIKAGKT
EPLQANHQFTPDAIGFIMLYILETLSSQESLDVLEIGSGTGNLAQTILNHSHKSIDYLGIELDDLLIDLSASIAEIMGSS
AQFIQEDAVRPQLLKESDLIISDLPVGFYPNDDIASRYQVASSDEHTYAHHLLMEQALKYLKKDGFAIFLAPVNLLSSPQ
SHLLKQWLKGYAQVAALITLPEAVFGNPANAKSIIVLCKQSNRFAETFVYPIRDLKSVDNVRDFMENFKNWKRDNVI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=507745 I2438_RS01030 WP_231146412.1 162727..163680(+) (comYH) [Streptococcus equi subsp. zooepidemicus strain SEZ36]
ATGAATTTTGAAAAGATTGAACAAGCCTATGAGCTTATATTAGAAAATAGCCAGCTGATTGAAAATGATTTAAAAACGCA
TATCTATGATGCTATTGTTGAGCAGAATTCTTTTTATCTGGGAGCCCAAGGAGCAAGCCCTCAGGTTGCTAAAAATATTG
AGACGTTGAAGGCCTTGCAGCTAACCAAGGAGGAGTGGCGTCAGGCTTATCAGTTTGTTTTGATCAAGGCTGGGAAAACA
GAGCCATTACAGGCCAACCACCAATTTACCCCAGACGCGATCGGCTTTATCATGCTTTATATTTTGGAGACCTTGAGTTC
ACAAGAGTCACTTGATGTGCTTGAGATTGGCAGTGGAACAGGTAATCTAGCTCAAACTATTTTAAACCATTCACATAAGA
GCATTGATTATTTAGGCATTGAGCTTGATGATTTACTAATTGATCTATCAGCTAGTATCGCTGAAATCATGGGGTCATCA
GCCCAGTTTATTCAAGAGGATGCTGTCAGACCTCAGCTGTTAAAAGAAAGTGATCTCATCATTAGCGATTTGCCAGTTGG
TTTTTATCCTAACGACGATATTGCTAGCAGGTATCAGGTGGCTAGCTCAGATGAGCATACCTATGCCCATCATTTACTGA
TGGAACAGGCCTTAAAGTACCTGAAAAAAGATGGTTTTGCTATTTTCTTAGCACCGGTAAATCTTTTGAGCAGCCCACAA
AGTCACCTCTTAAAACAATGGCTGAAGGGCTATGCTCAGGTTGCGGCCTTGATTACTTTGCCTGAGGCTGTGTTTGGCAA
TCCAGCTAATGCAAAATCGATTATTGTTCTTTGTAAGCAATCGAATCGCTTTGCAGAAACCTTTGTTTACCCCATTAGGG
ATTTGAAATCTGTTGATAATGTTCGTGATTTTATGGAAAACTTCAAAAATTGGAAACGGGATAATGTTATTTAA

Domains


Predicted by InterproScan.

(68-294)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

63.608

99.685

0.634

  comYH Streptococcus mutans UA140

63.608

99.685

0.634