Detailed information
Overview
| Name | comYH | Type | Machinery gene |
| Locus tag | GU336_RS10980 | Genome accession | NZ_CP047616 |
| Coordinates | 2231325..2232260 (-) | Length | 311 a.a. |
| NCBI ID | WP_167372300.1 | Uniprot ID | - |
| Organism | Lactococcus raffinolactis strain Lr_19_5 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| ICE | 2226184..2264602 | 2231325..2232260 | within | 0 |
Gene organization within MGE regions
Location: 2226184..2264602
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GU336_RS10955 (GU336_10945) | - | 2226272..2226973 (-) | 702 | WP_167839068.1 | hypothetical protein | - |
| GU336_RS10960 (GU336_10950) | - | 2227058..2228008 (+) | 951 | WP_068164079.1 | IS30 family transposase | - |
| GU336_RS10965 (GU336_10955) | frr | 2228276..2228833 (-) | 558 | WP_061774678.1 | ribosome recycling factor | - |
| GU336_RS10970 (GU336_10960) | pyrH | 2228864..2229583 (-) | 720 | WP_061774677.1 | UMP kinase | - |
| GU336_RS10975 (GU336_10965) | - | 2229823..2231016 (-) | 1194 | WP_096039558.1 | acetate kinase | - |
| GU336_RS10980 (GU336_10970) | comYH | 2231325..2232260 (-) | 936 | WP_167372300.1 | class I SAM-dependent methyltransferase | Machinery gene |
| GU336_RS10985 (GU336_10975) | - | 2232422..2232910 (-) | 489 | WP_167839069.1 | GNAT family N-acetyltransferase | - |
| GU336_RS10990 (GU336_10980) | murC | 2233011..2234345 (-) | 1335 | WP_138492113.1 | UDP-N-acetylmuramate--L-alanine ligase | - |
| GU336_RS10995 (GU336_10985) | - | 2234614..2235225 (-) | 612 | WP_138492112.1 | hypothetical protein | - |
| GU336_RS11000 (GU336_10990) | - | 2235361..2238474 (-) | 3114 | WP_096039562.1 | DEAD/DEAH box helicase | - |
| GU336_RS11005 (GU336_10995) | - | 2238740..2239744 (-) | 1005 | WP_138492110.1 | serine hydrolase domain-containing protein | - |
| GU336_RS11010 (GU336_11000) | - | 2239741..2240409 (-) | 669 | WP_167839070.1 | CppA N-terminal domain-containing protein | - |
| GU336_RS11015 (GU336_11005) | - | 2240680..2241390 (+) | 711 | WP_061774669.1 | type II CAAX endopeptidase family protein | - |
| GU336_RS11020 (GU336_11010) | gla | 2241475..2242374 (-) | 900 | WP_167839071.1 | aquaglyceroporin Gla | - |
| GU336_RS11025 (GU336_11015) | - | 2242645..2244957 (+) | 2313 | WP_167839072.1 | Xaa-Pro dipeptidyl-peptidase | - |
| GU336_RS11030 (GU336_11020) | - | 2244954..2245676 (+) | 723 | WP_167839073.1 | gamma-glutamyl-gamma-aminobutyrate hydrolase family protein | - |
| GU336_RS11035 (GU336_11025) | - | 2245895..2246026 (-) | 132 | WP_096039567.1 | putative holin-like toxin | - |
| GU336_RS11040 (GU336_11030) | - | 2246187..2246675 (-) | 489 | WP_167839074.1 | GNAT family N-acetyltransferase | - |
| GU336_RS11045 (GU336_11035) | gltX | 2246678..2248126 (-) | 1449 | WP_096039569.1 | glutamate--tRNA ligase | - |
| GU336_RS11050 (GU336_11040) | ispF | 2248271..2248753 (-) | 483 | WP_096039570.1 | 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase | - |
| GU336_RS11055 (GU336_11045) | - | 2248740..2249501 (-) | 762 | WP_061774662.1 | TIGR00266 family protein | - |
| GU336_RS11060 (GU336_11050) | ispD | 2249557..2250237 (-) | 681 | WP_096039571.1 | 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase | - |
| GU336_RS11065 (GU336_11055) | - | 2250234..2251307 (-) | 1074 | WP_167839075.1 | TRAM domain-containing protein | - |
| GU336_RS11070 (GU336_11060) | radA | 2251536..2252897 (-) | 1362 | WP_096039573.1 | DNA repair protein RadA | Machinery gene |
| GU336_RS11075 (GU336_11065) | - | 2252899..2253345 (-) | 447 | WP_061774658.1 | dUTP diphosphatase | - |
| GU336_RS11080 (GU336_11070) | - | 2253428..2255179 (-) | 1752 | WP_167839076.1 | ABC transporter ATP-binding protein | - |
| GU336_RS11085 (GU336_11075) | - | 2255172..2256890 (-) | 1719 | WP_167839077.1 | ABC transporter ATP-binding protein | - |
| GU336_RS11090 (GU336_11080) | - | 2257288..2257890 (+) | 603 | Protein_2164 | IS5-like element IS1194 family transposase | - |
| GU336_RS11095 (GU336_11085) | - | 2257964..2258644 (+) | 681 | WP_031367160.1 | IS6-like element ISS1N family transposase | - |
| GU336_RS13140 | - | 2259109..2259189 (+) | 81 | WP_220437321.1 | putative holin-like toxin | - |
| GU336_RS11105 (GU336_11095) | - | 2259462..2260067 (-) | 606 | WP_167839078.1 | helical hairpin domain-containing protein | - |
| GU336_RS11110 (GU336_11100) | - | 2260120..2261070 (+) | 951 | WP_167838214.1 | IS30 family transposase | - |
| GU336_RS11115 (GU336_11105) | - | 2261030..2262184 (-) | 1155 | WP_167839079.1 | relaxase/mobilization nuclease domain-containing protein | - |
| GU336_RS11120 (GU336_11110) | mobC | 2262200..2262547 (-) | 348 | WP_167839080.1 | plasmid mobilization relaxosome protein MobC | - |
| GU336_RS11125 (GU336_11115) | - | 2262740..2262937 (-) | 198 | WP_167839081.1 | hypothetical protein | - |
| GU336_RS11130 (GU336_11120) | - | 2263576..2264454 (-) | 879 | WP_167839190.1 | IS3 family transposase | - |
Sequence
Protein
Download Length: 311 a.a. Molecular weight: 34665.65 Da Isoelectric Point: 4.4804
>NTDB_id=414664 GU336_RS10980 WP_167372300.1 2231325..2232260(-) (comYH) [Lactococcus raffinolactis strain Lr_19_5]
MNMEKIETAFGLLLANVQQLETRLATHFYDALIEQNVSYLGKAVSEDLQQRNEQLRALNLTKQEWQKVYQFALIKGAKDM
HLQANHQLTPDAIGYIINFMIETLSTETNLSILELGSGTGNLAETLLTSMSDKALTYTGFEVDDLMIDLSASIADVMQTS
AQFLQIDAVRPQVIEPVDLLLSDLPVGYYPDDAIAQRSVVGSQSEHTYAHHLLMAQGFKYLKADGYAIFIAPSDLLSSPQ
SDLLKKWLQDYASVAAVITLPEDIVTENHTKAIFVLQKSAQGKAPFVFPLISLTNPEIVQSFMTQFRQNMI
MNMEKIETAFGLLLANVQQLETRLATHFYDALIEQNVSYLGKAVSEDLQQRNEQLRALNLTKQEWQKVYQFALIKGAKDM
HLQANHQLTPDAIGYIINFMIETLSTETNLSILELGSGTGNLAETLLTSMSDKALTYTGFEVDDLMIDLSASIADVMQTS
AQFLQIDAVRPQVIEPVDLLLSDLPVGYYPDDAIAQRSVVGSQSEHTYAHHLLMAQGFKYLKADGYAIFIAPSDLLSSPQ
SDLLKKWLQDYASVAAVITLPEDIVTENHTKAIFVLQKSAQGKAPFVFPLISLTNPEIVQSFMTQFRQNMI
Nucleotide
Download Length: 936 bp
>NTDB_id=414664 GU336_RS10980 WP_167372300.1 2231325..2232260(-) (comYH) [Lactococcus raffinolactis strain Lr_19_5]
ATGAATATGGAAAAAATAGAAACGGCATTTGGCCTATTATTAGCCAACGTTCAGCAACTTGAAACACGCTTGGCAACACA
TTTTTACGATGCCTTGATTGAGCAAAATGTGAGCTATCTCGGTAAAGCTGTATCAGAAGACTTGCAGCAACGCAATGAGC
AGTTGCGTGCGCTCAATTTGACAAAACAAGAGTGGCAAAAGGTCTATCAGTTTGCCTTGATTAAGGGTGCTAAGGACATG
CACCTGCAAGCCAATCATCAGTTAACACCGGATGCAATTGGGTATATCATCAATTTCATGATTGAGACCTTATCTACCGA
AACTAACTTGTCTATTTTGGAATTAGGGTCTGGGACAGGTAATTTAGCCGAGACATTATTGACTAGCATGTCAGATAAAG
CACTAACCTATACTGGCTTTGAAGTTGATGATTTAATGATTGACCTGTCGGCTAGCATTGCCGATGTCATGCAAACTTCA
GCCCAATTTTTGCAGATTGATGCTGTGCGCCCTCAGGTTATCGAACCTGTGGATCTGTTATTGTCAGATTTACCGGTAGG
CTATTATCCAGATGATGCGATTGCGCAACGTTCAGTTGTTGGCAGTCAGAGTGAGCATACCTACGCCCATCACTTGCTGA
TGGCGCAAGGATTCAAATATCTAAAAGCAGATGGTTATGCGATTTTTATTGCACCGAGTGATTTGTTGTCTAGTCCGCAA
TCCGATTTATTAAAAAAATGGTTGCAGGATTATGCCAGCGTCGCTGCTGTGATTACTTTACCAGAAGACATTGTCACTGA
AAATCATACTAAGGCAATCTTTGTTTTACAAAAGTCTGCACAAGGTAAAGCACCCTTTGTTTTTCCTTTGATAAGTCTAA
CCAATCCTGAAATTGTGCAGTCTTTCATGACGCAATTTCGTCAGAATATGATATAA
ATGAATATGGAAAAAATAGAAACGGCATTTGGCCTATTATTAGCCAACGTTCAGCAACTTGAAACACGCTTGGCAACACA
TTTTTACGATGCCTTGATTGAGCAAAATGTGAGCTATCTCGGTAAAGCTGTATCAGAAGACTTGCAGCAACGCAATGAGC
AGTTGCGTGCGCTCAATTTGACAAAACAAGAGTGGCAAAAGGTCTATCAGTTTGCCTTGATTAAGGGTGCTAAGGACATG
CACCTGCAAGCCAATCATCAGTTAACACCGGATGCAATTGGGTATATCATCAATTTCATGATTGAGACCTTATCTACCGA
AACTAACTTGTCTATTTTGGAATTAGGGTCTGGGACAGGTAATTTAGCCGAGACATTATTGACTAGCATGTCAGATAAAG
CACTAACCTATACTGGCTTTGAAGTTGATGATTTAATGATTGACCTGTCGGCTAGCATTGCCGATGTCATGCAAACTTCA
GCCCAATTTTTGCAGATTGATGCTGTGCGCCCTCAGGTTATCGAACCTGTGGATCTGTTATTGTCAGATTTACCGGTAGG
CTATTATCCAGATGATGCGATTGCGCAACGTTCAGTTGTTGGCAGTCAGAGTGAGCATACCTACGCCCATCACTTGCTGA
TGGCGCAAGGATTCAAATATCTAAAAGCAGATGGTTATGCGATTTTTATTGCACCGAGTGATTTGTTGTCTAGTCCGCAA
TCCGATTTATTAAAAAAATGGTTGCAGGATTATGCCAGCGTCGCTGCTGTGATTACTTTACCAGAAGACATTGTCACTGA
AAATCATACTAAGGCAATCTTTGTTTTACAAAAGTCTGCACAAGGTAAAGCACCCTTTGTTTTTCCTTTGATAAGTCTAA
CCAATCCTGAAATTGTGCAGTCTTTCATGACGCAATTTCGTCAGAATATGATATAA
Domains
No domain identified.
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comYH | Streptococcus mutans UA140 |
54.662 |
100 |
0.547 |
| comYH | Streptococcus mutans UA159 |
54.662 |
100 |
0.547 |