Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | QAO20_RS00200 | Genome accession | NZ_CP123106 |
| Coordinates | 31272..33473 (+) | Length | 733 a.a. |
| NCBI ID | WP_043054889.1 | Uniprot ID | - |
| Organism | Staphylococcus aureus strain IT-MSSA50 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 1..33473 | 31272..33473 | within | 0 |
Gene organization within MGE regions
Location: 1..33473
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| QAO20_RS00010 (QAO20_00010) | - | 1..1695 (+) | 1695 | WP_000568449.1 | terminase large subunit | - |
| QAO20_RS00015 (QAO20_00015) | - | 1709..1912 (+) | 204 | WP_001052526.1 | hypothetical protein | - |
| QAO20_RS00020 (QAO20_00020) | - | 1915..3171 (+) | 1257 | WP_031882625.1 | phage portal protein | - |
| QAO20_RS00025 (QAO20_00025) | - | 3590..4174 (+) | 585 | WP_025173991.1 | HK97 family phage prohead protease | - |
| QAO20_RS00030 (QAO20_00030) | - | 4261..5496 (+) | 1236 | WP_098683529.1 | phage major capsid protein | - |
| QAO20_RS00035 (QAO20_00035) | - | 5533..5691 (+) | 159 | WP_001240639.1 | hypothetical protein | - |
| QAO20_RS00040 (QAO20_00040) | - | 5700..6032 (+) | 333 | WP_001177664.1 | head-tail connector protein | - |
| QAO20_RS00045 (QAO20_00045) | - | 6022..6354 (+) | 333 | WP_000671501.1 | head-tail adaptor protein | - |
| QAO20_RS00050 (QAO20_00050) | - | 6354..6731 (+) | 378 | WP_000501001.1 | HK97-gp10 family putative phage morphogenesis protein | - |
| QAO20_RS00055 (QAO20_00055) | - | 6728..7108 (+) | 381 | WP_000608369.1 | hypothetical protein | - |
| QAO20_RS00060 (QAO20_00060) | - | 7109..8062 (+) | 954 | WP_031882624.1 | major tail protein | - |
| QAO20_RS00065 (QAO20_00065) | gpG | 8127..8573 (+) | 447 | WP_000442602.1 | phage tail assembly chaperone G | - |
| QAO20_RS00070 (QAO20_00070) | gpGT | 8633..8755 (+) | 123 | WP_000570353.1 | phage tail assembly chaperone GT | - |
| QAO20_RS00075 (QAO20_00075) | - | 8811..13460 (+) | 4650 | WP_049316081.1 | phage tail tape measure protein | - |
| QAO20_RS00080 (QAO20_00080) | - | 13460..14950 (+) | 1491 | WP_049316082.1 | phage distal tail protein | - |
| QAO20_RS00085 (QAO20_00085) | - | 14966..18751 (+) | 3786 | WP_411907505.1 | phage tail spike protein | - |
| QAO20_RS00090 (QAO20_00090) | - | 18741..18893 (+) | 153 | WP_001153681.1 | hypothetical protein | - |
| QAO20_RS00095 (QAO20_00095) | - | 18940..19227 (+) | 288 | WP_001040261.1 | hypothetical protein | - |
| QAO20_RS00100 (QAO20_00100) | - | 19285..19581 (+) | 297 | WP_000539688.1 | DUF2951 domain-containing protein | - |
| QAO20_RS00105 (QAO20_00105) | pepG1 | 19773..19907 (+) | 135 | WP_000226108.1 | type I toxin-antitoxin system toxin PepG1 | - |
| QAO20_RS00110 (QAO20_00110) | - | 19960..20067 (-) | 108 | WP_001791821.1 | hypothetical protein | - |
| QAO20_RS00115 (QAO20_00115) | - | 20119..20373 (+) | 255 | WP_000611512.1 | phage holin | - |
| QAO20_RS00120 (QAO20_00120) | - | 20385..21140 (+) | 756 | WP_411907507.1 | CHAP domain-containing protein | - |
| QAO20_RS00125 (QAO20_00125) | - | 21631..22425 (+) | 795 | WP_000238963.1 | HipA family kinase | - |
| QAO20_RS00130 (QAO20_00130) | - | 22432..23169 (+) | 738 | WP_000278830.1 | hypothetical protein | - |
| QAO20_RS00135 (QAO20_00135) | - | 23394..23537 (+) | 144 | Protein_26 | exotoxin beta-grasp domain-containing protein | - |
| QAO20_RS00140 (QAO20_00140) | - | 23739..24008 (-) | 270 | WP_000829753.1 | hypothetical protein | - |
| QAO20_RS00145 (QAO20_00145) | mtnN | 24321..25007 (+) | 687 | WP_411907509.1 | 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase | - |
| QAO20_RS00150 (QAO20_00150) | - | 25027..25554 (+) | 528 | WP_000524902.1 | YqeG family HAD IIIA-type phosphatase | - |
| QAO20_RS00155 (QAO20_00155) | yqeH | 25555..26655 (+) | 1101 | WP_001280141.1 | ribosome biogenesis GTPase YqeH | - |
| QAO20_RS00160 (QAO20_00160) | aroE | 26669..27475 (+) | 807 | WP_000666750.1 | shikimate dehydrogenase | - |
| QAO20_RS00165 (QAO20_00165) | yhbY | 27479..27769 (+) | 291 | WP_000955234.1 | ribosome assembly RNA-binding protein YhbY | - |
| QAO20_RS00170 (QAO20_00170) | nadD | 27772..28341 (+) | 570 | WP_000725157.1 | nicotinate (nicotinamide) nucleotide adenylyltransferase | - |
| QAO20_RS00175 (QAO20_00175) | yqeK | 28331..28915 (+) | 585 | WP_001017838.1 | bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK | - |
| QAO20_RS00180 (QAO20_00180) | rsfS | 28916..29269 (+) | 354 | WP_001088020.1 | ribosome silencing factor | - |
| QAO20_RS00185 (QAO20_00185) | - | 29272..29988 (+) | 717 | WP_000084829.1 | class I SAM-dependent DNA methyltransferase | - |
| QAO20_RS00190 (QAO20_00190) | comEA | 30028..30714 (+) | 687 | WP_072491736.1 | ComEA family DNA-binding protein | Machinery gene |
| QAO20_RS00195 (QAO20_00195) | - | 30806..31267 (+) | 462 | WP_000439693.1 | ComE operon protein 2 | - |
| QAO20_RS00200 (QAO20_00200) | comEC | 31272..33473 (+) | 2202 | WP_043054889.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
Sequence
Protein
Download Length: 733 a.a. Molecular weight: 84819.88 Da Isoelectric Point: 9.9910
>NTDB_id=818796 QAO20_RS00200 WP_043054889.1 31272..33473(+) (comEC) [Staphylococcus aureus strain IT-MSSA50]
MLYVALSMIVGVLWNSSKVLSTFLFILLLYIAYRKNKIVYAPISLFLIIFSSWYLHYSQQAIFNYINYIERNSQFNERAQ
VIHIQRQGSDTYKGRLSLKNEIYPFFLTNKMNFDLKKIESHNCIVKGQFKVNDNKFVTLKLQSIVVQSCLESNRSNLIEK
HKQFIMNRIYDSGIKFPDRIMALITGDVKEINEQFKERVKEIGIYHLLAVSGSHIAAIVFLIYQPLKRLNLPLFVIKGIT
IIVLALFAQYTNYAPSAVRAIIMTTLVLLITKQIKIKGIQLLAFAFIIMFILNPLVVYDIGFQFSFIISFFIMLLFPFLQ
QLSKLQSLFIITFIAQLASFIVAIPNFHQLQWVGFLSNLIFVPYYSIILFPLSILFFITSHFIVGLTPLNYLVDLSFNFH
DWLLDLFTRIKQSHFSVPKFNDWIFIIFIISVYYIFWLLAKRKYILVTFWTIIILTLLITFPTNSHHKITMLNVGQGDSI
LYEGGKNQNVLIDTGGKVIDDTKQPSYSISKYHILPTLNERGINELEYLILTHPHNDHIGEVEYIISHIKIKHIVIYNKG
YSSNTLMLLSKLSRKYNIKLMDVRQVSSFKLGDSSFLFFDSFIPNSRDKNEYSIITMITYQNKKVLLMGDASKNNESLLL
KKYNLPEIDILKVGHHGSKTSSSKEFIEMIKPKISLISSGKNNMYHLPNIEVVKRLQGIRSRIYNSQQNGQVTIDLDDNL
KVDSNSYGNASGL
MLYVALSMIVGVLWNSSKVLSTFLFILLLYIAYRKNKIVYAPISLFLIIFSSWYLHYSQQAIFNYINYIERNSQFNERAQ
VIHIQRQGSDTYKGRLSLKNEIYPFFLTNKMNFDLKKIESHNCIVKGQFKVNDNKFVTLKLQSIVVQSCLESNRSNLIEK
HKQFIMNRIYDSGIKFPDRIMALITGDVKEINEQFKERVKEIGIYHLLAVSGSHIAAIVFLIYQPLKRLNLPLFVIKGIT
IIVLALFAQYTNYAPSAVRAIIMTTLVLLITKQIKIKGIQLLAFAFIIMFILNPLVVYDIGFQFSFIISFFIMLLFPFLQ
QLSKLQSLFIITFIAQLASFIVAIPNFHQLQWVGFLSNLIFVPYYSIILFPLSILFFITSHFIVGLTPLNYLVDLSFNFH
DWLLDLFTRIKQSHFSVPKFNDWIFIIFIISVYYIFWLLAKRKYILVTFWTIIILTLLITFPTNSHHKITMLNVGQGDSI
LYEGGKNQNVLIDTGGKVIDDTKQPSYSISKYHILPTLNERGINELEYLILTHPHNDHIGEVEYIISHIKIKHIVIYNKG
YSSNTLMLLSKLSRKYNIKLMDVRQVSSFKLGDSSFLFFDSFIPNSRDKNEYSIITMITYQNKKVLLMGDASKNNESLLL
KKYNLPEIDILKVGHHGSKTSSSKEFIEMIKPKISLISSGKNNMYHLPNIEVVKRLQGIRSRIYNSQQNGQVTIDLDDNL
KVDSNSYGNASGL
Nucleotide
Download Length: 2202 bp
>NTDB_id=818796 QAO20_RS00200 WP_043054889.1 31272..33473(+) (comEC) [Staphylococcus aureus strain IT-MSSA50]
TTGCTGTATGTCGCGTTATCAATGATTGTAGGAGTACTTTGGAATTCTAGCAAAGTGCTCTCTACATTTCTTTTCATTTT
ACTTTTGTATATTGCTTATCGTAAAAATAAAATCGTTTATGCCCCTATTTCTCTCTTTTTAATCATTTTCTCCTCATGGT
ATTTACATTATTCACAACAAGCAATTTTTAATTATATCAATTATATTGAACGTAATTCTCAGTTTAATGAGCGTGCTCAA
GTAATCCACATTCAACGTCAAGGTAGTGACACATATAAAGGTAGGTTGAGTTTAAAAAATGAAATATATCCTTTCTTTTT
AACAAATAAAATGAATTTTGATTTAAAGAAAATTGAAAGTCATAATTGTATTGTTAAAGGACAATTCAAAGTTAATGACA
ATAAGTTTGTAACTCTTAAATTACAAAGTATAGTTGTACAAAGCTGCCTAGAATCAAACCGGTCTAATTTAATTGAGAAA
CATAAACAGTTTATAATGAATCGAATTTATGATTCGGGTATTAAGTTTCCGGATCGTATTATGGCATTGATTACTGGTGA
CGTAAAAGAAATTAATGAGCAATTTAAGGAACGTGTTAAAGAGATAGGTATATATCATTTGCTGGCAGTTAGTGGCTCGC
ATATAGCTGCAATTGTATTCTTAATTTACCAACCTTTAAAACGATTAAATTTACCTTTATTTGTCATTAAAGGAATTACA
ATCATTGTATTAGCTTTATTTGCTCAATACACAAATTATGCACCTAGTGCTGTAAGAGCTATAATAATGACAACTCTTGT
ACTGCTTATTACTAAGCAAATTAAAATAAAGGGTATTCAGCTATTAGCATTTGCATTTATAATTATGTTTATTTTAAATC
CGCTAGTTGTTTATGATATTGGATTTCAATTTTCATTCATCATTTCATTTTTTATTATGCTACTTTTCCCTTTTTTACAG
CAATTGTCAAAGTTACAATCATTATTCATAATTACGTTTATTGCACAATTAGCTTCATTTATCGTTGCCATTCCAAACTT
TCATCAACTTCAATGGGTGGGATTTTTATCTAATTTAATTTTTGTACCGTACTATTCGATTATATTGTTTCCGCTATCTA
TTTTATTCTTTATTACAAGTCATTTTATTGTGGGATTAACGCCGCTAAATTACTTGGTTGACCTAAGTTTTAATTTTCAT
GACTGGTTACTAGACCTATTCACAAGAATCAAGCAATCACATTTTTCTGTTCCCAAGTTTAATGATTGGATATTTATAAT
ATTTATAATTTCTGTTTATTACATATTTTGGTTATTGGCTAAACGTAAATATATATTGGTTACGTTTTGGACTATAATTA
TTCTGACATTATTAATTACGTTTCCAACAAATTCACATCACAAAATTACAATGTTAAATGTGGGGCAGGGAGACAGTATT
TTATATGAAGGTGGTAAGAACCAAAATGTCTTGATTGATACAGGTGGGAAAGTGATTGATGATACTAAACAACCTAGTTA
TTCAATTTCTAAATATCATATTTTACCAACGCTAAATGAAAGAGGGATAAATGAATTAGAGTATCTAATTTTAACACATC
CACACAATGACCATATTGGTGAAGTGGAATATATTATTAGTCATATTAAAATTAAACATATAGTGATATACAATAAGGGA
TATAGTAGTAATACATTGATGTTATTATCGAAATTAAGCCGTAAGTATAACATTAAACTTATGGATGTAAGACAAGTTAG
TAGTTTTAAACTTGGAGATAGTAGTTTTCTATTTTTTGATAGTTTTATTCCAAATAGCCGAGATAAAAATGAATATTCGA
TTATTACTATGATTACATATCAAAATAAAAAAGTTTTATTAATGGGCGATGCTAGTAAAAATAATGAATCTTTACTACTA
AAAAAATATAACTTGCCGGAGATTGATATTTTAAAAGTAGGACATCATGGGAGCAAGACAAGTAGTTCTAAAGAATTTAT
AGAGATGATTAAGCCTAAAATAAGTTTGATTTCTTCTGGAAAGAACAATATGTATCATCTTCCTAACATAGAAGTTGTTA
AACGATTGCAAGGGATTCGCAGTCGCATTTACAATAGCCAACAAAACGGTCAAGTTACAATAGACTTAGATGATAATTTA
AAAGTTGATTCAAACTCTTATGGAAATGCAAGTGGTTTATAG
TTGCTGTATGTCGCGTTATCAATGATTGTAGGAGTACTTTGGAATTCTAGCAAAGTGCTCTCTACATTTCTTTTCATTTT
ACTTTTGTATATTGCTTATCGTAAAAATAAAATCGTTTATGCCCCTATTTCTCTCTTTTTAATCATTTTCTCCTCATGGT
ATTTACATTATTCACAACAAGCAATTTTTAATTATATCAATTATATTGAACGTAATTCTCAGTTTAATGAGCGTGCTCAA
GTAATCCACATTCAACGTCAAGGTAGTGACACATATAAAGGTAGGTTGAGTTTAAAAAATGAAATATATCCTTTCTTTTT
AACAAATAAAATGAATTTTGATTTAAAGAAAATTGAAAGTCATAATTGTATTGTTAAAGGACAATTCAAAGTTAATGACA
ATAAGTTTGTAACTCTTAAATTACAAAGTATAGTTGTACAAAGCTGCCTAGAATCAAACCGGTCTAATTTAATTGAGAAA
CATAAACAGTTTATAATGAATCGAATTTATGATTCGGGTATTAAGTTTCCGGATCGTATTATGGCATTGATTACTGGTGA
CGTAAAAGAAATTAATGAGCAATTTAAGGAACGTGTTAAAGAGATAGGTATATATCATTTGCTGGCAGTTAGTGGCTCGC
ATATAGCTGCAATTGTATTCTTAATTTACCAACCTTTAAAACGATTAAATTTACCTTTATTTGTCATTAAAGGAATTACA
ATCATTGTATTAGCTTTATTTGCTCAATACACAAATTATGCACCTAGTGCTGTAAGAGCTATAATAATGACAACTCTTGT
ACTGCTTATTACTAAGCAAATTAAAATAAAGGGTATTCAGCTATTAGCATTTGCATTTATAATTATGTTTATTTTAAATC
CGCTAGTTGTTTATGATATTGGATTTCAATTTTCATTCATCATTTCATTTTTTATTATGCTACTTTTCCCTTTTTTACAG
CAATTGTCAAAGTTACAATCATTATTCATAATTACGTTTATTGCACAATTAGCTTCATTTATCGTTGCCATTCCAAACTT
TCATCAACTTCAATGGGTGGGATTTTTATCTAATTTAATTTTTGTACCGTACTATTCGATTATATTGTTTCCGCTATCTA
TTTTATTCTTTATTACAAGTCATTTTATTGTGGGATTAACGCCGCTAAATTACTTGGTTGACCTAAGTTTTAATTTTCAT
GACTGGTTACTAGACCTATTCACAAGAATCAAGCAATCACATTTTTCTGTTCCCAAGTTTAATGATTGGATATTTATAAT
ATTTATAATTTCTGTTTATTACATATTTTGGTTATTGGCTAAACGTAAATATATATTGGTTACGTTTTGGACTATAATTA
TTCTGACATTATTAATTACGTTTCCAACAAATTCACATCACAAAATTACAATGTTAAATGTGGGGCAGGGAGACAGTATT
TTATATGAAGGTGGTAAGAACCAAAATGTCTTGATTGATACAGGTGGGAAAGTGATTGATGATACTAAACAACCTAGTTA
TTCAATTTCTAAATATCATATTTTACCAACGCTAAATGAAAGAGGGATAAATGAATTAGAGTATCTAATTTTAACACATC
CACACAATGACCATATTGGTGAAGTGGAATATATTATTAGTCATATTAAAATTAAACATATAGTGATATACAATAAGGGA
TATAGTAGTAATACATTGATGTTATTATCGAAATTAAGCCGTAAGTATAACATTAAACTTATGGATGTAAGACAAGTTAG
TAGTTTTAAACTTGGAGATAGTAGTTTTCTATTTTTTGATAGTTTTATTCCAAATAGCCGAGATAAAAATGAATATTCGA
TTATTACTATGATTACATATCAAAATAAAAAAGTTTTATTAATGGGCGATGCTAGTAAAAATAATGAATCTTTACTACTA
AAAAAATATAACTTGCCGGAGATTGATATTTTAAAAGTAGGACATCATGGGAGCAAGACAAGTAGTTCTAAAGAATTTAT
AGAGATGATTAAGCCTAAAATAAGTTTGATTTCTTCTGGAAAGAACAATATGTATCATCTTCCTAACATAGAAGTTGTTA
AACGATTGCAAGGGATTCGCAGTCGCATTTACAATAGCCAACAAAACGGTCAAGTTACAATAGACTTAGATGATAATTTA
AAAGTTGATTCAAACTCTTATGGAAATGCAAGTGGTTTATAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Staphylococcus aureus N315 |
98.636 |
100 |
0.986 |
| comEC | Staphylococcus aureus MW2 |
98.499 |
100 |
0.985 |