Detailed information
Overview
| Name | comEC | Type | Machinery gene |
| Locus tag | NX050_RS08795 | Genome accession | NZ_CP103456 |
| Coordinates | 1664343..1666673 (+) | Length | 776 a.a. |
| NCBI ID | WP_033881047.1 | Uniprot ID | - |
| Organism | Bacillus subtilis strain PN176 (HK176) | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 1627505..1666673 | 1664343..1666673 | within | 0 |
Gene organization within MGE regions
Location: 1627505..1666673
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| NX050_RS08530 (NX050_08530) | bltR | 1627505..1628326 (+) | 822 | WP_014480349.1 | multidrug efflux transcriptional regulator BltR | - |
| NX050_RS08535 (NX050_08535) | - | 1628534..1628714 (+) | 181 | Protein_1674 | hypothetical protein | - |
| NX050_RS08540 (NX050_08540) | - | 1629023..1629253 (+) | 231 | WP_224588644.1 | hypothetical protein | - |
| NX050_RS08545 (NX050_08545) | - | 1629398..1629646 (+) | 249 | Protein_1676 | hypothetical protein | - |
| NX050_RS08550 (NX050_08550) | - | 1629609..1629731 (+) | 123 | Protein_1677 | RusA family crossover junction endodeoxyribonuclease | - |
| NX050_RS08555 (NX050_08555) | - | 1629814..1629966 (+) | 153 | WP_049832653.1 | XtrA/YqaO family protein | - |
| NX050_RS08560 (NX050_08560) | - | 1630105..1630371 (-) | 267 | WP_033881358.1 | hypothetical protein | - |
| NX050_RS08565 (NX050_08565) | - | 1630988..1631467 (+) | 480 | WP_014480344.1 | hypothetical protein | - |
| NX050_RS08575 (NX050_08575) | - | 1631860..1632165 (-) | 306 | WP_123772463.1 | hypothetical protein | - |
| NX050_RS08580 (NX050_08580) | terS | 1632292..1632856 (+) | 565 | Protein_1682 | phage terminase small subunit | - |
| NX050_RS08585 (NX050_08585) | - | 1632816..1633291 (+) | 476 | Protein_1683 | phage tail tube protein | - |
| NX050_RS08590 (NX050_08590) | - | 1633578..1633664 (-) | 87 | WP_072592549.1 | putative holin-like toxin | - |
| NX050_RS22555 | - | 1633839..1634005 (+) | 167 | Protein_1685 | peptidoglycan-binding protein | - |
| NX050_RS08600 (NX050_08600) | istB | 1634257..1635015 (-) | 759 | WP_014479891.1 | IS21-like element helper ATPase IstB | - |
| NX050_RS08605 (NX050_08605) | istA | 1635012..1636559 (-) | 1548 | WP_014480339.1 | IS21 family transposase | - |
| NX050_RS08610 (NX050_08610) | - | 1637400..1637690 (-) | 291 | WP_014480337.1 | contact-dependent growth inhibition system immunity protein | - |
| NX050_RS08615 (NX050_08615) | atxG | 1637800..1638377 (-) | 578 | Protein_1689 | suppressor of fused domain protein | - |
| NX050_RS08620 (NX050_08620) | - | 1638645..1638878 (-) | 234 | WP_224588641.1 | hypothetical protein | - |
| NX050_RS08625 (NX050_08625) | - | 1638967..1639170 (+) | 204 | WP_123772462.1 | hypothetical protein | - |
| NX050_RS08630 (NX050_08630) | - | 1639478..1639957 (-) | 480 | WP_224588637.1 | hypothetical protein | - |
| NX050_RS08635 (NX050_08635) | cdiI | 1640062..1640421 (-) | 360 | WP_014480334.1 | ribonuclease toxin immunity protein CdiI | - |
| NX050_RS08640 (NX050_08640) | - | 1640518..1640970 (-) | 453 | WP_014480333.1 | SMI1/KNR4 family protein | - |
| NX050_RS08645 (NX050_08645) | - | 1641069..1641509 (-) | 441 | WP_014480332.1 | SMI1/KNR4 family protein | - |
| NX050_RS08650 (NX050_08650) | - | 1641912..1642199 (-) | 288 | WP_014480331.1 | hypothetical protein | - |
| NX050_RS22560 (NX050_08655) | - | 1642213..1642920 (-) | 708 | WP_014480330.1 | hypothetical protein | - |
| NX050_RS22565 (NX050_08660) | - | 1643306..1644139 (-) | 834 | Protein_1698 | ribonuclease YeeF family protein | - |
| NX050_RS08665 (NX050_08665) | - | 1644321..1645448 (+) | 1128 | WP_014480328.1 | Rap family tetratricopeptide repeat protein | - |
| NX050_RS08675 (NX050_08675) | - | 1646188..1646583 (-) | 396 | WP_014480327.1 | VOC family protein | - |
| NX050_RS08680 (NX050_08680) | - | 1647525..1648415 (-) | 891 | WP_014480326.1 | LysR family transcriptional regulator | - |
| NX050_RS08685 (NX050_08685) | fumC | 1648582..1649970 (+) | 1389 | WP_014480325.1 | class II fumarate hydratase | - |
| NX050_RS08690 (NX050_08690) | - | 1650189..1650401 (+) | 213 | Protein_1703 | recombinase family protein | - |
| NX050_RS08695 (NX050_08695) | - | 1650398..1650496 (-) | 99 | WP_031600702.1 | hypothetical protein | - |
| NX050_RS08700 (NX050_08700) | sigK | 1650496..1651224 (-) | 729 | WP_013308023.1 | RNA polymerase sporulation sigma factor SigK | - |
| NX050_RS08705 (NX050_08705) | nucA/comI | 1651420..1651830 (+) | 411 | WP_009967785.1 | sporulation-specific Dnase NucB | Machinery gene |
| NX050_RS08710 (NX050_08710) | yqeB | 1651863..1652585 (-) | 723 | WP_014480321.1 | hypothetical protein | - |
| NX050_RS08715 (NX050_08715) | gnd | 1652836..1653729 (+) | 894 | WP_014480320.1 | phosphogluconate dehydrogenase (NAD(+)-dependent, decarboxylating) | - |
| NX050_RS08720 (NX050_08720) | yqeD | 1653748..1654374 (-) | 627 | WP_014480319.1 | TVP38/TMEM64 family protein | - |
| NX050_RS08725 (NX050_08725) | cwlH | 1654561..1655313 (+) | 753 | WP_014480318.1 | N-acetylmuramoyl-L-alanine amidase CwlH | - |
| NX050_RS08730 (NX050_08730) | yqeF | 1655565..1656296 (+) | 732 | WP_003229964.1 | SGNH/GDSL hydrolase family protein | - |
| NX050_RS08735 (NX050_08735) | - | 1656602..1656742 (-) | 141 | WP_003226124.1 | sporulation histidine kinase inhibitor Sda | - |
| NX050_RS08740 (NX050_08740) | yqeG | 1657104..1657622 (+) | 519 | WP_003226126.1 | YqeG family HAD IIIA-type phosphatase | - |
| NX050_RS08745 (NX050_08745) | yqeH | 1657626..1658726 (+) | 1101 | WP_003229966.1 | ribosome biogenesis GTPase YqeH | - |
| NX050_RS08750 (NX050_08750) | aroE | 1658744..1659586 (+) | 843 | WP_014480317.1 | shikimate dehydrogenase | - |
| NX050_RS08755 (NX050_08755) | yhbY | 1659580..1659870 (+) | 291 | WP_003226133.1 | ribosome assembly RNA-binding protein YhbY | - |
| NX050_RS08760 (NX050_08760) | nadD | 1659882..1660451 (+) | 570 | WP_004398676.1 | nicotinate-nucleotide adenylyltransferase | - |
| NX050_RS08765 (NX050_08765) | yqeK | 1660441..1661001 (+) | 561 | WP_014480316.1 | bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK | - |
| NX050_RS08770 (NX050_08770) | rsfS | 1661019..1661375 (+) | 357 | WP_014480315.1 | ribosome silencing factor | - |
| NX050_RS08775 (NX050_08775) | yqeM | 1661372..1662115 (+) | 744 | WP_014480314.1 | class I SAM-dependent methyltransferase | - |
| NX050_RS08780 (NX050_08780) | comER | 1662181..1663002 (-) | 822 | WP_014480313.1 | late competence protein ComER | - |
| NX050_RS08785 (NX050_08785) | comEA | 1663086..1663703 (+) | 618 | WP_014480312.1 | competence protein ComEA | Machinery gene |
| NX050_RS08790 (NX050_08790) | comEB | 1663770..1664339 (+) | 570 | WP_003229978.1 | ComE operon protein 2 | - |
| NX050_RS08795 (NX050_08795) | comEC | 1664343..1666673 (+) | 2331 | WP_033881047.1 | DNA internalization-related competence protein ComEC/Rec2 | Machinery gene |
Sequence
Protein
Download Length: 776 a.a. Molecular weight: 86669.18 Da Isoelectric Point: 7.0687
>NTDB_id=722873 NX050_RS08795 WP_033881047.1 1664343..1666673(+) (comEC) [Bacillus subtilis strain PN176 (HK176)]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPG
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPG
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN
Nucleotide
Download Length: 2331 bp
>NTDB_id=722873 NX050_RS08795 WP_033881047.1 1664343..1666673(+) (comEC) [Bacillus subtilis strain PN176 (HK176)]
ATGCGTAATTCGCGCTTATTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTCATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGGCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGGA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA
ATGCGTAATTCGCGCTTATTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTCATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGGCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGGA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comEC | Bacillus subtilis subsp. subtilis str. 168 |
98.454 |
100 |
0.985 |