Detailed information
Overview
| Name | endA | Type | Machinery gene |
| Locus tag | DK43_RS06775 | Genome accession | NZ_CP007573 |
| Coordinates | 1402587..1403432 (-) | Length | 281 a.a. |
| NCBI ID | WP_037608423.1 | Uniprot ID | - |
| Organism | Streptococcus anginosus strain SA1 | ||
| Function | cleavage of dsDNA into ssDNA (predicted from homology) DNA processing |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 1375372..1402297 | 1402587..1403432 | flank | 290 |
Gene organization within MGE regions
Location: 1375372..1403432
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| DK43_RS06635 (DK43_06815) | - | 1375372..1376088 (-) | 717 | WP_022525104.1 | glycosyltransferase family 2 protein | - |
| DK43_RS06640 (DK43_06820) | - | 1376089..1377057 (-) | 969 | WP_003027705.1 | glycosyltransferase | - |
| DK43_RS06645 (DK43_06825) | - | 1377083..1378024 (-) | 942 | WP_003027706.1 | glycosyltransferase family 2 protein | - |
| DK43_RS06650 (DK43_06830) | - | 1378030..1379280 (-) | 1251 | WP_003027707.1 | polysaccharide biosynthesis C-terminal domain-containing protein | - |
| DK43_RS06655 (DK43_06835) | rfbD | 1379369..1380220 (-) | 852 | WP_003027708.1 | dTDP-4-dehydrorhamnose reductase | - |
| DK43_RS06660 (DK43_06845) | - | 1380341..1381549 (-) | 1209 | WP_022525105.1 | hypothetical protein | - |
| DK43_RS06665 (DK43_06850) | - | 1381683..1383167 (-) | 1485 | WP_003027710.1 | glucosyltransferase domain-containing protein | - |
| DK43_RS06670 (DK43_06855) | - | 1383164..1384090 (-) | 927 | WP_003027711.1 | glycosyltransferase family 2 protein | - |
| DK43_RS06675 (DK43_06860) | - | 1384220..1384810 (-) | 591 | WP_003027712.1 | class I SAM-dependent methyltransferase | - |
| DK43_RS06680 (DK43_06865) | galE | 1385117..1386136 (-) | 1020 | WP_003027713.1 | UDP-glucose 4-epimerase GalE | - |
| DK43_RS06685 (DK43_06870) | rfbB | 1386170..1387216 (-) | 1047 | WP_003039992.1 | dTDP-glucose 4,6-dehydratase | - |
| DK43_RS06690 (DK43_06875) | - | 1387247..1387840 (-) | 594 | WP_003027715.1 | dTDP-4-dehydrorhamnose 3,5-epimerase family protein | - |
| DK43_RS06695 (DK43_06880) | rfbA | 1387844..1388713 (-) | 870 | WP_003027717.1 | glucose-1-phosphate thymidylyltransferase RfbA | - |
| DK43_RS06700 (DK43_06885) | - | 1388914..1389740 (-) | 827 | Protein_1335 | ZIP family metal transporter | - |
| DK43_RS06710 (DK43_06890) | - | 1389759..1390235 (-) | 477 | WP_003027719.1 | 8-oxo-dGTP diphosphatase | - |
| DK43_RS06715 (DK43_06895) | - | 1390236..1391333 (-) | 1098 | WP_041331854.1 | FAD-dependent oxidoreductase | - |
| DK43_RS06720 (DK43_06900) | - | 1391346..1392143 (-) | 798 | WP_022525108.1 | Nif3-like dinuclear metal center hexameric protein | - |
| DK43_RS06725 (DK43_06905) | - | 1392130..1392819 (-) | 690 | WP_003027722.1 | tRNA (adenine(22)-N(1))-methyltransferase TrmK | - |
| DK43_RS06730 (DK43_06910) | - | 1392913..1393587 (-) | 675 | WP_003027723.1 | DnaD domain-containing protein | - |
| DK43_RS06735 (DK43_06915) | metA | 1393596..1394540 (-) | 945 | WP_003027724.1 | homoserine O-succinyltransferase | - |
| DK43_RS06740 (DK43_06920) | - | 1394640..1395152 (-) | 513 | WP_003025236.1 | adenine phosphoribosyltransferase | - |
| DK43_RS06745 (DK43_06925) | - | 1395270..1395965 (-) | 696 | WP_003027725.1 | LrgB family protein | - |
| DK43_RS06750 (DK43_06930) | - | 1395958..1396338 (-) | 381 | WP_022525109.1 | CidA/LrgA family protein | - |
| DK43_RS06760 (DK43_06940) | - | 1397068..1398069 (+) | 1002 | WP_003027727.1 | YdcF family protein | - |
| DK43_RS06765 (DK43_06945) | - | 1398771..1400543 (-) | 1773 | WP_003028080.1 | ABC transporter ATP-binding protein | - |
| DK43_RS06770 (DK43_06950) | - | 1400558..1402297 (-) | 1740 | WP_003028078.1 | ABC transporter ATP-binding protein | - |
| DK43_RS06775 (DK43_06955) | endA | 1402587..1403432 (-) | 846 | WP_037608423.1 | DNA/RNA non-specific endonuclease | Machinery gene |
Sequence
Protein
Download Length: 281 a.a. Molecular weight: 30717.39 Da Isoelectric Point: 10.5050
>NTDB_id=121215 DK43_RS06775 WP_037608423.1 1402587..1403432(-) (endA) [Streptococcus anginosus strain SA1]
MARKKSNQKVAQSVAGLVIALVLALGGYSFSNHHGSTKPSDSTAINRSIRTNHAAPSQELAQSVLTESVKRQLKGKIEWN
GAGAFTINENKTTLDAKVASVPYADNKTKLVRGQTVPTVANALLSKTTRQYRSREETGNRSTTWTPAGWHQVKHLSGEYN
HAVDRGHLLGYALIGNLKGFDASTSNPKNIAVQTAWANQANTSHSTGQNFYETKVRKALDNNKRVRYRVTLIYANEQDLV
PVGSHIEAKSSDSSLEMNVFVPNVQTGLRLNYQTGEVTVTN
MARKKSNQKVAQSVAGLVIALVLALGGYSFSNHHGSTKPSDSTAINRSIRTNHAAPSQELAQSVLTESVKRQLKGKIEWN
GAGAFTINENKTTLDAKVASVPYADNKTKLVRGQTVPTVANALLSKTTRQYRSREETGNRSTTWTPAGWHQVKHLSGEYN
HAVDRGHLLGYALIGNLKGFDASTSNPKNIAVQTAWANQANTSHSTGQNFYETKVRKALDNNKRVRYRVTLIYANEQDLV
PVGSHIEAKSSDSSLEMNVFVPNVQTGLRLNYQTGEVTVTN
Nucleotide
Download Length: 846 bp
>NTDB_id=121215 DK43_RS06775 WP_037608423.1 1402587..1403432(-) (endA) [Streptococcus anginosus strain SA1]
ATGGCGAGAAAGAAATCCAATCAGAAAGTAGCACAAAGTGTAGCTGGTTTGGTTATAGCACTGGTTCTTGCGCTGGGAGG
TTATTCTTTTAGCAATCATCATGGTAGTACGAAGCCTTCTGACAGCACTGCTATTAATCGAAGTATTCGAACGAATCATG
CAGCACCCAGTCAAGAATTAGCACAGAGTGTTTTGACAGAGTCCGTTAAACGGCAACTCAAAGGAAAGATTGAATGGAAT
GGGGCAGGAGCCTTCACAATCAATGAAAATAAAACAACATTGGATGCTAAAGTTGCAAGCGTTCCTTATGCGGATAATAA
GACTAAACTCGTCCGAGGTCAGACCGTTCCGACGGTTGCAAATGCTCTTTTATCCAAAACAACTCGCCAATATAGAAGTC
GTGAAGAAACAGGAAATCGCTCCACGACTTGGACGCCTGCTGGTTGGCATCAAGTCAAGCATTTATCAGGTGAATACAAC
CATGCAGTTGACCGAGGACATTTGTTGGGTTATGCTTTGATTGGTAACTTGAAAGGGTTTGATGCTTCGACCAGTAACCC
GAAAAATATAGCTGTGCAAACAGCTTGGGCTAATCAAGCAAATACTAGTCATTCTACGGGTCAAAATTTCTATGAAACAA
AGGTTCGCAAGGCACTAGACAATAATAAACGAGTTCGGTATCGGGTGACTTTGATTTATGCCAATGAACAGGATTTAGTG
CCAGTTGGTTCGCATATTGAAGCTAAATCAAGCGATAGTAGCTTGGAAATGAATGTCTTTGTTCCCAACGTACAGACAGG
ACTTCGGCTAAATTATCAAACAGGAGAAGTGACCGTTACCAACTAG
ATGGCGAGAAAGAAATCCAATCAGAAAGTAGCACAAAGTGTAGCTGGTTTGGTTATAGCACTGGTTCTTGCGCTGGGAGG
TTATTCTTTTAGCAATCATCATGGTAGTACGAAGCCTTCTGACAGCACTGCTATTAATCGAAGTATTCGAACGAATCATG
CAGCACCCAGTCAAGAATTAGCACAGAGTGTTTTGACAGAGTCCGTTAAACGGCAACTCAAAGGAAAGATTGAATGGAAT
GGGGCAGGAGCCTTCACAATCAATGAAAATAAAACAACATTGGATGCTAAAGTTGCAAGCGTTCCTTATGCGGATAATAA
GACTAAACTCGTCCGAGGTCAGACCGTTCCGACGGTTGCAAATGCTCTTTTATCCAAAACAACTCGCCAATATAGAAGTC
GTGAAGAAACAGGAAATCGCTCCACGACTTGGACGCCTGCTGGTTGGCATCAAGTCAAGCATTTATCAGGTGAATACAAC
CATGCAGTTGACCGAGGACATTTGTTGGGTTATGCTTTGATTGGTAACTTGAAAGGGTTTGATGCTTCGACCAGTAACCC
GAAAAATATAGCTGTGCAAACAGCTTGGGCTAATCAAGCAAATACTAGTCATTCTACGGGTCAAAATTTCTATGAAACAA
AGGTTCGCAAGGCACTAGACAATAATAAACGAGTTCGGTATCGGGTGACTTTGATTTATGCCAATGAACAGGATTTAGTG
CCAGTTGGTTCGCATATTGAAGCTAAATCAAGCGATAGTAGCTTGGAAATGAATGTCTTTGTTCCCAACGTACAGACAGG
ACTTCGGCTAAATTATCAAACAGGAGAAGTGACCGTTACCAACTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| endA | Streptococcus pneumoniae Rx1 |
76.106 |
80.427 |
0.612 |
| endA | Streptococcus pneumoniae D39 |
76.106 |
80.427 |
0.612 |
| endA | Streptococcus pneumoniae R6 |
76.106 |
80.427 |
0.612 |
| endA | Streptococcus pneumoniae TIGR4 |
76.106 |
80.427 |
0.612 |