Detailed information    

insolico Bioinformatically predicted

Overview


Name   endA   Type   Machinery gene
Locus tag   DK43_RS06775 Genome accession   NZ_CP007573
Coordinates   1402587..1403432 (-) Length   281 a.a.
NCBI ID   WP_037608423.1    Uniprot ID   -
Organism   Streptococcus anginosus strain SA1     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1375372..1402297 1402587..1403432 flank 290


Gene organization within MGE regions


Location: 1375372..1403432
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DK43_RS06635 (DK43_06815) - 1375372..1376088 (-) 717 WP_022525104.1 glycosyltransferase family 2 protein -
  DK43_RS06640 (DK43_06820) - 1376089..1377057 (-) 969 WP_003027705.1 glycosyltransferase -
  DK43_RS06645 (DK43_06825) - 1377083..1378024 (-) 942 WP_003027706.1 glycosyltransferase family 2 protein -
  DK43_RS06650 (DK43_06830) - 1378030..1379280 (-) 1251 WP_003027707.1 polysaccharide biosynthesis C-terminal domain-containing protein -
  DK43_RS06655 (DK43_06835) rfbD 1379369..1380220 (-) 852 WP_003027708.1 dTDP-4-dehydrorhamnose reductase -
  DK43_RS06660 (DK43_06845) - 1380341..1381549 (-) 1209 WP_022525105.1 hypothetical protein -
  DK43_RS06665 (DK43_06850) - 1381683..1383167 (-) 1485 WP_003027710.1 glucosyltransferase domain-containing protein -
  DK43_RS06670 (DK43_06855) - 1383164..1384090 (-) 927 WP_003027711.1 glycosyltransferase family 2 protein -
  DK43_RS06675 (DK43_06860) - 1384220..1384810 (-) 591 WP_003027712.1 class I SAM-dependent methyltransferase -
  DK43_RS06680 (DK43_06865) galE 1385117..1386136 (-) 1020 WP_003027713.1 UDP-glucose 4-epimerase GalE -
  DK43_RS06685 (DK43_06870) rfbB 1386170..1387216 (-) 1047 WP_003039992.1 dTDP-glucose 4,6-dehydratase -
  DK43_RS06690 (DK43_06875) - 1387247..1387840 (-) 594 WP_003027715.1 dTDP-4-dehydrorhamnose 3,5-epimerase family protein -
  DK43_RS06695 (DK43_06880) rfbA 1387844..1388713 (-) 870 WP_003027717.1 glucose-1-phosphate thymidylyltransferase RfbA -
  DK43_RS06700 (DK43_06885) - 1388914..1389740 (-) 827 Protein_1335 ZIP family metal transporter -
  DK43_RS06710 (DK43_06890) - 1389759..1390235 (-) 477 WP_003027719.1 8-oxo-dGTP diphosphatase -
  DK43_RS06715 (DK43_06895) - 1390236..1391333 (-) 1098 WP_041331854.1 FAD-dependent oxidoreductase -
  DK43_RS06720 (DK43_06900) - 1391346..1392143 (-) 798 WP_022525108.1 Nif3-like dinuclear metal center hexameric protein -
  DK43_RS06725 (DK43_06905) - 1392130..1392819 (-) 690 WP_003027722.1 tRNA (adenine(22)-N(1))-methyltransferase TrmK -
  DK43_RS06730 (DK43_06910) - 1392913..1393587 (-) 675 WP_003027723.1 DnaD domain-containing protein -
  DK43_RS06735 (DK43_06915) metA 1393596..1394540 (-) 945 WP_003027724.1 homoserine O-succinyltransferase -
  DK43_RS06740 (DK43_06920) - 1394640..1395152 (-) 513 WP_003025236.1 adenine phosphoribosyltransferase -
  DK43_RS06745 (DK43_06925) - 1395270..1395965 (-) 696 WP_003027725.1 LrgB family protein -
  DK43_RS06750 (DK43_06930) - 1395958..1396338 (-) 381 WP_022525109.1 CidA/LrgA family protein -
  DK43_RS06760 (DK43_06940) - 1397068..1398069 (+) 1002 WP_003027727.1 YdcF family protein -
  DK43_RS06765 (DK43_06945) - 1398771..1400543 (-) 1773 WP_003028080.1 ABC transporter ATP-binding protein -
  DK43_RS06770 (DK43_06950) - 1400558..1402297 (-) 1740 WP_003028078.1 ABC transporter ATP-binding protein -
  DK43_RS06775 (DK43_06955) endA 1402587..1403432 (-) 846 WP_037608423.1 DNA/RNA non-specific endonuclease Machinery gene

Sequence


Protein


Download         Length: 281 a.a.        Molecular weight: 30717.39 Da        Isoelectric Point: 10.5050

>NTDB_id=121215 DK43_RS06775 WP_037608423.1 1402587..1403432(-) (endA) [Streptococcus anginosus strain SA1]
MARKKSNQKVAQSVAGLVIALVLALGGYSFSNHHGSTKPSDSTAINRSIRTNHAAPSQELAQSVLTESVKRQLKGKIEWN
GAGAFTINENKTTLDAKVASVPYADNKTKLVRGQTVPTVANALLSKTTRQYRSREETGNRSTTWTPAGWHQVKHLSGEYN
HAVDRGHLLGYALIGNLKGFDASTSNPKNIAVQTAWANQANTSHSTGQNFYETKVRKALDNNKRVRYRVTLIYANEQDLV
PVGSHIEAKSSDSSLEMNVFVPNVQTGLRLNYQTGEVTVTN

Nucleotide


Download         Length: 846 bp        

>NTDB_id=121215 DK43_RS06775 WP_037608423.1 1402587..1403432(-) (endA) [Streptococcus anginosus strain SA1]
ATGGCGAGAAAGAAATCCAATCAGAAAGTAGCACAAAGTGTAGCTGGTTTGGTTATAGCACTGGTTCTTGCGCTGGGAGG
TTATTCTTTTAGCAATCATCATGGTAGTACGAAGCCTTCTGACAGCACTGCTATTAATCGAAGTATTCGAACGAATCATG
CAGCACCCAGTCAAGAATTAGCACAGAGTGTTTTGACAGAGTCCGTTAAACGGCAACTCAAAGGAAAGATTGAATGGAAT
GGGGCAGGAGCCTTCACAATCAATGAAAATAAAACAACATTGGATGCTAAAGTTGCAAGCGTTCCTTATGCGGATAATAA
GACTAAACTCGTCCGAGGTCAGACCGTTCCGACGGTTGCAAATGCTCTTTTATCCAAAACAACTCGCCAATATAGAAGTC
GTGAAGAAACAGGAAATCGCTCCACGACTTGGACGCCTGCTGGTTGGCATCAAGTCAAGCATTTATCAGGTGAATACAAC
CATGCAGTTGACCGAGGACATTTGTTGGGTTATGCTTTGATTGGTAACTTGAAAGGGTTTGATGCTTCGACCAGTAACCC
GAAAAATATAGCTGTGCAAACAGCTTGGGCTAATCAAGCAAATACTAGTCATTCTACGGGTCAAAATTTCTATGAAACAA
AGGTTCGCAAGGCACTAGACAATAATAAACGAGTTCGGTATCGGGTGACTTTGATTTATGCCAATGAACAGGATTTAGTG
CCAGTTGGTTCGCATATTGAAGCTAAATCAAGCGATAGTAGCTTGGAAATGAATGTCTTTGTTCCCAACGTACAGACAGG
ACTTCGGCTAAATTATCAAACAGGAGAAGTGACCGTTACCAACTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  endA Streptococcus pneumoniae Rx1

76.106

80.427

0.612

  endA Streptococcus pneumoniae D39

76.106

80.427

0.612

  endA Streptococcus pneumoniae R6

76.106

80.427

0.612

  endA Streptococcus pneumoniae TIGR4

76.106

80.427

0.612


Multiple sequence alignment