Detailed information    

insolico Bioinformatically predicted

Overview


Name   endA   Type   Machinery gene
Locus tag   H1W98_RS05710 Genome accession   NZ_LR822026
Coordinates   1082483..1083364 (-) Length   293 a.a.
NCBI ID   WP_084828863.1    Uniprot ID   A0A0E2QH56
Organism   Streptococcus thermophilus isolate STH_CIRM_967     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1071841..1109258 1082483..1083364 within 0
IScluster/Tn 1078277..1082234 1082483..1083364 flank 249


Gene organization within MGE regions


Location: 1071841..1109258
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H1W98_RS05645 (STHERMO_1271) vicX 1071841..1072653 (-) 813 WP_179971933.1 MBL fold metallo-hydrolase Regulator
  H1W98_RS05650 (STHERMO_1272) vicK 1072659..1073999 (-) 1341 WP_179971934.1 cell wall metabolism sensor histidine kinase VicK Regulator
  H1W98_RS05655 (STHERMO_1273) vicR 1073992..1074699 (-) 708 WP_179971935.1 response regulator YycF Regulator
  H1W98_RS05660 (STHERMO_1275) - 1075223..1075990 (+) 768 WP_180482604.1 amino acid ABC transporter ATP-binding protein -
  H1W98_RS05665 (STHERMO_1276) - 1076002..1076835 (+) 834 WP_180482603.1 transporter substrate-binding domain-containing protein -
  H1W98_RS05670 (STHERMO_1277) - 1076853..1077551 (+) 699 WP_179971938.1 amino acid ABC transporter permease -
  H1W98_RS05675 (STHERMO_1278) - 1077563..1078213 (+) 651 WP_179971939.1 amino acid ABC transporter permease -
  H1W98_RS05680 (STHERMO_1280) - 1078277..1079533 (-) 1257 WP_180483860.1 ISL3 family transposase -
  H1W98_RS05685 (STHERMO_1281) - 1079665..1080141 (-) 477 WP_179971941.1 transposase -
  H1W98_RS11365 - 1080184..1080342 (+) 159 WP_179973363.1 hypothetical protein -
  H1W98_RS05695 - 1080395..1080922 (+) 528 WP_179971942.1 HdeD family acid-resistance protein -
  H1W98_RS10815 (STHERMO_1285) - 1081051..1082308 (-) 1258 Protein_1070 ISL3 family transposase -
  H1W98_RS05710 (STHERMO_1287) endA 1082483..1083364 (-) 882 WP_084828863.1 DNA/RNA non-specific endonuclease Machinery gene
  H1W98_RS05715 (STHERMO_1288) - 1083426..1083614 (-) 189 WP_011226090.1 DNA-directed RNA polymerase subunit beta -
  H1W98_RS05720 (STHERMO_1289) murA 1083616..1084887 (-) 1272 WP_180482602.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  H1W98_RS05725 (STHERMO_1290) - 1084955..1085191 (-) 237 WP_002946959.1 DUF1146 family protein -
  H1W98_RS05730 (STHERMO_1291) galE 1085379..1086392 (-) 1014 WP_180482601.1 UDP-glucose 4-epimerase GalE -
  H1W98_RS05735 (STHERMO_1292) - 1086407..1086862 (-) 456 WP_180482600.1 DUF4832 domain-containing protein -
  H1W98_RS05740 (STHERMO_1293) metK 1087886..1089079 (-) 1194 WP_180482599.1 methionine adenosyltransferase -
  H1W98_RS05745 (STHERMO_1294) birA 1089359..1090297 (+) 939 WP_011681223.1 bifunctional biotin--[acetyl-CoA-carboxylase] ligase/biotin operon repressor BirA -
  H1W98_RS05750 (STHERMO_1295) - 1090284..1090469 (-) 186 WP_011681224.1 DUF3272 family protein -
  H1W98_RS05755 (STHERMO_1296) dnaX 1090608..1092260 (-) 1653 WP_179971950.1 DNA polymerase III subunit gamma/tau -
  H1W98_RS05760 (STHERMO_1297) - 1092260..1092760 (-) 501 WP_171815269.1 GAF domain-containing protein -
  H1W98_RS05765 (STHERMO_1298) miaA 1092805..1093704 (-) 900 WP_179971952.1 tRNA (adenosine(37)-N6)-dimethylallyltransferase MiaA -
  H1W98_RS05770 (STHERMO_1299) - 1093781..1093957 (+) 177 WP_014727550.1 DUF3042 family protein -
  H1W98_RS05775 (STHERMO_1301) - 1094353..1094547 (-) 195 WP_180482598.1 hypothetical protein -
  H1W98_RS05780 (STHERMO_1302) - 1094560..1094769 (-) 210 WP_180482597.1 helix-turn-helix domain-containing protein -
  H1W98_RS05785 (STHERMO_1303) - 1095076..1095426 (+) 351 WP_180482596.1 helix-turn-helix domain-containing protein -
  H1W98_RS05790 (STHERMO_1304) - 1095429..1096310 (+) 882 WP_180482595.1 hypothetical protein -
  H1W98_RS05795 (STHERMO_1306) - 1096556..1096924 (+) 369 WP_180482594.1 ImmA/IrrE family metallo-endopeptidase -
  H1W98_RS05800 (STHERMO_1307) - 1097008..1097619 (+) 612 WP_232086966.1 CD20-like domain-containing protein -
  H1W98_RS05805 (STHERMO_1308) - 1097724..1098461 (+) 738 WP_232086965.1 Arm DNA-binding domain-containing protein -
  H1W98_RS10820 - 1098589..1098765 (+) 177 WP_232086976.1 tyrosine-type recombinase/integrase -
  H1W98_RS05815 (STHERMO_1311) - 1098936..1100185 (+) 1250 Protein_1092 ISL3 family transposase -
  H1W98_RS05825 (STHERMO_1312) rplS 1100357..1100704 (-) 348 WP_002950935.1 50S ribosomal protein L19 -
  H1W98_RS05830 (STHERMO_1313) - 1100832..1102058 (-) 1227 WP_180482593.1 voltage-gated chloride channel family protein -
  H1W98_RS05835 (STHERMO_1314) - 1102068..1102340 (-) 273 WP_011226101.1 chorismate mutase -
  H1W98_RS05840 (STHERMO_1315) - 1102415..1103950 (-) 1536 WP_180482592.1 ClC family H(+)/Cl(-) exchange transporter -
  H1W98_RS05845 (STHERMO_1316) - 1104001..1104444 (-) 444 WP_002888085.1 flavodoxin -
  H1W98_RS05850 (STHERMO_1318) - 1104529..1104900 (-) 372 WP_179971956.1 hypothetical protein -
  H1W98_RS05855 (STHERMO_1319) - 1104912..1105631 (-) 720 WP_180482591.1 matrixin family metalloprotease -
  H1W98_RS05860 (STHERMO_1320) - 1105751..1106533 (-) 783 WP_180482590.1 alpha/beta hydrolase -
  H1W98_RS05865 (STHERMO_1321) - 1106658..1108319 (-) 1662 WP_179971958.1 ribonuclease J -
  H1W98_RS05870 (STHERMO_1322) - 1108500..1109258 (-) 759 WP_179971959.1 ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 293 a.a.        Molecular weight: 31768.48 Da        Isoelectric Point: 10.0031

>NTDB_id=1131285 H1W98_RS05710 WP_084828863.1 1082483..1083364(-) (endA) [Streptococcus thermophilus isolate STH_CIRM_967]
MAKGKTSLTNKQKRQFVTLIIAALIAVLGYLGTSNKLSPDNPIRQVASLVSGKSNKSIKSNYLTNSQSTPQEQLAETVMT
SSVKSQLGNSLEWNGAGAYIINGNKTNLNAKVSSQPYANNQVKTVQGQTVPTVANALLSKATRQYKNRQETGNGSTSWTP
AGWHQVQNLSGEYSHAVDRGHLLGYALIGGLKGFDASTSNPENIAVQTAWANQAYSDNSTGQNYYESLVRKALDKNKRVR
YRVTLIYEGENLIPSGTHLEAKSADGSLEFNVFVPNVQEGITLDYYSGKVTVN

Nucleotide


Download         Length: 882 bp        

>NTDB_id=1131285 H1W98_RS05710 WP_084828863.1 1082483..1083364(-) (endA) [Streptococcus thermophilus isolate STH_CIRM_967]
ATGGCAAAAGGAAAAACATCTTTGACAAATAAACAGAAACGTCAGTTCGTCACATTAATAATAGCAGCTTTGATAGCTGT
TTTAGGTTACCTAGGAACCAGCAACAAGCTCTCGCCTGATAATCCTATCAGACAAGTAGCAAGCCTAGTTTCAGGAAAAT
CTAATAAATCCATTAAATCTAATTATTTAACAAATAGTCAGTCTACACCACAAGAGCAGCTTGCTGAGACAGTAATGACT
AGTTCAGTCAAAAGCCAACTGGGTAATTCCTTGGAATGGAATGGAGCTGGAGCCTATATTATTAATGGAAATAAGACGAA
TCTAAATGCTAAGGTATCAAGCCAACCTTATGCTAATAACCAAGTCAAAACGGTTCAAGGACAAACTGTTCCTACAGTTG
CTAATGCCTTATTAAGTAAAGCAACTAGACAGTACAAAAATCGTCAAGAAACAGGGAATGGTTCAACCTCTTGGACACCA
GCTGGATGGCACCAGGTTCAAAATTTATCAGGAGAGTATAGCCATGCCGTAGATAGAGGCCATTTGCTAGGTTATGCACT
GATAGGTGGTTTGAAAGGATTTGATGCCTCGACTTCAAATCCAGAGAATATTGCTGTTCAAACTGCTTGGGCAAATCAAG
CATATAGCGATAATTCTACAGGTCAAAACTATTATGAAAGCCTAGTACGAAAAGCTCTGGATAAAAACAAACGTGTTCGT
TATCGTGTGACCTTGATTTATGAAGGAGAGAACCTCATTCCATCAGGAACACATCTTGAAGCTAAGTCAGCAGACGGATC
ACTTGAGTTTAATGTCTTTGTTCCAAATGTACAAGAGGGAATAACCCTAGATTACTATTCAGGAAAAGTAACGGTGAATT
GA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0E2QH56

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  endA Streptococcus pneumoniae Rx1

75.893

76.451

0.58

  endA Streptococcus pneumoniae D39

75.893

76.451

0.58

  endA Streptococcus pneumoniae R6

75.893

76.451

0.58

  endA Streptococcus pneumoniae TIGR4

75.893

76.451

0.58


Multiple sequence alignment