Detailed information    

insolico Bioinformatically predicted

Overview


Name   endA   Type   Machinery gene
Locus tag   NQ504_RS05135 Genome accession   NZ_CP102284
Coordinates   1017378..1018202 (+) Length   274 a.a.
NCBI ID   WP_003698166.1    Uniprot ID   E7FPE3
Organism   Ligilactobacillus ruminis strain ATCC 25644     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 1015729..1027655 1017378..1018202 within 0


Gene organization within MGE regions


Location: 1015729..1027655
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NQ504_RS05120 (NQ504_05120) - 1015729..1016310 (+) 582 WP_003698160.1 GNAT family N-acetyltransferase -
  NQ504_RS05125 (NQ504_05125) - 1016373..1016813 (+) 441 WP_225430485.1 hypothetical protein -
  NQ504_RS10860 tnpB 1016713..1017066 (-) 354 WP_004563577.1 IS66 family insertion sequence element accessory protein TnpB -
  NQ504_RS05130 (NQ504_05130) - 1017047..1017247 (-) 201 WP_004563576.1 hypothetical protein -
  NQ504_RS05135 (NQ504_05135) endA 1017378..1018202 (+) 825 WP_003698166.1 DNA/RNA non-specific endonuclease Machinery gene
  NQ504_RS05140 (NQ504_05140) - 1018294..1018941 (+) 648 WP_003698168.1 nitroreductase family protein -
  NQ504_RS05145 (NQ504_05145) - 1018998..1019810 (+) 813 WP_003698169.1 endo alpha-1,4 polygalactosaminidase -
  NQ504_RS05150 (NQ504_05150) - 1019960..1020511 (+) 552 WP_003698171.1 NUMOD4 domain-containing protein -
  NQ504_RS05155 (NQ504_05155) - 1020563..1021240 (+) 678 WP_003698172.1 DUF3862 domain-containing protein -
  NQ504_RS05160 (NQ504_05160) cas9 1021403..1025530 (+) 4128 WP_004563574.1 type II CRISPR RNA-guided endonuclease Cas9 -
  NQ504_RS05165 (NQ504_05165) cas1 1025790..1026704 (+) 915 WP_003698174.1 type II CRISPR-associated endonuclease Cas1 -
  NQ504_RS05170 (NQ504_05170) cas2 1026682..1026993 (+) 312 WP_003698175.1 CRISPR-associated endonuclease Cas2 -
  NQ504_RS05175 (NQ504_05175) csn2 1026990..1027655 (+) 666 WP_003698177.1 type II-A CRISPR-associated protein Csn2 -

Sequence


Protein


Download         Length: 274 a.a.        Molecular weight: 30539.93 Da        Isoelectric Point: 9.7452

>NTDB_id=715868 NQ504_RS05135 WP_003698166.1 1017378..1018202(+) (endA) [Ligilactobacillus ruminis strain ATCC 25644]
MVKRRKQNKSFATLIIAIGLVLLSWTAFGKKIGLDEWLNSGGQSQTNRVQNDESTPERQLAESVLSDSIRQRLGQKIEWN
GHGAFIINDNKTDLNANVSSAPYAVNRKETGAMIGDAWLNRTTRQYKNRSETGQGRTDWRPQGFKQKLDLSGRYSHAYDR
GHLLAYALVGGLRGFDASEANPDNIATQTAWANEAQSANSTGQNYYEGLVRKALDQNERVRYRVTDLFEEGNLVPSGAHI
EAKSRSGSLEFNVFVPNVQKGIEINYQTGDVSVK

Nucleotide


Download         Length: 825 bp        

>NTDB_id=715868 NQ504_RS05135 WP_003698166.1 1017378..1018202(+) (endA) [Ligilactobacillus ruminis strain ATCC 25644]
ATGGTCAAAAGAAGAAAACAAAACAAAAGCTTTGCGACCCTGATCATTGCCATCGGGTTGGTCCTGCTGAGCTGGACTGC
CTTCGGCAAAAAAATCGGCTTGGACGAGTGGCTGAATTCAGGAGGTCAATCTCAAACAAACCGCGTCCAAAACGACGAAA
GCACACCGGAAAGACAGCTTGCGGAAAGCGTTTTGAGCGATTCAATCAGACAAAGACTCGGACAGAAAATCGAGTGGAAC
GGACATGGAGCCTTCATCATCAATGATAACAAGACCGATTTGAATGCCAATGTTTCAAGTGCTCCGTATGCGGTCAACAG
AAAAGAAACCGGGGCGATGATCGGGGATGCCTGGCTGAACAGGACGACCAGACAATACAAAAATCGCAGTGAAACCGGCC
AAGGCAGGACCGACTGGCGCCCCCAGGGCTTCAAACAAAAACTTGATTTGAGCGGTCGCTACTCTCATGCTTACGACAGA
GGTCACTTGCTGGCTTATGCTTTGGTCGGGGGACTGAGAGGGTTTGACGCATCTGAAGCCAATCCCGACAATATCGCAAC
GCAGACGGCCTGGGCAAATGAGGCCCAAAGCGCAAATTCGACCGGTCAAAACTACTATGAAGGACTTGTCCGAAAAGCGC
TTGATCAAAACGAACGGGTTCGCTATCGAGTTACCGATTTGTTTGAAGAAGGCAATCTGGTTCCTTCCGGGGCGCACATT
GAGGCCAAAAGCAGATCGGGATCGCTTGAATTCAACGTGTTCGTGCCAAACGTTCAAAAGGGCATCGAAATCAATTACCA
AACCGGAGACGTCAGCGTCAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB E7FPE3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  endA Streptococcus pneumoniae Rx1

58.228

86.496

0.504

  endA Streptococcus pneumoniae D39

58.228

86.496

0.504

  endA Streptococcus pneumoniae R6

58.228

86.496

0.504

  endA Streptococcus pneumoniae TIGR4

58.228

86.496

0.504