Detailed information    

insolico Bioinformatically predicted

Overview


Name   endA   Type   Machinery gene
Locus tag   DQM63_RS03860 Genome accession   NZ_LS483385
Coordinates   746339..747199 (+) Length   286 a.a.
NCBI ID   WP_002900735.1    Uniprot ID   -
Organism   Streptococcus sanguinis strain NCTC7863     
Function   cleavage of dsDNA into ssDNA (predicted from homology)   
DNA processing

Genomic Context


Location: 741339..752199
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM63_RS03830 (NCTC7863_00746) - 741595..742476 (+) 882 WP_002895995.1 F0F1 ATP synthase subunit gamma -
  DQM63_RS03835 (NCTC7863_00747) atpD 742515..743921 (+) 1407 WP_002897903.1 F0F1 ATP synthase subunit beta -
  DQM63_RS03840 (NCTC7863_00748) - 743929..744351 (+) 423 WP_002900740.1 F0F1 ATP synthase subunit epsilon -
  DQM63_RS03845 (NCTC7863_00749) - 744525..744755 (+) 231 WP_002895990.1 DUF1146 family protein -
  DQM63_RS03850 (NCTC7863_00750) murA 744823..746103 (+) 1281 WP_002900738.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  DQM63_RS03855 (NCTC7863_00751) - 746096..746278 (+) 183 WP_002900736.1 DNA-directed RNA polymerase subunit beta -
  DQM63_RS03860 (NCTC7863_00752) endA 746339..747199 (+) 861 WP_002900735.1 DNA/RNA non-specific endonuclease Machinery gene
  DQM63_RS03865 (NCTC7863_00753) - 747377..748096 (+) 720 WP_002902711.1 matrixin family metalloprotease -
  DQM63_RS03870 (NCTC7863_00754) - 748101..748466 (+) 366 WP_002900732.1 hypothetical protein -
  DQM63_RS03875 (NCTC7863_00755) - 748566..750461 (+) 1896 WP_032909515.1 ABC-F family ATP-binding cassette domain-containing protein -
  DQM63_RS03880 (NCTC7863_00756) - 750621..751178 (+) 558 WP_002900729.1 hypothetical protein -
  DQM63_RS03885 (NCTC7863_00757) - 751267..752010 (+) 744 WP_002913733.1 hypothetical protein -

Sequence


Protein


Download         Length: 286 a.a.        Molecular weight: 31314.98 Da        Isoelectric Point: 10.0558

>NTDB_id=1139629 DQM63_RS03860 WP_002900735.1 746339..747199(+) (endA) [Streptococcus sanguinis strain NCTC7863]
MKRKTSQKRNSKSLQGWAGLTLALILALAGYFWGQDGQQPLQSPNSEIRVGQVQDQGTPSRELAESVLTDAVRAQLKGSI
EWNGAGAFIVNGNKTDLDAGISSKPYADNKTKLVQGQTLPTVANAFLSKSTRQYKKREETRNANTSWVPAGWHQLKNLPG
EYNHAVDRGHLLAYSLIGGLKGFDASTSNPANIAVQTAWSNQANEADSTGQNYYETKIRKALDKNKRVRYRVTLIYATET
DLVPVGSHLEAKSSDGSLEFNVFIPNVQKGIRLDYNSGKVSQETGS

Nucleotide


Download         Length: 861 bp        

>NTDB_id=1139629 DQM63_RS03860 WP_002900735.1 746339..747199(+) (endA) [Streptococcus sanguinis strain NCTC7863]
ATGAAAAGAAAGACTAGTCAGAAAAGGAATTCTAAGAGCCTGCAAGGCTGGGCTGGGCTGACTCTAGCCTTGATTTTGGC
GCTGGCTGGCTATTTCTGGGGACAGGATGGTCAGCAGCCACTTCAGAGTCCAAATTCAGAAATCAGAGTTGGGCAAGTTC
AAGATCAGGGCACCCCTAGTCGGGAACTGGCTGAGAGCGTTCTGACGGATGCTGTCCGAGCTCAGCTTAAAGGCAGCATT
GAATGGAATGGAGCTGGTGCCTTTATCGTGAATGGGAATAAGACAGACTTGGATGCGGGGATTTCTAGTAAGCCCTACGC
TGACAATAAGACCAAGCTTGTACAAGGGCAAACCCTTCCGACAGTGGCGAATGCCTTTTTATCCAAGTCTACCCGCCAGT
ACAAAAAACGGGAAGAAACTAGAAATGCCAACACTTCATGGGTACCGGCAGGCTGGCATCAACTCAAGAACCTCCCTGGT
GAGTACAATCACGCTGTGGACCGCGGACATCTCTTGGCCTATTCGCTGATCGGTGGTTTGAAGGGCTTTGATGCTTCTAC
TAGCAATCCTGCCAATATCGCTGTGCAGACGGCCTGGTCCAACCAAGCCAATGAAGCGGATTCGACAGGGCAGAATTACT
ATGAGACAAAGATCCGTAAGGCCCTGGATAAGAACAAGCGGGTGCGCTACCGGGTAACCTTGATTTATGCCACGGAGACT
GATTTAGTACCGGTTGGGTCACATTTAGAGGCCAAGTCCAGCGATGGCAGTCTAGAATTCAATGTCTTTATTCCGAATGT
GCAAAAAGGGATTCGTTTGGACTATAATAGTGGCAAGGTCAGTCAGGAGACGGGTTCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  endA Streptococcus pneumoniae Rx1

74.439

77.972

0.58

  endA Streptococcus pneumoniae D39

74.439

77.972

0.58

  endA Streptococcus pneumoniae R6

74.439

77.972

0.58

  endA Streptococcus pneumoniae TIGR4

74.439

77.972

0.58


Multiple sequence alignment