Detailed information    

insolico Bioinformatically predicted

Overview


Name   ssb   Type   Machinery gene
Locus tag   CRT37_RS21005 Genome accession   NZ_CP023834
Coordinates   4375238..4375774 (+) Length   178 a.a.
NCBI ID   WP_000168305.1    Uniprot ID   A0A9P2PPK2
Organism   Escherichia coli strain 4/2-1     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 4351562..4396088 4375238..4375774 within 0


Gene organization within MGE regions


Location: 4351562..4396088
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CRT37_RS20895 (CRT37_22160) - 4351816..4353396 (+) 1581 Protein_4071 SopA family protein -
  CRT37_RS20900 (CRT37_22170) ubiC 4353619..4354116 (+) 498 WP_001326644.1 chorismate lyase -
  CRT37_RS20905 (CRT37_22175) ubiA 4354129..4355001 (+) 873 WP_000455227.1 4-hydroxybenzoate octaprenyltransferase -
  CRT37_RS20910 (CRT37_22180) plsB 4355156..4357579 (-) 2424 WP_000017354.1 glycerol-3-phosphate 1-O-acyltransferase PlsB -
  CRT37_RS20915 (CRT37_22185) dgkA 4357750..4358118 (+) 369 WP_000002907.1 diacylglycerol kinase -
  CRT37_RS20920 (CRT37_22190) lexA 4358228..4358836 (+) 609 WP_000646078.1 transcriptional repressor LexA -
  CRT37_RS20925 (CRT37_22195) dinF 4358909..4360234 (+) 1326 WP_001326646.1 MATE family efflux transporter DinF -
  CRT37_RS20930 (CRT37_22200) yjbJ 4360350..4360559 (+) 210 WP_001030593.1 CsbD family protein -
  CRT37_RS20935 (CRT37_22205) zur 4360601..4361116 (-) 516 WP_001295691.1 zinc uptake transcriptional repressor Zur -
  CRT37_RS24855 (CRT37_22210) yjbL 4361434..4361688 (+) 255 WP_000912571.1 protein YjbL -
  CRT37_RS20945 (CRT37_22215) yjbM 4361712..4362419 (+) 708 WP_001311303.1 DUF2713 domain-containing protein -
  CRT37_RS20950 (CRT37_22220) dusA 4362782..4363819 (+) 1038 WP_001298868.1 tRNA dihydrouridine(20/20a) synthase DusA -
  CRT37_RS20955 (CRT37_22225) pspG 4363953..4364195 (+) 243 WP_000891404.1 envelope stress response protein PspG -
  CRT37_RS20960 (CRT37_22230) qorA 4364361..4365344 (-) 984 WP_000235508.1 quinone oxidoreductase -
  CRT37_RS20965 (CRT37_22235) dnaB 4365427..4366842 (+) 1416 WP_000918363.1 replicative DNA helicase -
  CRT37_RS20970 (CRT37_22240) alr 4366895..4367974 (+) 1080 WP_001147328.1 alanine racemase -
  CRT37_RS20975 (CRT37_22245) tyrB 4368227..4369420 (+) 1194 WP_000486985.1 aromatic amino acid transaminase -
  CRT37_RS24370 (CRT37_22250) yjbS 4369922..4370125 (-) 204 WP_001321562.1 protein YjbS -
  CRT37_RS20985 (CRT37_22255) aphA 4370527..4371240 (+) 714 WP_001226928.1 acid phosphatase AphA -
  CRT37_RS20990 (CRT37_22260) yjbQ 4371351..4371767 (+) 417 WP_000270375.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  CRT37_RS20995 (CRT37_22265) yjbR 4371771..4372127 (+) 357 WP_000155657.1 MmcQ/YjbR family DNA-binding protein -
  CRT37_RS21000 (CRT37_22270) uvrA 4372162..4374984 (-) 2823 WP_000357740.1 excinuclease ABC subunit UvrA -
  CRT37_RS21005 (CRT37_22275) ssb 4375238..4375774 (+) 537 WP_000168305.1 single-stranded DNA-binding protein SSB1 Machinery gene
  CRT37_RS21010 (CRT37_22280) - 4375933..4377180 (+) 1248 WP_000414651.1 site-specific integrase -
  CRT37_RS21015 (CRT37_22285) - 4377167..4378705 (+) 1539 WP_001333349.1 site-specific integrase -
  CRT37_RS21020 (CRT37_22290) - 4378722..4380755 (+) 2034 WP_000807722.1 hypothetical protein -
  CRT37_RS24375 - 4380748..4381167 (+) 420 WP_001062122.1 hypothetical protein -
  CRT37_RS21030 (CRT37_22300) avs4 4381243..4386006 (+) 4764 WP_000240574.1 AVAST type 4 anti-phage nuclease Avs4 -
  CRT37_RS21040 (CRT37_22310) - 4386545..4387366 (-) 822 WP_001272932.1 abortive infection system antitoxin AbiGi family protein -
  CRT37_RS21045 (CRT37_22315) yjcB 4387774..4388055 (-) 282 WP_001295689.1 YjcB family protein -
  CRT37_RS21050 (CRT37_22325) pdeC 4388485..4390071 (+) 1587 WP_059328862.1 c-di-GMP phosphodiesterase PdeC -
  CRT37_RS21055 (CRT37_22330) soxS 4390074..4390397 (-) 324 WP_000019358.1 superoxide response transcriptional regulator SoxS -
  CRT37_RS21060 (CRT37_22335) soxR 4390483..4390947 (+) 465 WP_000412430.1 redox-sensitive transcriptional activator SoxR -
  CRT37_RS21065 (CRT37_22355) ghxP 4391493..4392842 (+) 1350 WP_000106882.1 guanine/hypoxanthine transporter GhxP -
  CRT37_RS21070 (CRT37_22360) yjcE 4392993..4394642 (+) 1650 WP_000402207.1 Na+/H+ antiporter -
  CRT37_RS21075 (CRT37_22365) espX5 4394796..4396088 (-) 1293 WP_001270095.1 T3SS effector pentapeptide repeat protein EspX5 -

Sequence


Protein


Download         Length: 178 a.a.        Molecular weight: 18975.00 Da        Isoelectric Point: 5.2358

>NTDB_id=250427 CRT37_RS21005 WP_000168305.1 4375238..4375774(+) (ssb) [Escherichia coli strain 4/2-1]
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMKEQTEWHRVVLFGKLAEVASEYLRKGSQVYI
EGQLRTRKWTDQSGQDRYTTEVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSGGAQSRPQQSA
PAAPSNEPPMDFDDDIPF

Nucleotide


Download         Length: 537 bp        

>NTDB_id=250427 CRT37_RS21005 WP_000168305.1 4375238..4375774(+) (ssb) [Escherichia coli strain 4/2-1]
ATGGCCAGCAGAGGCGTAAACAAGGTTATTCTCGTTGGTAATCTGGGTCAGGACCCGGAAGTACGCTACATGCCAAATGG
TGGCGCAGTTGCCAACATTACGCTGGCTACTTCCGAATCCTGGCGTGATAAAGCGACCGGCGAGATGAAAGAACAGACTG
AATGGCACCGCGTTGTGCTGTTCGGCAAACTGGCAGAAGTGGCGAGCGAATATCTGCGTAAAGGTTCTCAGGTTTATATC
GAAGGTCAGCTGCGTACCCGTAAATGGACCGATCAATCCGGTCAGGATCGCTACACCACAGAAGTCGTGGTGAACGTTGG
CGGCACCATGCAGATGCTGGGTGGTCGTCAGGGTGGTGGCGCTCCGGCAGGTGGCAATATCGGTGGTGGTCAGCCGCAGG
GCGGTTGGGGTCAGCCTCAGCAGCCGCAGGGTGGCAATCAGTTCAGCGGCGGCGCGCAGTCTCGCCCGCAGCAGTCCGCT
CCGGCAGCGCCGTCTAACGAGCCGCCGATGGACTTTGATGATGACATTCCGTTCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  ssb Vibrio cholerae strain A1552

74.444

100

0.753

  ssb Glaesserella parasuis strain SC1401

57.923

100

0.596

  ssb Neisseria meningitidis MC58

48.066

100

0.489

  ssb Neisseria gonorrhoeae MS11

48.066

100

0.489


Multiple sequence alignment