Detailed information    

insolico Bioinformatically predicted

Overview


Name   ssb   Type   Machinery gene
Locus tag   CR539_RS14910 Genome accession   NZ_CP024147
Coordinates   2849215..2849721 (+) Length   168 a.a.
NCBI ID   WP_000168274.1    Uniprot ID   A0A9X0Q0X8
Organism   Escherichia coli strain 14EC033     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2841436..2858258 2849215..2849721 within 0
IScluster/Tn 2847535..2848232 2849215..2849721 flank 983


Gene organization within MGE regions


Location: 2841436..2858258
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CR539_RS28600 - 2842060..2842362 (-) 303 Protein_2693 hypothetical protein -
  CR539_RS14865 (CR539_14815) ymfQ 2842366..2842950 (-) 585 WP_101969437.1 YmfQ family protein -
  CR539_RS14870 (CR539_14820) - 2842941..2843999 (-) 1059 WP_101969438.1 baseplate J/gp47 family protein -
  CR539_RS14875 (CR539_14825) - 2843986..2844411 (-) 426 WP_000424732.1 phage GP46 family protein -
  CR539_RS14880 (CR539_14830) - 2844411..2844959 (-) 549 WP_001259087.1 phage baseplate assembly protein V -
  CR539_RS14885 (CR539_14835) - 2844959..2846038 (-) 1080 WP_101969439.1 phage baseplate assembly protein -
  CR539_RS14890 (CR539_14840) - 2846035..2847363 (-) 1329 WP_101969569.1 DNA circularization N-terminal domain-containing protein -
  CR539_RS14905 (CR539_14855) - 2848507..2849214 (+) 708 WP_101969440.1 Rad52/Rad22 family DNA repair protein -
  CR539_RS14910 (CR539_14860) ssb 2849215..2849721 (+) 507 WP_000168274.1 single-stranded DNA-binding protein Machinery gene
  CR539_RS14915 (CR539_14865) - 2849735..2850031 (+) 297 WP_001111285.1 phage anti-RecBCD protein -
  CR539_RS14920 (CR539_14870) - 2850042..2850206 (+) 165 WP_101969570.1 DUF2737 family protein -
  CR539_RS14925 (CR539_14875) - 2850203..2850685 (+) 483 WP_101969441.1 hypothetical protein -
  CR539_RS28995 - 2850682..2850978 (+) 297 Protein_2706 ead/Ea22-like family protein -
  CR539_RS14935 (CR539_14885) - 2851158..2851670 (+) 513 WP_101969443.1 DUF551 domain-containing protein -
  CR539_RS14940 (CR539_14890) - 2851767..2851958 (+) 192 WP_101969444.1 Eag protein -
  CR539_RS14945 (CR539_14895) - 2851987..2852684 (+) 698 WP_095033700.1 IS1-like element IS1B family transposase -
  CR539_RS14950 (CR539_14900) - 2852824..2853168 (+) 345 WP_001281193.1 hypothetical protein -
  CR539_RS14955 (CR539_14905) - 2853246..2853437 (+) 192 WP_000132739.1 AlpA family phage regulatory protein -
  CR539_RS14960 (CR539_14910) - 2853418..2854596 (-) 1179 WP_097451079.1 site-specific integrase -
  CR539_RS14965 (CR539_14915) sbcB 2854778..2856202 (+) 1425 WP_101969445.1 exodeoxyribonuclease I -
  CR539_RS14970 (CR539_14920) tsuB 2856256..2856480 (-) 225 WP_001575873.1 thiosulfate utilization sulfurtransferase TsuB/YeeD -
  CR539_RS14975 (CR539_14925) tsuA 2856526..2857506 (-) 981 WP_283950833.1 thiosulfate utilization transporter TsuA/YeeE -
  CR539_RS14985 (CR539_14935) - 2857561..2858258 (+) 698 WP_094096600.1 IS1-like element IS1A family transposase -

Sequence


Protein


Download         Length: 168 a.a.        Molecular weight: 18731.03 Da        Isoelectric Point: 8.4877

>NTDB_id=252534 CR539_RS14910 WP_000168274.1 2849215..2849721(+) (ssb) [Escherichia coli strain 14EC033]
MASRGVNKVIILGRVGQDPEVRYSPSGTAFANLTIATSEQWRDKNTGEQKELTEWHRVAVSGKLAEVVGQYVKKGDQIYF
EGMLRTRKWKDQSGQDRYTTEVHVGINGVMQMLGGIGDSKQQAASRQSQKPQQQSSPAQHNEPPMDFDDDIPFAPVTLPF
PRHAIHAI

Nucleotide


Download         Length: 507 bp        

>NTDB_id=252534 CR539_RS14910 WP_000168274.1 2849215..2849721(+) (ssb) [Escherichia coli strain 14EC033]
ATGGCAAGCAGAGGCGTAAATAAGGTGATTATCCTTGGTCGGGTAGGACAAGACCCGGAAGTTCGATACTCACCATCAGG
AACAGCGTTCGCTAACCTGACAATAGCCACGTCAGAACAATGGCGAGATAAAAATACTGGCGAGCAAAAGGAATTGACTG
AATGGCATCGTGTTGCTGTATCCGGGAAACTGGCTGAGGTCGTGGGGCAGTATGTGAAAAAAGGTGATCAGATTTATTTC
GAGGGAATGCTGAGAACCAGAAAGTGGAAAGACCAGTCAGGGCAAGACCGTTACACAACCGAGGTTCATGTCGGAATTAA
TGGCGTGATGCAAATGCTTGGCGGCATTGGCGACAGCAAACAACAAGCAGCCAGCAGGCAATCACAGAAGCCACAGCAGC
AATCATCACCAGCACAACACAACGAACCTCCGATGGATTTTGACGACGATATACCCTTTGCACCAGTAACTCTCCCCTTC
CCTCGTCACGCTATTCACGCAATTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  ssb Vibrio cholerae strain A1552

62.147

100

0.655

  ssb Glaesserella parasuis strain SC1401

45.856

100

0.494

  ssb Neisseria meningitidis MC58

38.636

100

0.405

  ssb Neisseria gonorrhoeae MS11

38.636

100

0.405


Multiple sequence alignment