Detailed information    

insolico Bioinformatically predicted

Overview


Name   ssb   Type   Machinery gene
Locus tag   UNDKW_RS21270 Genome accession   NZ_AP018439
Coordinates   4758840..4759277 (+) Length   145 a.a.
NCBI ID   WP_162060351.1    Uniprot ID   A0A6N4TA35
Organism   Undibacterium sp. KW1     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 4744249..4759277 4758840..4759277 within 0


Gene organization within MGE regions


Location: 4744249..4759277
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  UNDKW_RS21180 (UNDKW_4251) - 4744249..4744611 (-) 363 WP_162060336.1 phage tail protein -
  UNDKW_RS21185 (UNDKW_4252) - 4744861..4745187 (-) 327 WP_197892963.1 hypothetical protein -
  UNDKW_RS21190 (UNDKW_4253) - 4745297..4745635 (-) 339 WP_162060337.1 hypothetical protein -
  UNDKW_RS21195 (UNDKW_4254) - 4746404..4747117 (+) 714 WP_162060338.1 GNAT family N-acetyltransferase -
  UNDKW_RS21200 (UNDKW_4255) - 4747426..4747938 (+) 513 WP_162060339.1 hypothetical protein -
  UNDKW_RS21205 (UNDKW_4256) - 4748182..4748613 (-) 432 WP_162060340.1 hypothetical protein -
  UNDKW_RS21210 (UNDKW_4257) - 4748692..4749123 (-) 432 WP_162060341.1 hypothetical protein -
  UNDKW_RS21215 (UNDKW_4258) - 4749638..4751290 (-) 1653 WP_162060342.1 hypothetical protein -
  UNDKW_RS21220 (UNDKW_4259) - 4751525..4751950 (-) 426 WP_162060343.1 hypothetical protein -
  UNDKW_RS21225 (UNDKW_4260) - 4752299..4753246 (-) 948 WP_162060344.1 ATP-binding protein -
  UNDKW_RS21230 (UNDKW_4261) - 4753233..4753925 (-) 693 WP_162060345.1 hypothetical protein -
  UNDKW_RS21235 (UNDKW_4262) - 4755305..4755997 (+) 693 WP_162042812.1 LexA family transcriptional regulator -
  UNDKW_RS21240 (UNDKW_4263) - 4756187..4756753 (+) 567 WP_162060346.1 GNAT family N-acetyltransferase -
  UNDKW_RS21245 - 4756658..4757107 (-) 450 WP_162060347.1 hypothetical protein -
  UNDKW_RS21250 (UNDKW_4264) - 4757311..4757526 (+) 216 WP_162042814.1 hypothetical protein -
  UNDKW_RS21255 - 4757747..4757971 (+) 225 WP_162060348.1 hypothetical protein -
  UNDKW_RS21260 (UNDKW_4265) - 4758037..4758360 (+) 324 WP_162060349.1 hypothetical protein -
  UNDKW_RS21265 (UNDKW_4266) - 4758357..4758671 (+) 315 WP_162060350.1 hypothetical protein -
  UNDKW_RS30875 - 4758668..4758796 (+) 129 WP_255431517.1 hypothetical protein -
  UNDKW_RS21270 (UNDKW_4267) ssb 4758840..4759277 (+) 438 WP_162060351.1 single-stranded DNA-binding protein Machinery gene

Sequence


Protein


Download         Length: 145 a.a.        Molecular weight: 16175.07 Da        Isoelectric Point: 7.2057

>NTDB_id=69616 UNDKW_RS21270 WP_162060351.1 4758840..4759277(+) (ssb) [Undibacterium sp. KW1]
MASVNKVILVGNLGRDPETRYAPSGEAVSNITIATTDNWKDRTTGERKERSEWHRISFFGRLAEVVAMHLKKGATVYVEG
SLRTRKYTDKDGQEKFITEVRGDTMQMLSSRSAAQADGADGTPAKPRQPARAPETWDNMDDDLPF

Nucleotide


Download         Length: 438 bp        

>NTDB_id=69616 UNDKW_RS21270 WP_162060351.1 4758840..4759277(+) (ssb) [Undibacterium sp. KW1]
ATGGCATCCGTCAATAAAGTGATCCTCGTTGGCAACCTGGGCCGCGACCCTGAAACCCGCTACGCCCCCAGTGGCGAAGC
CGTCAGCAACATCACCATTGCCACCACCGACAACTGGAAAGACAGAACAACTGGTGAACGCAAGGAGCGCAGTGAATGGC
ACCGCATCAGCTTCTTTGGCCGCCTGGCAGAAGTGGTCGCCATGCACCTCAAGAAAGGCGCGACCGTCTATGTAGAAGGC
AGCCTGCGCACCCGCAAATACACAGACAAAGATGGTCAGGAAAAATTCATCACTGAAGTACGCGGCGACACCATGCAGAT
GCTCAGCAGCAGATCAGCAGCCCAGGCAGACGGCGCGGATGGCACACCGGCAAAACCCCGCCAGCCCGCCCGCGCACCTG
AGACCTGGGACAACATGGATGATGATTTACCGTTTTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6N4TA35

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  ssb Vibrio cholerae strain A1552

46.328

100

0.566

  ssb Glaesserella parasuis strain SC1401

43.889

100

0.545

  ssb Neisseria gonorrhoeae MS11

40.805

100

0.49

  ssb Neisseria meningitidis MC58

40.23

100

0.483

  ssb Latilactobacillus sakei subsp. sakei 23K

31.977

100

0.379


Multiple sequence alignment