Detailed information    

insolico Bioinformatically predicted

Overview


Name   degU   Type   Regulator
Locus tag   S101267_RS18345 Genome accession   NZ_CP021505
Coordinates   3512910..3513599 (-) Length   229 a.a.
NCBI ID   WP_003219701.1    Uniprot ID   G4P1D5
Organism   Bacillus amyloliquefaciens strain SRCM101267     
Function   activation of comK (predicted from homology)   
Competence regulation

Genomic Context


Location: 3507910..3518599
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101267_RS18310 (S101267_03681) - 3508109..3508591 (-) 483 WP_013353780.1 flagellar protein FlgN -
  S101267_RS18315 (S101267_03682) flgM 3508606..3508872 (-) 267 WP_013353781.1 flagellar biosynthesis anti-sigma factor FlgM -
  S101267_RS18320 (S101267_03683) - 3508942..3509361 (-) 420 WP_013353782.1 TIGR03826 family flagellar region protein -
  S101267_RS18325 (S101267_03684) comFC 3509435..3510124 (-) 690 WP_198315882.1 ComF family protein Machinery gene
  S101267_RS18330 (S101267_03685) - 3510130..3510414 (-) 285 WP_013353784.1 late competence development ComFB family protein -
  S101267_RS18335 (S101267_03686) comFA 3510473..3511858 (-) 1386 WP_013353785.1 DEAD/DEAH box helicase Machinery gene
  S101267_RS18340 (S101267_03687) - 3511965..3512813 (-) 849 WP_013353786.1 DegV family protein -
  S101267_RS18345 (S101267_03688) degU 3512910..3513599 (-) 690 WP_003219701.1 two-component system response regulator DegU Regulator
  S101267_RS18350 (S101267_03689) degS 3513676..3514839 (-) 1164 WP_013353787.1 two-component sensor histidine kinase DegS Regulator
  S101267_RS18355 (S101267_03690) - 3515062..3515709 (+) 648 WP_013353788.1 YigZ family protein -
  S101267_RS18360 (S101267_03691) - 3515714..3516988 (+) 1275 WP_013353789.1 LCP family protein -
  S101267_RS18365 (S101267_03692) - 3517088..3518164 (-) 1077 WP_013353790.1 glycosyltransferase family 4 protein -

Sequence


Protein


Download         Length: 229 a.a.        Molecular weight: 25866.57 Da        Isoelectric Point: 5.9446

>NTDB_id=231425 S101267_RS18345 WP_003219701.1 3512910..3513599(-) (degU) [Bacillus amyloliquefaciens strain SRCM101267]
MTKVNIVIIDDHQLFREGVKRILDFEPTFEVVAEGDDGDEAARIVEHYHPDVVIMDINMPNVNGVEATKQLVELYPESKV
IILSIHDDENYVTHALKTGARGYLLKEMDADTLIEAVKVVAEGGSYLHPKVTHNLVNEFRRLATSGVSAHPQHEVYPEIR
RPLHILTRRECEVLQMLADGKSNRGIGESLFISEKTVKNHVSNILQKMNVNDRTQAVVVAIKNGWVEMR

Nucleotide


Download         Length: 690 bp        

>NTDB_id=231425 S101267_RS18345 WP_003219701.1 3512910..3513599(-) (degU) [Bacillus amyloliquefaciens strain SRCM101267]
GTGACTAAAGTAAATATTGTTATTATCGACGACCATCAACTATTCCGTGAGGGTGTAAAAAGAATATTGGATTTTGAACC
TACCTTTGAAGTGGTAGCAGAAGGTGACGATGGAGATGAAGCGGCTCGTATCGTCGAGCATTATCATCCCGATGTCGTCA
TAATGGATATCAATATGCCGAATGTAAATGGAGTAGAGGCTACTAAGCAGCTTGTTGAGCTGTACCCTGAATCTAAGGTA
ATTATCCTCTCCATCCATGATGATGAAAACTATGTGACTCATGCTTTAAAAACAGGTGCAAGAGGCTATCTCCTGAAAGA
GATGGACGCGGATACTTTAATCGAGGCAGTGAAAGTGGTAGCCGAGGGCGGTTCTTACCTTCACCCGAAAGTAACCCACA
ATCTTGTCAACGAATTCCGCCGTCTTGCGACAAGCGGAGTTTCTGCTCACCCTCAGCATGAGGTTTATCCGGAAATTCGG
AGACCATTACATATATTAACGAGACGGGAATGTGAAGTGCTGCAAATGCTTGCAGACGGAAAAAGCAACCGCGGAATCGG
TGAATCATTGTTTATCAGTGAGAAAACAGTGAAAAACCACGTGAGTAATATTTTGCAGAAAATGAACGTTAATGACCGGA
CACAAGCTGTTGTCGTAGCCATTAAAAACGGCTGGGTAGAGATGCGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB G4P1D5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  degU Bacillus subtilis subsp. subtilis str. 168

100

100

1


Multiple sequence alignment