Detailed information    

insolico Bioinformatically predicted

Overview


Name   degU   Type   Regulator
Locus tag   S101395_RS03030 Genome accession   NZ_CP021920
Coordinates   587453..588142 (+) Length   229 a.a.
NCBI ID   WP_003185730.1    Uniprot ID   Q5QSL6
Organism   Bacillus sonorensis strain SRCM101395     
Function   activation of comK (predicted from homology)   
Competence regulation

Genomic Context


Location: 582453..593142
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S101395_RS03010 (S101395_00608) - 583051..584115 (+) 1065 WP_006639270.1 glycosyltransferase family 4 protein -
  S101395_RS03015 (S101395_00609) - 584258..585337 (-) 1080 WP_006639269.1 LCP family protein -
  S101395_RS03020 (S101395_00610) - 585354..585992 (-) 639 WP_006639268.1 YigZ family protein -
  S101395_RS03025 (S101395_00611) degS 586214..587371 (+) 1158 WP_006639267.1 sensor histidine kinase Regulator
  S101395_RS03030 (S101395_00612) degU 587453..588142 (+) 690 WP_003185730.1 two-component system response regulator DegU Regulator
  S101395_RS03035 (S101395_00613) - 588262..589104 (+) 843 WP_006639266.1 DegV family protein -
  S101395_RS03040 (S101395_00614) comFA 589297..590643 (+) 1347 WP_006639265.1 DEAD/DEAH box helicase Machinery gene
  S101395_RS03045 (S101395_00615) - 590700..590984 (+) 285 WP_006639264.1 late competence development ComFB family protein -
  S101395_RS26895 (S101395_00616) comFC 590941..591675 (+) 735 WP_006639263.1 ComF family protein Machinery gene
  S101395_RS03055 (S101395_00617) - 591733..592152 (+) 420 WP_006639262.1 TIGR03826 family flagellar region protein -
  S101395_RS03060 (S101395_00618) flgM 592232..592495 (+) 264 WP_006639261.1 flagellar biosynthesis anti-sigma factor FlgM -
  S101395_RS03065 (S101395_00619) - 592510..592992 (+) 483 WP_029419581.1 flagellar protein FlgN -

Sequence


Protein


Download         Length: 229 a.a.        Molecular weight: 25842.50 Da        Isoelectric Point: 5.9436

>NTDB_id=234870 S101395_RS03030 WP_003185730.1 587453..588142(+) (degU) [Bacillus sonorensis strain SRCM101395]
MTKVNIVIIDDHQLFREGVKRILDFEPTFEVVAEGDDGDEAARIVEHYHPDVVIMDINMPNVNGVEATKQLVDLYPESKV
IILSIHDDENYVTHALKTGARGYLLKEMDADTLIEAVKVVAEGGSYLHPKVTHNLVNEFRRLATSGVSSHAQHEVYPEIR
RPLHILTRRECEVLQMLADGKSNRGIGESLFISEKTVKNHVSNILQKMNVNDRTQAVVVAIKNGWVEMR

Nucleotide


Download         Length: 690 bp        

>NTDB_id=234870 S101395_RS03030 WP_003185730.1 587453..588142(+) (degU) [Bacillus sonorensis strain SRCM101395]
GTGACTAAAGTAAATATTGTAATTATTGATGATCATCAGTTATTCCGAGAAGGTGTCAAACGGATTTTGGATTTTGAGCC
TACTTTTGAGGTAGTGGCTGAAGGAGACGACGGTGATGAAGCGGCTCGCATCGTCGAGCACTATCATCCTGATGTTGTGA
TCATGGATATTAATATGCCGAATGTAAACGGAGTTGAAGCGACAAAACAGCTTGTCGATTTGTATCCTGAATCAAAGGTC
ATTATTCTATCGATCCATGATGACGAAAACTATGTGACACATGCGTTGAAAACCGGAGCGCGCGGCTATCTGCTGAAAGA
AATGGACGCAGATACATTGATTGAAGCCGTTAAGGTAGTGGCAGAAGGCGGTTCTTATCTTCACCCTAAAGTAACGCACA
ATCTTGTCAATGAATTCCGCCGTCTGGCAACAAGCGGTGTGTCATCACACGCTCAGCACGAAGTATATCCGGAAATCCGC
AGACCTTTGCACATTTTGACAAGACGGGAATGCGAAGTTTTACAGATGCTCGCGGACGGAAAAAGCAACCGCGGCATAGG
AGAATCTTTATTTATCAGTGAAAAAACGGTTAAAAACCATGTCAGCAACATCCTGCAGAAAATGAATGTCAATGACAGAA
CTCAAGCTGTAGTTGTGGCCATTAAAAATGGCTGGGTAGAAATGAGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q5QSL6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  degU Bacillus subtilis subsp. subtilis str. 168

98.69

100

0.987


Multiple sequence alignment