Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GSY53_RS20750 Genome accession   NZ_CP047325
Coordinates   4043950..4046280 (-) Length   776 a.a.
NCBI ID   WP_032677118.1    Uniprot ID   -
Organism   Bacillus subtilis strain GOT9     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4038950..4051280
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GSY53_RS20715 (GSY53_20715) yqxA 4039289..4039627 (-) 339 WP_004399162.1 YqxA family protein -
  GSY53_RS20720 (GSY53_20720) spoIIP 4039644..4040849 (-) 1206 WP_003229993.1 spore autolysin SpoIIP -
  GSY53_RS20725 (GSY53_20725) gpr 4040912..4042018 (-) 1107 WP_003229991.1 GPR endopeptidase -
  GSY53_RS20730 (GSY53_20730) rpsT 4042222..4042488 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  GSY53_RS20735 (GSY53_20735) holA 4042503..4043546 (-) 1044 WP_003229987.1 DNA polymerase III subunit delta -
  GSY53_RS20740 (GSY53_20740) - 4043586..4043735 (-) 150 WP_003229985.1 hypothetical protein -
  GSY53_RS20745 (GSY53_20745) yqzM 4043776..4043910 (+) 135 WP_003229983.1 YqzM family protein -
  GSY53_RS20750 (GSY53_20750) comEC 4043950..4046280 (-) 2331 WP_032677118.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GSY53_RS20755 (GSY53_20755) comEB 4046284..4046853 (-) 570 WP_003229978.1 ComE operon protein 2 -
  GSY53_RS20760 (GSY53_20760) comEA 4046920..4047537 (-) 618 WP_004398514.1 competence protein ComEA Machinery gene
  GSY53_RS20765 (GSY53_20765) comER 4047621..4048442 (+) 822 WP_004398597.1 late competence protein ComER -
  GSY53_RS20770 (GSY53_20770) yqeM 4048508..4049251 (-) 744 WP_003229973.1 class I SAM-dependent methyltransferase -
  GSY53_RS20775 (GSY53_20775) rsfS 4049248..4049604 (-) 357 WP_003229971.1 ribosome silencing factor -
  GSY53_RS20780 (GSY53_20780) yqeK 4049622..4050182 (-) 561 WP_004399059.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  GSY53_RS20785 (GSY53_20785) nadD 4050172..4050741 (-) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  GSY53_RS20790 (GSY53_20790) yhbY 4050753..4051043 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86600.06 Da        Isoelectric Point: 7.0494

>NTDB_id=412754 GSY53_RS20750 WP_032677118.1 4043950..4046280(-) (comEC) [Bacillus subtilis strain GOT9]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVLFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPG
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=412754 GSY53_RS20750 WP_032677118.1 4043950..4046280(-) (comEC) [Bacillus subtilis strain GOT9]
ATGCGTAATTCGCGCTTATTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATAGGCAGGGAACTTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCTGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTGTATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGATGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGCTGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGGA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.711

100

0.987


Multiple sequence alignment