Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   S100333_RS13435 Genome accession   NZ_CP021892
Coordinates   2500713..2503043 (-) Length   776 a.a.
NCBI ID   WP_069837640.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis strain SRCM100333     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2495713..2508043
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  S100333_RS13405 (S100333_02740) yqxA 2496052..2496390 (-) 339 WP_046160606.1 YqxA family protein -
  S100333_RS13410 (S100333_02741) spoIIP 2496407..2497612 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  S100333_RS13415 (S100333_02742) gpr 2497675..2498781 (-) 1107 WP_014480309.1 GPR endopeptidase -
  S100333_RS13420 (S100333_02743) rpsT 2498985..2499251 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  S100333_RS13425 (S100333_02744) holA 2499266..2500309 (-) 1044 WP_029317888.1 DNA polymerase III subunit delta -
  S100333_RS22335 - 2500349..2500498 (-) 150 WP_003229985.1 hypothetical protein -
  S100333_RS13430 (S100333_02745) yqzM 2500539..2500673 (+) 135 WP_003229983.1 YqzM family protein -
  S100333_RS13435 (S100333_02746) comEC 2500713..2503043 (-) 2331 WP_069837640.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  S100333_RS13440 (S100333_02747) comEB 2503047..2503616 (-) 570 WP_003229978.1 ComE operon protein 2 -
  S100333_RS13445 (S100333_02748) comEA 2503683..2504300 (-) 618 WP_046160608.1 competence protein ComEA Machinery gene
  S100333_RS13450 (S100333_02749) comER 2504384..2505205 (+) 822 WP_032726195.1 late competence protein ComER -
  S100333_RS13455 (S100333_02750) yqeM 2505271..2506014 (-) 744 WP_069837641.1 class I SAM-dependent methyltransferase -
  S100333_RS13460 (S100333_02751) rsfS 2506011..2506367 (-) 357 WP_014480315.1 ribosome silencing factor -
  S100333_RS13465 (S100333_02752) yqeK 2506385..2506945 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  S100333_RS13470 (S100333_02753) nadD 2506935..2507504 (-) 570 WP_041333756.1 nicotinate-nucleotide adenylyltransferase -
  S100333_RS13475 (S100333_02754) yhbY 2507516..2507806 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86668.24 Da        Isoelectric Point: 7.2641

>NTDB_id=234543 S100333_RS13435 WP_069837640.1 2500713..2503043(-) (comEC) [Bacillus subtilis subsp. subtilis strain SRCM100333]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPG
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=234543 S100333_RS13435 WP_069837640.1 2500713..2503043(-) (comEC) [Bacillus subtilis subsp. subtilis strain SRCM100333]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTGTATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGGCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGGA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.325

100

0.983


Multiple sequence alignment