Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   ES968_RS13005 Genome accession   NZ_CP035395
Coordinates   2427310..2429640 (-) Length   776 a.a.
NCBI ID   WP_046160607.1    Uniprot ID   -
Organism   Bacillus subtilis strain SRCM103697     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2422310..2434640
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ES968_RS12970 (ES968_12970) yqxA 2422648..2422986 (-) 339 WP_014477382.1 YqxA family protein -
  ES968_RS12975 (ES968_12975) spoIIP 2423003..2424208 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  ES968_RS12980 (ES968_12980) gpr 2424271..2425377 (-) 1107 WP_014480309.1 GPR endopeptidase -
  ES968_RS12985 (ES968_12985) rpsT 2425581..2425847 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  ES968_RS12990 (ES968_12990) holA 2425862..2426905 (-) 1044 WP_029317888.1 DNA polymerase III subunit delta -
  ES968_RS12995 (ES968_12995) - 2426945..2427058 (-) 114 WP_122060479.1 hypothetical protein -
  ES968_RS13000 (ES968_13000) yqzM 2427135..2427269 (+) 135 WP_003229983.1 YqzM family protein -
  ES968_RS13005 (ES968_13005) comEC 2427310..2429640 (-) 2331 WP_046160607.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ES968_RS13010 (ES968_13010) comEB 2429644..2430213 (-) 570 WP_003229978.1 ComE operon protein 2 -
  ES968_RS13015 (ES968_13015) comEA 2430280..2430897 (-) 618 WP_046160608.1 competence protein ComEA Machinery gene
  ES968_RS13020 (ES968_13020) comER 2430981..2431802 (+) 822 WP_032726195.1 late competence protein ComER -
  ES968_RS13025 (ES968_13025) yqeM 2431868..2432611 (-) 744 WP_046160609.1 class I SAM-dependent methyltransferase -
  ES968_RS13030 (ES968_13030) rsfS 2432608..2432964 (-) 357 WP_014480315.1 ribosome silencing factor -
  ES968_RS13035 (ES968_13035) yqeK 2432982..2433542 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  ES968_RS13040 (ES968_13040) nadD 2433532..2434101 (-) 570 WP_041333756.1 nicotinate-nucleotide adenylyltransferase -
  ES968_RS13045 (ES968_13045) yhbY 2434113..2434403 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86846.48 Da        Isoelectric Point: 7.2477

>NTDB_id=339871 ES968_RS13005 WP_046160607.1 2427310..2429640(-) (comEC) [Bacillus subtilis strain SRCM103697]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLRQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKYHKVKRLVIPKGFVSEPKDEKVLQAAREEGVEIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=339871 ES968_RS13005 WP_046160607.1 2427310..2429640(-) (comEC) [Bacillus subtilis strain SRCM103697]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAATTACTCTGTCACGTCTATCCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTCGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCGATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATCCGTCTTGGTATAACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCTTTGAGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAGCAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGATTCTGCTGAAGTATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGGCAGCCAGGGAAG
AGGGAGTGGAAATTGAAGAGGTGAAGCGAGGCGATGTGTTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCGGAA
GCACCTGATCCGGCAAGCAAAAATAATTCATCTCTCGTTCTTTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAAGAGGTGATGAACGTGTTTCCGAATATAAAAGCAGATGTGTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGACCAAAA
CGGAACGATCCAATATAGATATAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.711

100

0.987


Multiple sequence alignment