Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   I33_RS12340 Genome accession   NC_017195
Coordinates   2485171..2487501 (-) Length   776 a.a.
NCBI ID   WP_041519416.1    Uniprot ID   -
Organism   Bacillus subtilis subsp. subtilis str. RO-NN-1     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2480171..2492501
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I33_RS12310 (I33_2636) yqxA 2480510..2480848 (-) 339 WP_014477382.1 YqxA family protein -
  I33_RS12315 (I33_2637) spoIIP 2480865..2482070 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  I33_RS12320 (I33_2638) gpr 2482133..2483239 (-) 1107 WP_014477384.1 GPR endopeptidase -
  I33_RS12325 (I33_2639) rpsT 2483443..2483709 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  I33_RS12330 (I33_2640) holA 2483724..2484767 (-) 1044 WP_014477385.1 DNA polymerase III subunit delta -
  I33_RS20990 (I33_2641) - 2484807..2484956 (-) 150 WP_003229985.1 hypothetical protein -
  I33_RS20480 yqzM 2484997..2485131 (+) 135 WP_003229983.1 YqzM family protein -
  I33_RS12340 (I33_2643) comEC 2485171..2487501 (-) 2331 WP_041519416.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  I33_RS12345 (I33_2645) comEB 2487505..2488074 (-) 570 WP_014477387.1 ComE operon protein 2 -
  I33_RS12350 (I33_2646) comEA 2488141..2488758 (-) 618 WP_014477388.1 competence protein ComEA Machinery gene
  I33_RS12355 (I33_2647) comER 2488842..2489663 (+) 822 WP_014477389.1 late competence protein ComER -
  I33_RS12360 (I33_2648) yqeM 2489729..2490472 (-) 744 WP_014477390.1 class I SAM-dependent methyltransferase -
  I33_RS12365 (I33_2649) rsfS 2490469..2490825 (-) 357 WP_014477391.1 ribosome silencing factor -
  I33_RS12370 (I33_2650) yqeK 2490843..2491403 (-) 561 WP_014477392.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  I33_RS12375 (I33_2651) nadD 2491393..2491962 (-) 570 WP_014477393.1 nicotinate-nucleotide adenylyltransferase -
  I33_RS12380 (I33_2652) yhbY 2491974..2492264 (-) 291 WP_014477394.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86706.39 Da        Isoelectric Point: 7.3798

>NTDB_id=45423 I33_RS12340 WP_041519416.1 2485171..2487501(-) (comEC) [Bacillus subtilis subsp. subtilis str. RO-NN-1]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIMLIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDREKWAAAYRIQSADEKEQLLYIEPGMSCELTGTLEEPKHATVPGAFGYNEYLYRQHI
HWNFSVKSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLAVVHLLAISGLHV
GILTAALFYIMIHLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLFFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTLCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITKIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVLVLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPQQRGHVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNMKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=45423 I33_RS12340 WP_041519416.1 2485171..2487501(-) (comEC) [Bacillus subtilis subsp. subtilis str. RO-NN-1]
ATGCGTAATTCGCGCTTATTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATGTTAATCAAAACGAGGCACGCTTTTCTCATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATCCCAAAAATTGACGGCGACCGTATGTCAATGATGGTTAAGACACCTGATAGGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGACGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACCTTGGAAGAACCGAAACACGCAACTGTGCCGGGTGCATTTGGTTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTTCTCTGTCAAGTCTATCCAAAACTGCAGCGAACCTGAAAATTTCAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTCGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGCCGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGCTTTGTTTTATATCATGATTCACCTTGGTATCACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGCCTTTCATACATCGTCCTTCTGTTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGTTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTATGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGTTAATCACAAAAATTGCAGATGTTGATGTGTTCACGATTATGATCGCACATCCTGCACCTGTTCTGCTCT
TTTTATTCACAGTCACAATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGCGGC
ATTTGCTGCACGGTGCTGGTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTCGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCAGCAGCGGGGTCATGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCCTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
GCACCTGATCCGGCAAGCAAAAATAATTCATCTCTTGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAGGAGGTGATGAACGTGTTTCCAAATATGAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATTCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

96.649

100

0.966


Multiple sequence alignment