Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   BSF20_RS14160 Genome accession   NZ_CP018152
Coordinates   2716185..2718536 (+) Length   783 a.a.
NCBI ID   WP_072588441.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain LM2303     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2711185..2723536
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BSF20_RS14120 (BSF20_13830) yhbY 2711457..2711747 (+) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -
  BSF20_RS14125 (BSF20_13835) - 2711758..2712327 (+) 570 WP_003152860.1 nicotinate-nucleotide adenylyltransferase -
  BSF20_RS14130 (BSF20_13840) yqeK 2712317..2712898 (+) 582 WP_003152863.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  BSF20_RS14135 (BSF20_13845) rsfS 2712895..2713251 (+) 357 WP_003152864.1 ribosome silencing factor -
  BSF20_RS14140 (BSF20_13850) - 2713248..2713985 (+) 738 WP_014305450.1 class I SAM-dependent DNA methyltransferase -
  BSF20_RS14145 (BSF20_13855) comER 2714054..2714875 (-) 822 WP_044053476.1 late competence protein ComER -
  BSF20_RS14150 (BSF20_13860) comEA 2714934..2715548 (+) 615 WP_043867297.1 helix-hairpin-helix domain-containing protein Machinery gene
  BSF20_RS14155 (BSF20_13865) - 2715615..2716184 (+) 570 WP_003152868.1 ComE operon protein 2 -
  BSF20_RS14160 (BSF20_13870) comEC 2716185..2718536 (+) 2352 WP_072588441.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  BSF20_RS14165 (BSF20_13875) - 2718555..2718689 (-) 135 WP_003152870.1 YqzM family protein -
  BSF20_RS20150 - 2718730..2718884 (+) 155 Protein_2720 hypothetical protein -
  BSF20_RS14170 (BSF20_13880) holA 2718924..2719965 (+) 1042 Protein_2721 DNA polymerase III subunit delta -
  BSF20_RS14175 (BSF20_13885) rpsT 2719982..2720248 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  BSF20_RS14180 (BSF20_13890) gpr 2720451..2721557 (+) 1107 WP_003152878.1 GPR endopeptidase -
  BSF20_RS14185 (BSF20_13895) spoIIP 2721625..2722818 (+) 1194 WP_024085623.1 stage II sporulation protein P -
  BSF20_RS14190 (BSF20_13900) - 2722835..2723173 (+) 339 WP_003152882.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86610.59 Da        Isoelectric Point: 9.0078

>NTDB_id=206543 BSF20_RS14160 WP_072588441.1 2716185..2718536(+) (comEC) [Bacillus amyloliquefaciens strain LM2303]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIMIKTKQPAPVVVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATIPGGFDYKEYLYSQQI
HWLFTVTSIQQCEKSKQPLFKLLSIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGNLVKRGIHSSAALSLSYLLLLLFNP
YLLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLASFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFNMFDLVMKPVHDFITYAASVDLFTMIVLKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTGEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=206543 BSF20_RS14160 WP_072588441.1 2716185..2718536(+) (comEC) [Bacillus amyloliquefaciens strain LM2303]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATGATAAAAACAAAGCAGCCTGCTCCGGTTGTTGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGATTCCCGGGGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTACCGTGACTTCTATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAGCATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAAGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGACTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
GTTTTTGCCGGTCTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAAATCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCTCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACCTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCCTCATTCATCGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAAATTTCACTTGCCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTTTCAAGGCAGATGGGAGAATGTTTGTTTAATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTTAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTAGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGCCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAGGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAAGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGGGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCACGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.606

98.595

0.558


Multiple sequence alignment