Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CEG11_RS12355 Genome accession   NZ_CP021976
Coordinates   2517609..2519960 (-) Length   783 a.a.
NCBI ID   WP_003152869.1    Uniprot ID   -
Organism   Bacillus velezensis strain T20E-257     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2512609..2524960
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CEG11_RS12325 (CEG11_12325) - 2512972..2513310 (-) 339 WP_003152882.1 YqxA family protein -
  CEG11_RS12330 (CEG11_12330) spoIIP 2513327..2514520 (-) 1194 WP_003152880.1 stage II sporulation protein P -
  CEG11_RS12335 (CEG11_12335) gpr 2514588..2515694 (-) 1107 WP_003152878.1 GPR endopeptidase -
  CEG11_RS12340 (CEG11_12340) rpsT 2515897..2516163 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  CEG11_RS12345 (CEG11_12345) holA 2516180..2517221 (-) 1042 Protein_2369 DNA polymerase III subunit delta -
  CEG11_RS19605 - 2517261..2517415 (-) 155 Protein_2370 hypothetical protein -
  CEG11_RS12350 (CEG11_12350) - 2517456..2517590 (+) 135 WP_003152870.1 YqzM family protein -
  CEG11_RS12355 (CEG11_12355) comEC 2517609..2519960 (-) 2352 WP_003152869.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CEG11_RS12360 (CEG11_12360) - 2519961..2520530 (-) 570 WP_003152868.1 ComE operon protein 2 -
  CEG11_RS12365 (CEG11_12365) comEA 2520597..2521211 (-) 615 WP_003152867.1 helix-hairpin-helix domain-containing protein Machinery gene
  CEG11_RS12370 (CEG11_12370) comER 2521270..2522091 (+) 822 WP_003152866.1 late competence protein ComER -
  CEG11_RS12375 (CEG11_12375) - 2522160..2522897 (-) 738 WP_003152865.1 class I SAM-dependent DNA methyltransferase -
  CEG11_RS12380 (CEG11_12380) rsfS 2522894..2523250 (-) 357 WP_003152864.1 ribosome silencing factor -
  CEG11_RS12385 (CEG11_12385) yqeK 2523247..2523828 (-) 582 WP_003152863.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CEG11_RS12390 (CEG11_12390) - 2523818..2524387 (-) 570 WP_003152860.1 nicotinate-nucleotide adenylyltransferase -
  CEG11_RS12395 (CEG11_12395) yhbY 2524398..2524688 (-) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86685.66 Da        Isoelectric Point: 9.0078

>NTDB_id=235509 CEG11_RS12355 WP_003152869.1 2517609..2519960(-) (comEC) [Bacillus velezensis strain T20E-257]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIMIKTKQPAPVVVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRRIEQLEPGMRCTFTGSLEQPAHATIPGGFDYKEYLYSQQI
HWLFTVTSIQQCEKSKQPLFKLLSIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVIHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YLLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLASFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVLKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=235509 CEG11_RS12355 WP_003152869.1 2517609..2519960(-) (comEC) [Bacillus velezensis strain T20E-257]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATGATAAAAACAAAGCAGCCTGCTCCGGTTGTTGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAAGAAAAGAGACGCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGATTCCCGGGGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTACCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAGCATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAAGACGATATACTGAGTGCATATCAAAATTTGGGAGTCATTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGACTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
GTTTTTGCCGGTCTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCTCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACCTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCCTCATTCATCGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAAATTTCACTTGCCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTTTCAAGGCAGATGGGAGAATGTTTGTTTGATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTTAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTAGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGCCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAGGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAAGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCACGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.347

98.595

0.556


Multiple sequence alignment