Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CHN56_RS18645 Genome accession   NZ_CP022654
Coordinates   3655369..3657720 (+) Length   783 a.a.
NCBI ID   WP_094247717.1    Uniprot ID   -
Organism   Bacillus velezensis strain SCDB 291     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3650369..3662720
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CHN56_RS18605 (CHN56_03684) yhbY 3650641..3650931 (+) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -
  CHN56_RS18610 (CHN56_03685) - 3650942..3651511 (+) 570 WP_003152860.1 nicotinate-nucleotide adenylyltransferase -
  CHN56_RS18615 (CHN56_03686) yqeK 3651501..3652061 (+) 561 WP_032856887.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CHN56_RS18620 (CHN56_03687) rsfS 3652079..3652435 (+) 357 WP_003152864.1 ribosome silencing factor -
  CHN56_RS18625 (CHN56_03688) - 3652432..3653169 (+) 738 WP_031378935.1 class I SAM-dependent methyltransferase -
  CHN56_RS18630 (CHN56_03689) comER 3653238..3654059 (-) 822 WP_014305449.1 late competence protein ComER -
  CHN56_RS18635 (CHN56_03690) comEA 3654118..3654732 (+) 615 WP_013352942.1 helix-hairpin-helix domain-containing protein Machinery gene
  CHN56_RS18640 (CHN56_03691) - 3654799..3655368 (+) 570 WP_003152868.1 ComE operon protein 2 -
  CHN56_RS18645 (CHN56_03692) comEC 3655369..3657720 (+) 2352 WP_094247717.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CHN56_RS18650 - 3657739..3657873 (-) 135 WP_003152870.1 YqzM family protein -
  CHN56_RS21420 - 3657914..3658068 (+) 155 Protein_3585 hypothetical protein -
  CHN56_RS18655 (CHN56_03693) holA 3658108..3659149 (+) 1042 Protein_3586 DNA polymerase III subunit delta -
  CHN56_RS18660 (CHN56_03694) rpsT 3659166..3659432 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  CHN56_RS18665 (CHN56_03695) gpr 3659635..3660741 (+) 1107 WP_003152878.1 GPR endopeptidase -
  CHN56_RS18670 (CHN56_03696) - 3660809..3662002 (+) 1194 WP_013352939.1 stage II sporulation protein P -
  CHN56_RS18675 (CHN56_03697) - 3662019..3662357 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86715.67 Da        Isoelectric Point: 9.0078

>NTDB_id=241341 CHN56_RS18645 WP_094247717.1 3655369..3657720(+) (comEC) [Bacillus velezensis strain SCDB 291]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIMIKTKQPAPVVVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAALYKIRSLEEKRRIEQLEPGMRCTFTGSLEQPAHATIPGGFDYKEYLYSQQI
HWLFTVTSIQQCEKSKQPLFKLLSIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLFKRGIHSSAALSLSYLLLLLFNP
YLLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQVGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=241341 CHN56_RS18645 WP_094247717.1 3655369..3657720(+) (comEC) [Bacillus velezensis strain SCDB 291]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATGATAAAAACAAAGCAGCCTGCTCCGGTTGTTGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTTGTACAAAATCCGGTCTCTTGAAGAAAAGAGACGCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGATTCCCGGGGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTACCGTGACTTCTATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAGCATCAGAAA
AAATTTGATTTCAATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCAGGACTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
GTTTTTACCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTTTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCTCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACCTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTCATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAAATTTCACTTGTCAGTTTTCCGATGAACATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTGCTTTCAAGGCAGGTGGGAGAATGTTTGTTTGATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCTC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAGGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCGGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCATCTGACGGAAAGAGTAAAAATGATTCATCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.347

98.595

0.556


Multiple sequence alignment