Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   NUW85_RS18675 Genome accession   NZ_CP102603
Coordinates   3786926..3789277 (+) Length   783 a.a.
NCBI ID   WP_207119962.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain MBLB0692     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3781926..3794277
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NUW85_RS18635 (NUW85_18640) yhbY 3782198..3782488 (+) 291 WP_139885324.1 ribosome assembly RNA-binding protein YhbY -
  NUW85_RS18640 (NUW85_18645) - 3782499..3783068 (+) 570 WP_003152860.1 nicotinate-nucleotide adenylyltransferase -
  NUW85_RS18645 (NUW85_18650) yqeK 3783058..3783639 (+) 582 WP_014305452.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  NUW85_RS18650 (NUW85_18655) rsfS 3783636..3783992 (+) 357 WP_003152864.1 ribosome silencing factor -
  NUW85_RS18655 (NUW85_18660) - 3783989..3784726 (+) 738 WP_014305450.1 class I SAM-dependent methyltransferase -
  NUW85_RS18660 (NUW85_18665) comER 3784795..3785616 (-) 822 WP_003152866.1 late competence protein ComER -
  NUW85_RS18665 (NUW85_18670) comEA 3785675..3786289 (+) 615 WP_095318411.1 helix-hairpin-helix domain-containing protein Machinery gene
  NUW85_RS18670 (NUW85_18675) - 3786356..3786925 (+) 570 WP_003152868.1 ComE operon protein 2 -
  NUW85_RS18675 (NUW85_18680) comEC 3786926..3789277 (+) 2352 WP_207119962.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  NUW85_RS18680 (NUW85_18685) - 3789296..3789430 (-) 135 WP_003152870.1 YqzM family protein -
  NUW85_RS18685 (NUW85_18690) - 3789471..3789625 (+) 155 Protein_3622 hypothetical protein -
  NUW85_RS18690 (NUW85_18695) holA 3789665..3790706 (+) 1042 Protein_3623 DNA polymerase III subunit delta -
  NUW85_RS18695 (NUW85_18700) rpsT 3790723..3790989 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  NUW85_RS18700 (NUW85_18705) gpr 3791192..3792298 (+) 1107 WP_139885323.1 GPR endopeptidase -
  NUW85_RS18705 (NUW85_18710) - 3792366..3793559 (+) 1194 WP_003152880.1 stage II sporulation protein P -
  NUW85_RS18710 (NUW85_18715) - 3793576..3793914 (+) 339 WP_003152882.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86636.63 Da        Isoelectric Point: 9.1033

>NTDB_id=718078 NUW85_RS18675 WP_207119962.1 3786926..3789277(+) (comEC) [Bacillus amyloliquefaciens strain MBLB0692]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIMIKTKQPAPVVVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRRIEQLEPGMRCTFTGSLEQPAHATIPGGFDYKEYLYSQQI
HWLFTVTSIQQCEKSKQPLFKLLSIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVIHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGVYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YLLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLASFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFNMFDLVMKPVHDFITYAASVDLFTMIVLKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=718078 NUW85_RS18675 WP_207119962.1 3786926..3789277(+) (comEC) [Bacillus amyloliquefaciens strain MBLB0692]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATGATAAAAACAAAGCAGCCTGCTCCGGTTGTTGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAAGAAAAGAGACGCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGATTCCCGGGGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTACCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAGCATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAAGACGATATACTGAGTGCATATCAAAATTTGGGAGTCATTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGACTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
GTTTTTGCCGGTCTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGAGTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCTCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACCTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCCTCATTCATCGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAAATTTCACTTGCCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTTTCAAGGCAGATGGGAGAATGTTTGTTTAATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTTAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTAGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGCCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAGGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAAGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCACGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.477

98.595

0.557