Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   BAGQ_RS12235 Genome accession   NZ_CP021495
Coordinates   2515732..2518083 (-) Length   783 a.a.
NCBI ID   WP_032873958.1    Uniprot ID   -
Organism   Bacillus velezensis strain GQJK49     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2510732..2523083
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BAGQ_RS12205 (BAGQ_2583) - 2511098..2511436 (-) 339 WP_007408270.1 YqxA family protein -
  BAGQ_RS12210 (BAGQ_2584) spoIIP 2511453..2512646 (-) 1194 WP_032873961.1 stage II sporulation protein P -
  BAGQ_RS12215 (BAGQ_2585) gpr 2512714..2513820 (-) 1107 WP_007408268.1 GPR endopeptidase -
  BAGQ_RS12220 (BAGQ_2586) rpsT 2514023..2514289 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  BAGQ_RS12225 (BAGQ_2587) holA 2514306..2515347 (-) 1042 Protein_2339 DNA polymerase III subunit delta -
  BAGQ_RS19675 - 2515387..2515538 (-) 152 Protein_2340 hypothetical protein -
  BAGQ_RS12230 (BAGQ_2588) - 2515579..2515713 (+) 135 WP_003152870.1 YqzM family protein -
  BAGQ_RS12235 (BAGQ_2589) comEC 2515732..2518083 (-) 2352 WP_032873958.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  BAGQ_RS12240 (BAGQ_2590) - 2518084..2518653 (-) 570 WP_003152868.1 ComE operon protein 2 -
  BAGQ_RS12245 (BAGQ_2591) comEA 2518720..2519334 (-) 615 WP_032873955.1 helix-hairpin-helix domain-containing protein Machinery gene
  BAGQ_RS12250 (BAGQ_2592) comER 2519393..2520214 (+) 822 WP_012118027.1 late competence protein ComER -
  BAGQ_RS12255 (BAGQ_2593) - 2520283..2521020 (-) 738 WP_032873951.1 class I SAM-dependent DNA methyltransferase -
  BAGQ_RS12260 (BAGQ_2594) rsfS 2521017..2521373 (-) 357 WP_007408260.1 ribosome silencing factor -
  BAGQ_RS12265 (BAGQ_2595) yqeK 2521377..2521952 (-) 576 WP_032873949.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  BAGQ_RS12270 (BAGQ_2596) - 2521942..2522511 (-) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  BAGQ_RS12275 (BAGQ_2597) yhbY 2522522..2522812 (-) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86681.60 Da        Isoelectric Point: 9.1106

>NTDB_id=231064 BAGQ_RS12235 WP_032873958.1 2515732..2518083(-) (comEC) [Bacillus velezensis strain GQJK49]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKHHAPVIVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLHSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGEKFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLMFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFLMNMVLVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFVSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVISYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKVYSVNVLRTDVSGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=231064 BAGQ_RS12235 WP_032873958.1 2515732..2518083(-) (comEC) [Bacillus velezensis strain GQJK49]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCACCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGTTCTCTTGAGGAAAAAAGACTTATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTCACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAAAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCGGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGATGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCGGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTCATTGCGGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCTGATGAATATGGTGTTGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTTATCGGTTTCCTTCTTCTTTTACTTTCAAGGCAGATGGGAGAATGTTTGTTTGATATGTTCGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGATTTTGTTTCCC
TTCTTCTGCTTGCGGTTTCAGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTTCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAAGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCTACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGTGTACTCTGTCAATGTGCTTCGCACCGATGT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.606

98.595

0.558


Multiple sequence alignment