Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   AAVB98_RS16825 Genome accession   NZ_CP154895
Coordinates   3205325..3207676 (+) Length   783 a.a.
NCBI ID   WP_041481655.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain Fad 77     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3200325..3212676
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAVB98_RS16785 (AAVB98_16785) yhbY 3200597..3200887 (+) 291 WP_061573777.1 ribosome assembly RNA-binding protein YhbY -
  AAVB98_RS16790 (AAVB98_16790) - 3200898..3201467 (+) 570 WP_013352946.1 nicotinate-nucleotide adenylyltransferase -
  AAVB98_RS16795 (AAVB98_16795) yqeK 3201457..3202017 (+) 561 WP_013352945.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  AAVB98_RS16800 (AAVB98_16800) rsfS 3202035..3202391 (+) 357 WP_003152864.1 ribosome silencing factor -
  AAVB98_RS16805 (AAVB98_16805) - 3202388..3203125 (+) 738 WP_013352944.1 class I SAM-dependent methyltransferase -
  AAVB98_RS16810 (AAVB98_16810) comER 3203194..3204015 (-) 822 WP_013352943.1 late competence protein ComER -
  AAVB98_RS16815 (AAVB98_16815) comEA 3204074..3204688 (+) 615 WP_013352942.1 helix-hairpin-helix domain-containing protein Machinery gene
  AAVB98_RS16820 (AAVB98_16820) - 3204755..3205324 (+) 570 WP_003152868.1 ComE operon protein 2 -
  AAVB98_RS16825 (AAVB98_16825) comEC 3205325..3207676 (+) 2352 WP_041481655.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  AAVB98_RS16830 (AAVB98_16830) - 3207695..3207829 (-) 135 WP_003152870.1 YqzM family protein -
  AAVB98_RS16835 (AAVB98_16835) - 3207870..3208024 (+) 155 Protein_3252 hypothetical protein -
  AAVB98_RS16840 (AAVB98_16840) holA 3208064..3209105 (+) 1042 Protein_3253 DNA polymerase III subunit delta -
  AAVB98_RS16845 (AAVB98_16845) rpsT 3209122..3209388 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  AAVB98_RS16850 (AAVB98_16850) gpr 3209591..3210697 (+) 1107 WP_003152878.1 GPR endopeptidase -
  AAVB98_RS16855 (AAVB98_16855) - 3210765..3211958 (+) 1194 WP_013352939.1 stage II sporulation protein P -
  AAVB98_RS16860 (AAVB98_16860) - 3211975..3212313 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86760.68 Da        Isoelectric Point: 8.9023

>NTDB_id=994802 AAVB98_RS16825 WP_041481655.1 3205325..3207676(+) (comEC) [Bacillus amyloliquefaciens strain Fad 77]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIMIKTKQPAPVVVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATIPGGFDYKEYLYSQQI
HWLFSVTSIEQCEKSKQPLFKLLNFRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALFLTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILKMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=994802 AAVB98_RS16825 WP_041481655.1 3205325..3207676(+) (comEC) [Bacillus amyloliquefaciens strain Fad 77]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATGATAAAAACAAAGCAGCCTGCTCCGGTTGTTGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTATAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGTTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGATTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTGAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACTTCAGAAA
AAATTTGATTTCAATCATTCGGAATCACGTACCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGACTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCTCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCGGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCCTCATTCATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAAATTTCACTTGTCAGTTTTCCGATGAACATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGCTTCCTTCTTCTTTTGCTTTCAAGGCAGATGGGAGAATGTTTGTTTGATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGTTTTTAACCCATGCGGATCAGGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAAAATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCGGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCATCTGACGGAAAGAGTAAAAATGATTCATCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.218

98.595

0.554


Multiple sequence alignment