Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   IL989_RS12020 Genome accession   NZ_CP062984
Coordinates   2459404..2461755 (-) Length   783 a.a.
NCBI ID   WP_193350453.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain PP19 isolate PP19     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2454404..2466755
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IL989_RS11985 (IL989_11985) - 2454767..2455105 (-) 339 WP_045510470.1 YqxA family protein -
  IL989_RS11990 (IL989_11990) spoIIP 2455121..2456314 (-) 1194 WP_045510467.1 stage II sporulation protein P -
  IL989_RS11995 (IL989_11995) gpr 2456382..2457488 (-) 1107 WP_115997660.1 GPR endopeptidase -
  IL989_RS12000 (IL989_12000) rpsT 2457691..2457957 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  IL989_RS12005 (IL989_12005) holA 2457975..2459016 (-) 1042 Protein_2334 DNA polymerase III subunit delta -
  IL989_RS12010 (IL989_12010) - 2459056..2459210 (-) 155 Protein_2335 hypothetical protein -
  IL989_RS12015 (IL989_12015) - 2459251..2459385 (+) 135 WP_003152870.1 YqzM family protein -
  IL989_RS12020 (IL989_12020) comEC 2459404..2461755 (-) 2352 WP_193350453.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  IL989_RS12025 (IL989_12025) - 2461756..2462325 (-) 570 WP_193350454.1 ComE operon protein 2 -
  IL989_RS12030 (IL989_12030) comEA 2462391..2463005 (-) 615 WP_193350455.1 helix-hairpin-helix domain-containing protein Machinery gene
  IL989_RS12035 (IL989_12035) comER 2463064..2463885 (+) 822 WP_045510457.1 late competence protein ComER -
  IL989_RS12040 (IL989_12040) - 2463954..2464691 (-) 738 WP_014470703.1 class I SAM-dependent methyltransferase -
  IL989_RS12045 (IL989_12045) rsfS 2464688..2465044 (-) 357 WP_045510454.1 ribosome silencing factor -
  IL989_RS12050 (IL989_12050) yqeK 2465061..2465621 (-) 561 WP_193350456.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  IL989_RS12055 (IL989_12055) - 2465611..2466180 (-) 570 WP_065522728.1 nicotinate-nucleotide adenylyltransferase -
  IL989_RS12060 (IL989_12060) yhbY 2466191..2466481 (-) 291 WP_014470707.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 87051.77 Da        Isoelectric Point: 8.7005

>NTDB_id=492261 IL989_RS12020 WP_193350453.1 2459404..2461755(-) (comEC) [Bacillus amyloliquefaciens strain PP19 isolate PP19]
MKYKCLLLPLAAVSATAGIAAANSFWVLFFFLLYLLFIIVKTKQHTPVVVCLVSFCLYFFLYTVCDAANVSRYQAGSYTE
QALIINIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPEMRCTFTGSLEQPGHATVPGGFDYKEYLYSQHI
HWIFSVTSIKQCEKSEQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDVLSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLLKRSIHSSAALSMSYLLLLLFDP
YLLWQAGFQFSFAVSAALILSSSILKKAGKNRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPF
SVIGFLLLLLSRQAGEFLFYLFDLVMKPVHDFITYAASVDLFTMIASKPDFLSLLMLAVSVFTLFAALEKGGFLKLRKSA
LFFFAVLAYIMFRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYSEESWKEKHHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKKHRVKRLIVPVGFVKEPKDRDILNLAKENNIPVAEAKQGDTITAGDLQFQVLSP
ESPDGRSKNDSSLVLWTVLGGVSWILTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPETALISAGK
DNRYHHPHEEVLDRLKAYSVNVLRTDVSGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=492261 IL989_RS12020 WP_193350453.1 2459404..2461755(-) (comEC) [Bacillus amyloliquefaciens strain PP19 isolate PP19]
ATGAAATATAAATGCCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACCGCGGGAATTGCCGCCGCTAATTCCTTCTGGGT
CCTATTCTTTTTCCTTCTGTATCTTCTCTTTATTATCGTAAAAACAAAGCAGCACACACCGGTTGTCGTCTGCCTCGTAT
CTTTTTGTCTTTATTTCTTTCTGTATACGGTTTGTGACGCTGCGAATGTATCCCGGTATCAGGCCGGCAGCTATACTGAA
CAGGCTCTCATCATAAACATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACCCATGACAAAGAAAAATG
GGCCGCTTCGTACAAAATCCGGTCCCTTGAAGAAAAAAGACTCATTGAACAGCTGGAACCGGAGATGCGCTGTACGTTTA
CAGGCTCACTGGAACAGCCCGGACATGCGACGGTTCCCGGGGGTTTTGATTATAAGGAATATCTTTACTCACAGCATATT
CACTGGATTTTTTCCGTTACTTCCATTAAGCAGTGCGAAAAATCCGAACAGCCGCTGTTTAAGCTTCTGAACATCAGAAA
GAATTTGATTTCGATCATTCGGAATCATGTACCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATGTACTGAGCGCTTATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGGTTATTTTATGCCCTTATCAGAATTGGACTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
TTTTTTGCCGGTGTATACGTTGTTAAGCGGTGCCGCCCCATCTGTGCTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTTTGAAACGCAGCATTCATTCCTCAGCTGCATTGTCCATGTCTTATCTGCTGCTTCTGCTGTTTGATCCT
TACCTCCTTTGGCAGGCTGGCTTCCAGTTTTCCTTTGCGGTAAGCGCCGCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAACAGACTCGCAGGGCTTGCGATGGCTTCATTTATTGCGGAGCTCAGTTCGCTGCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCGCTCGTCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGTTT
TCTGTCATCGGTTTTCTTCTTCTCTTACTTTCCAGGCAGGCGGGGGAATTTTTGTTTTACCTGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACAATGATTGCGTCAAAGCCTGACTTTCTGTCGC
TTCTTATGCTTGCGGTTTCCGTTTTTACGCTTTTTGCAGCTTTGGAAAAGGGAGGTTTTTTAAAGCTCAGGAAATCGGCT
CTTTTTTTCTTTGCGGTTTTGGCTTATATAATGTTCCGTCCTTATTTCAGTCCTTGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGTGCCCCGCACCGTAAAGGGACTGTAATGGTTGATACAGGAGGAGTGATTGCTT
ATTCCGAAGAATCGTGGAAAGAAAAACACCATCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTGAACGGAAAG
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAGGATCATATCGGGGAAGCCGGAGTATTAATCAAAAA
ACATAGAGTGAAACGGTTAATTGTCCCCGTGGGATTCGTGAAGGAACCGAAGGATCGGGATATATTGAATTTGGCGAAAG
AAAACAACATTCCCGTTGCTGAAGCAAAACAGGGCGATACCATAACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCACCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGGGGAGTGAGCTGGATATTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAGGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCCGGTC
ATCACGGCAGCAAAAGCTCTACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAGACAGCGCTGATTTCAGCAGGAAAA
GATAATCGTTATCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAAGCTTACTCTGTCAATGTGCTTCGCACCGATGT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCTGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAGGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.347

98.595

0.556