Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   FOC89_RS19525 Genome accession   NZ_CP053980
Coordinates   3172972..3176694 (+) Length   1240 a.a.
NCBI ID   WP_000572265.1    Uniprot ID   -
Organism   Bacillus thuringiensis strain FDAARGOS_795     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3167972..3181694
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FOC89_RS19510 (FOC89_19505) - 3168130..3168723 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  FOC89_RS19515 (FOC89_19510) lepB 3168780..3169343 (+) 564 WP_000751894.1 signal peptidase I -
  FOC89_RS19520 (FOC89_19515) addB 3169460..3172975 (+) 3516 WP_000058572.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  FOC89_RS19525 (FOC89_19520) addA 3172972..3176694 (+) 3723 WP_000572265.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  FOC89_RS19530 (FOC89_19525) - 3176707..3176994 (+) 288 WP_000718626.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  FOC89_RS19535 (FOC89_19530) gerPF 3177032..3177247 (-) 216 WP_001141566.1 spore germination protein GerPF -
  FOC89_RS19540 (FOC89_19535) - 3177290..3177676 (-) 387 WP_000902341.1 spore germination protein GerPE -
  FOC89_RS19545 (FOC89_19540) gerPD 3177692..3177886 (-) 195 WP_001052802.1 spore germination protein GerPD -
  FOC89_RS19550 (FOC89_19545) gerPC 3177893..3178507 (-) 615 WP_001070767.1 spore germination protein GerPC -
  FOC89_RS19555 (FOC89_19550) gerPB 3178575..3178781 (-) 207 WP_001012512.1 spore germination protein GerPB -
  FOC89_RS19560 (FOC89_19555) gerPA 3178796..3179017 (-) 222 WP_001111188.1 spore germination protein GerPA -
  FOC89_RS19565 (FOC89_19560) - 3179114..3179293 (-) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  FOC89_RS19570 (FOC89_19565) - 3179541..3180440 (+) 900 WP_001241727.1 fumarylacetoacetate hydrolase family protein -
  FOC89_RS19575 (FOC89_19570) - 3180481..3180765 (-) 285 WP_000925338.1 hypothetical protein -

Sequence


Protein


Download         Length: 1240 a.a.        Molecular weight: 142603.02 Da        Isoelectric Point: 4.8661

>NTDB_id=449630 FOC89_RS19525 WP_000572265.1 3172972..3176694(+) (addA) [Bacillus thuringiensis strain FDAARGOS_795]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNMIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSAAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEVIDGEEGAEVEKAQLEARLMA
QRIKAMVDSGYEVYDRKTDSMRPVKYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDNP
MQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSTFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDLI
WKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTIH
KSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTRA
KEKLILIGTVKDATKEMEKWLDAREHSEWLLPDHVRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWKV
EVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNAF
IKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAVEKVISFFDSD
LGKRILAAKSVEREVPFTMMLAAEEAYQDWQGKSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPILE
ERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3723 bp        

>NTDB_id=449630 FOC89_RS19525 WP_000572265.1 3172972..3176694(+) (addA) [Bacillus thuringiensis strain FDAARGOS_795]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCGGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGAGAGGCTTTAGAAAAAGTATTAATTGATGAGCCTGGCTCTCAGCACATCCGAAAGCAACTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACCGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATATGATATTCTTTGAACTCGTTGATCGTTATACGAGCGACCGTAGTGATGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTCGAAGGAA
AGACAATTGAAGATTTAGTGTACGCCTCTTACTTATTAGAAGATGTAAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACTGAGCTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCGTTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCAGCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATATTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCAAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTTGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACAAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAACAAACAGAAGAAGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCACAGCTTGAAGCACGTCTTATGGCG
CAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTAAAATACCG
TGACTTCGTTATTTTACTTCGCTCGATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAGTAT
ACGCTGACCTTGCCACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAATCCG
ATGCAAGATATTCCGCTTGCAGCAGTACTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACGCTTCGTGC
TCACGGGAAGAAAGGCTCGTTTTATGAAGTAATGAGCACATTCTTAAAAGGAGCACCGCTTGAAGAAGAAAAAGAACTAC
ATGATAAATTAGAATGGTTCTATAACTTACTGCAAGGATGGCGTGAATTCGCACGCCAACAGTCTCTTTCTGATTTAATT
TGGAAAGTGTACGGTGAGACAGGTTATTATGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCTGCG
TGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGCGTA
TTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATTCAT
AAAAGTAAAGGACTTGAGTTCCCAGTCGTATTCGTCGCTGGACTTGGTCGTCGTTTTAATACGCAAGACTTAATGAAACG
TTTCTTACTTCATAAAGACTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTATCAC
AACTTGCAATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTCTTATACGTAGCGTTAACACGGGCA
AAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAACTAAGGAAATGGAAAAATGGCTGGATGCGAGGGAGCATAG
TGAATGGTTATTACCAGATCACGTACGTGCCGGAGCATCTTGTTATTTAGACTGGATTGCACCTTCCTTATATAGACACC
GTGATAGTGAAATGCTTCTTGAATTAGGGCAAGGAAGTATTCCAGATGAAATTTATGGGTATGACACTAGCTGGAAAGTA
GAAGTTGTTGACGGGAACACGTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCGTGA
GAAAAAAGCTGTTCCCCTGCAAAGTGAACGAAAAGATGAAGTGTACGACAGGTTAATGTGGAAGTACGGATATGAGGAAG
CGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCTTTT
ATTAAAAAATTACGTGCACCAATTCAAACACGTCCTCGTTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGAGGAAC
AGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGTTGAAGTTCTTCAAGAGCAAATTGCTGGAA
TGGTAAATAAAGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAGTTGAAAAAGTGATTTCATTCTTTGACAGTGAC
CTAGGTAAAAGGATATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGCGTA
TCAAGATTGGCAAGGGAAGAGCGGGGAATCAATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATGGTA
TTACTTTAATCGACTTTAAAACGGATACGATTGAAGGGAAGTTCCCGGGAGGATTCGAACAAGCGAAACCAATTTTAGAA
GAGCGATATAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATGTTT
ATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.629

100

0.536