Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   FOC88_RS20415 Genome accession   NZ_CP053934
Coordinates   3846189..3849914 (-) Length   1241 a.a.
NCBI ID   WP_042511618.1    Uniprot ID   -
Organism   Bacillus thuringiensis strain FDAARGOS_794     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3841189..3854914
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FOC88_RS20365 (FOC88_20360) - 3842123..3842407 (+) 285 WP_000925338.1 hypothetical protein -
  FOC88_RS20370 (FOC88_20365) - 3842443..3843342 (-) 900 WP_042511615.1 fumarylacetoacetate hydrolase family protein -
  FOC88_RS20375 (FOC88_20370) - 3843590..3843769 (+) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  FOC88_RS20380 (FOC88_20375) gerPA 3843866..3844087 (+) 222 WP_001111188.1 spore germination protein GerPA -
  FOC88_RS20385 (FOC88_20380) gerPB 3844102..3844308 (+) 207 WP_001012512.1 spore germination protein GerPB -
  FOC88_RS20390 (FOC88_20385) gerPC 3844376..3844990 (+) 615 WP_001070767.1 spore germination protein GerPC -
  FOC88_RS20395 (FOC88_20390) gerPD 3844997..3845191 (+) 195 WP_001052802.1 spore germination protein GerPD -
  FOC88_RS20400 (FOC88_20395) - 3845207..3845593 (+) 387 WP_042511616.1 spore germination protein GerPE -
  FOC88_RS20405 (FOC88_20400) gerPF 3845636..3845851 (+) 216 WP_001141566.1 spore germination protein GerPF -
  FOC88_RS20410 (FOC88_20405) - 3845889..3846176 (-) 288 WP_042511617.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  FOC88_RS20415 (FOC88_20410) addA 3846189..3849914 (-) 3726 WP_042511618.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  FOC88_RS20420 (FOC88_20415) addB 3849911..3853426 (-) 3516 WP_042511619.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  FOC88_RS20425 (FOC88_20420) lepB 3853543..3854106 (-) 564 WP_000751894.1 signal peptidase I -
  FOC88_RS20430 (FOC88_20425) - 3854163..3854756 (-) 594 WP_000347516.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142723.10 Da        Isoelectric Point: 4.8339

>NTDB_id=449170 FOC88_RS20415 WP_042511618.1 3846189..3849914(-) (addA) [Bacillus thuringiensis strain FDAARGOS_794]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNMIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSAAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVKYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLCRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDATKEMEKWLDAREHSEWLLPDHVRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAVEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
ETRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=449170 FOC88_RS20415 WP_042511618.1 3846189..3849914(-) (addA) [Bacillus thuringiensis strain FDAARGOS_794]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCGGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGAGAGGCTTTAGAAAAAGTATTAATTGATGAGCCTGGCTCTCAGCACATCCGAAAGCAACTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACCGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATATGATATTCTTTGAACTCGTTGATCGTTATACGAGCGACCGTAGTGATGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTCGAAGGAA
AGACAATTGAAGATTTAGTGTACGCCTCTTACTTATTAGAAGATGTAAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACTGAGCTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCGTTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCAGCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATATTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCAAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTTGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACAAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAACAAACAGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCACAGCTTGAAGCACGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTAAAATA
CCGTGACTTCGTTATTTTACTTCGCTCGATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATACGCTGACCTTGCCACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTACTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACGCTTCG
TGCTCACGGGAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGAGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTATAACTTACTGCAAGGATGGCGTGAATTCGCACGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTATGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTCGTCGCTGGACTTTGTCGTCGTTTTAATACGCAAGACTTAATGAA
ACGTTTCTTACTTCATAAAGACTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CACAACTTGCAATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTCTTATACGTAGCGTTAACACGG
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAACTAAGGAAATGGAAAAATGGCTGGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCACGTACGTGCCGGAGCATCTTGTTATTTAGACTGGATTGCACCTTCCTTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGGCAAGGAAGTATTCCAGATGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGGAACACGTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAAGCTGTTCCCCTGCAAAGTGAACGAAAAGATGAAGTGTACGACAGGTTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCT
TTTATTAAAAAATTACGTGCACCAATTCAAACACGTCCTCGTTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGAGG
AACAGCAGTCCATGCGGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAAGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAGTTGAAAAAGTGATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGGAGAGCGGGGAATCAATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GTATTACTTTAATCGACTTTAAAACGGATACGATTGAAGGGAAGTTCCCGGGAGGATTCGAACAAGCGAAACCAATTTTA
GAAACTCGTTACAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGTTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.586

100

0.536