Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   CAB88_RS05315 Genome accession   NZ_CP021061
Coordinates   1001199..1004924 (+) Length   1241 a.a.
NCBI ID   WP_000572301.1    Uniprot ID   A0AAN4HLW6
Organism   Bacillus thuringiensis strain ATCC 10792     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 996199..1009924
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CAB88_RS05300 (CAB88_05280) - 996357..996950 (+) 594 WP_000347515.1 TVP38/TMEM64 family protein -
  CAB88_RS05305 (CAB88_05285) lepB 997007..997570 (+) 564 WP_000751897.1 signal peptidase I -
  CAB88_RS05310 (CAB88_05290) addB 997687..1001202 (+) 3516 WP_000058585.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  CAB88_RS05315 (CAB88_05295) addA 1001199..1004924 (+) 3726 WP_000572301.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  CAB88_RS05320 (CAB88_05300) - 1004937..1005224 (+) 288 WP_000255723.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  CAB88_RS05325 (CAB88_05305) gerPF 1005362..1005577 (-) 216 WP_001141570.1 spore germination protein GerPF -
  CAB88_RS05330 (CAB88_05310) - 1005620..1006006 (-) 387 WP_000902329.1 spore germination protein GerPE -
  CAB88_RS05335 (CAB88_05315) gerPD 1006022..1006216 (-) 195 WP_001052807.1 spore germination protein GerPD -
  CAB88_RS05340 (CAB88_05320) gerPC 1006223..1006837 (-) 615 WP_001070761.1 spore germination protein GerPC -
  CAB88_RS05345 (CAB88_05325) gerPB 1006905..1007111 (-) 207 WP_001012508.1 spore germination protein GerPB -
  CAB88_RS05350 (CAB88_05330) gerPA 1007126..1007347 (-) 222 WP_001111187.1 spore germination protein GerPA -
  CAB88_RS05355 (CAB88_05335) - 1007444..1007623 (-) 180 WP_000462857.1 aspartyl-phosphate phosphatase Spo0E family protein -
  CAB88_RS05360 (CAB88_05340) - 1007871..1008770 (+) 900 WP_001213080.1 fumarylacetoacetate hydrolase family protein -
  CAB88_RS05365 (CAB88_05345) - 1008805..1009086 (-) 282 WP_000926859.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142703.79 Da        Isoelectric Point: 4.8002

>NTDB_id=228057 CAB88_RS05315 WP_000572301.1 1001199..1004924(+) (addA) [Bacillus thuringiensis strain ATCC 10792]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADVVLLGTLSSAARESWTSLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEDLATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLLWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRVPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITAETLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=228057 CAB88_RS05315 WP_000572301.1 1001199..1004924(+) (addA) [Bacillus thuringiensis strain ATCC 10792]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGACG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTTATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGCATCGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAGGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCGATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACTCTGCAAGCAGATGTAGTTTTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCTTCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCTGAAGTGTTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCCATTATTAAATTTGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGCGACGTAAAGCAGTCAATTTATCGTTTTCGACTAGCCG
AACCAGGCTTATTCCTAGGAAAGTATAAACGCTTTACGCAAGAAGGATCGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AATTTCCGTAGTCGTCATGAAGTGCTAGCAGGAACGAACTTCATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAGCTATTATGTA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTAGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGTGATTTCGTTATTTTACTCCGCTCGATGCCGTGGGCGCCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATATGCTGACCTTGCGACGGGTTATTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCACCGATTGTGGGACTAAATGATGAAGACCTTGCGACGCTTCG
TGCTCATGGGAAGAAAGGGTCATTTTATGAAGTAATGAGCTCGTTCTTAAAAGGAGCACCGCTTGAAGAAGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAATTTACTGCAAGGATGGCGTGAATTTGCCCGTCAACAATCCCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCAGGAAAGCAAAGACAAGCAAACTT
ACGCGTACTATATGACCGTGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGGGCTCTCGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGTTAGAGTTCCCAGTTGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGCACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCTGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCGGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTTCTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTACTATGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGTACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGTGG
AACAGCAGTCCATGCCGTTATGCAGCATGTAGATTTGAAGAAACCGATTACGGCCGAAACGCTACAAGAACAAATCGCGG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCAATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCAAAAAGTGTAGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTTCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCAAAACCAATTTTA
GAAGATCGATATAAAGTGCAACTTTCGTTATATGCAAAAGCACTCGAAAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.505

100

0.535


Multiple sequence alignment