Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   AC241_RS05790 Genome accession   NZ_CP012099
Coordinates   1123676..1127401 (+) Length   1241 a.a.
NCBI ID   WP_029442119.1    Uniprot ID   A0A9X0EZ15
Organism   Bacillus thuringiensis strain HS18-1     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1118676..1132401
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AC241_RS05775 (AC241_05770) - 1118834..1119427 (+) 594 WP_043936619.1 TVP38/TMEM64 family protein -
  AC241_RS05780 (AC241_05775) lepB 1119484..1120047 (+) 564 WP_016082503.1 signal peptidase I -
  AC241_RS05785 (AC241_05780) addB 1120164..1123679 (+) 3516 WP_043936620.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  AC241_RS05790 (AC241_05785) addA 1123676..1127401 (+) 3726 WP_029442119.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  AC241_RS05795 (AC241_05790) - 1127426..1127704 (+) 279 WP_016082500.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  AC241_RS05800 (AC241_05795) gerPF 1127841..1128056 (-) 216 WP_001141570.1 spore germination protein GerPF -
  AC241_RS05805 (AC241_05800) - 1128099..1128485 (-) 387 WP_016082499.1 spore germination protein GerPE -
  AC241_RS05810 (AC241_05805) gerPD 1128504..1128698 (-) 195 WP_001052807.1 spore germination protein GerPD -
  AC241_RS05815 (AC241_05810) gerPC 1128705..1129319 (-) 615 WP_001070761.1 spore germination protein GerPC -
  AC241_RS05820 (AC241_05815) gerPB 1129387..1129593 (-) 207 WP_001055456.1 spore germination protein GerPB -
  AC241_RS05825 (AC241_05820) gerPA 1129608..1129829 (-) 222 WP_001111187.1 spore germination protein GerPA -
  AC241_RS05830 (AC241_05825) - 1129926..1130105 (-) 180 WP_016082498.1 aspartyl-phosphate phosphatase Spo0E family protein -
  AC241_RS05835 (AC241_05830) - 1130353..1131252 (+) 900 WP_029442121.1 fumarylacetoacetate hydrolase family protein -
  AC241_RS05840 (AC241_05835) - 1131288..1131569 (-) 282 WP_029442122.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142631.86 Da        Isoelectric Point: 4.8960

>NTDB_id=151375 AC241_RS05790 WP_029442119.1 1123676..1127401(+) (addA) [Bacillus thuringiensis strain HS18-1]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPGKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADVALLGTLSSAARESWTRLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVMDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEDLATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHNEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPSEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKEEVYNRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAKEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEG

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=151375 AC241_RS05790 WP_029442119.1 1123676..1127401(+) (addA) [Bacillus thuringiensis strain HS18-1]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACGGATGACCAGTGGAAAGCTGTTGTAGCGAACGGACG
TGATATTTTAGTTGCGGCTGCAGCTGGGTCAGGTAAAACAGCAGTATTAGTTGAACGTATTATCAAGAAGATTATTAATG
AAGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACCTTTACGAATGCAGCAGCGCAAGAGATGAAAAACCGAATT
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGATCTCAGCACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCATTCATTTTGTTTACAAGTCATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATCGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGGAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACTATTGAAGATTTAGTATATGCTTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCTGCTCCTCGCGTTGAAACCCTGCAAGCAGATGTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCGTGGACAAGATTGTATGAAGCGATGCAAAACGTATCGTGGCAAACATTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTA
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCTGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCTTCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCTGAAGTGTTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGATGTAAAGCAGTCGATTTATCGTTTCCGACTAGCCG
AACCAGGACTATTCCTAGGAAAGTATAAACGCTTCACACAAGAAGGGTCGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTTGGGGAGAT
TGACTACGATGCTGATGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAGCTATTATGTA
TTCAGCAAACGGAAGAAGAAGTAATGGATGGAGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTAGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAAATGATAGTATGCGCCCTGTACAATA
CCGTGATTTCGTTATTTTACTCCGCTCGATGCCGTGGGCGCCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATATGCTGACCTTGCGACTGGTTATTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCACCGATTGTGGGACTAAATGATGAAGACCTTGCGACGCTTCG
TGCTCATGGGAAGAAAGGGTCATTTTATGAAGTAATGAGCTCGTTCTTAAAAGGAGCACCGCTTGAAGAAGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAATTTACTGCAAGGGTGGCGTGAATTTGCACGTCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCAGGAAAGCAAAGACAGGCAAACTT
ACGCGTACTATATGACCGTGCAAGACAATATGAAGCAACATCATTTAGAGGGTTATTCCGCTTCTTGCGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCCCTCGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
GCGTTTCTTACTGCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCAATTAAACGCAAAATGAAAATGGAATTAATCGCGGAAGAAATGCGTGTATTATATGTAGCGTTAACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAATGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAAGTGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAATACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTTCCATTAGAAAGTGAACGGAAAGAAGAAGTGTATAACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGCCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGTGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCAAAAAGTGTAGAGCGCGAAGTACCATTTACGATGATGCTTGCAGCAAAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCAAAACCAATTTTA
GAAGATCGATATAAAGTGCAACTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGGATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.248

100

0.535


Multiple sequence alignment