Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   EXW43_RS06465 Genome accession   NZ_CP036117
Coordinates   1178077..1181802 (+) Length   1241 a.a.
NCBI ID   WP_002125881.1    Uniprot ID   -
Organism   Bacillus mycoides strain JAS481     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1173077..1186802
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EXW43_RS06450 (EXW43_06635) - 1173079..1173672 (+) 594 WP_002011278.1 TVP38/TMEM64 family protein -
  EXW43_RS06455 (EXW43_06640) lepB 1173883..1174446 (+) 564 WP_002011277.1 signal peptidase I -
  EXW43_RS06460 (EXW43_06645) addB 1174565..1178080 (+) 3516 WP_002125879.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  EXW43_RS06465 (EXW43_06650) addA 1178077..1181802 (+) 3726 WP_002125881.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  EXW43_RS06470 (EXW43_06655) - 1181826..1182104 (+) 279 WP_002011271.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  EXW43_RS06475 (EXW43_06660) - 1182297..1183004 (+) 708 WP_002011269.1 DNA alkylation repair protein -
  EXW43_RS06480 (EXW43_06665) gerPF 1183060..1183275 (-) 216 WP_001141570.1 spore germination protein GerPF -
  EXW43_RS06485 (EXW43_06670) - 1183318..1183704 (-) 387 WP_002125894.1 spore germination protein GerPE -
  EXW43_RS06490 (EXW43_06675) gerPD 1183719..1183913 (-) 195 WP_001052805.1 spore germination protein GerPD -
  EXW43_RS06495 (EXW43_06680) gerPC 1183920..1184534 (-) 615 WP_215576125.1 spore germination protein GerPC -
  EXW43_RS06500 (EXW43_06685) gerPB 1184602..1184808 (-) 207 WP_001012505.1 spore germination protein GerPB -
  EXW43_RS06505 (EXW43_06690) gerPA 1184823..1185044 (-) 222 WP_001111190.1 spore germination protein GerPA -
  EXW43_RS06510 (EXW43_06695) - 1185141..1185320 (-) 180 WP_002087032.1 aspartyl-phosphate phosphatase Spo0E family protein -
  EXW43_RS06515 (EXW43_06700) - 1185564..1186466 (+) 903 WP_002125896.1 fumarylacetoacetate hydrolase family protein -
  EXW43_RS06520 (EXW43_06705) - 1186501..1186785 (-) 285 WP_063226440.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142513.77 Da        Isoelectric Point: 4.9188

>NTDB_id=346917 EXW43_RS06465 WP_002125881.1 1178077..1181802(+) (addA) [Bacillus mycoides strain JAS481]
MIENWPQKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIISEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDGPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNSIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEKHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARGSWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVKLVKVFTERFQAIKRDKGMVDFTDLEHFCLQILSEQSEGGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVQDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGNIPGEIYEYSASWK
VEVVEGKDLLAPEPVQEEKQELLEALRDKKAVPLESERKEEVYDRLMWEYGYADATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIARMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLSAEEAYQDWQGKKGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKPIL
EDRYKVQLSLYAKALEKSLKHPVKEKCLYFFDGNHVVNIDE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=346917 EXW43_RS06465 WP_002125881.1 1178077..1181802(+) (addA) [Bacillus mycoides strain JAS481]
ATGATAGAAAATTGGCCTCAGAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCAGCGGCAGCTGGATCAGGGAAAACAGCGGTATTAGTTGAACGTATTATTAAAAAGATTATAAGTG
AGGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAAATGAAAAATCGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGGGCCAGGTTCACAGCATATAAGAAAGCAGCTTAGCTTATTAAATAAAGC
TTCCATTTCTACAATTCATTCATTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTCGATCCTCGTTTTC
GTATTGCGAACCAAACAGAAAATGAGTTATTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAATATGGAATCGAA
GATAATAGTATTTTCTTTGAACTAGTTGATCGTTATACGAGTGACCGTAGTGACGATGACTTACAACGAATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAACCCGGAGAAGTGGCTTGATAAATTAGTAGAAGCATACGACGTTGAAGGAA
AGACGATTGAAGATTTAGTGTACGCTTCTTACTTATTAGAAGATGTGAAGTTTCAGCTTGAAACAGCGGAAAAGCATATT
CGTAAGGCGACTGAACTCGCAATGCTTCCTGATGGTCCAGCGCCTCGCGTTGAAACTTTACAAGCGGATTTAGCTTTACT
TGGAACGTTATCCTCAGCAGCTCGTGGATCGTGGACAAGCGTTTATGAAGCGATGCAAAATGTATCGTGGCAAACGTTAA
AGCGTATTAAAAAAAGTGATTACAACGAAGATGTTGTAAAACAAGTAGATTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGACCCGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAACTCGTGAAGCTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATTAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTGCAAATTTTAAGCGAGCAAAGTGAAGGCGGTGAAATGAAGCCGTCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTACTAGTTGATGAATATCAAGATACGAACTTCGTACAGGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAACAGTCAATCTATCGTTTCCGACTAGCAG
AGCCTGGTTTATTCTTAGGAAAATATAAACGTTTCACACAAGAAGGATCGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGCCGTCATGAAGTGCTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTCGGCGAAAT
CGATTATGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAGCTATTATGTA
TTCAGCAAACTGAGGAAGAAGTGCAAGACGGTGAAGAAGGTGCCGAAGTAGAAAAAGCGCAGCTTGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCAATGGTCGATTCAGGCTATGAAGTGTATGATCGTAAAACTGATAGTATGCGCCCAGTGCAATA
TCGTGATTTCGTTATTTTACTTCGCTCGATGCCATGGGCGCCGCAAATTATGGAAGAGTTAAAACTACAAGGAATTCCGG
TATATGCTGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTCATTGATAAC
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCACCGATCGTTGGACTAAATGACGAAGAACTTGCGACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTGATGAGCTCATTCTTAAAGGGAGCACCGCTTGAAGAAGAACAAGAAC
TGCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGGTGGCGTGAATTTGCGCGTCAACAATCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGCTATTACGACTTTGTCGGCGGTTTACCAGCCGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGTGCAAGACAATATGAAGCAACATCGTTTAGAGGATTATTCCGCTTCTTGCGCTTTATTGAGC
GTATTTTAGAACGCGGAGATGATATGGGAACGGCGAGAGCCCTCGGAGAACAAGAAGATGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCAGTCGTATTTGTAGCTGGACTTGGTCGCCGTTTTAATACACAAGATTTAATGAA
GCGTTTCTTACTTCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCTCGTAAACGAATTAAATATACGACATTAT
CGCAACTAGCGATTAAGCGTAAAATGAAGATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACAGTTAAAGATGCCAATAAAGAAATGGAAAAATGGCTCGATGCAAGAGAGCA
TAGTGAATGGTTATTACCAGACCATATTCGTGCCGGAGCGTCTTGCTACTTAGACTGGATTGCACCTTCCTTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAATATTCCAGGCGAAATTTATGAATATAGTGCTAGCTGGAAA
GTGGAAGTTGTTGAAGGAAAAGATTTACTTGCACCAGAACCGGTTCAAGAAGAGAAACAAGAGTTATTAGAAGCGCTTCG
CGATAAAAAAGCTGTTCCGTTAGAAAGTGAACGGAAAGAAGAAGTGTATGACAGATTGATGTGGGAATACGGGTATGCGG
ACGCGACATCTCACCGTGCGAAACAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTTCGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGCTAACATACGCAGAGCGAGG
GACAGCAGTACATGCCGTTATGCAGCATGTAGATTTGAAGAAACCAATTACGATTGAAGTTCTTCAAGAACAAATTGCAA
GAATGGTAAATAAAGAACTATTAACATTTGAACAAGCTGAAGAAATAGCGATTGAAAAGGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCAGCAAAAAGTGTTGAGCGCGAAGTACCATTTACGATGATGCTTTCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGAAGAAAGGCGAATCAATACTTGTCCAAGGGGTTATCGACTGCATGATCGAAGAGGAAGACG
GAATTACTTTAATCGACTTTAAAACAGATACGATTGAAGGAAAGTTCCCAGGCGGATTTGATCAAGCGAAACCAATTTTA
GAAGACCGATACAAAGTACAGCTTTCACTATATGCAAAAGCACTCGAGAAAAGCTTAAAACATCCTGTGAAAGAGAAATG
TTTATATTTCTTTGATGGAAATCATGTTGTAAATATTGACGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.569

100

0.538


Multiple sequence alignment