Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   EXW38_RS06020 Genome accession   NZ_CP036026
Coordinates   1148622..1152347 (+) Length   1241 a.a.
NCBI ID   WP_215572532.1    Uniprot ID   -
Organism   Bacillus mycoides strain BPN54/2     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1143622..1157347
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EXW38_RS06005 (EXW38_06095) - 1143916..1144623 (+) 708 WP_002158921.1 hypothetical protein -
  EXW38_RS06010 (EXW38_06100) - 1144620..1144886 (+) 267 WP_002158920.1 DUF4176 domain-containing protein -
  EXW38_RS06015 (EXW38_06105) addB 1145110..1148625 (+) 3516 WP_078179998.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  EXW38_RS06020 (EXW38_06110) addA 1148622..1152347 (+) 3726 WP_215572532.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  EXW38_RS06025 (EXW38_06115) - 1152371..1152649 (+) 279 WP_078180000.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  EXW38_RS06030 (EXW38_06120) - 1152865..1153572 (+) 708 WP_078180001.1 DNA alkylation repair protein -
  EXW38_RS06035 (EXW38_06125) gerPF 1153623..1153838 (-) 216 WP_001141570.1 spore germination protein GerPF -
  EXW38_RS06040 (EXW38_06130) - 1153881..1154267 (-) 387 WP_002140807.1 spore germination protein GerPE -
  EXW38_RS06045 (EXW38_06135) gerPD 1154282..1154476 (-) 195 WP_001052805.1 spore germination protein GerPD -
  EXW38_RS06050 (EXW38_06140) - 1154483..1155097 (-) 615 WP_002158914.1 spore germination protein GerPC -
  EXW38_RS06055 (EXW38_06145) gerPB 1155162..1155368 (-) 207 WP_002158913.1 spore germination protein GerPB -
  EXW38_RS06060 (EXW38_06150) gerPA 1155383..1155604 (-) 222 WP_002158912.1 spore germination protein GerPA -
  EXW38_RS06065 (EXW38_06155) - 1155701..1155880 (-) 180 WP_002087032.1 aspartyl-phosphate phosphatase Spo0E family protein -
  EXW38_RS06070 (EXW38_06160) - 1156124..1157026 (+) 903 WP_002158910.1 fumarylacetoacetate hydrolase family protein -
  EXW38_RS06075 (EXW38_06165) - 1157062..1157346 (-) 285 WP_002158909.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142717.11 Da        Isoelectric Point: 4.9414

>NTDB_id=346020 EXW38_RS06020 WP_215572532.1 1148622..1152347(+) (addA) [Bacillus mycoides strain BPN54/2]
MIENWPQKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIISKENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDGPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIGNQTENELLKEEVLDDILEEEYGIE
DNSIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEEHI
RKATELAMLPDGPAPRVETLQADVALLGMLSSAARGSWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVKLVKVFTERFQAIKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLDGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVQEGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYKLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGNIPGEIYEYSASWK
VEVVDGKTLLAPEPVQEEKQELLEALRDKKAVPLESERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIARMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLSAEEAYQDWQGKKGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKPTL
EDRYKVQLSLYAKALEKSLKHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=346020 EXW38_RS06020 WP_215572532.1 1148622..1152347(+) (addA) [Bacillus mycoides strain BPN54/2]
ATGATAGAAAATTGGCCTCAGAAACCAGAAGGTAGTCAGTGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCAGCGGCAGCTGGATCAGGGAAAACAGCGGTATTAGTTGAACGTATTATTAAAAAGATTATAAGTA
AGGAAAACCCGGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAAATGAAAAATCGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGGGCCAGGTTCACAGCATATAAGAAAGCAGCTTAGCTTATTAAATAAAGC
TTCCATTTCTACCATTCATTCATTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTCGATCCTCGTTTTC
GTATTGGGAACCAAACAGAAAATGAGTTATTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAATATGGAATCGAA
GATAATAGTATTTTCTTTGAATTAGTTGATCGTTATACGAGTGACCGTAGTGACGATGACTTACAACGAATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAGAAGTGGCTTGATAAATTAGTAGAAGCATACGACGTTGAAGGAA
AGACGATTGAAGATTTAGTGTACGCTTCTTACTTATTAGAAGATGTGAAGTTTCAGCTTGAAACAGCGGAAGAGCATATT
CGTAAAGCGACTGAACTCGCAATGCTTCCTGATGGTCCAGCGCCTCGCGTTGAAACGTTACAAGCGGATGTAGCTTTACT
TGGAATGTTATCCTCAGCAGCTCGCGGGTCGTGGACAAGCGTTTATGAAGCGATGCAAAATGTATCGTGGCAAACGCTAA
AGCGTATTAAGAAAAGTGATTACAACGAAGATGTTGTAAAACAAGTAGATTCTCTTCGTAATAAAGCGAAAGATGAAGTA
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTCTTAGA
AAAACTCGTGAAGCTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATTAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTGCAAATATTAAGCGAGCAAAGTGAAGACGGTGAAATGAAGCCGTCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTACTAGTCGATGAATATCAAGATACGAACTTCGTACAGGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAACAGTCAATCTATCGTTTCCGACTAGCAG
AGCCTGGTTTATTCTTAGGAAAATATAAACGTTTCACACAAGAAGGATTAGACGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGCCGTCATGAAGTGCTAGCAGGTACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTCGGAGAAAT
CGATTATGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAACTACTATGCA
TTCAGCAAACTGAGGAAGAAGTGCAAGAAGGTGAAGAAGGTGCAGAAGTAGAAAAAGCGCAGCTTGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCAGGCTATGAAGTGTATGATCGTAAAACTGATAGTATGCGCCCTGTACAATA
CCGTGATTTCGTTATTTTACTTCGCTCCATGCCATGGGCGCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TATATGCTGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCACCGATCGTTGGACTAAATGACGAAGAACTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGAGCGCCGCTTGAAGAAGAACAAGAAC
TGCATGATAAATTAGAGTGGTTTTATAAATTACTGCAAGGATGGCGTGAATTTGCGCGTCAACAATCTCTTTCCGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGCTATTACGACTTTGTCGGCGGTTTACCAGCCGGAAAGCAAAGACAGGCAAACTT
ACGCGTATTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGATTATTCCGATTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGACATGGGAACGGCGAGAGCTCTTGGAGAACAAGAAGACGTCGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCAGTCGTATTTGTAGCTGGACTTGGTCGCCGTTTTAATACACAAGATTTAATGAA
GCGTTTCTTACTTCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CGCAACTAGCGATTAAGCGTAAAATGAAGATGGAATTAATTGCGGAGGAAATGCGCGTATTATACGTAGCGTTAACACGT
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCCAATAAAGAAATGGAAAAATGGCTCGATGCAAGAGAGCA
TAGTGAATGGTTATTACCAGATCATATACGTGCCGGAGCGTCATGTTATTTAGATTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAATATTCCAGGCGAAATTTATGAGTATAGTGCTAGCTGGAAA
GTAGAAGTTGTTGACGGCAAAACGTTACTTGCACCAGAACCAGTTCAAGAAGAGAAACAAGAGTTATTAGAAGCGCTTCG
CGATAAAAAAGCTGTTCCGTTAGAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAATACGGGTATGAGG
ACGCAACATCTCACCGTGCGAAACAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAGAAACTCCGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGCTAACATACGCAGAGCGAGG
GACAGCAGTCCATGCCGTTATGCAGCATGTAGATTTGAAGAAACCAATTACGATTGAAGTTCTTCAAGAGCAAATTGCAA
GAATGGTAAATAAAGAACTATTAACATTTGAACAAGCTGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCAGCAAAAAGTGTTGAACGTGAAGTACCATTTACGATGATGCTTTCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAAAGGCGAATCAATACTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGACG
GAATTACTTTAATCGACTTTAAGACAGATACGATTGAAGGGAAGTTCCCAGGCGGATTTGATCAAGCGAAACCAACTTTA
GAAGACCGATACAAAGTACAGCTTTCACTATATGCAAAAGCACTCGAGAAAAGCTTAAAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCATGTTGTAAATATTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.168

100

0.534


Multiple sequence alignment