Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   MWG54_RS05785 Genome accession   NZ_CP095377
Coordinates   1117566..1121291 (+) Length   1241 a.a.
NCBI ID   WP_246962301.1    Uniprot ID   -
Organism   Bacillus cereus strain SEM-15     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1112566..1126291
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MWG54_RS05770 (MWG54_05770) - 1112724..1113317 (+) 594 WP_000347518.1 TVP38/TMEM64 family protein -
  MWG54_RS05775 (MWG54_05775) lepB 1113374..1113937 (+) 564 WP_000751910.1 signal peptidase I -
  MWG54_RS05780 (MWG54_05780) addB 1114054..1117569 (+) 3516 WP_098382044.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  MWG54_RS05785 (MWG54_05785) addA 1117566..1121291 (+) 3726 WP_246962301.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  MWG54_RS05790 (MWG54_05790) - 1121304..1121591 (+) 288 WP_000255725.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  MWG54_RS05795 (MWG54_05795) gerPF 1121729..1121944 (-) 216 WP_001141566.1 spore germination protein GerPF -
  MWG54_RS05800 (MWG54_05800) - 1121987..1122373 (-) 387 WP_000902319.1 spore germination protein GerPE -
  MWG54_RS05805 (MWG54_05805) gerPD 1122389..1122583 (-) 195 WP_001052807.1 spore germination protein GerPD -
  MWG54_RS05810 (MWG54_05810) gerPC 1122590..1123204 (-) 615 WP_001070760.1 spore germination protein GerPC -
  MWG54_RS05815 (MWG54_05815) gerPB 1123272..1123478 (-) 207 WP_001012508.1 spore germination protein GerPB -
  MWG54_RS05820 (MWG54_05820) gerPA 1123493..1123714 (-) 222 WP_001111187.1 spore germination protein GerPA -
  MWG54_RS05825 (MWG54_05825) - 1123809..1123988 (-) 180 WP_000462845.1 aspartyl-phosphate phosphatase Spo0E family protein -
  MWG54_RS05830 (MWG54_05830) - 1124235..1125134 (+) 900 WP_246962310.1 fumarylacetoacetate hydrolase family protein -
  MWG54_RS05835 (MWG54_05835) - 1125170..1125454 (-) 285 WP_000926863.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142626.68 Da        Isoelectric Point: 4.7561

>NTDB_id=676572 MWG54_RS05785 WP_246962301.1 1117566..1121291(+) (addA) [Bacillus cereus strain SEM-15]
MIENWPEKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADLALLGTLSSAAHESWTSLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFIERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLSDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEDGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGNSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=676572 MWG54_RS05785 WP_246962301.1 1117566..1121291(+) (addA) [Bacillus cereus strain SEM-15]
ATGATAGAAAATTGGCCTGAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAATGGACG
TGATATTTTAGTTGCGGCTGCAGCTGGGTCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATCTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCATGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAATGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGAAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTATAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCTATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAGTCGATTTATCGTTTCCGACTAGCCG
AACCAGGACTATTCCTAGGAAAATATAAACGCTTCACACAAGAAGGATTGGGCGGTGGAATGAAGATCGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGTACGAACTTTATCTTCAAACAAATTATGGGTGAAGAAGTCGGAGAAAT
CGACTACGATGCAGACGCTGAATTAAAGTTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCAGAACTACTGTGCA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGAGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGCCCAGTACAATA
CCGCGACTTCGTTATTTTACTTCGCTCCATGCCGTGGGCCCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TGTACGCTGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCTCCAATCGTTGGATTGAGCGATGAAGAACTTGCGACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCACTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTTGCACGTCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATACGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGACATGGGTACGGCGAGGGCCCTCGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGTTAGAGTTCCCGGTTGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCGTTGACGCGC
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGGGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAATACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTGCCGTTAGAAAGTGAACGAAAAGATGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
AAGCAACATCTCACCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCAGAAGATGGTAGTGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTTCCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAATAGCGGGGAATCGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTTCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.488

100

0.537