Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   BCG9842_RS05500 Genome accession   NC_011772
Coordinates   1106450..1110175 (+) Length   1241 a.a.
NCBI ID   WP_000572302.1    Uniprot ID   A0AB35PIF6
Organism   Bacillus cereus G9842     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1101450..1115175
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BCG9842_RS05485 (BCG9842_B4151) - 1101609..1102202 (+) 594 WP_000347517.1 TVP38/TMEM64 family protein -
  BCG9842_RS05490 (BCG9842_B4150) lepB 1102259..1102822 (+) 564 WP_000751897.1 signal peptidase I -
  BCG9842_RS05495 (BCG9842_B4149) addB 1102938..1106453 (+) 3516 WP_000058604.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  BCG9842_RS05500 (BCG9842_B4148) addA 1106450..1110175 (+) 3726 WP_000572302.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  BCG9842_RS05505 (BCG9842_B4147) - 1110188..1110475 (+) 288 WP_000255717.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  BCG9842_RS05510 (BCG9842_B4146) gerPF 1110613..1110828 (-) 216 WP_001141566.1 spore germination protein GerPF -
  BCG9842_RS05515 (BCG9842_B4145) - 1110871..1111257 (-) 387 WP_000902327.1 spore germination protein GerPE -
  BCG9842_RS05520 (BCG9842_B4144) gerPD 1111273..1111467 (-) 195 WP_001052807.1 spore germination protein GerPD -
  BCG9842_RS05525 (BCG9842_B4143) gerPC 1111474..1112088 (-) 615 WP_001070761.1 spore germination protein GerPC -
  BCG9842_RS05530 (BCG9842_B4142) gerPB 1112156..1112362 (-) 207 WP_001012509.1 spore germination protein GerPB -
  BCG9842_RS05535 (BCG9842_B4141) gerPA 1112377..1112598 (-) 222 WP_001111188.1 spore germination protein GerPA -
  BCG9842_RS05540 (BCG9842_B4140) - 1112695..1112874 (-) 180 WP_000462848.1 aspartyl-phosphate phosphatase Spo0E family protein -
  BCG9842_RS05545 (BCG9842_B4139) - 1113121..1114020 (+) 900 WP_001213079.1 fumarylacetoacetate hydrolase family protein -
  BCG9842_RS05550 - 1114056..1114340 (-) 285 WP_000926870.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142865.01 Da        Isoelectric Point: 4.7855

>NTDB_id=32394 BCG9842_RS05500 WP_000572302.1 1106450..1110175(+) (addA) [Bacillus cereus G9842]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADVALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVAFQ
YRNKFTEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEDLATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHEKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTNWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSYRAKQSVTEIKRNYQSEEGSDNA
FIKKLRTPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=32394 BCG9842_RS05500 WP_000572302.1 1106450..1110175(+) (addA) [Bacillus cereus G9842]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGACG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACAAATGCAGCAGCGCAAGAGATGAAAAATCGAATT
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAGCATGTAAGAAAGCAACTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTGGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCGTTGAAACCCTGCAAGCAGATGTAGCATTACT
TGGAACGTTATCATCAGCTGCTCGTGAGTCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTA
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGTTCCAA
TATCGTAATAAATTTACTGAAGTATTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAGTCGATTTATCGTTTCCGACTAGCTG
AACCAGGACTATTCCTAGGAAAGTATAAACGCTTCACACAAGAAGGATTGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTCATTTTCAAACAAATTATGGGCGAAGAAGTTGGAGAGAT
TGACTACGATGCTGATGCTGAATTAAAACTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCAGAATTACTCTGTA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGCCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCCGGTTATGAAGTGTATGACCGTAAAACGGATAGTATGCGTCCTGTACAATA
CCGCGACTTCGTTATTTTACTTCGCTCCATGCCGTGGGCGCCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCGG
TATATGCTGACCTTGCGACTGGTTATTTTGAAGCAACGGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATCCCGCTTGCAGCTGTACTTCGTTCCCCAATCGTTGGATTAAACGATGAAGACCTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGTGCACCGCTTGAAGAGGAACAAGAAC
TTCATGAAAAACTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTTGCACGTCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGCGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGTGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGGACGGCGAGGGCCCTCGGTGAACAAGAAGACGTTGTTCGCATCATGACGATC
CATAAAAGTAAAGGGCTAGAGTTCCCGGTTGTATTTGTAGCTGGACTCGGCCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTACTCCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATACACGACATTAT
CACAGCTTGCGATTAAACGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCGTTGACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAGATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCGGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGGTATGACACGAACTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCAACATCTTATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAACTACGTACGCCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGATTAACGTACGCAGAGCGTGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTGCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GATTTAGGTAAAAGGGTACTAGCGGCGAAAAGTGTTGAGCGTGAAGTGCCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGACG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCAAAACCAATTTTA
GAAGATCGATATAAAGTACAGCTTTCGTTATATGCAAAAGCGCTGGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCATGTTGTAAATATTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.505

100

0.535


Multiple sequence alignment