Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   AW22_RS18880 Genome accession   NZ_CP009300
Coordinates   3378198..3381923 (-) Length   1241 a.a.
NCBI ID   WP_042513100.1    Uniprot ID   -
Organism   Bacillus cereus D17     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3373198..3386923
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AW22_RS18830 (AW22_3521) - 3374127..3374411 (+) 285 WP_000925338.1 hypothetical protein -
  AW22_RS18835 (AW22_3522) - 3374452..3375351 (-) 900 WP_001241731.1 fumarylacetoacetate hydrolase family protein -
  AW22_RS18840 (AW22_3523) - 3375599..3375778 (+) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  AW22_RS18845 (AW22_3524) gerPA 3375875..3376096 (+) 222 WP_001111188.1 spore germination protein GerPA -
  AW22_RS18850 (AW22_3525) gerPB 3376111..3376317 (+) 207 WP_001012512.1 spore germination protein GerPB -
  AW22_RS18855 (AW22_3526) gerPC 3376385..3376999 (+) 615 WP_042513099.1 spore germination protein GerPC -
  AW22_RS18860 (AW22_3527) gerPD 3377006..3377200 (+) 195 WP_001052802.1 spore germination protein GerPD -
  AW22_RS18865 (AW22_3528) - 3377216..3377602 (+) 387 WP_000902341.1 spore germination protein GerPE -
  AW22_RS18870 (AW22_3529) gerPF 3377645..3377860 (+) 216 WP_001141566.1 spore germination protein GerPF -
  AW22_RS18875 (AW22_3530) - 3377898..3378185 (-) 288 WP_000718623.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  AW22_RS18880 (AW22_3531) addA 3378198..3381923 (-) 3726 WP_042513100.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  AW22_RS18885 (AW22_3532) addB 3381920..3385435 (-) 3516 WP_000058562.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  AW22_RS18890 (AW22_3533) lepB 3385552..3386115 (-) 564 WP_000751894.1 signal peptidase I -
  AW22_RS18895 (AW22_3534) - 3386172..3386765 (-) 594 WP_000347516.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142659.97 Da        Isoelectric Point: 4.8494

>NTDB_id=128369 AW22_RS18880 WP_042513100.1 3378198..3381923(-) (addA) [Bacillus cereus D17]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADLALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPGEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKVVPLQSERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAVEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
ETRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=128369 AW22_RS18880 WP_042513100.1 3378198..3381923(-) (addA) [Bacillus cereus D17]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCGGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACAAATGCAGCAGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAACCAGGATCTCAGCACGTAAGAAAGCAACTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCACTCCTTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACAGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGCGACCGTAGTGATGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTCGAAGGAA
AGACAATTGAAGATTTAGTGTACGCCTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACTGAGCTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATATTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCAAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACAAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAGCAAACAGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCACAGCTTGAAGCACGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGTGACTTCGTTATTTTACTTCGCTCGATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATACGCTGACCTTGCCACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTACTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACGCTTCG
TGCTCACGGGAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGAGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTATAACTTACTGCAAGGATGGCGTGAATTCGCACGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTATGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCGGTCGTATTCGTCGCTGGACTTGGTCGTCGTTTTAATACGCAAGACTTAATGAA
ACGTTTCTTACTTCATAAAGACTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CACAACTTGCAATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTCTTATACGTAGCGTTAACACGG
GCAAAAGAGAAGTTAATTTTAATCGGAACAGTGAAAGATGCAAATAAGGAAATGGAAAAATGGCTGGATGCGAGGGAGCA
TAGTGAATGGTTATTACCGGATCATATACGTGCCGGAGCATCTTGTTATTTAGACTGGATTGCACCTTCCTTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGGTGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGTAACACGTTACTTGCACCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCTCTTCG
TGAGAAAAAAGTTGTTCCCCTGCAAAGTGAACGAAAAGATGAAGTGTACGACAGGTTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCT
TTTATTAAAAAATTACGTGCACCAATTAAAACACGTCCTCGTTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGAGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAAGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAGTTGAAAAAGTGATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGGAGAGCGGGGAATCAATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GTATTACTTTAATCGACTTTAAAACGGATACGATTGAAGGGAAGTTCCCGGGAGGATTCGAACAAGCGAAACCAATTTTA
GAAACTCGTTACAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.328

100

0.536


Multiple sequence alignment