Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   I6I41_RS15095 Genome accession   NZ_CP068135
Coordinates   3014078..3017803 (-) Length   1241 a.a.
NCBI ID   WP_074619143.1    Uniprot ID   -
Organism   Bacillus cereus strain FDAARGOS_1084     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3009078..3022803
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6I41_RS15045 (I6I41_15045) - 3009919..3010200 (+) 282 WP_000926857.1 hypothetical protein -
  I6I41_RS15050 (I6I41_15050) - 3010236..3011135 (-) 900 WP_001213071.1 fumarylacetoacetate hydrolase family protein -
  I6I41_RS15055 (I6I41_15055) - 3011383..3011562 (+) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  I6I41_RS15060 (I6I41_15060) gerPA 3011659..3011880 (+) 222 WP_001111187.1 spore germination protein GerPA -
  I6I41_RS15065 (I6I41_15065) gerPB 3011895..3012101 (+) 207 WP_001012508.1 spore germination protein GerPB -
  I6I41_RS15070 (I6I41_15070) gerPC 3012169..3012783 (+) 615 WP_025709865.1 spore germination protein GerPC -
  I6I41_RS15075 (I6I41_15075) gerPD 3012790..3012984 (+) 195 WP_001052807.1 spore germination protein GerPD -
  I6I41_RS15080 (I6I41_15080) - 3013000..3013386 (+) 387 WP_000902333.1 spore germination protein GerPE -
  I6I41_RS15085 (I6I41_15085) gerPF 3013430..3013645 (+) 216 WP_001141566.1 spore germination protein GerPF -
  I6I41_RS15090 (I6I41_15090) - 3013778..3014065 (-) 288 WP_000255728.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  I6I41_RS15095 (I6I41_15095) addA 3014078..3017803 (-) 3726 WP_074619143.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  I6I41_RS15100 (I6I41_15100) addB 3017800..3021315 (-) 3516 WP_074619144.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  I6I41_RS15105 (I6I41_15105) lepB 3021432..3021995 (-) 564 WP_000751919.1 signal peptidase I -
  I6I41_RS15110 (I6I41_15110) - 3022052..3022645 (-) 594 WP_000347517.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142664.85 Da        Isoelectric Point: 4.7852

>NTDB_id=527395 I6I41_RS15095 WP_074619143.1 3014078..3017803(-) (addA) [Bacillus cereus strain FDAARGOS_1084]
MIENCPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADVALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVLDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLSDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYEYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKQAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSGEMILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=527395 I6I41_RS15095 WP_074619143.1 3014078..3017803(-) (addA) [Bacillus cereus strain FDAARGOS_1084]
ATGATAGAAAATTGTCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGGCG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGTTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTTTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTGTATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATGTAGCATTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCATGGACAAGTGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGAGAGATAAAGGGATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTAGGTGACGTGAAGCAGTCGATCTATCGTTTCCGACTAGCCG
AGCCAGGACTATTCCTAGGAAAATATAAACGCTTCACACAAGAAGGATTAGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTTAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGACTACGATGCTGACGCCGAACTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCAGCAGAATTATTATGTA
TTCAGCAAACGGAAGAAGAAGTACTAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCCGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGCCCAGTACAATA
CCGCGACTTCGTTATTTTACTTCGCTCCATGCCGTGGGCCCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TGTACGCTGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCTCCAATCGTTGGATTGAGCGATGAAGAACTTGCAACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTCGCCCGTCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGATTTCGTTGGCGGTTTACCGGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGGGCTCTCGGTGAACAAGAAGACGTTGTTCGCATTATGACAATT
CATAAAAGTAAAGGGTTAGAGTTCCCAGTTGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGCACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCTGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGAGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAACAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGCCCGCGATTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGCGGGGAAATGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCTTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.168

100

0.534