Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   AB3U43_RS10175 Genome accession   NZ_CP162629
Coordinates   1134774..1138499 (+) Length   1241 a.a.
NCBI ID   WP_044796120.1    Uniprot ID   A0A9X6JM84
Organism   Bacillus cereus strain L6     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1129774..1143499
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AB3U43_RS10160 (AB3U43_10160) - 1129932..1130525 (+) 594 WP_000347520.1 TVP38/TMEM64 family protein -
  AB3U43_RS10165 (AB3U43_10165) lepB 1130582..1131145 (+) 564 WP_000751910.1 signal peptidase I -
  AB3U43_RS10170 (AB3U43_10170) addB 1131262..1134777 (+) 3516 WP_000058603.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  AB3U43_RS10175 (AB3U43_10175) addA 1134774..1138499 (+) 3726 WP_044796120.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  AB3U43_RS10180 (AB3U43_10180) - 1138512..1138799 (+) 288 WP_000255724.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  AB3U43_RS10185 (AB3U43_10185) gerPF 1138937..1139152 (-) 216 WP_001141566.1 spore germination protein GerPF -
  AB3U43_RS10190 (AB3U43_10190) - 1139195..1139581 (-) 387 WP_000902319.1 spore germination protein GerPE -
  AB3U43_RS10195 (AB3U43_10195) gerPD 1139597..1139791 (-) 195 WP_001052807.1 spore germination protein GerPD -
  AB3U43_RS10200 (AB3U43_10200) gerPC 1139798..1140412 (-) 615 WP_001070762.1 spore germination protein GerPC -
  AB3U43_RS10205 (AB3U43_10205) gerPB 1140480..1140686 (-) 207 WP_001012508.1 spore germination protein GerPB -
  AB3U43_RS10210 (AB3U43_10210) gerPA 1140701..1140922 (-) 222 WP_001111188.1 spore germination protein GerPA -
  AB3U43_RS10215 (AB3U43_10215) - 1141019..1141198 (-) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  AB3U43_RS10220 (AB3U43_10220) - 1141445..1142344 (+) 900 WP_001213075.1 fumarylacetoacetate hydrolase family protein -
  AB3U43_RS10225 (AB3U43_10225) - 1142380..1142664 (-) 285 WP_000926864.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142711.83 Da        Isoelectric Point: 4.7998

>NTDB_id=1027610 AB3U43_RS10175 WP_044796120.1 1134774..1138499(+) (addA) [Bacillus cereus strain L6]
MIENWPKKPEDSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADLALLGTLSSAAHESWTSLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWKGNSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=1027610 AB3U43_RS10175 WP_044796120.1 1134774..1138499(+) (addA) [Bacillus cereus strain L6]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGATAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGGCG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGACTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATCTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCATGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAGGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGAAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTTTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAACAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAATCGATTTATCGTTTCCGACTAGCCG
AACCAGGACTATTCCTAGGAAAATATAAACGCTTCACACAAGAAGGATTGGGCGGCGGAATGAAGATCGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATCTTCAAACAAATTATGGGTGAAGAAGTCGGAGAAAT
CGACTACGATGCAGACGCTGAATTAAAGTTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCAGAACTACTGTGCA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGAGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGCCCAGTACAATA
CCGCGACTTCGTTATTTTACTTCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TATATGCAGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATCCCGCTTGCAGCAGTACTTCGTTCACCGATCGTTGGATTAAATGACGAAGAACTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGGTCATTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCACTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTCGCGCGCCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGGGCCCTCGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGTTAGAGTTCCCAGTTGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGT
GCGAAAGAGAAGTTAATTTTAATTGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCATATACGTGCCGGGGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCGGATGAAATTTACGGGTATGACACGAGCTGGAAA
GTAGAAGTTGTGGACGGTAATACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTATTAGAAGCGCTTCG
TGAGAAAAAGGCTGTGCCATTACAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGCAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGAAAGGGAATAGCGGGGAATCGATCCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATATTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.488

100

0.537


Multiple sequence alignment