Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   FORC47_RS05855 Genome accession   NZ_CP017060
Coordinates   1123320..1127045 (+) Length   1241 a.a.
NCBI ID   WP_089170590.1    Uniprot ID   -
Organism   Bacillus cereus strain FORC_047     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1118320..1132045
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FORC47_RS05840 (FORC47_1018) - 1118478..1119071 (+) 594 WP_089170588.1 TVP38/TMEM64 family protein -
  FORC47_RS05845 (FORC47_1019) lepB 1119128..1119691 (+) 564 WP_000751897.1 signal peptidase I -
  FORC47_RS05850 (FORC47_1020) addB 1119808..1123323 (+) 3516 WP_089170589.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  FORC47_RS05855 (FORC47_1021) addA 1123320..1127045 (+) 3726 WP_089170590.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  FORC47_RS05860 (FORC47_1022) - 1127069..1127347 (+) 279 WP_089170591.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  FORC47_RS05865 (FORC47_1023) gerPF 1127484..1127699 (-) 216 WP_001141570.1 spore germination protein GerPF -
  FORC47_RS05870 (FORC47_1024) - 1127742..1128128 (-) 387 WP_000902329.1 spore germination protein GerPE -
  FORC47_RS05875 (FORC47_1025) gerPD 1128144..1128338 (-) 195 WP_001052807.1 spore germination protein GerPD -
  FORC47_RS05880 (FORC47_1026) gerPC 1128345..1128959 (-) 615 WP_089170592.1 spore germination protein GerPC -
  FORC47_RS05885 (FORC47_1027) gerPB 1129027..1129233 (-) 207 WP_001012508.1 spore germination protein GerPB -
  FORC47_RS05890 (FORC47_1028) gerPA 1129248..1129469 (-) 222 WP_001111187.1 spore germination protein GerPA -
  FORC47_RS05895 (FORC47_1029) - 1129566..1129745 (-) 180 WP_000462857.1 aspartyl-phosphate phosphatase Spo0E family protein -
  FORC47_RS05900 (FORC47_1030) - 1129994..1130893 (+) 900 WP_053565035.1 fumarylacetoacetate hydrolase family protein -
  FORC47_RS05905 (FORC47_1031) - 1130928..1131209 (-) 282 WP_000926859.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142652.74 Da        Isoelectric Point: 4.8146

>NTDB_id=194857 FORC47_RS05855 WP_089170590.1 1123320..1127045(+) (addA) [Bacillus cereus strain FORC_047]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPEGPAPRIETLQADVALLGTLSSAARESWTSLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVMDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEDLATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHNEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITAETLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEG

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=194857 FORC47_RS05855 WP_089170590.1 1123320..1127045(+) (addA) [Bacillus cereus strain FORC_047]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGACG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAACCCAGTCGATGTTGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCACAAGAGATGAAAAACCGAATC
GGAGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGCATCGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAGGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGAAGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATGTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAGGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGACAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCTTCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCTGAAGTGTTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGCGACGTAAAGCAGTCAATTTATCGTTTCCGACTAGCCG
AACCAGGCTTATTCCTAGGAAAGTATAAACGCTTTACGCAAGAAGGATCGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AATTTTCGTAGTCGTCATGAAGTGCTAGCAGGAACGAACTTCATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAGCTATTATGTA
TTCAGCAAACGGAAGAAGAAGTAATGGATGGAGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTAGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGTGATTTCGTTATTTTACTCCGCTCGATGCCGTGGGCGCCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATATGCTGACCTTGCGACTGGTTATTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATCCCGCTTGCAGCTGTACTTCGTTCACCGATTGTGGGACTAAATGATGAAGACCTTGCGACGCTTCG
TGCTCATGGGAAGAAAGGGTCATTTTATGAAGTAATGAGCTCGTTCTTAAAAGGAGCACCGCTTGAAGAAGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAATTTACTGCAAGGATGGCGTGAATTTGCCCGTCAACAATCCCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCAGGAAAGCAAAGACAAGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTGCGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCAAGAGCTCTTGGTGAACAAGAAGACGTCGTTCGAATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
GCGTTTCTTACTGCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCAATTAAACGCAAAATGAAAATGGAATTAATCGCGGAAGAAATGCGTGTATTATATGTAGCGTTAACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAATGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGCATTCCGGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGTGG
AACAGCAGTCCATGCCGTTATGCAGCATGTAGATTTGAAGAAACCGATTACGGCCGAAACGCTACAAGAACAAATCGCAG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCAATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCAAAACCAATTTTA
GAAGATCGATATAAAGTGCAACTTTCGTTATATGCAAAAGCACTCGAGAAAAGCTTACAACATCCTGTCAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGGATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.248

100

0.535


Multiple sequence alignment