Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   FORC48_RS06005 Genome accession   NZ_CP017234
Coordinates   1134124..1137849 (+) Length   1241 a.a.
NCBI ID   WP_088867688.1    Uniprot ID   -
Organism   Bacillus cereus strain FORC_048     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1129124..1142849
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FORC48_RS05990 (FORC48_1081) - 1129282..1129875 (+) 594 WP_000347517.1 TVP38/TMEM64 family protein -
  FORC48_RS05995 (FORC48_1082) lepB 1129932..1130495 (+) 564 WP_000751919.1 signal peptidase I -
  FORC48_RS06000 (FORC48_1083) addB 1130612..1134127 (+) 3516 WP_025709863.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  FORC48_RS06005 (FORC48_1084) addA 1134124..1137849 (+) 3726 WP_088867688.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  FORC48_RS06010 (FORC48_1085) - 1137862..1138149 (+) 288 WP_000255728.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  FORC48_RS06015 (FORC48_1086) gerPF 1138287..1138502 (-) 216 WP_001141566.1 spore germination protein GerPF -
  FORC48_RS06020 (FORC48_1087) - 1138545..1138931 (-) 387 WP_000902331.1 spore germination protein GerPE -
  FORC48_RS06025 (FORC48_1088) gerPD 1138947..1139141 (-) 195 WP_001052807.1 spore germination protein GerPD -
  FORC48_RS06030 (FORC48_1089) gerPC 1139148..1139762 (-) 615 WP_001070760.1 spore germination protein GerPC -
  FORC48_RS06035 (FORC48_1090) gerPB 1139830..1140036 (-) 207 WP_001012508.1 spore germination protein GerPB -
  FORC48_RS06040 (FORC48_1091) gerPA 1140051..1140272 (-) 222 WP_001111187.1 spore germination protein GerPA -
  FORC48_RS06045 (FORC48_1092) - 1140369..1140548 (-) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  FORC48_RS06050 (FORC48_1094) - 1140796..1141695 (+) 900 WP_001213071.1 fumarylacetoacetate hydrolase family protein -
  FORC48_RS06055 (FORC48_1095) - 1141732..1142013 (-) 282 WP_078186992.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142678.92 Da        Isoelectric Point: 4.8010

>NTDB_id=196721 FORC48_RS06005 WP_088867688.1 1134124..1137849(+) (addA) [Bacillus cereus strain FORC_048]
MIENCPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADVALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLSDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEEMILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=196721 FORC48_RS06005 WP_088867688.1 1134124..1137849(+) (addA) [Bacillus cereus strain FORC_048]
ATGATAGAAAATTGTCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGGCG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGTTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTTTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTGCAACGTATGATTTTAGC
GCTTCATACAGAATCAAGGGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGATGGTCCAGCGCCTCGCATTGAAACTCTGCAAGCAGATGTAGCATTACT
TGGAACGCTATCATCAGCTGCTCGTGAATCGTGGACAAGTGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTTGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAATTTCGTACAAGAATCGATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAGTCGATTTATCGTTTCCGACTAGCCG
AGCCAGGACTATTCCTAGGAAAATATAAACGCTTCACACAAGAAGGATTAGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGACTACGATGCTGACGCCGAACTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCAGCAGAATTATTATGTA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGCCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCCGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGTCCGGTGCAATA
CCGCGATTTCGTTATTTTACTTCGCTCCATGCCGTGGGCCCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TATACGCTGACCTCGCGACTGGTTATTTTGAAGCGACGGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCCGCTGTACTTCGTTCCCCAATCGTTGGATTGAGCGATGAAGAACTTGCAACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTTGCACGTCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGGGCTCTAGGTGAACAAGAAGACGTTGTTCGCATTATGACAATT
CATAAAAGTAAAGGGTTAGAGTTCCCAGTTGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGCACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGCCCGCGATTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGATTGAGGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAATGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTTAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.328

100

0.536


Multiple sequence alignment