Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   FORC21_RS06190 Genome accession   NZ_CP014486
Coordinates   1152334..1156056 (+) Length   1240 a.a.
NCBI ID   WP_065381909.1    Uniprot ID   A0A9X5VF81
Organism   Bacillus cereus strain FORC021     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1147334..1161056
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FORC21_RS06175 (FORC21_1081) - 1147492..1148085 (+) 594 WP_000347517.1 TVP38/TMEM64 family protein -
  FORC21_RS06180 (FORC21_1082) lepB 1148142..1148705 (+) 564 WP_000751919.1 signal peptidase I -
  FORC21_RS06185 (FORC21_1083) addB 1148822..1152337 (+) 3516 WP_065381910.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  FORC21_RS06190 (FORC21_1084) addA 1152334..1156056 (+) 3723 WP_065381909.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  FORC21_RS06195 (FORC21_1085) - 1156069..1156356 (+) 288 WP_000255728.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  FORC21_RS06200 (FORC21_1086) gerPF 1156494..1156709 (-) 216 WP_001141566.1 spore germination protein GerPF -
  FORC21_RS06205 (FORC21_1087) - 1156752..1157138 (-) 387 WP_000902324.1 spore germination protein GerPE -
  FORC21_RS06210 (FORC21_1088) gerPD 1157154..1157348 (-) 195 WP_001052807.1 spore germination protein GerPD -
  FORC21_RS06215 (FORC21_1089) gerPC 1157355..1157969 (-) 615 WP_001070760.1 spore germination protein GerPC -
  FORC21_RS06220 (FORC21_1090) gerPB 1158037..1158243 (-) 207 WP_001012508.1 spore germination protein GerPB -
  FORC21_RS06225 (FORC21_1091) gerPA 1158258..1158479 (-) 222 WP_001111187.1 spore germination protein GerPA -
  FORC21_RS06230 (FORC21_1092) - 1158576..1158755 (-) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  FORC21_RS06235 (FORC21_1093) - 1159003..1159902 (+) 900 WP_001213071.1 fumarylacetoacetate hydrolase family protein -
  FORC21_RS06240 (FORC21_1094) - 1159939..1160220 (-) 282 WP_065382234.1 hypothetical protein -

Sequence


Protein


Download         Length: 1240 a.a.        Molecular weight: 142633.80 Da        Isoelectric Point: 4.8146

>NTDB_id=171883 FORC21_RS06190 WP_065381909.1 1152334..1156056(+) (addA) [Bacillus cereus strain FORC021]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADVALLGTLSSAARESWTSLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEVIDGEEGAEVEKAQLEARLMA
QRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDNP
MQDIPLAAVLRSPIVGLSDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDLI
WKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTIH
KSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTRA
KEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWKV
EVVDGNTLLAQEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNAF
IKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDSD
LGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPILE
DRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3723 bp        

>NTDB_id=171883 FORC21_RS06190 WP_065381909.1 1152334..1156056(+) (addA) [Bacillus cereus strain FORC021]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCTAACGGACG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCACAAGAGATGAAAAACCGAATC
GGAGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGCATCGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAGGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATGTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGACACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAGTCGATTTATCGTTTCCGACTAGCCG
AGCCAGGCTTATTCCTAGGAAAGTATAAACGTTTCACACAAGAAGGATTGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTTAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGACTACGATGCTGACGCTGAACTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCAGCAGAATTATTATGTA
TTCAGCAAACGGAAGAAGTAATAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGCCTTATGGCG
CAGCGCATTAAAGCGATGGTTGATTCCGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGTCCGGTGCAATATCG
CGATTTCGTTATTTTACTTCGCTCCATGCCGTGGGCCCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAGTAT
ACGCTGACCTCGCGACTGGTTATTTTGAAGCGACGGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAATCCG
ATGCAAGATATTCCGCTTGCCGCTGTACTTCGTTCCCCAATCGTTGGGTTGAGCGATGAAGAACTTGCAACGCTTCGTGC
TCATGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAGGAGCAAGAACTTC
ATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTTGCCCGTCAACAATCCCTTTCTGATTTAATT
TGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCAGGAAAGCAAAGACAAGCAAACTTACG
CGTACTATATGACCGTGCAAGACAATATGAAGCAACATCATTTAGAGGGTTATTCCGCTTCTTACGTTTTATTGAACGTA
TTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCCCTAGGTGAACAAGAAGACGTTGTTCGCATTATGACGATTCAT
AAAAGTAAAGGGCTAGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAAACG
TTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTATCAC
AGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGTGCG
AAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACATAG
TGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGACACC
GTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGATATGACACTAGCTGGAAAGTA
GAAGTTGTGGACGGTAACACACTACTTGCACAAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCGTGA
GAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGGATG
CGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCCTTT
ATTAAAAAACTACGTGCACCAATTAAAACACGCCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGGAAC
AGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTGGAA
TGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGTGAC
TTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGCATA
TCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATGGCA
TTACGTTAATCGACTTTAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCGAAACCAATTTTAGAA
GATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATGTTT
ATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.371

100

0.536


Multiple sequence alignment