Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   BCPR1_RS21550 Genome accession   NZ_CP040515
Coordinates   4137157..4140882 (-) Length   1241 a.a.
NCBI ID   WP_138322324.1    Uniprot ID   -
Organism   Bacillus paranthracis strain PR1     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 4132157..4145882
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BCPR1_RS21505 (BCPR1_21505) - 4133416..4134315 (-) 900 WP_044792701.1 fumarylacetoacetate hydrolase family protein -
  BCPR1_RS21510 (BCPR1_21510) - 4134563..4134742 (+) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  BCPR1_RS21515 (BCPR1_21515) gerPA 4134839..4135060 (+) 222 WP_001111188.1 spore germination protein GerPA -
  BCPR1_RS21520 (BCPR1_21520) gerPB 4135075..4135281 (+) 207 WP_001012508.1 spore germination protein GerPB -
  BCPR1_RS21525 (BCPR1_21525) gerPC 4135349..4135963 (+) 615 WP_001070747.1 spore germination protein GerPC -
  BCPR1_RS21530 (BCPR1_21530) gerPD 4135970..4136164 (+) 195 WP_001052802.1 spore germination protein GerPD -
  BCPR1_RS21535 (BCPR1_21535) - 4136180..4136566 (+) 387 WP_000902334.1 spore germination protein GerPE -
  BCPR1_RS21540 (BCPR1_21540) gerPF 4136609..4136824 (+) 216 WP_001141566.1 spore germination protein GerPF -
  BCPR1_RS21545 (BCPR1_21545) - 4136857..4137144 (-) 288 WP_000845836.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  BCPR1_RS21550 (BCPR1_21550) addA 4137157..4140882 (-) 3726 WP_138322324.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  BCPR1_RS21555 (BCPR1_21555) addB 4140879..4144394 (-) 3516 WP_138322325.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  BCPR1_RS21560 (BCPR1_21560) lepB 4144511..4145074 (-) 564 WP_000751898.1 signal peptidase I -
  BCPR1_RS21565 (BCPR1_21565) - 4145131..4145724 (-) 594 WP_000347517.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142681.91 Da        Isoelectric Point: 4.8184

>NTDB_id=364884 BCPR1_RS21550 WP_138322324.1 4137157..4140882(-) (addA) [Bacillus paranthracis strain PR1]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVRFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSAYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREYSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSVPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITEEVIREQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIAGKFPGGFDQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=364884 BCPR1_RS21550 WP_138322324.1 4137157..4140882(-) (addA) [Bacillus paranthracis strain PR1]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAGTGGACAGATGACCAGTGGAAAGCCGTTGTAGCGAACGGGCG
TGATATTTTAGTTGCGGCTGCAGCAGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCATTAGAAAAAGTATTAATTGATGAGCCTGGCTCTCAGCACATCCGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACCGAAAATGAATTGTTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTGTATGCTTCTTATTTATTAGAAGATGTAAGATTCCAGCTTGAAACAGCGGAGCAGCATATT
CGTAAAGCGACGGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCGTTGAAACGCTGCAAGCAGATTTAGCCTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGCCTATAACGAGGATGTTGTGAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGTCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAGCAAAGTGAAGATGGTGAAATGAAGCCGTCAGCTGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGTACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCTGAACTACTATGCA
TTCAGCAAACGGAAGAAGAGGTAATAGACGGAGAAGAAGGGGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGCCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGTCCTGTACAATA
CCGCGACTTCGTTATTTTGCTTCGCTCCATGCCGTGGGCGCCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATCGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTACTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACGCTTCG
TGCTCATGGGAAGAAAGGATCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAGTGGTTCTATAACTTATTGCAAGGATGGCGTGAATTTGCACGTCAACAGTCACTTTCTGATTTA
ATTTGGAAAGTATACGGTGAGACAGGTTATTACGATTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACTT
GCGTGTACTATATGACCGCGCAAGACAATACGAGGCAACGTCGTTTAGAGGATTATTCCGCTTCTTGCGTTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTTGTAGCTGGACTAGGTCGTCGCTTTAATACACAAGACTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGTTTCGGTTCACAATTTATTGATCCACGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGAGTATTATACGTAGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACAGTTAAGGATGCAAATAAAGAAATGGAAAAATGGCTTGATGCGAGGGAGTA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCCTGCTACTTAGACTGGATTGCACCTTCATTATATAGGC
ACCGTGATAGTGAAATACTTCTTGAATTAGGACAAGGAAGTGTTCCAGATGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGTAACACGTTACTCGCGCCAGAACCGGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAGGCCGTTCCCCTGCAAAGTGAACGGAAAGAAGAGGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAATTACGTGCACCAATTCAAACACGTCCTCGTTTTATGGAGAAAAAGGGATTAACATATGCAGAGCGCGG
AACAGCAGTACATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGAAGAAGTGATTCGGGAGCAAATTGCTG
GAATGGTAAATAAAGAATTATTAACATTCGAGCAGGCGGAAGAAATTGCGATTGAAAAAGTAATTTCATTCTTTGATAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGGAGAGCGGGGAATCCATTCTTGTCCAAGGGGTTATTGACTGCATGATCGAAGAGGAAGACG
GTATTACGTTAATCGACTTTAAAACGGATACGATTGCAGGAAAATTCCCAGGTGGATTTGATCAAGCGAAACCAATTTTA
GAAGAGCGATATAAAGTACAACTTTCGTTATATGCAAAAGCGCTCGAGAAAAGCTTACAACACCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGCAACCATGTTGTAAAAATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.747

100

0.537


Multiple sequence alignment