Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   C3496_RS11090 Genome accession   NZ_CP026608
Coordinates   2167779..2171504 (+) Length   1241 a.a.
NCBI ID   WP_136444325.1    Uniprot ID   A0A4S4HWX4
Organism   Bacillus anthracis strain HDZK-BYSB7     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 2162779..2176504
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C3496_RS11075 (C3496_11575) - 2162937..2163530 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  C3496_RS11080 (C3496_11580) lepB 2163587..2164150 (+) 564 WP_071729527.1 signal peptidase I -
  C3496_RS11085 (C3496_11585) addB 2164267..2167782 (+) 3516 WP_136444326.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  C3496_RS11090 (C3496_11590) addA 2167779..2171504 (+) 3726 WP_136444325.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  C3496_RS11095 (C3496_11595) - 2171520..2171807 (+) 288 WP_136444324.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  C3496_RS11100 (C3496_11600) gerPF 2171844..2172059 (-) 216 WP_001141566.1 spore germination protein GerPF -
  C3496_RS11105 (C3496_11605) - 2172102..2172488 (-) 387 WP_000902315.1 spore germination protein GerPE -
  C3496_RS11110 (C3496_11610) gerPD 2172504..2172698 (-) 195 WP_001102340.1 spore germination protein GerPD -
  C3496_RS11115 (C3496_11615) gerPC 2172705..2173319 (-) 615 WP_001070766.1 spore germination protein GerPC -
  C3496_RS11120 (C3496_11620) gerPB 2173387..2173593 (-) 207 WP_136444323.1 spore germination protein GerPB -
  C3496_RS11125 (C3496_11625) gerPA 2173609..2173830 (-) 222 WP_001111188.1 spore germination protein GerPA -
  C3496_RS11130 (C3496_11630) - 2173927..2174106 (-) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  C3496_RS11135 (C3496_11635) - 2174354..2175253 (+) 900 WP_001241722.1 fumarylacetoacetate hydrolase family protein -
  C3496_RS11140 (C3496_11640) - 2175289..2175573 (-) 285 WP_000925336.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142723.06 Da        Isoelectric Point: 4.8199

>NTDB_id=270700 C3496_RS11090 WP_136444325.1 2167779..2171504(+) (addA) [Bacillus anthracis strain HDZK-BYSB7]
MMENWPKKPEGSQWTDDQWKAVVAKGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGMLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKEEVYDRLMWKYGYGEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEILQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=270700 C3496_RS11090 WP_136444325.1 2167779..2171504(+) (addA) [Bacillus anthracis strain HDZK-BYSB7]
ATGATGGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAAGGGGCG
TGATATTTTAGTCGCAGCAGCAGCAGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACAAATGCAGCAGCGCAAGAGATGAAAAACAGAATT
GGAGAGGCTTTAGAAAAAGTATTAATTGATGAACCAGGATCTCAGCACGTAAGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTCATTAGAGGATATTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACAGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
GCTTCATACGGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTCGAAGGAA
AGACAATTGAAGATTTAGTGTACGCCTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACGGAGCTCGCAATGCTTCCTGACGGTCCGGCGCCTCGCGTTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAATGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCAATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATATTGTCAAACAAGTAGACTCTCTTCGTAATAAAGCAAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCCGGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAGCAAAGTGAAGATGGTGAAATGAAGCCGTCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACAAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAGCAAACAGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAAGCACAGCTTGAAGCACGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGTGACTTCGTTATTTTGCTCCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAGCTGCAAGGAATTCCAG
TATACGCTGACCTTGCCACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCAATGCAAGATATTCCGCTTGCAGCAGTGCTTCGCTCCCCAATCGTTGGATTAAATGATGAAGAGCTTGCAACGCTTCG
TGCTCACGGGAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGAGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTACAACTTACTGCAAGGATGGCGTGAATTCGCGCGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTATGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCGGTCGTATTCGTAGCTGGACTAGGTCGCCGCTTTAATACACAAGACTTAATGAA
GCGTTTCTTACTGCATAAAGACTTCGGTTTTGGTTCGCAATTTATCGATCCTCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATCGCGGAAGAAATGCGTGTATTATACGTAGCGTTAACACGG
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAAATAAAGAAATGGAAAAGTGGCTTGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCATATACGTGCTGGAGCGTCTTGTTATTTAGACTGGATTGCCCCTTCCTTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGCATTCCAGATGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGATGGTAACACTTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAGGCTGTTCCGTTAGAGAGTGAACGGAAAGAAGAAGTGTACGATAGATTAATGTGGAAGTACGGATATGGAG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCT
TTTATTAAAAAATTACGTGCACCAATTCAAACACGTCCTCGTTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGAGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGTTGAAATTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAAGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAATTGAAAAAGTGATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGGAGAGCGGGGAATCAATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GTATCACTTTAATCGACTTTAAAACGGATACGATTGAAGGGAAGTTCCCAGGGGGATTTGATCAAGCGAAACCAATTTTA
GAAGAGCGATATAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGTTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4S4HWX4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.747

100

0.537


Multiple sequence alignment