Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   KM393_RS05850 Genome accession   NZ_CP076190
Coordinates   1116538..1120263 (+) Length   1241 a.a.
NCBI ID   WP_000970458.1    Uniprot ID   Q81TW1
Organism   Bacillus anthracis strain Pasteur     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1111538..1125263
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KM393_RS05835 (KM393_05855) - 1111696..1112289 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  KM393_RS05840 (KM393_05860) lepB 1112346..1112909 (+) 564 WP_000751894.1 signal peptidase I -
  KM393_RS05845 (KM393_05865) addB 1113026..1116541 (+) 3516 WP_000058556.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  KM393_RS05850 (KM393_05870) addA 1116538..1120263 (+) 3726 WP_000970458.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  KM393_RS05855 (KM393_05875) - 1120276..1120563 (+) 288 WP_000718626.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  KM393_RS05860 (KM393_05880) gerPF 1120601..1120816 (-) 216 WP_001141566.1 spore germination protein GerPF -
  KM393_RS05865 (KM393_05885) - 1120859..1121245 (-) 387 WP_000902341.1 spore germination protein GerPE -
  KM393_RS05870 (KM393_05890) gerPD 1121261..1121455 (-) 195 WP_001052802.1 spore germination protein GerPD -
  KM393_RS05875 (KM393_05895) gerPC 1121462..1122076 (-) 615 WP_001070765.1 spore germination protein GerPC -
  KM393_RS05880 (KM393_05900) gerPB 1122143..1122349 (-) 207 WP_001012512.1 spore germination protein GerPB -
  KM393_RS05885 (KM393_05905) gerPA 1122364..1122585 (-) 222 WP_001111188.1 spore germination protein GerPA -
  KM393_RS05890 (KM393_05910) - 1122682..1122860 (-) 179 Protein_1083 aspartyl-phosphate phosphatase Spo0E family protein -
  KM393_RS05895 (KM393_05915) - 1123108..1124007 (+) 900 WP_001241727.1 fumarylacetoacetate hydrolase family protein -
  KM393_RS05900 (KM393_05920) - 1124043..1124327 (-) 285 WP_000925338.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142622.96 Da        Isoelectric Point: 4.8348

>NTDB_id=573231 KM393_RS05850 WP_000970458.1 1116538..1120263(+) (addA) [Bacillus anthracis strain Pasteur]
MMENWPKKPEGSQWTDDQWKAVVATGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADLALLGTLSAAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVKYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDATKEMEKWLDAREHSEWLLPDHVRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKEEVYDRLMWKYGYGEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEILQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
ETRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=573231 KM393_RS05850 WP_000970458.1 1116538..1120263(+) (addA) [Bacillus anthracis strain Pasteur]
ATGATGGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGATCAGTGGAAAGCGGTTGTAGCGACCGGACG
TGATATTTTAGTCGCAGCAGCAGCAGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACAAATGCAGCAGCGCAAGAGATGAAAAACAGAATT
GGAGAGGCTTTAGAAAAAGTATTAATTGATGAACCAGGATCTCAGCACGTAAGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACAGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGCGACCGTAGTGATGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTCGAAGGAA
AGACAATTGAAGATTTAGTGTACGCCTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACTGAGCTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCAGCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATATTGTCAAACAAGTAGACTCTCTTCGTAATAAAGCAAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCCGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACAAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAGCAAACAGAGGAAGAAGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCACAGCTTGAAGCACGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTAAAATA
CCGTGACTTCGTTATTTTACTTCGCTCGATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATACGCTGACCTTGCCACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTGCTTCGCTCCCCAATCGTTGGATTAAATGATGAAGAGCTTGCAACGCTTCG
TGCTCACGGGAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGAGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTATAACTTACTGCAAGGATGGCGTGAATTCGCACGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTATGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGCAAAGGACTTGAGTTCCCAGTCGTATTCGTCGCTGGACTTGGTCGTCGTTTTAATACGCAAGACTTAATGAA
ACGTTTCTTACTTCATAAAGACTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CACAACTTGCAATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTCTTATACGTAGCGTTAACACGG
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAACTAAGGAAATGGAAAAATGGCTGGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCACGTACGTGCCGGAGCATCTTGTTATTTAGACTGGATTGCACCTTCCTTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGGCAAGGAAGTATTCCAGATGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGGAACACGTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAGGCTGTTCCGTTAGAGAGTGAACGGAAAGAAGAAGTGTACGATAGATTAATGTGGAAGTACGGATATGGAG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCT
TTTATTAAAAAATTACGTGCACCAATTCAAACACGTCCTCGTTTTATGGAGAAAAAAGGGCTAACGTACGCAGAGCGAGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGTTGAAATTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAAGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAATTGAAAAAGTGATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGGAGAGCGGGGAATCCATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GTATTACTTTAATCGACTTTAAAACGGATACGATTGAAGGGAAGTTCCCGGGAGGATTCGAACAAGCGAAACCAATTTTA
GAAACTCGTTACAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q81TW1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.828

100

0.538