Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   MQW34_RS05785 Genome accession   NZ_CP094455
Coordinates   1126216..1129941 (+) Length   1241 a.a.
NCBI ID   WP_243499602.1    Uniprot ID   -
Organism   Bacillus sp. ZJS3     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1121216..1134941
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MQW34_RS05770 (MQW34_05765) - 1121374..1121967 (+) 594 WP_000347517.1 TVP38/TMEM64 family protein -
  MQW34_RS05775 (MQW34_05770) lepB 1122024..1122587 (+) 564 WP_071723866.1 signal peptidase I -
  MQW34_RS05780 (MQW34_05775) addB 1122704..1126219 (+) 3516 WP_243499600.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  MQW34_RS05785 (MQW34_05780) addA 1126216..1129941 (+) 3726 WP_243499602.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  MQW34_RS05790 (MQW34_05785) - 1129954..1130241 (+) 288 WP_000179960.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  MQW34_RS05795 (MQW34_05790) gerPF 1130283..1130498 (-) 216 WP_001141566.1 spore germination protein GerPF -
  MQW34_RS05800 (MQW34_05795) - 1130541..1130927 (-) 387 WP_000902314.1 spore germination protein GerPE -
  MQW34_RS05805 (MQW34_05800) gerPD 1130943..1131137 (-) 195 WP_001052802.1 spore germination protein GerPD -
  MQW34_RS05810 (MQW34_05805) gerPC 1131144..1131758 (-) 615 WP_001070766.1 spore germination protein GerPC -
  MQW34_RS05815 (MQW34_05810) gerPB 1131826..1132032 (-) 207 WP_001012512.1 spore germination protein GerPB -
  MQW34_RS05820 (MQW34_05815) gerPA 1132047..1132268 (-) 222 WP_001111188.1 spore germination protein GerPA -
  MQW34_RS05825 (MQW34_05820) - 1132365..1132544 (-) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  MQW34_RS05830 (MQW34_05825) - 1132792..1133691 (+) 900 WP_170879213.1 fumarylacetoacetate hydrolase family protein -
  MQW34_RS05835 (MQW34_05830) - 1133732..1134016 (-) 285 WP_243499604.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142703.94 Da        Isoelectric Point: 4.8339

>NTDB_id=669630 MQW34_RS05785 WP_243499602.1 1126216..1129941(+) (addA) [Bacillus sp. ZJS3]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARESWTSVYEAMQNISWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLDGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVPDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPGEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIATEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQARPIL
ETRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=669630 MQW34_RS05785 WP_243499602.1 1126216..1129941(+) (addA) [Bacillus sp. ZJS3]
ATGATTGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAGTGGACAGATGATCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCAGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAGCCAGGCTCTCAGCACATTAGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAACCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATTGAA
GATAATACGATATTTTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTGCAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATACGACGTAGAAGGAA
AGACAATTGAAGATTTAGTATACGCTTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAACATATT
CGGAAAGCGACGGAACTTGCAATGCTTCCTGACGGTCCAGCACCTCGCGTTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACATATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGTGATTATAATGAGGATATTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTGTTAGA
AAAACTCGTGCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAGCAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCTATTATTAAATTCGT
AACGAAAGATTCGGAGAGTGAAGGAAACTTATTTATGGTTGGTGATGTGAAGCAGTCGATCTATCGTTTCCGATTAGCAG
AACCAGGCTTATTCCTAGGAAAGTATAAACGCTTCACGCAAGAAGGGTTAGACGGCGGAATGAAAATCGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGTACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGTTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCTGAACTACTATGTA
TTCAGCAAACAGAAGAAGAAGTGCCAGACGGTGAAGAAGGTGCGGAAGTAGAAAAAGCGCAACTTGAAGCTCGACTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGACCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGCGACTTCGTTATTTTACTTCGCTCTATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAGTTGCAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATAATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTGCTTCGTTCTCCAATCGTTGGATTAAACGATGAAGAACTTGCAACGCTTCG
TGCTCACGGAAAGAAAGGCTCGTTTTATGAAGTGATGAGCTCATTCTTAAAAGGGGCACCGCTGGAAGAAGAAAAAGAAC
TCCATGATAAATTAGAGTGGTTCTATAACTTACTACAAGGATGGCGTGAATTTGCGCGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTTGTCGGCGGTTTACCAGCTGGAAAGCAAAGACAAGCAAACTT
GCGTGTATTATATGACCGCGCAAGACAATATGAGGCAACATCGTTTAGAGGATTATTCCGTTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGACGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTTGTCGCTGGACTTGGTCGTCGTTTTAATACACAAGATTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGTTTCGGTTCGCAATTTATCGATCCTCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTCTTATACGTAGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATCGGAACAGTGAAAGATGCAAATAAGGAAATGGAAAAATGGCTGGATGCGAGGGAGCA
TAGTGAATGGTTATTACCGGATCATATACGTGCCGGAGCATCTTGTTATTTAGACTGGATTGCACCTTCCTTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGGTGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGTAACACGTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCTCTTCG
TGAGAAAAAAGCTGTTCCCCTGCAAAGTGAACGAAAAGATGAAGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACGCGTCCTCGTTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCTGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCGG
GAATGGTAAATAAGGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAACTGAAAAAGTGATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGGAGAGCGGGGAATCAATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GTATCACTTTAATCGACTTTAAAACAGATACGATTGAAGGGAAGTTCCCAGGGGGATTCGAACAAGCGAGACCAATTTTA
GAAACTCGTTACAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.248

100

0.535


Multiple sequence alignment