Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   ACHZK0_RS05835 Genome accession   NZ_CP172246
Coordinates   1153471..1157196 (+) Length   1241 a.a.
NCBI ID   WP_395761325.1    Uniprot ID   -
Organism   Bacillus sp. 3G2     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1148471..1162196
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACHZK0_RS05820 (ACHZK0_05820) - 1148629..1149222 (+) 594 WP_043936619.1 TVP38/TMEM64 family protein -
  ACHZK0_RS05825 (ACHZK0_05825) lepB 1149279..1149842 (+) 564 WP_061676503.1 signal peptidase I -
  ACHZK0_RS05830 (ACHZK0_05830) addB 1149959..1153474 (+) 3516 WP_262742311.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  ACHZK0_RS05835 (ACHZK0_05835) addA 1153471..1157196 (+) 3726 WP_395761325.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  ACHZK0_RS05840 (ACHZK0_05840) - 1157209..1157496 (+) 288 WP_262742314.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  ACHZK0_RS05845 (ACHZK0_05845) gerPF 1157577..1157792 (-) 216 WP_001141570.1 spore germination protein GerPF -
  ACHZK0_RS05850 (ACHZK0_05850) - 1157835..1158221 (-) 387 WP_262742315.1 spore germination protein GerPE -
  ACHZK0_RS05855 (ACHZK0_05855) gerPD 1158237..1158431 (-) 195 WP_001052807.1 spore germination protein GerPD -
  ACHZK0_RS05860 (ACHZK0_05860) gerPC 1158438..1159052 (-) 615 WP_063263235.1 spore germination protein GerPC -
  ACHZK0_RS05865 (ACHZK0_05865) gerPB 1159120..1159326 (-) 207 WP_001012508.1 spore germination protein GerPB -
  ACHZK0_RS05870 (ACHZK0_05870) gerPA 1159341..1159562 (-) 222 WP_001111187.1 spore germination protein GerPA -
  ACHZK0_RS05875 (ACHZK0_05875) - 1159659..1159838 (-) 180 WP_000462840.1 aspartyl-phosphate phosphatase Spo0E family protein -
  ACHZK0_RS05880 (ACHZK0_05880) - 1160085..1160984 (+) 900 WP_262742316.1 fumarylacetoacetate hydrolase family protein -
  ACHZK0_RS05885 (ACHZK0_05885) - 1161018..1161302 (-) 285 WP_262742317.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142600.79 Da        Isoelectric Point: 4.8978

>NTDB_id=1065889 ACHZK0_RS05835 WP_395761325.1 1153471..1157196(+) (addA) [Bacillus sp. 3G2]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIISEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEAEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADAVLLGTLSSAARESWTSVYEAMQKVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVQDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSGWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPGEIYDYDTSWK
VQVVDGSTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKMRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITEEIIREQISGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGQSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKQIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=1065889 ACHZK0_RS05835 WP_395761325.1 1153471..1157196(+) (addA) [Bacillus sp. 3G2]
ATGATTGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAGTGGACAGATGACCAATGGAAAGCAGTTGTAGCGAACGGGCG
TGATATTTTAGTCGCAGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAGTG
AAGAAAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAATAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAGCCAGGCTCTCAGCACGTAAGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCTATTTCGACGATCCATTCCTTTTGTTTACAAGTAATTAGAGGATACTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGCAGAGTATGGAATTGAA
GACAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGATCGTAGTGATGATGACTTACAGCGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAGTGGCTCGATAAATTAGTAGAAGCATACGATGTAGAAGGAA
AGACAATAGAAGATTTAGTGTATGCTTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACCGAACTCGCAATGCTTCCTGACGGCCCGGCGCCTCGCGTTGAAACCTTGCAAGCTGATGCGGTTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGTGTGTATGAAGCGATGCAAAAAGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAACGAAGATGTTGTAAAACAAGTAGATTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTTTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTCACGGAGCGCTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACCG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAGATGAAGCCATCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCTGAAGTACTAGTCGATGAATATCAAGATACGAACTTCGTACAGGAATCCATTATTAAATTTGT
AACGAAGGATTCTGAGAGTGAAGGAAACTTGTTCATGGTAGGTGACGTAAAACAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGTTTATTCCTAGGAAAGTATAAACGCTTCACACAAGAAGGATCGGGCGGCGGAATGAAGATAGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTCGGAGAAAT
CGACTACGATGCAGACGCTGAATTAAAGTTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAACTACTATGCA
TTCAGCAAACGGAAGAAGAAGTGCAAGACGGTGAAGAAGGTGCAGAAGTAGAAAAAGCGCAACTTGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGCGATTTCGTTATTTTACTTCGCTCTATGCCGTGGGCGCCGCAAATTATGGAAGAGTTAAAGCTGCAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTGATTGATAAT
CCAATGCAAGATATTCCACTTGCCGCAGTGCTTCGTTCACCAATCGTTGGATTAAATGATGAAGAACTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGTTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAGGAAGAGCAAGAAC
TACATGATAAATTAGAGTGGTTCTATAATTTACTGCAAGGGTGGCGTGAATTCGCGCGTCAACAATCTCTTTCTGATTTG
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTTGTTGGTGGTTTACCAGCCGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTTTATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGATTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGACGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGTATTATGACGATT
CATAAAAGTAAAGGATTAGAGTTCCCAGTCGTGTTTGTTGCGGGGCTTGGTCGCCGTTTTAATACACAAGATTTAATGAA
ACGTTTCTTACTGCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCTCGTAAACGTATTAAATATACGACATTAT
CGCAACTAGCAATTAAGCGTAAAATGAAAATGGAATTAATCGCCGAAGAAATGCGCGTATTATACGTAGCGTTAACACGT
GCAAAAGAAAAGTTAATTTTAATCGGAACGGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAGCA
TAGTGGATGGTTATTACCAGATCATATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGTAGTATTCCGGGTGAAATTTATGACTATGACACGAGCTGGAAA
GTCCAAGTTGTAGACGGTAGCACGTTACTTGCGCCAGAACCGGTTCAAGAAGAGAAACAAGAATTACTAGAAGCACTTCG
TGAGAAAAAAGCTGTTCCCCTGCAAAGTGAACGAAAAGAAGAAGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCAACATCTCACCGCGCGAAACAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAAATGCGTGCGCCAATTAAAACACGTCCTCGTTTTATGGAGAAAAAAGGATTAACATACGCTGAAAGAGG
TACGGCAGTTCATGCTGTCATGCAACATGTTGATTTGAAGAAGCCAATTACAGAAGAAATAATTCGGGAGCAAATTAGTG
GCATGGTTAATAAAGAGCTATTAACATTCGAACAAGCCGAAGAAATAGCAATCGAAAAAGTCATTTCATTCTTTGACAGT
GATCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGCAGAGCGGGGAATCAATTCTCGTCCAAGGGGTTATCGACTGTATGATTGAAGAGGAAGACG
GCATTACGTTAATCGACTTTAAAACAGATACGATTGAAGGGAAGTTCCCAGGCGGATTCGATCAAGCGAAACAAATTTTA
GAAGATCGATACAAAGTACAGCTTTCGTTATATGCAAAGGCACTTGAAAAAAGCTTGCAGCACCCAGTGAAAGAGAAATG
CTTATACTTCTTTGATGGCAATCATGTTGTGAAAATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.425

100

0.534


Multiple sequence alignment