Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   CY96_RS04835 Genome accession   NZ_CP007512
Coordinates   960154..963879 (+) Length   1241 a.a.
NCBI ID   WP_044584221.1    Uniprot ID   A0A9W3PPV4
Organism   Bacillus bombysepticus str. Wang     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 955154..968879
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CY96_RS04820 (CY96_04895) - 955312..955905 (+) 594 WP_000347517.1 TVP38/TMEM64 family protein -
  CY96_RS04825 (CY96_04900) lepB 955962..956525 (+) 564 WP_000751910.1 signal peptidase I -
  CY96_RS04830 (CY96_04905) addB 956642..960157 (+) 3516 WP_044584220.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  CY96_RS04835 (CY96_04910) addA 960154..963879 (+) 3726 WP_044584221.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  CY96_RS04840 (CY96_04915) - 963892..964179 (+) 288 WP_000255727.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  CY96_RS04845 (CY96_04920) gerPF 964317..964532 (-) 216 WP_001141566.1 spore germination protein GerPF -
  CY96_RS04850 (CY96_04925) - 964575..964961 (-) 387 WP_000902319.1 spore germination protein GerPE -
  CY96_RS04855 (CY96_04930) gerPD 964977..965171 (-) 195 WP_001052807.1 spore germination protein GerPD -
  CY96_RS04860 (CY96_04935) gerPC 965178..965792 (-) 615 WP_001070759.1 spore germination protein GerPC -
  CY96_RS04865 (CY96_04940) gerPB 965860..966066 (-) 207 WP_001012508.1 spore germination protein GerPB -
  CY96_RS04870 (CY96_04945) gerPA 966081..966302 (-) 222 WP_001111188.1 spore germination protein GerPA -
  CY96_RS04875 (CY96_04950) - 966399..966578 (-) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  CY96_RS04880 (CY96_04955) - 966825..967724 (+) 900 WP_001213065.1 fumarylacetoacetate hydrolase family protein -
  CY96_RS04885 (CY96_04960) - 967760..968053 (-) 294 WP_000926854.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142754.85 Da        Isoelectric Point: 4.7703

>NTDB_id=120196 CY96_RS04835 WP_044584221.1 960154..963879(+) (addA) [Bacillus bombysepticus str. Wang]
MIENWPKKPEDSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADLVLLGTLSSAAHESWTSLYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEDGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLVAEEAYQDWQGNSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=120196 CY96_RS04835 WP_044584221.1 960154..963879(+) (addA) [Bacillus bombysepticus str. Wang]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGATAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGGCG
TGATATTTTAGTTGCGGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCTAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTACGGAATCGAA
GATAATACAATCTTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAGTCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATCTAGTTTTACT
TGGAACGCTATCATCAGCTGCTCATGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGAAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTTTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTTGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTTGTACAGGAATCTATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTTATGGTTGGTGACGTAAAACAGTCGATTTATCGTTTCCGACTAGCCG
AACCAGGACTATTCCTAGGAAAATATAAACGCTTCACACAAGAAGGATTGGGCGGTGGAATGAAGATCGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATCTTCAAACAAATTATGGGTGAAGAAGTCGGAGAAAT
CGACTACGATGCAGACGCTGAATTAAAGTTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCAGAACTACTGTGCA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGCCCAGTACAATA
CCGCGACTTCGTTATTTTACTTCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAACTACAAGGAATTCCAG
TATATGCAGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATCCCGCTTGCAGCAGTACTTCGTTCACCGATCGTTGGATTAAATGACGAAGAACTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGGTCATTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCACTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTCGCGCGCCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATACGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGACATGGGTACGGCGAGGGCCCTCGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGTTAGAGTTCCCGGTTGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCGTTGACGCGC
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGGGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAATACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTGCCGTTAGAAAGTGAACGAAAAGATGAAGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCAACATCTCACCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCAGAAGATGGTAGTGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGCGTTGAGCGTGAAGTACCATTTACGATGATGCTTGTAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAATAGCGGGGAATCGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAAGAAGATG
GTATTACGCTAATCGATTTCAAAACAGATACGATTGAAGGGAAATTTCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.488

100

0.537


Multiple sequence alignment