Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   F0362_RS05785 Genome accession   NZ_CP043830
Coordinates   1105047..1108772 (+) Length   1241 a.a.
NCBI ID   WP_000572318.1    Uniprot ID   -
Organism   Bacillus sp. BS98     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1100047..1113772
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  F0362_RS05770 (F0362_05770) - 1100079..1100672 (+) 594 WP_000347508.1 TVP38/TMEM64 family protein -
  F0362_RS05775 (F0362_05775) lepB 1100855..1101418 (+) 564 WP_000751914.1 signal peptidase I -
  F0362_RS05780 (F0362_05780) addB 1101535..1105050 (+) 3516 WP_061529373.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  F0362_RS05785 (F0362_05785) addA 1105047..1108772 (+) 3726 WP_000572318.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  F0362_RS05790 (F0362_05790) - 1108785..1109072 (+) 288 WP_000179959.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  F0362_RS05795 (F0362_05795) gerPF 1109161..1109376 (-) 216 WP_001141566.1 spore germination protein GerPF -
  F0362_RS05800 (F0362_05800) - 1109419..1109805 (-) 387 WP_000902321.1 spore germination protein GerPE -
  F0362_RS05805 (F0362_05805) gerPD 1109821..1110015 (-) 195 WP_001052804.1 spore germination protein GerPD -
  F0362_RS05810 (F0362_05810) gerPC 1110022..1110636 (-) 615 WP_001070753.1 spore germination protein GerPC -
  F0362_RS05815 (F0362_05815) gerPB 1110705..1110911 (-) 207 WP_001012514.1 spore germination protein GerPB -
  F0362_RS05820 (F0362_05820) gerPA 1110926..1111147 (-) 222 WP_001111187.1 spore germination protein GerPA -
  F0362_RS05825 (F0362_05825) - 1111249..1111425 (-) 177 WP_000462849.1 aspartyl-phosphate phosphatase Spo0E family protein -
  F0362_RS05830 (F0362_05830) - 1111672..1112571 (+) 900 WP_001213067.1 fumarylacetoacetate hydrolase family protein -
  F0362_RS05835 (F0362_05835) - 1112607..1112888 (-) 282 WP_000926856.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142775.96 Da        Isoelectric Point: 4.8167

>NTDB_id=386059 F0362_RS05785 WP_000572318.1 1105047..1108772(+) (addA) [Bacillus sp. BS98]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERMIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNSIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVRFQLETAEQHI
RKATELAMLPDGPAPRVETLQADAALLGMLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVKLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENDEVKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPIGEDVAAELLCIQQTEEEVLEGEEGTEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHEKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHTVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKIISFFDS
DLGKRVLAAKSVEREVPFTMMLSAEEAYQDWQGNSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNYVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=386059 F0362_RS05785 WP_000572318.1 1105047..1108772(+) (addA) [Bacillus sp. BS98]
ATGATAGAGAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCCGTTGTAGCGAATGGACG
TGACATTTTAGTCGCAGCAGCAGCTGGATCGGGAAAAACAGCAGTATTAGTTGAACGTATGATTAAAAAGATTATTAATG
AGGAAAATCCAGTCGATGTCGATCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAATCGAATT
GGAGAAGCGTTAGAAAAAGTACTAATTGATGAGCCAGGATCTCAACATGTAAGAAAACAGCTGAGCCTATTAAATAAAGC
ATCTATTTCAACGATCCATTCATTTTGTTTACAAGTAATTAGAGGGTATTACTACATGCTGGATGTTGATCCTCGTTTTC
GTATTGCAAATCAAACAGAAAATGAGTTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATCGAA
GATAATAGTATTTTCTTTGAATTAGTTGATCGCTATACGAGTGACCGTAGTGATGATGACTTACAAAGAATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATACGACGTGGAAGGAA
AGACAATTGAAGATTTAGTGTATGCTTCTTACCTATTAGAAGATGTGAGATTCCAGCTTGAAACAGCGGAACAACATATT
CGTAAAGCAACCGAACTCGCAATGCTTCCTGACGGCCCGGCGCCTCGCGTTGAAACCCTGCAAGCGGATGCAGCTTTACT
TGGAATGTTATCATCAGCAGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAACGAGGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAACTCGTGAAGCTCGTAAAAGTCTTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTTGATTTCACAG
ATTTAGAGCATTTCTGTTTGCAAATTTTAAGTGAACAAAGTGAAAATGATGAAGTGAAGCCATCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCAGAAGTACTAGTCGATGAATATCAAGATACGAACTTCGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGGAACTTGTTCATGGTTGGTGACGTAAAACAGTCAATCTACCGTTTCCGACTAGCAG
AACCAGGCTTATTCTTAGGAAAGTATAAACGCTTCACGCAAGAAGGATCGGGCGGCGGAATGAAGATTGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAATAGGTGAAGATGTAGCAGCTGAATTATTATGCA
TTCAGCAAACGGAAGAAGAAGTACTAGAGGGTGAAGAAGGTACGGAAGTCGAAAAGGCGCAACTGGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTCGATTCAGGTTATGAAGTGTATGACCGAAAAACGGATAGTATGCGACCAGTGCAATA
TCGTGATTTCGTTATTTTACTTCGCTCTATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAACTACAAGGAATTCCAG
TATATGCAGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCACTTGCAGCAGTACTTCGTTCACCGATCGTTGGATTAAATGATGAAGAACTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGGTCATTTTATGAAGTAATGAGCTCATTCTTAAAAGGTGCACCGCTTGAAGAAGAACAAGAAC
TTCATGAAAAACTAGAATGGTTTTATAACTTACTACAAGGATGGCGTGAATTCGCGCGCCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGGTATTATGACTTTGTCGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
GCGTGTATTATATGACCGCGCAAGGCAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGACGATATGGGGACGGCGAGAGCTCTTGGTGAACAAGAAGATGTTGTTCGCATTATGACGATT
CATAAAAGTAAGGGACTTGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGATTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGTTTCGGTTCACAGTTCATTGATCCGCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCAATTAAACGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTATTATACGTAGCTTTAACGCGT
GCAAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTCGATGCGAGGGAACA
TAGTGAATGGTTATTACCGGATCACATACGTGCTGGAGCGTCTTGCTATTTAGACTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATACTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAGTTGTTAGAAGCACTTCG
TGAGAAAAAGGCTGTGCCATTACAAAGTGAACGAAAAGAAGAAGTGTACGATAGGTTAATGTGGAAGTACGGGTATGAGG
AAGCAACCTCTCATCGTGCGAAGCAATCCGTTACAGAAATAAAGAGAAATTATCAGTCTGAAGAAGGCAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGATTAACGTACGCAGAGCGTGG
GACGGCAGTCCATACCGTTATGCAACATGTAGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAACAAATTGCTG
GAATGGTAAATAAGGAATTGTTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAATAATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTTCCATTTACGATGATGCTATCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAATAGCGGGGAATCGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAAGAAGATG
GTATTACGCTAATCGATTTCAAAACAGATACGATTGAAGGTAAATTCCCAGGCGGATTTGAACAAGCAAAACCAATTTTA
GAAGATCGCTATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTCGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGGAATTATGTTGTGAATATTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.328

100

0.536


Multiple sequence alignment