Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   LCF45_RS05855 Genome accession   NZ_CP084035
Coordinates   1155573..1159298 (+) Length   1241 a.a.
NCBI ID   WP_002125881.1    Uniprot ID   -
Organism   Bacillus sp. 41-22     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1150573..1164298
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LCF45_RS05840 (LCF45_05830) - 1150575..1151168 (+) 594 WP_002011278.1 TVP38/TMEM64 family protein -
  LCF45_RS05845 (LCF45_05835) lepB 1151379..1151942 (+) 564 WP_002011277.1 signal peptidase I -
  LCF45_RS05850 (LCF45_05840) addB 1152061..1155576 (+) 3516 WP_002125879.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  LCF45_RS05855 (LCF45_05845) addA 1155573..1159298 (+) 3726 WP_002125881.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  LCF45_RS05860 (LCF45_05850) - 1159322..1159600 (+) 279 WP_061675287.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  LCF45_RS05865 (LCF45_05855) - 1159793..1160500 (+) 708 WP_002011269.1 DNA alkylation repair protein -
  LCF45_RS05870 (LCF45_05860) gerPF 1160556..1160771 (-) 216 WP_001141570.1 spore germination protein GerPF -
  LCF45_RS05875 (LCF45_05865) - 1160814..1161200 (-) 387 WP_002125894.1 spore germination protein GerPE -
  LCF45_RS05880 (LCF45_05870) gerPD 1161215..1161409 (-) 195 WP_001052805.1 spore germination protein GerPD -
  LCF45_RS05885 (LCF45_05875) gerPC 1161416..1162030 (-) 615 WP_001070757.1 spore germination protein GerPC -
  LCF45_RS05890 (LCF45_05880) gerPB 1162098..1162304 (-) 207 WP_001012505.1 spore germination protein GerPB -
  LCF45_RS05895 (LCF45_05885) gerPA 1162319..1162540 (-) 222 WP_001111190.1 spore germination protein GerPA -
  LCF45_RS05900 (LCF45_05890) - 1162637..1162816 (-) 180 WP_002087032.1 aspartyl-phosphate phosphatase Spo0E family protein -
  LCF45_RS05905 (LCF45_05895) - 1163060..1163962 (+) 903 WP_078206356.1 fumarylacetoacetate hydrolase family protein -
  LCF45_RS05910 (LCF45_05900) - 1163997..1164281 (-) 285 WP_000925332.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142513.77 Da        Isoelectric Point: 4.9188

>NTDB_id=609161 LCF45_RS05855 WP_002125881.1 1155573..1159298(+) (addA) [Bacillus sp. 41-22]
MIENWPQKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIISEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDGPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNSIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEKHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARGSWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVKLVKVFTERFQAIKRDKGMVDFTDLEHFCLQILSEQSEGGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVQDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGNIPGEIYEYSASWK
VEVVEGKDLLAPEPVQEEKQELLEALRDKKAVPLESERKEEVYDRLMWEYGYADATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIARMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLSAEEAYQDWQGKKGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKPIL
EDRYKVQLSLYAKALEKSLKHPVKEKCLYFFDGNHVVNIDE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=609161 LCF45_RS05855 WP_002125881.1 1155573..1159298(+) (addA) [Bacillus sp. 41-22]
ATGATAGAAAATTGGCCTCAGAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCAGCGGCAGCTGGATCAGGGAAAACAGCGGTATTAGTTGAACGTATTATTAAAAAGATTATAAGTG
AGGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAAATGAAAAATCGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGGGCCAGGTTCACAGCATATAAGAAAGCAGCTTAGCTTATTAAATAAAGC
TTCCATTTCTACAATTCATTCATTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTCGATCCTCGTTTTC
GTATTGCGAACCAAACAGAAAATGAGTTATTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAATATGGAATCGAA
GATAATAGTATTTTCTTTGAACTAGTTGATCGTTATACGAGTGACCGTAGTGACGATGACTTACAACGAATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAACCCGGAGAAGTGGCTTGATAAATTAGTAGAAGCATACGACGTTGAAGGAA
AGACGATTGAAGATTTAGTGTACGCTTCTTACTTATTAGAAGATGTGAAGTTTCAGCTTGAAACAGCGGAAAAGCATATT
CGTAAGGCGACTGAACTCGCAATGCTTCCTGATGGTCCAGCGCCTCGCGTTGAAACTTTACAAGCGGATTTAGCTTTACT
TGGAACGTTATCCTCAGCAGCTCGTGGATCGTGGACAAGCGTTTATGAAGCGATGCAAAATGTATCGTGGCAAACGTTAA
AGCGTATTAAAAAAAGTGATTACAACGAAGATGTTGTAAAACAAGTAGATTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGACCCGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAACTCGTGAAGCTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATTAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTGCAAATTTTAAGCGAGCAAAGTGAAGGCGGTGAAATGAAGCCGTCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTACTAGTTGATGAATATCAAGATACGAACTTCGTACAGGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAACAGTCAATCTATCGTTTCCGACTAGCAG
AGCCTGGTTTATTCTTAGGAAAATATAAACGTTTCACACAAGAAGGATCGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGCCGTCATGAAGTGCTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTCGGCGAAAT
CGATTATGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCGGCAGAGCTATTATGTA
TTCAGCAAACTGAGGAAGAAGTGCAAGACGGTGAAGAAGGTGCCGAAGTAGAAAAAGCGCAGCTTGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCAATGGTCGATTCAGGCTATGAAGTGTATGATCGTAAAACTGATAGTATGCGCCCAGTGCAATA
TCGTGATTTCGTTATTTTACTTCGCTCGATGCCATGGGCGCCGCAAATTATGGAAGAGTTAAAACTACAAGGAATTCCGG
TATATGCTGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTCATTGATAAC
CCGATGCAAGATATTCCGCTTGCAGCTGTACTTCGTTCACCGATCGTTGGACTAAATGACGAAGAACTTGCGACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTGATGAGCTCATTCTTAAAGGGAGCGCCGCTTGAAGAAGAACAAGAAC
TGCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGGTGGCGTGAATTTGCGCGTCAACAATCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGCTATTACGACTTTGTCGGCGGTTTACCAGCCGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGTGCAAGACAATATGAAGCAACATCGTTTAGAGGATTATTCCGCTTCTTGCGCTTTATTGAGC
GTATTTTAGAACGCGGAGATGATATGGGAACGGCGAGAGCCCTCGGAGAACAAGAAGATGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCAGTCGTATTTGTAGCTGGACTTGGTCGCCGTTTTAATACACAAGATTTAATGAA
GCGTTTCTTACTTCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCTCGTAAACGAATTAAATATACGACATTAT
CGCAACTAGCGATTAAGCGTAAAATGAAGATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACAGTTAAAGATGCCAATAAAGAAATGGAAAAATGGCTCGATGCAAGAGAGCA
TAGTGAATGGTTATTACCAGACCATATTCGTGCCGGAGCGTCTTGCTACTTAGACTGGATTGCACCTTCCTTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAATATTCCAGGCGAAATTTATGAATATAGTGCTAGCTGGAAA
GTGGAAGTTGTTGAAGGAAAAGATTTACTTGCACCAGAACCGGTTCAAGAAGAGAAACAAGAGTTATTAGAAGCGCTTCG
CGATAAAAAAGCTGTTCCGTTAGAAAGTGAACGGAAAGAAGAAGTGTATGACAGATTGATGTGGGAATACGGGTATGCGG
ACGCGACATCTCACCGTGCGAAACAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTTCGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGCTAACATACGCAGAGCGAGG
GACAGCAGTCCATGCCGTTATGCAGCATGTAGATTTGAAGAAACCAATTACGATTGAAGTTCTTCAAGAACAAATTGCAA
GAATGGTAAATAAAGAACTATTAACATTTGAACAAGCTGAAGAAATAGCGATTGAAAAGGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCAGCAAAAAGTGTTGAGCGCGAAGTACCATTTACGATGATGCTTTCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGAAGAAAGGCGAATCAATACTTGTCCAAGGGGTTATCGACTGCATGATCGAAGAGGAAGACG
GAATTACTTTAATCGACTTTAAAACAGATACGATTGAAGGAAAGTTCCCAGGCGGATTTGATCAAGCGAAACCAATTTTA
GAAGACCGATACAAAGTACAGCTTTCACTATATGCAAAAGCACTCGAGAAAAGCTTAAAACATCCTGTGAAAGAGAAATG
TTTATATTTCTTTGATGGAAATCATGTTGTAAATATTGACGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.569

100

0.538