Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   CUC43_RS15545 Genome accession   NZ_CP024771
Coordinates   2889966..2893691 (+) Length   1241 a.a.
NCBI ID   WP_029437815.1    Uniprot ID   -
Organism   Bacillus thuringiensis LM1212     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 2884966..2898691
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CUC43_RS15530 (CUC43_16025) - 2885124..2885717 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  CUC43_RS15535 (CUC43_16030) lepB 2885774..2886337 (+) 564 WP_000751904.1 signal peptidase I -
  CUC43_RS15540 (CUC43_16035) addB 2886454..2889969 (+) 3516 WP_029437814.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  CUC43_RS15545 (CUC43_16040) addA 2889966..2893691 (+) 3726 WP_029437815.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  CUC43_RS15550 (CUC43_16045) - 2893704..2893991 (+) 288 WP_029437816.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  CUC43_RS15555 (CUC43_16050) gerPF 2894097..2894312 (-) 216 WP_001141566.1 spore germination protein GerPF -
  CUC43_RS15560 (CUC43_16055) - 2894355..2894741 (-) 387 WP_029437817.1 spore germination protein GerPE -
  CUC43_RS15565 (CUC43_16060) gerPD 2894757..2894951 (-) 195 WP_001102341.1 spore germination protein GerPD -
  CUC43_RS15570 (CUC43_16065) gerPC 2894958..2895572 (-) 615 WP_029437818.1 spore germination protein GerPC -
  CUC43_RS15575 (CUC43_16070) gerPB 2895641..2895847 (-) 207 WP_001012508.1 spore germination protein GerPB -
  CUC43_RS15580 (CUC43_16075) gerPA 2895862..2896083 (-) 222 WP_001111187.1 spore germination protein GerPA -
  CUC43_RS15585 (CUC43_16080) - 2896180..2896359 (-) 180 WP_029437819.1 aspartyl-phosphate phosphatase Spo0E family protein -
  CUC43_RS15590 (CUC43_16085) - 2896607..2897506 (+) 900 WP_029437820.1 fumarylacetoacetate hydrolase family protein -
  CUC43_RS15595 (CUC43_16090) - 2897542..2897826 (-) 285 WP_000925339.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142861.27 Da        Isoelectric Point: 4.8691

>NTDB_id=255371 CUC43_RS15545 WP_029437815.1 2889966..2893691(+) (addA) [Bacillus thuringiensis LM1212]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLVLLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQTMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPDGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKVMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSLFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEIHLELGQGSIPDEIYGYSASWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATAHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIRTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=255371 CUC43_RS15545 WP_029437815.1 2889966..2893691(+) (addA) [Bacillus thuringiensis LM1212]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCCGTTGTAGCGAATGGGCG
TGATATTTTAGTCGCAGCAGCAGCTGGATCAGGGAAAACAGCGGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAGCCAGGTTCTCAGCACATTCGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCCTTTTGTTTACAAGTTATTAGAGGATACTATTATATGCTTGATGTTGATCCTCGTTTCC
GCATAGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
ACTTCATACAGAATCAAGAGCACATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATACGATGTGGAAGGAA
AGACAATTGAAGATTTAGTGTACGCTTCCTACTTATTAGAAGATGTGAAATTCCAACTTGAAACAGCGGAACAGCATATT
CGTAAAGCGACAGAGCTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCGTTGAAACCCTGCAAGCAGATTTAGTTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAACGAGGATGTTGTAAAGCAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAAGCCTGAAAGTTTCTTGCGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAACTTGTACAGCTTGTAAAAGTATTTACAGAACGTTTCCAAACGATGAAGCGAGATAAAGGAATGGTCGATTTCACGG
ATTTAGAGCATTTTTGTTTACAAATTTTAAGTGAACAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCGCTTCAA
TATCGCAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCGATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGCGACGTGAAGCAGTCAATCTATCGTTTCCGACTAGCAG
AACCAGGATTATTCCTAGGAAAGTATAAACGCTTCACGCAAGAAGGATTAGGCGGCGGAATGAAGATCGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGCACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTACCTATCCAGATGGTGAAGATGTAGCAGCTGAGCTACTATGCA
TTCAGCAAACGGAAGAAGAGGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCGCAGCTTGAAGCTCGTCTTATG
GCGCAACGCATTAAAGTGATGGTCGATTCTGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGCGACTTCGTTATTTTACTCCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTGAAATTACAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTACTTCGTTCACCAATCGTTGGATTAAACGATGAAGAACTTGCAACGCTTCG
TGCTCACGGGAAGAAAGGTTCGTTTTATGAAGTAATGAGCTTATTCTTAAAAGGGGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTACAACTTACTGCAAGGATGGCGTGAATTCGCGCGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTATGATTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGCCTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAGCGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTCGTAGCTGGACTAGGTCGCCGCTTTAATACACAAGACTTAATGAA
GCGTTTCTTACTGCATAAAGACTTCGGTTTCGGTTCGCAATTTATTGATCCACGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATCGCGGAAGAAATGCGCGTATTATATGTTGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAAATAAAGAAATGGAAAAATGGCTTGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCATATACGTGCCGGAGCGTCTTGCTACTTAGACTGGATTGCACCTTCCTTATATAGAC
ATCGTGATAGTGAAATACATCTTGAATTAGGGCAAGGAAGCATTCCAGATGAAATTTATGGATATAGTGCGAGCTGGAAA
GTAGAAGTTGTTGACGGTAACACGTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAAGCTGTTCCGTTGCAAAGTGAACGAAAAGAAGAGGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACAGCTCATCGTGCGAAGCAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAATTACGTGCACCAATTAGAACCCGTCCTCGTTTTATGGAGAAAAAAGGTTTAACGTACGCAGAGCGTGG
AACTGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGAGCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCGCTCGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTGTAAAAATCGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.778

100

0.539


Multiple sequence alignment