Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   BGI23_RS05770 Genome accession   NZ_CP017016
Coordinates   1113125..1116850 (+) Length   1241 a.a.
NCBI ID   WP_070805470.1    Uniprot ID   -
Organism   Bacillus sp. ABP14     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1108125..1121850
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BGI23_RS05755 (BGI23_05755) - 1108283..1108876 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  BGI23_RS05760 (BGI23_05760) lepB 1108933..1109496 (+) 564 WP_000751898.1 signal peptidase I -
  BGI23_RS05765 (BGI23_05765) addB 1109613..1113128 (+) 3516 WP_070805469.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  BGI23_RS05770 (BGI23_05770) addA 1113125..1116850 (+) 3726 WP_070805470.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  BGI23_RS05775 (BGI23_05775) - 1116863..1117150 (+) 288 WP_070805471.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  BGI23_RS05780 (BGI23_05780) gerPF 1117184..1117399 (-) 216 WP_001141566.1 spore germination protein GerPF -
  BGI23_RS05785 (BGI23_05785) - 1117442..1117828 (-) 387 WP_000902334.1 spore germination protein GerPE -
  BGI23_RS05790 (BGI23_05790) gerPD 1117844..1118038 (-) 195 WP_001052802.1 spore germination protein GerPD -
  BGI23_RS05795 (BGI23_05795) gerPC 1118045..1118659 (-) 615 WP_001070747.1 spore germination protein GerPC -
  BGI23_RS05800 (BGI23_05800) gerPB 1118727..1118933 (-) 207 WP_001012508.1 spore germination protein GerPB -
  BGI23_RS05805 (BGI23_05805) gerPA 1118948..1119169 (-) 222 WP_001111188.1 spore germination protein GerPA -
  BGI23_RS05810 (BGI23_05810) - 1119266..1119445 (-) 180 WP_000462851.1 aspartyl-phosphate phosphatase Spo0E family protein -
  BGI23_RS05815 (BGI23_05815) - 1119693..1120592 (+) 900 WP_172801679.1 fumarylacetoacetate hydrolase family protein -
  BGI23_RS05820 (BGI23_05820) - 1120628..1120912 (-) 285 WP_070805472.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142684.87 Da        Isoelectric Point: 4.8343

>NTDB_id=194479 BGI23_RS05770 WP_070805470.1 1113125..1116850(+) (addA) [Bacillus sp. ABP14]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVRFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVSDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREYSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSVPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITEEVIREQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSGESILVQGVIDCMIEEEDGITLIDFKTDTIAGKFPGGFDQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=194479 BGI23_RS05770 WP_070805470.1 1113125..1116850(+) (addA) [Bacillus sp. ABP14]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAGTGGACAGATGACCAGTGGAAAGCCGTTGTAGCGAACGGGCG
TGATATTTTAGTTGCGGCTGCAGCAGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAGCCTGGCTCTCAGCACATCCGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCCTTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACCGAAAATGAATTGTTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAAGATTTAGTGTATGCTTCTTATTTATTAGAAGATGTAAGATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCGACGGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCGTTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATGTTGTGAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGTCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAGCAAAGTGAAGATGGTGAAATGAAACCGTCAGCTGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTTGGAGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAGCAAACAGAAGAAGAAGTAAGCGATGGTGAAGAAGGTGCGGAAGTAGAAAAAGCACAGCTTGAAGCACGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGTGACTTCGTTATTTTACTTCGATCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCCGTACTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACGCTTCG
TGCTCATGGGAAGAAAGGATCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAGTGGTTCTATAACTTACTGCAAGGATGGCGTGAATTTGCACGCCAACAGTCACTTTCTGATTTA
ATTTGGAAAGTATACGGTGAGACAGGTTATTACGATTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACTT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACGTCGTTTAGAGGATTATTCCGCTTCTTGCGTTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTTGTAGCTGGACTAGGTCGTCGCTTTAATACACAAGACTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGTTTCGGTTCACAATTTATTGATCCACGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTATTATACGTAGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACAGTTAAGGATGCAAATAAAGAAATGGAAAAATGGCTTGATGCGAGGGAGTA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCCTGCTACTTAGACTGGATTGCACCTTCATTATATAGGC
ACCGTGATAGTGAAATACTTCTTGAATTAGGACAAGGAAGTGTTCCAGATGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGTAACACGTTACTCGCGCCAGAACCGGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAGGCCGTTCCCCTGCAAAGTGAACGGAAAGAAGAGGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAATTACGTGCACCAATTCAAACACGTCCTCGTTTTATGGAGAAAAAGGGATTAACATATGCAGAGCGCGG
AACAGCAGTACATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGAAGAAGTGATTCGGGAGCAAATTGCTG
GAATGGTAAATAAAGAATTATTAACATTCGAGCAGGCGGAAGAAATTGCGATTGAAAAAGTAATTTCATTCTTTGATAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGCGGGGAATCCATTCTTGTTCAAGGGGTTATCGACTGTATGATCGAAGAGGAAGACG
GTATTACTTTAATCGACTTCAAAACGGATACGATTGCAGGAAAATTCCCAGGTGGATTTGATCAAGCGAAACCAATTTTA
GAAGAGCGATATAAAGTACAACTTTCGTTATATGCAAAAGCGCTCGAGAAAAGCTTACAACACCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGCAATCATGTTGTAAAAATCGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.666

100

0.537


Multiple sequence alignment