Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   EKQ63_RS07375 Genome accession   NZ_CP034686
Coordinates   1063117..1066842 (+) Length   1241 a.a.
NCBI ID   WP_143880924.1    Uniprot ID   -
Organism   Bacillus sp. BD59S     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1058117..1071842
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EKQ63_RS07360 (EKQ63_07340) - 1058275..1058868 (+) 594 WP_048553998.1 TVP38/TMEM64 family protein -
  EKQ63_RS07365 (EKQ63_07345) lepB 1058925..1059488 (+) 564 WP_000751900.1 signal peptidase I -
  EKQ63_RS07370 (EKQ63_07350) addB 1059605..1063120 (+) 3516 WP_143880923.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  EKQ63_RS07375 (EKQ63_07355) addA 1063117..1066842 (+) 3726 WP_143880924.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  EKQ63_RS07380 (EKQ63_07360) - 1066855..1067142 (+) 288 WP_087951643.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  EKQ63_RS07385 (EKQ63_07365) gerPF 1067247..1067462 (-) 216 WP_001141566.1 spore germination protein GerPF -
  EKQ63_RS07390 (EKQ63_07370) - 1067505..1067891 (-) 387 WP_087951644.1 spore germination protein GerPE -
  EKQ63_RS07395 (EKQ63_07375) gerPD 1067907..1068101 (-) 195 WP_001102341.1 spore germination protein GerPD -
  EKQ63_RS07400 (EKQ63_07380) gerPC 1068108..1068722 (-) 615 WP_087951645.1 spore germination protein GerPC -
  EKQ63_RS07405 (EKQ63_07385) gerPB 1068790..1068996 (-) 207 WP_001012508.1 spore germination protein GerPB -
  EKQ63_RS07410 (EKQ63_07390) gerPA 1069011..1069232 (-) 222 WP_001111187.1 spore germination protein GerPA -
  EKQ63_RS07415 (EKQ63_07395) - 1069329..1069508 (-) 180 WP_000462845.1 aspartyl-phosphate phosphatase Spo0E family protein -
  EKQ63_RS07420 (EKQ63_07400) - 1069757..1070656 (+) 900 WP_173601724.1 fumarylacetoacetate hydrolase family protein -
  EKQ63_RS07425 (EKQ63_07405) - 1070692..1070976 (-) 285 WP_143880925.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142561.91 Da        Isoelectric Point: 4.8494

>NTDB_id=332479 EKQ63_RS07375 WP_143880924.1 1063117..1066842(+) (addA) [Bacillus sp. BD59S]
MMENWPEKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELICIQQTEEEVLDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWRAFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDATKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSIPDEIYGYSASWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLESERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSGESILVQGVIDCMIEEEDGITLIDFKTDTIAGKFPSGFDQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=332479 EKQ63_RS07375 WP_143880924.1 1063117..1066842(+) (addA) [Bacillus sp. BD59S]
ATGATGGAAAATTGGCCTGAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCCGTTGTAGCGAATGGGCG
TGATATTTTAGTCGCAGCAGCAGCAGGATCAGGGAAAACAGCGGTATTAGTTGAACGTATTATCAAAAAGATAATAAATG
AAGAAAACCCAGTCGATGTGGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAGCCTGGCTCTCAGCACATCCGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACTGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATTGAA
GATAATACGATATTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCACATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATACGATGTGGAAGGAA
AGACAATTGAAGATTTAGTGTACGCTTCCTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCAGAACAGCATATT
CGTAAAGCGACAGAACTCGCAATGCTCCCTGACGGTCCAGCGCCTCGCGTTGAAACACTGCAAGCGGATTTAGCTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAGTCATGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTATAACGAAGACATTGTCAAACAAGTAGACTCTCTTCGTAATAAGGCGAAAGATGAAGTG
AAGAAACTACAAGAAGAGCTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTTGTTCAGCTCGTAAAAGTATTTACAGAACGTTTCCAAGCGATGAAACGAGATAAAGGAATGGTTGATTTTACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAGACGGTGAAATGAAGCCATCAGCTGTAGCGCTTCAA
TATCGCAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCGATTATTAAGTTTGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTAGGTGACGTAAAGCAGTCGATCTATCGTTTCCGACTAGCAG
AACCAGGATTATTCCTAGGAAAGTATAAACGCTTCACGCAAGAAGGATTAGGCGGCGGAATGAAGATTGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCGGGTACGAACTTTATTTTTAAACAAATTATGGGTGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGTTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTAATATGCA
TTCAGCAAACGGAAGAAGAAGTATTAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCGCAGCTTGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTCGATTCGGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGCGACTTCGTTATCTTGCTTCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAGCTGCAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTCTTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTGCTTCGTTCACCAATTGTTGGACTAAATGATGAAGAACTTGCGACGCTTCG
TGCTCATGGGAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTACAACTTACTGCAAGGATGGCGTGCATTCGCGCGCCAACAGTCTCTTTCTGACTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGATTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGACAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACGTCGTTTAGAGGATTATTCCGCTTCTTGCGTTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGACGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTTGTCGCTGGACTAGGTCGCCGCTTTAATACACAAGACTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGTTTTGGTTCGCAATTTATCGATCCTCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTATTATACGTAGCATTAACACGT
GCAAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAACTAAGGAAATGGAAAAATGGCTGGATGCTAGGGAGCA
TAGTGAATGGTTATTACCAGATCATATACGTGCCGGAGCGTCTTGCTACTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATACTTCTTGAATTAGGACAAGGGAGCATTCCAGATGAAATTTATGGGTATAGTGCGAGCTGGAAA
GTAGAAGTTGTTGACGGAAACACGTTACTTGCGCCAGAGCCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAGGCCGTTCCGTTAGAAAGTGAACGGAAAGAAGAGGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAATTACGTGCACCAATTAAAACACGTCCTCGTTTTATGGAGAAAAAGGGATTAACATACGCTGAAAGAGG
TACGGCAGTTCACGCTGTTATGCAGCATGTTGACTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCGG
GAATGGTAAATAAGGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAATTGAAAAAGTAATTTCATTCTTTGATAGT
GACCTAGGTAAACGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTGCCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGCGGGGAATCCATTCTTGTCCAAGGGGTTATTGACTGCATGATCGAAGAGGAAGACG
GTATTACGTTAATCGACTTCAAAACGGATACAATTGCAGGAAAATTCCCAAGCGGATTTGATCAGGCGAAGCCAATTTTA
GAAGAGCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCGCTCGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGCAATCATGTTGTAAAAATCGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.408

100

0.537


Multiple sequence alignment