Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   HQG80_RS05720 Genome accession   NZ_CP085506
Coordinates   1110880..1114605 (+) Length   1241 a.a.
NCBI ID   WP_173602431.1    Uniprot ID   -
Organism   Bacillus cereus strain HD2.4     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1105880..1119605
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HQG80_RS05705 (HQG80_005705) - 1106038..1106631 (+) 594 WP_000347517.1 TVP38/TMEM64 family protein -
  HQG80_RS05710 (HQG80_005710) lepB 1106688..1107251 (+) 564 WP_000751919.1 signal peptidase I -
  HQG80_RS05715 (HQG80_005715) addB 1107368..1110883 (+) 3516 WP_078385726.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  HQG80_RS05720 (HQG80_005720) addA 1110880..1114605 (+) 3726 WP_173602431.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  HQG80_RS05725 (HQG80_005725) - 1114618..1114905 (+) 288 WP_000255727.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  HQG80_RS05730 (HQG80_005730) gerPF 1115043..1115258 (-) 216 WP_001141566.1 spore germination protein GerPF -
  HQG80_RS05735 (HQG80_005735) - 1115301..1115687 (-) 387 WP_000902319.1 spore germination protein GerPE -
  HQG80_RS05740 (HQG80_005740) gerPD 1115703..1115897 (-) 195 WP_001052807.1 spore germination protein GerPD -
  HQG80_RS05745 (HQG80_005745) gerPC 1115904..1116518 (-) 615 WP_001070760.1 spore germination protein GerPC -
  HQG80_RS05750 (HQG80_005750) gerPB 1116586..1116792 (-) 207 WP_001012508.1 spore germination protein GerPB -
  HQG80_RS05755 (HQG80_005755) gerPA 1116807..1117028 (-) 222 WP_001111187.1 spore germination protein GerPA -
  HQG80_RS05760 (HQG80_005760) - 1117125..1117304 (-) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  HQG80_RS05765 (HQG80_005765) - 1117552..1118451 (+) 900 WP_173602432.1 fumarylacetoacetate hydrolase family protein -
  HQG80_RS05770 (HQG80_005770) - 1118487..1118768 (-) 282 WP_000926857.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142615.74 Da        Isoelectric Point: 4.7975

>NTDB_id=618664 HQG80_RS05720 WP_173602431.1 1110880..1114605(+) (addA) [Bacillus cereus strain HD2.4]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADVALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLSDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDISWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEDGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGNSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=618664 HQG80_RS05720 WP_173602431.1 1110880..1114605(+) (addA) [Bacillus cereus strain HD2.4]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCTAACGGACG
TGATATTTTAGTTGCTGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGCCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGCATCGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAGGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATGTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCGTGGACAAGTGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTCCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGACACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAGTCGATTTATCGTTTCCGACTAGCCG
AGCCAGGCTTATTCCTAGGAAAGTATAAACGTTTCACACAAGAAGGATTGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTTAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGACTACGATGCTGACGCTGAACTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCAGCAGAATTATTATGTA
TTCAGCAAACGGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGCCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCCGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGTCCGGTGCAATA
TCGCGATTTCGTTATTTTACTTCGCTCCATGCCGTGGGCCCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TATACGCTGACCTCGCGACTGGTTATTTTGAAGCGACGGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCCGCTGTACTTCGTTCCCCAATCGTTGGATTGAGCGATGAAGAACTTGCAACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTCGCCCGTCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
ACGCGTACTATATGACCGCGCAAGACAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCCCTAGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGATATGACATTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACACTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCGCTTCG
TGAGAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCAGAAGATGGTAGTGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTTCCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAATAGCGGGGAATCGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTTCCAGGCGGATTTGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.328

100

0.536