Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   CU648_RS20590 Genome accession   NZ_CP025122
Coordinates   3808110..3811835 (+) Length   1241 a.a.
NCBI ID   WP_101195509.1    Uniprot ID   -
Organism   Bacillus sp. HBCD-sjtu     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3803110..3816835
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CU648_RS20575 (CU648_20575) - 3803268..3803861 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  CU648_RS20580 (CU648_20580) lepB 3803918..3804481 (+) 564 WP_000751894.1 signal peptidase I -
  CU648_RS20585 (CU648_20585) addB 3804598..3808113 (+) 3516 WP_046946929.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  CU648_RS20590 (CU648_20590) addA 3808110..3811835 (+) 3726 WP_101195509.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  CU648_RS20595 (CU648_20595) - 3811848..3812135 (+) 288 WP_000718623.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  CU648_RS20600 (CU648_20600) gerPF 3812173..3812388 (-) 216 WP_001141566.1 spore germination protein GerPF -
  CU648_RS20605 (CU648_20605) - 3812431..3812817 (-) 387 WP_000902341.1 spore germination protein GerPE -
  CU648_RS20610 (CU648_20610) gerPD 3812833..3813027 (-) 195 WP_001052802.1 spore germination protein GerPD -
  CU648_RS20615 (CU648_20615) gerPC 3813034..3813648 (-) 615 WP_001070767.1 spore germination protein GerPC -
  CU648_RS20620 (CU648_20620) gerPB 3813716..3813922 (-) 207 WP_001012512.1 spore germination protein GerPB -
  CU648_RS20625 (CU648_20625) gerPA 3813937..3814158 (-) 222 WP_001111188.1 spore germination protein GerPA -
  CU648_RS20630 (CU648_20630) - 3814259..3814432 (-) 174 Protein_3845 aspartyl-phosphate phosphatase Spo0E family protein -
  CU648_RS20635 (CU648_20635) - 3814680..3815579 (+) 900 WP_179950258.1 fumarylacetoacetate hydrolase family protein -
  CU648_RS20640 (CU648_20640) - 3815615..3815899 (-) 285 WP_000925338.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142590.82 Da        Isoelectric Point: 4.8322

>NTDB_id=258511 CU648_RS20590 WP_101195509.1 3808110..3811835(+) (addA) [Bacillus sp. HBCD-sjtu]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLALLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDIVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSEDGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDATKEMEKWLDAREHSEWLLPDHVRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPGEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKDEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIQTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAVEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
ETRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVIKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=258511 CU648_RS20590 WP_101195509.1 3808110..3811835(+) (addA) [Bacillus sp. HBCD-sjtu]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCGGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACAAATGCAGCAGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGAACCAGGATCTCAGCACGTAAGAAAGCAACTGAGTTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATTGCGAATCAAACAGAAAATGAATTATTAAAAGAAGAAGTGTTAGATGACATATTAGAAGAAGAGTATGGAATAGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGCGACCGTAGTGATGATGATTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATATGACGTCGAAGGAA
AGACAATTGAAGATTTAGTGTACGCCTCTTACTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAAGCAACTGAGCTCGCAATGCTTCCTGACGGCCCAGCGCCTCGCGTTGAAACGCTGCAAGCAGATTTAGCTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGCATTAAGAAAAGCGATTATAACGAGGATATTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCAAAAGATGAAGTG
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTCGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGCATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAGATGGTGAAATGAAGCCATCAGCAGTAGCACTTCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTATTCATGGTTGGTGACGTGAAGCAGTCGATTTATCGTTTCCGACTAGCAG
AACCAGGATTATTCTTAGGAAAGTATAAACGTTTCACACAAGAAGGATTAGGCGGCGGAATGAAAATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGTACAAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAGAAGGTGAAGATGTAGCAGCTGAACTATTGTGCA
TTCAGCAAACAGAAGAAGAAGTAATAGACGGTGAAGAAGGTGCGGAAGTAGAAAAGGCACAGCTTGAAGCACGTCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCAGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGCCCTGTACAATA
CCGTGACTTCGTTATTTTACTTCGCTCGATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAATTGCAAGGAATTCCAG
TATACGCTGACCTTGCCACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCAGCAGTACTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACGCTTCG
TGCTCACGGGAAGAAAGGATCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGAGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTATAATTTACTGCAAGGATGGCGTGAATTCGCACGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTATGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACCT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCTTTAGGTGAACAAGAAGATGTCGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTCGTCGCTGGACTTGGTCGTCGTTTTAATACGCAAGACTTAATGAA
ACGTTTCTTACTTCATAAAGACTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACGACATTAT
CACAACTTGCAATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTCTTATACGTAGCGTTAACACGG
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAACTAAGGAAATGGAAAAATGGCTGGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCACGTACGTGCCGGAGCATCTTGTTATTTAGACTGGATTGCACCTTCCTTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGGCAAGGAAGTATTCCAGGTGAAATTTATGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGGAACACGTTACTTGCGCCAGAACCCGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAAGCTGTTCCCCTGCAAAGTGAACGAAAAGATGAAGTGTACGACAGGTTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCT
TTTATTAAAAAATTACGTGCACCGATTCAAACACGTCCTCGTTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGAGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACAGTTGAAGTTCTTCAAGAGCAAATTGCGG
GAATGGTAAATAAAGAATTATTAACATTTGAACAAGCAGAAGAAATAGCAGTTGAAAAAGTGATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGGAGAGCGGGGAATCAATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GTATCACTTTAATCGACTTTAAAACGGATACGATTGAAGGGAAGTTCCCGGGAGGATTCGAACAAGCGAAACCAATTTTA
GAAACTCGTTACAAAGTGCAGCTTTCGTTATATGCAAAGGCACTTGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTATAAAAGTTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.666

100

0.537


Multiple sequence alignment