Detailed information    

insolico Bioinformatically predicted

Overview


Name   addB   Type   Machinery gene
Locus tag   CU648_RS20585 Genome accession   NZ_CP025122
Coordinates   3804598..3808113 (+) Length   1171 a.a.
NCBI ID   WP_046946929.1    Uniprot ID   A0A150AY47
Organism   Bacillus sp. HBCD-sjtu     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3799598..3813113
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CU648_RS20550 (CU648_20550) - 3799920..3800708 (-) 789 WP_101195506.1 phosphotransferase -
  CU648_RS20555 (CU648_20555) cspA 3801060..3801263 (+) 204 WP_000301519.1 RNA chaperone/antiterminator CspA -
  CU648_RS28885 - 3801450..3801578 (-) 129 WP_014300173.1 hypothetical protein -
  CU648_RS20565 (CU648_20565) - 3801947..3802132 (-) 186 WP_000391382.1 hypothetical protein -
  CU648_RS20570 (CU648_20570) - 3802414..3802995 (+) 582 WP_101195508.1 competence protein ComK -
  CU648_RS20575 (CU648_20575) - 3803268..3803861 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  CU648_RS20580 (CU648_20580) lepB 3803918..3804481 (+) 564 WP_000751894.1 signal peptidase I -
  CU648_RS20585 (CU648_20585) addB 3804598..3808113 (+) 3516 WP_046946929.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  CU648_RS20590 (CU648_20590) addA 3808110..3811835 (+) 3726 WP_101195509.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  CU648_RS20595 (CU648_20595) - 3811848..3812135 (+) 288 WP_000718623.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  CU648_RS20600 (CU648_20600) gerPF 3812173..3812388 (-) 216 WP_001141566.1 spore germination protein GerPF -
  CU648_RS20605 (CU648_20605) - 3812431..3812817 (-) 387 WP_000902341.1 spore germination protein GerPE -
  CU648_RS20610 (CU648_20610) gerPD 3812833..3813027 (-) 195 WP_001052802.1 spore germination protein GerPD -

Sequence


Protein


Download         Length: 1171 a.a.        Molecular weight: 134355.59 Da        Isoelectric Point: 5.3742

>NTDB_id=258510 CU648_RS20585 WP_046946929.1 3804598..3808113(+) (addB) [Bacillus sp. HBCD-sjtu]
MSLRFVIGRAGSGKSTLCLHEVQEELKQRPRGETILYLVPEQMTFQTQQALIGSEDVRGSIRAQVFSFSRLAWKVLQEVG
GASRLHIDEAGVHMLLRKIVESRKDGLSVFQKAAEQNGFFEHLGSMIAEFKRYNVTPSNVYEMWQQLDAHSSSAEQKLLA
NKVYDLQLLYDDFERALIGKYLDSEDYLQLLVEKLPQSEYVKGAEIYIDGFHSFSPQELEIVRQLMICGARVTITLTIDE
KTLAQPVNELDLFYETTLTYEKIKQVAREEKIEIEKTIPLMEQPRFHSPALAHLEMHYEARPNEKFHGEASVTIHTAANL
RAEIEGVAREIRRLVAEENYRYRDIAVLLRNGESYYDVMRTLFTDYNIPHFIDEKRPMSHHPLVECIRSALEIISGNWRY
DAVFRCVKTELLYPLDVRKETMREEMDEFENYCLAYGVQGKRWTSEDPWMYRRYRSLDDTNGMITDSEREMEEKINRLRD
VVRTPVIRMQKRLKRAGTVMQMCEAVYLFLEELDVPKKLEALRIRAEESGDFLFATDHEQVWEEVMSLLDTFVEMLGEEK
MSLSMFTDVMSTGLEALQFANIPPSLDQVLIANIDRSRLSNVKATFVIGVNEGVIPAAPMDEGMLSDEERDVLSAAGIEL
APTTRQTLLEEQFVMYQMVTRATEKLYISCPLADEEGKTLLASSFIKKIKRMFPDVKDTFITNDVNDLSRSEQISYVATP
EVTLSYVMQQLQTWKRYGFEGNLDFWWDVYNFYVTSDEWKQKSSRVLSSLFYRNRAQKLSTAVSRDLYGDKIKGSVSRME
LFNRCAYAHFAQHGLSLRERDIFKLDAPDIGELFHAALKRIADRLLRENRTWADLSIKECEHLSTVVIEEIAPLLQRQIL
LSSNRHFYLKQKLQQIIFRTSIILREHAKSSGFVPVDLEVPFGMGGTGSLPPMEFSLPNGVKMEVVGRIDRVDKAEDENG
TFLRIIDYKSSSKSLDLTEVYYGLALQMLTYLDVVTSNAQTWMKKGHAASPAGVLYFHIHNPIVEVKGDASEAEIEKEIL
KKFKMKGLVLGDADVVRLMDNKLSTGSSDIISAGLKKDGSFSARSSIASEQEFNVLQKYVHHTFKNIGKDITEGVIDIAP
YKKGNKAACTFCNFKSVCQFDESLEDNQFRTLKDMKDSEAMEKIREEVGGE

Nucleotide


Download         Length: 3516 bp        

>NTDB_id=258510 CU648_RS20585 WP_046946929.1 3804598..3808113(+) (addB) [Bacillus sp. HBCD-sjtu]
ATGTCACTTCGATTTGTGATTGGTAGAGCGGGAAGTGGAAAAAGTACACTTTGTTTACACGAAGTGCAAGAAGAATTAAA
ACAGCGCCCAAGAGGGGAAACAATATTATATCTTGTGCCAGAACAGATGACATTCCAGACGCAGCAGGCGTTAATTGGCA
GTGAGGATGTTAGAGGTTCTATTCGGGCACAAGTTTTTAGTTTTTCACGATTAGCGTGGAAGGTATTGCAAGAAGTTGGG
GGAGCGAGTCGTCTTCACATTGATGAAGCGGGCGTACATATGTTACTTCGTAAAATTGTAGAGTCTCGTAAAGATGGATT
ATCGGTGTTCCAAAAAGCAGCGGAGCAAAACGGTTTCTTTGAACATCTGGGCAGTATGATTGCGGAGTTTAAACGATATA
ACGTGACGCCATCAAACGTATATGAAATGTGGCAACAATTAGATGCGCATAGTAGCAGTGCAGAACAAAAGCTACTGGCG
AATAAAGTGTATGATTTACAACTACTATATGATGATTTTGAGCGTGCTTTAATCGGAAAATATTTAGATTCAGAAGACTA
CCTGCAATTGCTAGTCGAAAAGCTTCCGCAATCTGAATATGTAAAGGGAGCTGAAATTTATATAGATGGATTTCACTCAT
TTTCACCGCAAGAGTTAGAAATTGTAAGACAGCTTATGATTTGCGGAGCGAGAGTTACGATCACGTTAACGATAGATGAA
AAAACGTTAGCGCAGCCAGTAAATGAATTAGATTTATTTTATGAAACGACATTAACGTATGAAAAAATAAAACAAGTAGC
GCGTGAAGAGAAAATAGAAATTGAAAAAACGATTCCACTTATGGAACAGCCGCGTTTTCATTCTCCGGCATTAGCGCATT
TAGAAATGCATTACGAAGCGCGTCCAAATGAAAAGTTTCACGGTGAAGCAAGTGTAACGATTCATACAGCAGCTAATTTA
CGAGCGGAAATAGAAGGCGTTGCTCGTGAAATTCGTAGACTTGTGGCTGAAGAAAACTATCGTTACCGAGATATTGCGGT
TCTTCTTCGTAACGGGGAAAGTTATTACGATGTAATGCGGACACTATTTACAGATTATAATATCCCGCACTTCATCGATG
AAAAACGCCCGATGTCACATCATCCGTTAGTAGAATGCATTCGTTCTGCACTCGAGATTATTAGCGGGAATTGGCGTTAT
GATGCAGTCTTTCGCTGCGTGAAAACAGAGCTTTTATATCCACTAGACGTAAGAAAAGAAACGATGCGCGAAGAGATGGA
TGAGTTTGAAAACTATTGCTTAGCATACGGTGTACAAGGGAAGAGATGGACTTCTGAGGATCCGTGGATGTATCGTCGCT
ATCGTTCTCTTGACGATACGAACGGGATGATCACAGACAGTGAACGTGAAATGGAAGAGAAAATAAATCGATTGCGTGAC
GTTGTAAGAACGCCAGTTATTCGTATGCAAAAAAGACTGAAGCGCGCGGGAACAGTTATGCAAATGTGCGAAGCTGTTTA
CTTATTTTTAGAAGAGCTGGACGTTCCAAAAAAATTAGAAGCATTACGTATTCGTGCAGAAGAGAGTGGGGATTTCTTAT
TTGCGACAGATCATGAACAAGTATGGGAAGAAGTTATGAGTCTTCTTGATACGTTCGTTGAGATGCTTGGCGAAGAGAAA
ATGTCACTTTCTATGTTCACAGACGTTATGTCGACAGGTCTTGAGGCGCTTCAATTCGCAAACATTCCGCCGTCATTAGA
CCAAGTGTTGATTGCTAATATTGATCGTTCAAGATTATCAAATGTGAAAGCAACATTTGTTATTGGCGTGAATGAAGGTG
TCATTCCAGCAGCACCGATGGATGAAGGTATGCTTTCTGATGAGGAAAGAGATGTTTTGAGCGCTGCAGGTATTGAACTC
GCACCAACGACGAGACAAACTTTATTAGAAGAACAGTTCGTTATGTACCAAATGGTAACGAGAGCAACTGAGAAATTATA
TATTTCATGCCCGCTTGCAGATGAGGAAGGGAAGACGTTACTTGCGTCTAGCTTTATTAAGAAAATAAAAAGAATGTTCC
CTGATGTGAAAGATACATTTATTACGAATGACGTAAATGATTTATCACGTTCGGAACAAATTTCATACGTAGCAACGCCG
GAAGTAACACTGTCATATGTTATGCAGCAACTTCAAACGTGGAAGCGATATGGATTTGAAGGGAATTTAGACTTTTGGTG
GGATGTATATAATTTCTATGTAACTTCGGATGAATGGAAGCAAAAAAGTAGCCGCGTATTATCAAGTTTATTCTACCGAA
ATCGTGCGCAAAAGCTAAGTACAGCTGTAAGTAGAGATTTATACGGAGACAAAATAAAAGGAAGCGTTTCTCGTATGGAA
CTATTTAATCGTTGTGCGTACGCTCATTTCGCACAGCACGGTTTATCGTTAAGAGAGCGTGATATTTTTAAACTTGATGC
GCCAGATATCGGGGAGCTTTTCCATGCAGCGCTGAAGAGAATTGCAGACAGGCTATTACGTGAAAATCGTACTTGGGCAG
ATTTATCAATAAAAGAGTGTGAGCATCTTTCTACTGTAGTAATAGAAGAGATTGCACCGTTATTACAAAGGCAAATTTTA
TTAAGTTCAAACCGTCATTTCTATTTAAAACAAAAACTACAACAAATCATTTTCCGTACGTCCATTATTCTACGTGAACA
TGCGAAGTCGAGCGGTTTCGTACCAGTTGATTTAGAAGTGCCATTTGGTATGGGCGGTACAGGATCACTTCCGCCGATGG
AATTCTCGTTACCAAATGGTGTAAAGATGGAAGTAGTCGGCCGTATTGACCGCGTTGATAAGGCGGAAGATGAAAACGGA
ACATTCCTCCGTATTATTGACTATAAATCAAGCTCAAAATCGTTAGACTTAACGGAAGTGTATTACGGATTGGCACTTCA
AATGTTAACGTATTTAGATGTTGTTACTTCAAATGCACAGACGTGGATGAAAAAAGGCCACGCAGCATCACCAGCTGGTG
TACTGTACTTCCACATTCATAACCCAATTGTTGAGGTGAAAGGTGACGCATCTGAAGCAGAAATTGAAAAGGAAATTTTA
AAGAAATTCAAAATGAAAGGGCTCGTACTAGGAGATGCTGACGTTGTTCGTTTAATGGATAACAAACTTTCAACAGGAAG
TTCAGATATTATTTCTGCTGGTCTGAAAAAAGACGGTAGTTTTAGTGCGCGTTCAAGTATTGCCAGTGAACAAGAGTTTA
ACGTCCTGCAAAAATACGTACATCATACGTTTAAAAATATCGGAAAAGACATTACAGAGGGTGTTATCGATATTGCTCCA
TACAAAAAGGGGAATAAAGCAGCGTGTACGTTCTGTAACTTCAAATCAGTTTGTCAGTTCGATGAATCACTTGAAGATAA
TCAATTCCGTACGCTAAAAGATATGAAAGATAGTGAAGCGATGGAGAAAATTAGAGAGGAGGTTGGCGGAGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A150AY47

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addB Bacillus subtilis subsp. subtilis str. 168

49.232

100

0.493


Multiple sequence alignment