Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   IRJ20_RS15090 Genome accession   NZ_CP065137
Coordinates   2984294..2987998 (+) Length   1234 a.a.
NCBI ID   WP_040238975.1    Uniprot ID   -
Organism   Bacillus sp. A1(2020)     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 2979294..2992998
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IRJ20_RS15075 - 2979376..2979885 (-) 510 WP_007409173.1 ferritin-like domain-containing protein -
  IRJ20_RS15080 - 2979954..2980667 (-) 714 WP_079891285.1 DUF421 domain-containing protein -
  IRJ20_RS15085 addB 2980807..2984307 (+) 3501 WP_039252063.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  IRJ20_RS15090 addA 2984294..2987998 (+) 3705 WP_040238975.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  IRJ20_RS15095 sbcD 2988063..2989235 (+) 1173 WP_040238308.1 exonuclease subunit SbcD -
  IRJ20_RS15100 sbcC 2989232..2992624 (+) 3393 WP_040238306.1 exonuclease subunit SbcC -
  IRJ20_RS15105 - 2992640..2992933 (+) 294 WP_007409167.1 hypothetical protein -

Sequence


Protein


Download         Length: 1234 a.a.        Molecular weight: 141162.53 Da        Isoelectric Point: 5.1545

>NTDB_id=507250 IRJ20_RS15090 WP_040238975.1 2984294..2987998(+) (addA) [Bacillus sp. A1(2020)]
MQIPKPKDSIWTDDQWSAIVSSGRDILVAAAAGSGKTAVLVERMIRKITAEEDPVDVDRLLVVTFTNASAAEMKHRIAEA
LEKELAKNPGSLHIRRQLSLLNRASISTLHSFCLQVLKKYYYMIDLDPGFRMADQTEGELLGDEVLDELFEDEYAKGNQA
FFELADRYTTDRHDLDLQDLVKRVYEYSRSHPDPEAWLQSFVRLYDVTEESKMEELPFYQYVKEDAEMALFGAKQKLEKA
LELTKAPGGPAPRADNFLDDLQQIEELISCRHDFDALYERVPAVSFKRAKAVKGDEFDKALLDEATDLRNGAKKLIEKVK
TDYFTRSPQDHLKSLTDMKPVIETLVQLVISYGKRFEAAKQEKSIIDFSDLEHYCLAILTAVDEEGRRVPSEAAVYYQDQ
FHEVLVDEYQDTNLVQESILQLVKSGNEETGNLFMVGDVKQSIYRFRLAEPLLFLGKYKRFTESGEGAGQKIDLNQNFRS
RSDILDSTNFLFKQLMGGKIGEVDYDEQAALKLGASYPPNDAAKTELLLIDSADGADSSEDAEDFETVHWEAKAIAGEIR
KLVSSPFKVYDGKTKTHRNIQYRDIVILLRSMPWAPQLMEELKNQGIPVYANLTSGYFEAVEVAAALSVLKVIDNPYQDI
PLASVLRSPIVGCDENELALIRLEKKKAPFYEALKAYIANADRHDDLYQKLRTFYDSLQKWRSFSTNHSVSELIWEVYRD
TGYFDYAGGMPGGKQRQANLRVLYDRARSYEATAFRGLFRFLRFIERMQERGDDLGTARALSEQEDVVRLMTIHSSKGLE
FPVVFTAGLGRSFNMMDLNKSYLLDKELGFGTKYIHPELRISYPTLPLVAMKKKMRRELLSEELRVLYVALTRAKEKLFL
VGSCKNREKQLAKWQAQADRPDWLLSEFDRYQASSYLDFIGPALIRHRDMEAHRTPGLSSSEDIARDPSRFHIRMLQQSE
LLEENPKERAEEKSKRLKAIQQGEPIPDSFSFDDQARRLLVWEYPYRELTAIRTKQSVSELKRKQEYEDEYSGRSLIKPS
GDTLLYRRPGFMMKKGLTAAEKGTAMHTVMQHIPLTHVPTAEEAERTVRMLYEKELLTEEQQEAIDIEEIVQFFGTEIGK
DLLGALRIDREVPFSMALPAGEVYKDAETAGEPLLVQGIIDCLYETADGLYLLDYKTDRIEGKFRNGFEGAAPILQKRYE
TQIELYTKAVEQITKTKVKGRALYFFDGGHVLTL

Nucleotide


Download         Length: 3705 bp        

>NTDB_id=507250 IRJ20_RS15090 WP_040238975.1 2984294..2987998(+) (addA) [Bacillus sp. A1(2020)]
ATGCAAATTCCTAAACCGAAAGACAGCATATGGACGGATGACCAATGGAGCGCCATCGTTTCTTCCGGCCGCGATATTTT
GGTCGCTGCCGCAGCCGGATCAGGTAAAACGGCCGTGCTCGTGGAACGGATGATCAGAAAGATAACCGCGGAGGAAGATC
CGGTTGATGTTGACCGGCTCCTTGTGGTGACCTTTACAAACGCCTCAGCAGCGGAGATGAAGCATCGGATCGCGGAAGCA
TTAGAAAAAGAGCTCGCCAAAAATCCGGGATCTCTGCATATCAGACGGCAGCTCTCTCTATTAAATCGGGCGAGCATTTC
GACGCTTCACTCTTTCTGTCTGCAAGTGCTTAAAAAATATTATTATATGATTGATCTTGACCCCGGTTTTCGGATGGCTG
ACCAGACGGAAGGCGAATTGCTCGGAGACGAAGTGCTTGATGAGCTGTTTGAAGATGAATACGCAAAAGGGAATCAGGCG
TTTTTTGAGCTTGCGGACAGATATACGACAGACAGGCATGATCTTGATCTTCAAGACCTTGTCAAACGGGTGTACGAGTA
TTCCCGGTCGCATCCTGACCCGGAAGCGTGGCTGCAAAGCTTTGTCCGCCTGTATGACGTGACGGAAGAAAGCAAAATGG
AAGAGCTTCCGTTTTACCAGTATGTGAAAGAAGATGCTGAAATGGCGCTCTTCGGGGCAAAACAAAAGCTTGAGAAGGCG
CTTGAACTGACAAAAGCGCCCGGTGGGCCGGCCCCGCGCGCTGATAATTTTCTGGATGACCTTCAGCAGATAGAAGAGCT
GATCAGCTGCCGGCATGACTTCGATGCGCTGTATGAACGGGTTCCCGCAGTCTCCTTTAAGCGGGCGAAGGCGGTAAAGG
GCGATGAATTTGACAAAGCGCTTTTGGACGAAGCGACCGATTTGCGAAACGGCGCGAAGAAACTGATCGAAAAGGTAAAG
ACCGATTACTTTACAAGAAGTCCGCAAGATCACTTAAAGAGTTTAACAGATATGAAGCCTGTCATCGAAACGCTCGTTCA
GCTCGTGATATCGTACGGAAAACGATTTGAAGCGGCGAAACAAGAAAAATCAATTATTGATTTCTCAGATTTGGAACATT
ACTGCCTGGCGATTTTAACTGCGGTTGATGAAGAAGGCCGGCGGGTGCCGAGCGAAGCGGCCGTTTATTATCAAGACCAG
TTCCATGAAGTTCTGGTTGATGAATATCAAGATACGAATCTTGTGCAAGAGTCTATCCTGCAGCTTGTTAAAAGCGGGAA
TGAAGAGACGGGAAATCTGTTTATGGTAGGCGACGTTAAACAATCGATCTACCGTTTCAGACTTGCTGAGCCGCTGCTGT
TTCTCGGGAAATATAAAAGGTTTACAGAAAGCGGGGAAGGCGCGGGGCAGAAAATTGACCTGAACCAAAATTTCCGCAGC
CGCTCCGATATTTTGGACAGTACGAATTTCTTGTTTAAGCAGCTGATGGGCGGAAAAATCGGCGAAGTGGATTACGATGA
ACAGGCGGCATTGAAGCTCGGCGCATCGTATCCGCCGAATGACGCGGCAAAAACGGAACTGCTGCTCATTGACAGCGCTG
ATGGCGCAGACAGCTCCGAGGATGCCGAAGACTTTGAAACTGTGCATTGGGAAGCGAAAGCGATAGCCGGGGAGATCCGA
AAGCTTGTCTCATCTCCGTTTAAGGTATATGACGGCAAAACAAAAACACACCGGAATATCCAGTACCGGGATATCGTAAT
TTTGCTGCGCTCCATGCCGTGGGCGCCGCAATTAATGGAAGAGCTGAAAAACCAGGGCATTCCGGTGTACGCCAATTTGA
CGTCGGGTTACTTTGAAGCCGTGGAGGTGGCTGCGGCGCTTTCAGTCTTGAAGGTGATTGATAATCCGTATCAGGATATT
CCGCTTGCTTCCGTGCTGCGATCGCCCATCGTCGGCTGTGACGAAAATGAACTTGCGCTGATACGCCTTGAAAAGAAAAA
AGCGCCGTTTTATGAAGCGCTGAAAGCATATATAGCGAACGCTGACAGGCATGATGACCTGTATCAAAAACTGAGAACCT
TTTATGACAGTCTGCAAAAGTGGCGGTCATTTTCGACGAATCATTCGGTGTCTGAGCTGATTTGGGAAGTATACCGCGAT
ACCGGATATTTTGACTATGCAGGCGGCATGCCTGGCGGAAAGCAGCGCCAGGCCAATCTGCGCGTCTTGTATGACCGGGC
GCGCTCGTACGAAGCGACGGCGTTCCGCGGGTTATTCCGCTTTTTGCGGTTCATTGAACGGATGCAGGAGCGCGGCGATG
ATCTGGGGACGGCCCGGGCGCTCAGCGAGCAGGAGGATGTCGTTCGGCTGATGACGATCCACAGCAGCAAAGGACTTGAA
TTTCCCGTCGTTTTCACCGCCGGACTCGGCAGAAGTTTTAATATGATGGATCTCAACAAATCATACCTGCTTGATAAAGA
GCTCGGTTTCGGCACGAAGTATATTCATCCGGAACTCAGAATCAGCTATCCGACTCTTCCGCTTGTGGCGATGAAGAAAA
AGATGCGCAGAGAGCTCTTATCCGAAGAATTGCGCGTATTGTATGTGGCGCTGACCAGAGCGAAGGAAAAGCTGTTTCTC
GTCGGTTCCTGCAAAAACCGGGAAAAACAGCTGGCCAAATGGCAGGCGCAAGCAGACCGGCCCGATTGGCTTCTTTCTGA
ATTCGACCGCTATCAGGCGTCATCTTATTTGGACTTTATCGGGCCGGCGCTCATCCGCCACCGTGACATGGAAGCGCACA
GGACGCCGGGACTTTCCTCATCTGAGGATATCGCGCGCGACCCGTCGCGTTTTCACATCCGGATGCTGCAGCAAAGTGAA
TTGCTGGAAGAGAATCCGAAAGAACGTGCTGAAGAAAAGAGCAAACGTCTCAAAGCGATTCAGCAGGGCGAGCCGATTCC
TGATTCGTTTTCATTTGATGATCAGGCGCGCCGTCTGCTTGTATGGGAATATCCGTACCGAGAGCTGACGGCTATCAGAA
CGAAGCAGTCAGTTTCTGAGCTGAAGAGAAAGCAAGAATACGAGGATGAATACAGCGGCCGCTCTCTCATCAAACCGTCA
GGAGACACGCTTTTATACAGGCGTCCCGGTTTTATGATGAAAAAAGGGCTGACAGCCGCCGAAAAAGGGACGGCGATGCA
TACGGTGATGCAGCACATACCGCTGACGCACGTTCCGACTGCCGAAGAAGCGGAGCGGACGGTCCGGATGCTTTACGAAA
AAGAGCTGCTGACAGAAGAACAGCAGGAAGCTATTGATATAGAAGAAATTGTCCAGTTTTTCGGAACTGAAATCGGGAAA
GACCTTCTCGGGGCGCTGCGGATTGACCGGGAAGTGCCGTTCAGTATGGCGCTTCCGGCCGGCGAGGTGTACAAGGATGC
CGAAACGGCCGGTGAACCGCTGCTCGTGCAGGGGATTATTGACTGCCTCTACGAAACTGCGGACGGTCTCTATTTGCTGG
ACTATAAAACTGATCGGATAGAGGGGAAATTCCGGAACGGATTTGAAGGCGCGGCTCCGATTCTGCAAAAAAGGTATGAA
ACACAAATCGAATTGTATACAAAAGCGGTTGAACAAATTACAAAAACGAAAGTGAAAGGGCGTGCGCTCTATTTCTTCGA
TGGAGGTCACGTTCTTACACTGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

82.577

100

0.826