Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   I6G54_RS11695 Genome accession   NZ_CP065743
Coordinates   2029066..2032791 (+) Length   1241 a.a.
NCBI ID   WP_002194449.1    Uniprot ID   -
Organism   Bacillus tropicus strain FDAARGOS_897     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 2024066..2037791
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6G54_RS11680 (I6G54_11680) - 2024224..2024817 (+) 594 WP_000347516.1 TVP38/TMEM64 family protein -
  I6G54_RS11685 (I6G54_11685) lepB 2024874..2025437 (+) 564 WP_002194447.1 signal peptidase I -
  I6G54_RS11690 (I6G54_11690) addB 2025554..2029069 (+) 3516 WP_002194448.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  I6G54_RS11695 (I6G54_11695) addA 2029066..2032791 (+) 3726 WP_002194449.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  I6G54_RS11700 (I6G54_11700) - 2032804..2033091 (+) 288 WP_002194451.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  I6G54_RS11705 (I6G54_11705) gerPF 2033229..2033444 (-) 216 WP_001141566.1 spore germination protein GerPF -
  I6G54_RS11710 (I6G54_11710) - 2033487..2033873 (-) 387 WP_002194453.1 spore germination protein GerPE -
  I6G54_RS11715 (I6G54_11715) gerPD 2033889..2034083 (-) 195 WP_001102341.1 spore germination protein GerPD -
  I6G54_RS11720 (I6G54_11720) gerPC 2034090..2034704 (-) 615 WP_002194454.1 spore germination protein GerPC -
  I6G54_RS11725 (I6G54_11725) gerPB 2034773..2034979 (-) 207 WP_001012508.1 spore germination protein GerPB -
  I6G54_RS11730 (I6G54_11730) gerPA 2034994..2035215 (-) 222 WP_001111187.1 spore germination protein GerPA -
  I6G54_RS11735 (I6G54_11735) - 2035312..2035491 (-) 180 WP_000462845.1 aspartyl-phosphate phosphatase Spo0E family protein -
  I6G54_RS11740 (I6G54_11740) - 2035739..2036638 (+) 900 WP_038356988.1 fumarylacetoacetate hydrolase family protein -
  I6G54_RS11745 (I6G54_11745) - 2036674..2036958 (-) 285 WP_002194464.1 hypothetical protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142703.02 Da        Isoelectric Point: 4.8669

>NTDB_id=513829 I6G54_RS11695 WP_002194449.1 2029066..2032791(+) (addA) [Bacillus tropicus strain FDAARGOS_897]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLINEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLVLLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQTMKRDKGMVDFTDLEHFCLQILSEQSESGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSIPDEIYGYSASWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIRTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFSGGFEQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=513829 I6G54_RS11695 WP_002194449.1 2029066..2032791(+) (addA) [Bacillus tropicus strain FDAARGOS_897]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAGTGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCGGCAGCAGCTGGTTCAGGGAAAACAGCGGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTAATGAGCCGGGCTCTCAGCACATTAGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATAGCGAATCAAACAGAAAATGAACTATTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATTGAA
GATAATACGATATTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
ACTTCATACAGAATCAAGAGCACATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATACGATGTGGAAGGAA
AGACAATTGAAGATTTAGTGTACGCTTCCTACTTATTAGAAGATGTGAAATTCCAACTTGAAACAGCGGAACAGCATATT
CGTAAAGCGACAGAGCTCGCAATGCTTCCTGATGGTCCAGCGCCTCGCGTTGAAACCCTGCAAGCAGATTTAGTTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAACGAGGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAAAAATTACAAGAAGAGCTATTTAGCCGTAAGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTTGTTCAGCTCGTAAAAGTATTTACCGAGCGTTTCCAAACGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAGTGGTGAAATGAAGCCATCAGCAGTAGCGCTTCAA
TATCGCAATAAATTTGCTGAAGTACTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCGATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGCGACGTGAAGCAGTCGATCTATCGTTTCCGACTAGCAG
AACCAGGATTATTCCTAGGAAAGTATAAACGCTTCACGCAAGAAGGATTAGGCGGCGGAATGAAGATCGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGCACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTCGGAGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCTGAACTACTATGCA
TTCAGCAAACGGAAGAAGAGGTAATAGACGGTGAAGAAGGCGCAGAAGTAGAAAAGGCACAGCTTGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCTGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGTCCTGTACAATA
CCGCGACTTCGTTATTTTGCTCCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTGAAATTACAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATCGATAAT
CCGATGCAAGATATTCCGCTTGCAGCCGTGCTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACACTTCG
TGCTCATGGGAAGAAAGGTTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGCGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTACAACTTACTGCAAGGATGGCGTGAATTCGCGCGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGGTATTACGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACTT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGGACGGCAAGAGCTTTAGGTGAACAAGAAGATGTTGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTCGTAGCTGGACTAGGTCGTCGCTTTAATACACAAGACTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGCTTCGGTTCGCAATTTATTGATCCACGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTATTATATGTTGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAAATAAAGAAATGGAAAAATGGCTTGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCATATACGTGCCGGAGCGTCTTGCTACTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATACTTCTTGAATTGGGACAAGGAAGCATTCCAGATGAAATTTATGGGTATAGTGCAAGCTGGAAA
GTAGAAGTTGTTGACGGAAACACGTTACTTGCGCCAGAACCAGTTCAAGAAGAGAAGCAAGAGTTGTTAGAAGCACTTCG
TGAGAAAAAGGCCGTTCCTCTGCAAAGTGAACGGAAAGAAGAGGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAATTACGTGCACCAATTAGAACACGTCCTCGTTTTATGGAGAAAAAAGGTTTAACGTACGCAGAGCGTGG
AACCGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGATTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTTGAACAAGCAGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGGAGAGCGGGGAATCCATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCTCAGGTGGATTCGAACAAGCGAAACCAATTTTA
GAAGAGCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCGCTCGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTGTAAAAATCGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.908

100

0.539