Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   P3F89_RS21350 Genome accession   NZ_CP119875
Coordinates   4111772..4115497 (-) Length   1241 a.a.
NCBI ID   WP_309573708.1    Uniprot ID   -
Organism   Bacillus tropicus strain T36S-23     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 4106772..4120497
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P3F89_RS21300 (P3F89_21270) - 4107605..4107889 (+) 285 WP_098364062.1 hypothetical protein -
  P3F89_RS21305 (P3F89_21275) - 4107925..4108824 (-) 900 WP_038356988.1 fumarylacetoacetate hydrolase family protein -
  P3F89_RS21310 (P3F89_21280) - 4109072..4109251 (+) 180 WP_000462845.1 aspartyl-phosphate phosphatase Spo0E family protein -
  P3F89_RS21315 (P3F89_21285) gerPA 4109348..4109569 (+) 222 WP_001111187.1 spore germination protein GerPA -
  P3F89_RS21320 (P3F89_21290) gerPB 4109584..4109790 (+) 207 WP_001012508.1 spore germination protein GerPB -
  P3F89_RS21325 (P3F89_21295) gerPC 4109859..4110473 (+) 615 WP_002194454.1 spore germination protein GerPC -
  P3F89_RS21330 (P3F89_21300) gerPD 4110480..4110674 (+) 195 WP_001102341.1 spore germination protein GerPD -
  P3F89_RS21335 (P3F89_21305) - 4110690..4111076 (+) 387 WP_002194453.1 spore germination protein GerPE -
  P3F89_RS21340 (P3F89_21310) gerPF 4111119..4111334 (+) 216 WP_001141566.1 spore germination protein GerPF -
  P3F89_RS21345 (P3F89_21315) - 4111472..4111759 (-) 288 WP_002194451.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  P3F89_RS21350 (P3F89_21320) addA 4111772..4115497 (-) 3726 WP_309573708.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  P3F89_RS21355 (P3F89_21325) addB 4115494..4119009 (-) 3516 WP_098367907.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  P3F89_RS21360 (P3F89_21330) lepB 4119126..4119689 (-) 564 WP_002194447.1 signal peptidase I -
  P3F89_RS21365 (P3F89_21335) - 4119746..4120339 (-) 594 WP_000347516.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142745.10 Da        Isoelectric Point: 4.8851

>NTDB_id=801227 P3F89_RS21350 WP_309573708.1 4111772..4115497(-) (addA) [Bacillus tropicus strain T36S-23]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEEKPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLINEPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRVETLQADLVLLGTLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVRQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQTMKRDKGMVDFTDLEHFCLQILSEQSESGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEKELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSIPDEIYGYSASWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIRTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGESGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFSGGFEQAKPIL
EERYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVKIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=801227 P3F89_RS21350 WP_309573708.1 4111772..4115497(-) (addA) [Bacillus tropicus strain T36S-23]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAGTGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCGGCAGCAGCTGGTTCAGGGAAAACAGCGGTATTAGTTGAACGTATTATTAAAAAGATTATAAATG
AAGAAAAACCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAACAGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTAATGAGCCGGGCTCTCAGCACATTAGAAAGCAGCTGAGCTTATTAAATAAAGC
TTCCATTTCAACGATCCATTCATTTTGTTTACAAGTTATTAGAGGATACTATTACATGCTTGATGTTGATCCTCGTTTCC
GCATAGCGAATCAAACAGAAAATGAACTATTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATTGAA
GATAATACGATATTCTTTGAACTTGTTGATCGTTATACGAGTGACCGTAGTGACGATGATTTACAACGTATGATTTTAGC
ACTTCATACAGAATCAAGAGCACATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATACGATGTGGAAGGAA
AGACAATTGAAGATTTAGTGTACGCTTCCTACTTATTAGAAGATGTGAAATTCCAACTTGAAACAGCGGAACAGCATATT
CGTAAAGCGACAGAGCTCGCAATGCTTCCTGATGGTCCAGCGCCTCGCGTTGAAACCCTGCAAGCAGATTTAGTTTTACT
TGGAACGTTATCATCAGCTGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAACGAGGATGTTGTAAGACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAAAAATTACAAGAAGAGCTATTTAGCCGTAAGCCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAGCTTGTTCAGCTCGTAAAAGTATTTACCGAGCGTTTCCAAACGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAGTGGTGAAATGAAGCCATCAGCAGTAGCGCTTCAA
TATCGCAATAAATTTGCTGAAGTACTAGTCGATGAATATCAAGATACGAACTTCGTACAAGAATCGATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGCGACGTGAAGCAGTCGATCTATCGTTTCCGACTAGCAG
AACCAGGATTATTCCTAGGAAAGTATAAACGCTTCACGCAAGAAGGATTAGGCGGCGGAATGAAGATCGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTGTTAGCAGGCACGAACTTTATCTTCAAACAAATTATGGGCGAAGAAGTCGGAGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCGGCTGAACTACTATGCA
TTCAGCAAACGGAAGAAGAGGTAATAGACGGTGAAGAAGGCGCAGAAGTAGAAAAGGCACAGCTTGAAGCTCGTCTTATG
GCGCAGCGTATTAAAGCGATGGTTGATTCTGGTTATGAAGTGTATGATCGTAAAACGGATAGTATGCGTCCTGTACAATA
CCGCGACTTCGTTATTTTGCTCCGCTCCATGCCGTGGGCACCGCAAATTATGGAAGAGTTGAAATTACAAGGAATTCCAG
TATACGCTGATCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATCGATAAT
CCGATGCAAGATATTCCGCTTGCAGCCGTGCTTCGTTCCCCAATCGTTGGATTAAATGATGAAGAACTTGCAACACTTCG
TGCTCATGGGAAGAAAGGTTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGCGCACCGCTTGAAGAAGAAAAAGAAC
TACATGATAAATTAGAATGGTTCTACAACTTACTGCAAGGATGGCGTGAATTCGCGCGCCAACAGTCTCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGGTATTACGACTTTGTCGGTGGTTTACCAGCTGGAAAGCAAAGGCAAGCAAACTT
GCGTGTACTATATGACCGCGCAAGACAATATGAAGCAACATCGTTTAGAGGACTATTCCGCTTCTTACGCTTTATTGAGC
GTATTTTAGAACGCGGTGATGATATGGGGACGGCAAGAGCTTTAGGTGAACAAGAAGATGTTGTTCGCATTATGACAATT
CATAAAAGTAAAGGACTTGAGTTCCCAGTCGTATTCGTAGCTGGACTAGGTCGTCGCTTTAATACACAAGACTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGCTTCGGTTCGCAATTTATTGATCCACGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTATTATATGTTGCGTTAACACGT
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAGGATGCAAATAAAGAAATGGAAAAATGGCTTGATGCGAGGGAGCA
TAGTGAATGGTTATTACCAGATCATATACGTGCCGGAGCGTCTTGCTACTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATACTTCTTGAATTGGGACAAGGAAGCATTCCAGATGAAATTTATGGGTATAGTGCAAGCTGGAAA
GTAGAAGTTGTTGACGGAAACACGTTACTTGCGCCAGAACCAGTTCAAGAAGAGAAGCAAGAGTTGTTAGAAGCACTTCG
TGAGAAAAAGGCCGTTCCTCTGCAAAGTGAACGGAAAGAAGAGGTGTACGATAGATTAATGTGGAAGTACGGATATGAGG
AAGCGACATCTCATCGTGCGAAGCAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAATGCC
TTTATTAAAAAATTACGTGCACCAATTAGAACACGTCCTCGTTTTATGGAGAAAAAAGGTTTAACGTACGCAGAGCGTGG
AACCGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGATTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTTGAACAAGCAGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGGAGAGCGGGGAATCCATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTCAAAACAGATACGATTGAAGGGAAATTCTCAGGTGGATTCGAACAAGCGAAACCAATTTTA
GAAGAGCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCGCTCGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGTAATCATGTTGTAAAAATCGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.828

100

0.538