Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   A6J74_RS11200 Genome accession   NZ_CP020437
Coordinates   1830878..1834603 (-) Length   1241 a.a.
NCBI ID   WP_000572319.1    Uniprot ID   A0AAP8JW19
Organism   Bacillus sp. FDAARGOS_235     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1825878..1839603
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6J74_RS11150 (A6J74_11160) - 1826762..1827043 (+) 282 WP_080609500.1 hypothetical protein -
  A6J74_RS11155 (A6J74_11165) - 1827079..1827978 (-) 900 WP_001213067.1 fumarylacetoacetate hydrolase family protein -
  A6J74_RS11160 (A6J74_11170) - 1828225..1828401 (+) 177 WP_000462849.1 aspartyl-phosphate phosphatase Spo0E family protein -
  A6J74_RS11165 (A6J74_11175) gerPA 1828501..1828722 (+) 222 WP_001111187.1 spore germination protein GerPA -
  A6J74_RS11170 (A6J74_11180) gerPB 1828737..1828943 (+) 207 WP_001012514.1 spore germination protein GerPB -
  A6J74_RS11175 (A6J74_11185) gerPC 1829012..1829626 (+) 615 WP_001070753.1 spore germination protein GerPC -
  A6J74_RS11180 (A6J74_11190) gerPD 1829633..1829827 (+) 195 WP_001052804.1 spore germination protein GerPD -
  A6J74_RS11185 (A6J74_11195) - 1829843..1830229 (+) 387 WP_000902321.1 spore germination protein GerPE -
  A6J74_RS11190 (A6J74_11200) gerPF 1830272..1830487 (+) 216 WP_001141566.1 spore germination protein GerPF -
  A6J74_RS11195 (A6J74_11205) - 1830576..1830854 (-) 279 WP_000224035.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  A6J74_RS11200 (A6J74_11210) addA 1830878..1834603 (-) 3726 WP_000572319.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  A6J74_RS11205 (A6J74_11215) addB 1834600..1838115 (-) 3516 WP_000058641.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  A6J74_RS11210 (A6J74_11220) lepB 1838232..1838795 (-) 564 WP_000751915.1 signal peptidase I -
  A6J74_RS11215 (A6J74_11225) - 1838978..1839571 (-) 594 WP_000347508.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142735.89 Da        Isoelectric Point: 4.8330

>NTDB_id=223046 A6J74_RS11200 WP_000572319.1 1830878..1834603(-) (addA) [Bacillus sp. FDAARGOS_235]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERMIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNSIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVRFQLETAEQHI
RKATELAMLPDGPAPRVETLQADAALLGMLSSAARESWTSVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVKLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENDEVKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGSGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPIGEDVAAELLCIQQTEEEVLEGEEGTEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHEKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEILLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEEATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHTVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLSAEEAYQDWQGNSGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=223046 A6J74_RS11200 WP_000572319.1 1830878..1834603(-) (addA) [Bacillus sp. FDAARGOS_235]
ATGATAGAGAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCCGTTGTAGCGAATGGACG
TGACATTTTAGTCGCAGCAGCAGCTGGATCGGGAAAAACAGCAGTATTAGTTGAACGTATGATTAAAAAGATTATTAATG
AGGAAAATCCAGTCGATGTCGATCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAGATGAAAAATCGAATT
GGAGAAGCGTTAGAAAAAGTACTAATTGATGAGCCAGGATCTCAACATGTAAGAAAACAGCTGAGCCTATTAAATAAAGC
ATCTATTTCAACGATCCATTCATTTTGTTTACAAGTAATTAGAGGGTATTACTACATGCTGGATGTTGATCCTCGTTTTC
GTATTGCAAATCAAACAGAAAATGAGTTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATCGAA
GATAATAGTATTTTCTTTGAATTAGTTGATCGCTATACGAGTGACCGTAGTGATGATGACTTACAAAGAATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTCGATAAATTAGTAGAAGCATACGACGTGGAAGGAA
AGACAATTGAAGATTTAGTGTATGCTTCTTACCTATTAGAAGATGTGAGATTCCAGCTTGAAACAGCGGAACAACATATT
CGTAAAGCAACCGAACTCGCAATGCTTCCTGACGGCCCGGCGCCTCGCGTTGAAACCCTGCAAGCGGATGCAGCTTTACT
TGGAATGTTATCATCAGCAGCTCGTGAATCGTGGACAAGCGTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAACGAGGATGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTG
AAGAAATTACAAGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAACTCGTGAAGCTCGTAAAAGTCTTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTTGATTTCACAG
ATTTAGAGCATTTCTGTTTGCAAATTTTAAGTGAACAAAGTGAAAATGATGAAGTGAAGCCATCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCAGAAGTACTAGTCGATGAATATCAAGATACGAACTTCGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGGAACTTGTTCATGGTTGGTGACGTAAAACAGTCAATCTACCGTTTCCGACTAGCAG
AACCAGGCTTATTCTTAGGAAAGTATAAACGCTTCACGCAAGAAGGATCGGGCGGCGGAATGAAGATTGATTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
CGATTACGATGCTGACGCTGAATTAAAGCTAGGTGCTAGCTATCCAATAGGTGAAGATGTAGCAGCTGAATTATTATGCA
TTCAGCAAACGGAAGAAGAAGTACTAGAGGGTGAAGAAGGTACGGAAGTCGAAAAGGCGCAACTGGAAGCTCGTCTTATG
GCGCAGCGCATTAAAGCGATGGTCGATTCAGGTTATGAAGTGTATGACCGAAAAACGGATAGTATGCGACCAGTGCAATA
TCGTGATTTCGTTATTTTACTTCGCTCTATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAACTACAAGGAATTCCAG
TATATGCAGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCACTTGCAGCAGTACTTCGTTCACCGATCGTTGGATTAAATGATGAAGAACTTGCGACGCTTCG
TGCTCACGGAAAGAAAGGGTCATTTTATGAAGTAATGAGCTCATTCTTAAAAGGTGCACCGCTTGAAGAAGAACAAGAAC
TTCATGAAAAACTAGAATGGTTTTATAACTTACTACAAGGATGGCGTGAATTCGCGCGCCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGGTATTATGACTTTGTCGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
GCGTGTATTATATGACCGCGCAAGGCAATATGAAGCAACATCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGACGATATGGGGACGGCGAGAGCTCTTGGTGAACAAGAAGATGTTGTTCGCATTATGACGATT
CATAAAAGTAAGGGACTTGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGATTTAATGAA
ACGTTTCTTACTGCATAAAGACTTCGGTTTCGGTTCACAGTTCATTGATCCGCGTAAACGAATTAAATATACGACATTAT
CGCAACTTGCAATTAAACGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTATTATACGTAGCTTTAACGCGT
GCAAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTCGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCTGGAGCGTCTTGCTATTTAGACTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATACTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGGTATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACGCTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAGTTGTTAGAAGCACTTCG
TGAGAAAAAGGCTGTGCCATTACAAAGTGAACGAAAAGAAGAAGTGTACGATAGGTTAATGTGGAAGTACGGGTATGAGG
AAGCAACCTCTCATCGTGCGAAGCAATCCGTTACAGAAATAAAGAGAAATTATCAGTCTGAAGAAGGCAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGATTAACGTACGCAGAGCGTGG
GACGGCAGTCCATACCGTTATGCAACATGTAGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAACAAATTGCTG
GAATGGTAAATAAGGAATTGTTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACCTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTTCCATTTACGATGATGCTATCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAATAGCGGGGAATCGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAAGAAGATG
GTATTACGCTAATCGATTTCAAAACAGATACGATTGAAGGTAAATTCCCAGGCGGATTTGAACAAGCAAAACCAATTTTA
GAAGATCGCTATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTCGAGAAAAGCTTACAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCATGTTGTGAATATTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.328

100

0.536


Multiple sequence alignment