Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   IQ781_RS09835 Genome accession   NZ_CP065509
Coordinates   1965466..1969191 (-) Length   1241 a.a.
NCBI ID   WP_139846981.1    Uniprot ID   -
Organism   Bacillus sp. N447-1     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 1960466..1974191
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IQ781_RS09790 (IQ781_09760) - 1961620..1962519 (-) 900 WP_001213071.1 fumarylacetoacetate hydrolase family protein -
  IQ781_RS09795 (IQ781_09765) - 1962767..1962946 (+) 180 WP_000462841.1 aspartyl-phosphate phosphatase Spo0E family protein -
  IQ781_RS09800 (IQ781_09770) gerPA 1963043..1963264 (+) 222 WP_001111187.1 spore germination protein GerPA -
  IQ781_RS09805 (IQ781_09775) gerPB 1963279..1963485 (+) 207 WP_001012508.1 spore germination protein GerPB -
  IQ781_RS09810 (IQ781_09780) gerPC 1963553..1964167 (+) 615 WP_001070760.1 spore germination protein GerPC -
  IQ781_RS09815 (IQ781_09785) gerPD 1964174..1964368 (+) 195 WP_001052807.1 spore germination protein GerPD -
  IQ781_RS09820 (IQ781_09790) - 1964384..1964770 (+) 387 WP_000902324.1 spore germination protein GerPE -
  IQ781_RS09825 (IQ781_09795) gerPF 1964813..1965028 (+) 216 WP_001141566.1 spore germination protein GerPF -
  IQ781_RS09830 (IQ781_09800) - 1965166..1965453 (-) 288 WP_000255728.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  IQ781_RS09835 (IQ781_09805) addA 1965466..1969191 (-) 3726 WP_139846981.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  IQ781_RS09840 (IQ781_09810) addB 1969188..1972703 (-) 3516 WP_139846980.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  IQ781_RS09845 (IQ781_09815) lepB 1972820..1973383 (-) 564 WP_000751919.1 signal peptidase I -
  IQ781_RS09850 (IQ781_09820) - 1973440..1974033 (-) 594 WP_000347517.1 TVP38/TMEM64 family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142745.93 Da        Isoelectric Point: 4.8018

>NTDB_id=511424 IQ781_RS09835 WP_139846981.1 1965466..1969191(-) (addA) [Bacillus sp. N447-1]
MIENWPKKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIINEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDEPGSQHVRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNTIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEQHI
RKATELAMLPDGPAPRIETLQADVALLGTLSSAARESWTSLYEAMQNVSWQTLKRIKKSDYNEEVVKQVDSLRNKAKDEV
KKLQEELFSRKPESFLRDFQDMHPVLEKLVQLVKVFTERFQAMKRDKGMVDFTDLEHFCLQILSEQSENGEMNPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLGGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGATYPEGEDVAAELLCIQQTEEEVLDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKNDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLSDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGSIPDEIYGYDTSWK
VEVVDGNTLLAPEPVQEEKQELLEALREKKAVPLQSERKEEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITVEVLQEQIAGMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLAAEEAYQDWQGKSEETILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFEQAKPIL
EDRYKVQLSLYAKALEKSLQHPVKEKCLYFFDGNHVVNIEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=511424 IQ781_RS09835 WP_139846981.1 1965466..1969191(-) (addA) [Bacillus sp. N447-1]
ATGATAGAAAATTGGCCTAAAAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCTGTTGTAGCGAACGGACG
TGATATTTTAGTTGCTGCTGCAGCTGGATCAGGTAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATTAATG
AAGAGAACCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCAGCGCAAGAGATGAAAAACCGAATC
GGGGAAGCGTTAGAAAAGGTATTAATTGATGAACCAGGCTCTCAACACGTAAGAAAGCAGCTGAGTCTATTAAATAAAGC
TTCCATTTCAACAATTCACTCATTCTGTTTACAAGTAATTAGAGGATATTACTACATGCTTGATGTTGATCCTCGTTTCC
GTATTGCGAATCAAACAGAAAATGAATTGTTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGCATCGAA
GATAATACGATATTCTTTGAACTCGTTGATCGTTATACGAGTGACCGTAGTGATGATGACTTACAACGTATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCAAATCCGGAAAAATGGCTTGATAAATTAGTAGAAGCATATGACGTAGAAGGAA
AGACAATTGAGGATTTAGTATATGCTTCTTATTTATTAGAAGATGTGAAATTCCAGCTTGAAACAGCGGAACAGCATATT
CGTAAGGCAACTGAACTCGCAATGCTTCCTGACGGTCCAGCGCCTCGCATTGAAACCCTGCAAGCAGATGTAGCTTTACT
TGGAACGCTATCATCAGCTGCTCGTGAGTCGTGGACAAGTTTGTATGAAGCGATGCAAAACGTATCGTGGCAAACGTTAA
AGCGTATTAAGAAAAGTGATTACAATGAAGAGGTTGTAAAACAAGTAGACTCTCTTCGTAATAAAGCGAAAGATGAAGTA
AAGAAATTACAGGAAGAGTTATTTAGCCGCAAACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCACCCAGTATTAGA
AAAGCTTGTTCAACTTGTAAAAGTATTTACAGAGCGTTTCCAAGCGATGAAGCGAGATAAAGGAATGGTCGATTTCACAG
ATTTAGAGCATTTCTGTTTACAAATTTTAAGTGAACAAAGTGAAAATGGTGAAATGAATCCGTCAGCAGTGGCGCTCCAA
TATCGTAATAAATTTGCTGAAGTATTAGTCGATGAATATCAAGACACGAACTTTGTACAGGAATCCATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAGCAGTCGATTTATCGTTTCCGACTAGCCG
AGCCAGGCTTATTCCTAGGAAAGTATAAACGTTTCACACAAGAAGGATTGGGCGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGTCGTCATGAAGTACTAGCAGGTACGAACTTTATTTTTAAACAAATTATGGGCGAAGAAGTTGGGGAAAT
TGACTACGATGCTGACGCTGAACTAAAGCTAGGTGCTACCTATCCAGAAGGTGAAGATGTAGCAGCAGAATTATTATGTA
TTCAGCAAACGGAAGAAGAAGTACTAGACGGTGAAGAAGGTGCAGAAGTCGAAAAAGCACAGCTTGAAGCTCGCCTTATG
GCGCAGCGCATTAAAGCGATGGTTGATTCCGGTTATGAAGTGTATGACCGTAAAAATGATAGTATGCGTCCGGTGCAATA
TCGCGATTTCGTTATTTTACTTCGCTCCATGCCGTGGGCCCCGCAAATTATGGAAGAGTTAAAATTACAAGGAATTCCAG
TATACGCTGACCTCGCGACTGGTTATTTTGAAGCGACGGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCGCTTGCCGCTGTACTTCGTTCCCCAATCGTTGGATTGAGCGATGAAGAACTTGCAACGCTTCG
TGCTCATGGAAAGAAAGGCTCGTTTTATGAAGTAATGAGCTCATTCTTAAAAGGGGCACCGCTTGAAGAGGAGCAAGAAC
TTCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTTGCCCGTCAACAATCCCTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGTTATTACGACTTCGTTGGCGGTTTACCAGCAGGAAAGCAAAGACAAGCAAACTT
ACGCGTACTATATGACCGTGCAAGACAATATGAAGCAACATCATTTAGAGGGTTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGATGATATGGGTACGGCGAGAGCCCTAGGTGAACAAGAAGACGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCGGTCGTATTTGTAGCTGGACTCGGTCGTCGTTTTAATACACAAGACTTAATGAA
ACGTTTCTTATTGCATAAAGATTTCGGTTTCGGTTCGCAATTTATCGATCCGCGTAAACGAATTAAATATACAACATTAT
CACAGCTTGCGATTAAGCGTAAAATGAAAATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCATTGACGCGT
GCGAAAGAGAAGTTAATTTTAATCGGAACAGTTAAGGATGCAAATAAGGAAATGGAAAAATGGCTTGATGCGAGGGAACA
TAGTGAATGGTTATTACCAGATCACATACGTGCCGGAGCGTCTTGTTATTTAGACTGGATTGCACCTTCATTATATAGAC
ACCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAGTATTCCAGATGAAATTTACGGATATGACACTAGCTGGAAA
GTAGAAGTTGTGGACGGTAACACACTACTTGCACCAGAGCCAGTTCAAGAAGAGAAACAAGAATTGTTAGAAGCACTTCG
TGAGAAAAAGGCTGTTCCCCTGCAAAGTGAACGGAAAGAAGAAGTGTACGACAGATTAATGTGGAAGTACGGATATGAGG
ATGCGACATCTCATCGTGCGAAGCAATCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGTAGCGATAACGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGCCCGCGCTTTATGGAGAAAAAAGGGTTAACGTACGCAGAGCGCGG
AACAGCAGTCCATGCCGTTATGCAGCATGTTGATTTGAAGAAGCCGATTACGGTTGAAGTTCTTCAAGAGCAAATTGCTG
GAATGGTAAATAAGGAATTATTAACATTCGAGCAGGCGGAAGAAATAGCGATTGAAAAAGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGGGTATTAGCGGCGAAAAGTGTTGAGCGTGAAGTACCATTTACGATGATGCTTGCAGCAGAAGAAGC
ATATCAAGATTGGCAAGGGAAGAGTGAAGAAACGATTCTTGTCCAAGGGGTTATCGACTGCATGATTGAAGAGGAAGATG
GCATTACGTTAATCGACTTTAAAACAGATACGATTGAAGGGAAATTCCCAGGCGGATTCGAACAAGCGAAACCAATTTTA
GAAGATCGATATAAAGTGCAGCTTTCGTTATATGCAAAAGCACTGGAGAAAAGCTTACAACATCCTGTAAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCACGTTGTAAATATTGAAGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.328

100

0.536