Detailed information    

insolico Bioinformatically predicted

Overview


Name   addA   Type   Machinery gene
Locus tag   KPL75_RS19765 Genome accession   NZ_CP076653
Coordinates   3797406..3801131 (+) Length   1241 a.a.
NCBI ID   WP_219917449.1    Uniprot ID   -
Organism   Bacillus sp. NP247     
Function   homologous recombination; plasmid transformation (predicted from homology)   
Homologous recombination

Genomic Context


Location: 3792406..3806131
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KPL75_RS19750 (KPL75_19745) - 3792564..3793157 (+) 594 WP_078179996.1 TVP38/TMEM64 family protein -
  KPL75_RS19755 (KPL75_19750) lepB 3793214..3793777 (+) 564 WP_002149908.1 signal peptidase I -
  KPL75_RS19760 (KPL75_19755) addB 3793894..3797409 (+) 3516 WP_219917448.1 helicase-exonuclease AddAB subunit AddB Machinery gene
  KPL75_RS19765 (KPL75_19760) addA 3797406..3801131 (+) 3726 WP_219917449.1 helicase-exonuclease AddAB subunit AddA Machinery gene
  KPL75_RS19770 (KPL75_19765) - 3801155..3801433 (+) 279 WP_219917450.1 RNA polymerase alpha subunit C-terminal domain-containing protein -
  KPL75_RS19775 (KPL75_19770) gerPF 3801559..3801774 (-) 216 WP_098385040.1 spore germination protein GerPF -
  KPL75_RS19780 (KPL75_19775) - 3801817..3802203 (-) 387 WP_219917451.1 spore germination protein GerPE -
  KPL75_RS19785 (KPL75_19780) gerPD 3802218..3802412 (-) 195 WP_002149893.1 spore germination protein GerPD -
  KPL75_RS19790 (KPL75_19785) - 3802419..3803033 (-) 615 WP_002087034.1 spore germination protein GerPC -
  KPL75_RS19795 (KPL75_19790) gerPB 3803098..3803304 (-) 207 WP_002149891.1 spore germination protein GerPB -
  KPL75_RS19800 (KPL75_19795) gerPA 3803319..3803540 (-) 222 WP_002149890.1 spore germination protein GerPA -
  KPL75_RS19805 (KPL75_19800) - 3803637..3803816 (-) 180 WP_002149889.1 aspartyl-phosphate phosphatase Spo0E family protein -
  KPL75_RS19815 (KPL75_19810) - 3804233..3805363 (+) 1131 Protein_3850 RNA-guided endonuclease InsQ/TnpB family protein -

Sequence


Protein


Download         Length: 1241 a.a.        Molecular weight: 142644.11 Da        Isoelectric Point: 4.9370

>NTDB_id=577015 KPL75_RS19765 WP_219917449.1 3797406..3801131(+) (addA) [Bacillus sp. NP247]
MIENWPQKPEGSQWTDDQWKAVVANGRDILVAAAAGSGKTAVLVERIIKKIISEENPVDVDRLLVVTFTNAAAQEMKNRI
GEALEKVLIDGPGSQHIRKQLSLLNKASISTIHSFCLQVIRGYYYMLDVDPRFRIANQTENELLKEEVLDDILEEEYGIE
DNSIFFELVDRYTSDRSDDDLQRMILALHTESRAHPNPEKWLDKLVEAYDVEGKTIEDLVYASYLLEDVKFQLETAEEHI
RKATELAMLPDGPAPRVETLQADLALLGMLSSAARGSWTGVYEAMQNVSWQTLKRIKKSDYNEDVVKQVDSLRNKAKDEV
KKLQEELFSRRPESFLRDFQDMHPVLEKLVKLVKVFTERFQAIKRDKGMVDFTDLEHFCLQILSEQSEAGEMKPSAVALQ
YRNKFAEVLVDEYQDTNFVQESIIKFVTKDSESEGNLFMVGDVKQSIYRFRLAEPGLFLGKYKRFTQEGLDGGMKIDLAK
NFRSRHEVLAGTNFIFKQIMGEEVGEIDYDADAELKLGASYPEGEDVAAELLCIQQTEEEVIDGEEGAEVEKAQLEARLM
AQRIKAMVDSGYEVYDRKTDSMRPVQYRDFVILLRSMPWAPQIMEELKLQGIPVYADLATGYFEATEVNIMMNVFRVIDN
PMQDIPLAAVLRSPIVGLNDEELATLRAHGKKGSFYEVMSSFLKGAPLEEEQELHDKLEWFYNLLQGWREFARQQSLSDL
IWKVYGETGYYDFVGGLPAGKQRQANLRVLYDRARQYEATSFRGLFRFLRFIERILERGDDMGTARALGEQEDVVRIMTI
HKSKGLEFPVVFVAGLGRRFNTQDLMKRFLLHKDFGFGSQFIDPRKRIKYTTLSQLAIKRKMKMELIAEEMRVLYVALTR
AKEKLILIGTVKDANKEMEKWLDAREHSEWLLPDHIRAGASCYLDWIAPSLYRHRDSEMLLELGQGNIPSDIYGYDTSWK
VEVVDGNTLLAPEPVQEEKKELLEALREKKAVLLQSERKDEVYDRLMWKYGYEDATSHRAKQSVTEIKRNYQSEEGSDNA
FIKKLRAPIKTRPRFMEKKGLTYAERGTAVHAVMQHVDLKKPITIEVLQEQIARMVNKELLTFEQAEEIAIEKVISFFDS
DLGKRVLAAKSVEREVPFTMMLSAEEAYQDWQGKKGESILVQGVIDCMIEEEDGITLIDFKTDTIEGKFPGGFDQAKPIL
EDRYKVQLSLYAKALEKSLKHPVKEKCLYFFDGNHVVKVEE

Nucleotide


Download         Length: 3726 bp        

>NTDB_id=577015 KPL75_RS19765 WP_219917449.1 3797406..3801131(+) (addA) [Bacillus sp. NP247]
ATGATAGAAAATTGGCCTCAGAAACCAGAAGGTAGTCAATGGACAGATGACCAGTGGAAAGCGGTTGTAGCGAACGGACG
TGATATTTTAGTCGCAGCAGCAGCTGGATCAGGGAAAACAGCAGTATTAGTTGAACGTATTATTAAAAAGATTATAAGTG
AGGAAAATCCAGTCGATGTCGACCGCCTGCTCGTTGTAACATTTACGAATGCAGCGGCGCAAGAAATGAAAAATCGAATT
GGGGAAGCGTTAGAAAAAGTATTAATTGATGGGCCAGGTTCACAGCATATAAGAAAACAGCTTAGCTTATTAAATAAAGC
TTCCATTTCTACCATTCATTCATTTTGTTTACAAGTTATTAGAGGATATTATTACATGCTTGATGTCGATCCTCGTTTTC
GTATTGCGAACCAAACAGAAAATGAGTTATTAAAAGAAGAAGTGCTAGATGACATATTAGAAGAAGAGTATGGAATCGAA
GATAATAGTATTTTCTTTGAATTAGTTGATCGTTATACGAGTGACCGTAGTGACGATGACTTACAACGAATGATTTTAGC
GCTTCATACAGAATCAAGAGCGCATCCGAACCCGGAGAAGTGGCTCGATAAATTAGTAGAAGCATACGATGTTGAAGGCA
AGACGATTGAAGATTTAGTGTACGCTTCTTACTTATTAGAAGATGTGAAATTCCAGCTGGAAACAGCGGAAGAGCATATT
CGTAAAGCGACTGAACTCGCAATGCTTCCTGATGGTCCGGCGCCTCGCGTTGAAACCCTGCAAGCGGATTTAGCTTTACT
TGGAATGTTATCCTCAGCAGCTCGTGGATCGTGGACAGGCGTTTATGAAGCGATGCAAAATGTATCGTGGCAAACGCTAA
AGCGTATTAAGAAAAGTGATTATAACGAAGATGTTGTAAAACAAGTAGATTCTCTTCGTAATAAAGCGAAAGATGAAGTA
AAGAAATTACAAGAAGAGCTATTTAGCCGCAGACCTGAAAGTTTCTTACGAGATTTTCAAGATATGCATCCTGTATTAGA
AAAACTCGTGAAGCTTGTAAAAGTATTTACAGAGCGTTTCCAAGCAATTAAACGAGATAAAGGAATGGTTGATTTCACAG
ATTTAGAGCATTTCTGTTTGCAAATTTTAAGTGAGCAAAGTGAAGCCGGTGAAATGAAGCCGTCAGCAGTAGCGCTTCAA
TATCGTAATAAATTTGCTGAAGTACTAGTTGATGAATATCAAGATACGAACTTCGTACAGGAATCAATTATTAAATTCGT
AACGAAAGATTCTGAGAGTGAAGGAAACTTGTTCATGGTTGGTGACGTAAAACAGTCAATCTATCGTTTCCGACTAGCAG
AGCCTGGTTTATTCTTAGGAAAATATAAACGTTTCACACAAGAAGGATTGGACGGCGGAATGAAGATTGACTTAGCGAAA
AACTTCCGTAGCCGTCATGAAGTGCTAGCAGGTACGAACTTTATTTTCAAACAAATTATGGGCGAAGAAGTTGGAGAGAT
TGACTACGACGCTGACGCTGAATTAAAACTAGGCGCTAGTTATCCAGAAGGTGAAGATGTAGCGGCTGAATTATTATGTA
TTCAGCAAACGGAAGAAGAAGTAATAGATGGTGAAGAAGGTGCAGAAGTAGAAAAAGCGCAGCTTGAAGCTCGTCTTATG
GCACAGCGCATTAAAGCGATGGTCGATTCAGGTTATGAAGTGTATGACCGAAAAACGGATAGTATGCGACCAGTACAATA
CCGTGATTTCGTTATTTTACTTCGCTCTATGCCGTGGGCACCGCAAATTATGGAAGAGTTAAAACTACAAGGAATTCCAG
TATATGCAGACCTTGCGACTGGTTACTTTGAAGCGACAGAAGTAAATATTATGATGAACGTATTCCGCGTTATTGATAAT
CCGATGCAAGATATTCCACTTGCAGCAGTACTTCGTTCACCGATCGTTGGACTAAATGACGAAGAACTTGCGACGCTTCG
TGCTCATGGAAAGAAAGGGTCATTTTATGAAGTAATGAGTTCATTCTTAAAAGGAGCACCGCTTGAAGAAGAACAAGAAC
TGCATGATAAATTAGAGTGGTTTTATAACTTACTGCAAGGATGGCGTGAATTTGCGCGCCAACAATCACTTTCTGATTTA
ATTTGGAAAGTGTACGGTGAGACAGGGTATTATGACTTTGTCGGCGGTTTACCAGCTGGAAAGCAAAGGCAGGCAAACTT
GCGCGTATTATATGACCGCGCAAGGCAATATGAAGCAACTTCATTTAGAGGATTATTCCGCTTCTTACGTTTTATTGAAC
GTATTTTAGAACGCGGTGACGATATGGGTACGGCGAGAGCTCTCGGTGAACAAGAAGATGTTGTTCGCATTATGACGATT
CATAAAAGTAAAGGGCTAGAGTTCCCGGTCGTATTTGTAGCTGGACTTGGTCGCCGTTTTAATACACAAGATTTAATGAA
GCGTTTCTTACTTCATAAAGATTTCGGTTTCGGTTCACAATTTATCGATCCTCGTAAACGAATTAAATATACGACATTAT
CGCAACTAGCGATTAAGCGTAAAATGAAGATGGAATTAATTGCGGAAGAAATGCGCGTACTATACGTAGCGCTAACGCGT
GCAAAAGAGAAGTTAATTTTAATTGGAACGGTTAAAGATGCAAATAAAGAAATGGAAAAATGGCTCGATGCAAGAGAGCA
TAGTGAATGGTTATTACCAGACCATATACGTGCCGGAGCGTCATGTTATTTAGATTGGATTGCACCTTCATTATATAGAC
ATCGTGATAGTGAAATGCTTCTTGAATTAGGACAAGGAAATATTCCAAGTGACATTTATGGATATGACACTAGCTGGAAA
GTAGAAGTTGTTGACGGCAACACGTTACTCGCACCAGAACCGGTTCAAGAAGAGAAAAAAGAATTACTAGAAGCACTTCG
TGAGAAAAAAGCTGTTCTGCTACAAAGTGAACGAAAAGATGAAGTATACGACAGATTAATGTGGAAGTACGGATATGAGG
ACGCGACATCTCACCGTGCGAAACAGTCTGTTACAGAAATAAAGAGAAATTATCAATCTGAAGAAGGCAGCGATAATGCC
TTTATTAAAAAACTACGTGCACCAATTAAAACACGTCCGCGCTTTATGGAGAAAAAAGGGCTAACATACGCAGAGCGAGG
GACAGCAGTCCATGCCGTTATGCAGCATGTAGATTTGAAGAAACCAATTACGATTGAAGTTCTTCAAGAACAAATTGCAA
GAATGGTAAATAAAGAACTATTAACATTTGAACAAGCTGAAGAAATAGCGATTGAAAAGGTAATTTCATTCTTTGACAGT
GACTTAGGTAAAAGAGTATTAGCAGCAAAAAGTGTTGAGCGCGAAGTACCATTTACGATGATGCTTTCAGCAGAAGAAGC
GTATCAAGATTGGCAAGGGAAGAAAGGCGAATCAATACTTGTCCAAGGGGTTATCGACTGCATGATCGAAGAGGAAGACG
GAATTACTTTAATCGACTTTAAAACAGATACGATTGAAGGAAAGTTTCCAGGCGGATTTGATCAAGCGAAACCAATTTTA
GAAGACCGATACAAAGTACAGCTTTCACTATATGCAAAAGCACTCGAGAAAAGCTTAAAACATCCTGTGAAAGAGAAATG
TTTATACTTCTTTGATGGGAATCATGTTGTAAAGGTTGAGGAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  addA Bacillus subtilis subsp. subtilis str. 168

53.408

100

0.537