Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   ACFXAA_RS18750 Genome accession   NZ_CP170717
Coordinates   3826332..3828683 (+) Length   783 a.a.
NCBI ID   WP_317859499.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain JP5014     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3821332..3833683
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACFXAA_RS18710 (ACFXAA_18710) yhbY 3821604..3821894 (+) 291 WP_007408257.1 ribosome assembly RNA-binding protein YhbY -
  ACFXAA_RS18715 (ACFXAA_18715) - 3821905..3822474 (+) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  ACFXAA_RS18720 (ACFXAA_18720) yqeK 3822464..3823024 (+) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  ACFXAA_RS18725 (ACFXAA_18725) rsfS 3823042..3823398 (+) 357 WP_007408260.1 ribosome silencing factor -
  ACFXAA_RS18730 (ACFXAA_18730) - 3823395..3824132 (+) 738 WP_015417856.1 class I SAM-dependent methyltransferase -
  ACFXAA_RS18735 (ACFXAA_18735) comER 3824201..3825022 (-) 822 WP_039254409.1 late competence protein ComER -
  ACFXAA_RS18740 (ACFXAA_18740) comEA 3825081..3825695 (+) 615 WP_015240245.1 helix-hairpin-helix domain-containing protein Machinery gene
  ACFXAA_RS18745 (ACFXAA_18745) - 3825762..3826331 (+) 570 WP_003152868.1 ComE operon protein 2 -
  ACFXAA_RS18750 (ACFXAA_18750) comEC 3826332..3828683 (+) 2352 WP_317859499.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  ACFXAA_RS18755 (ACFXAA_18755) - 3828702..3828836 (-) 135 WP_003152870.1 YqzM family protein -
  ACFXAA_RS18760 (ACFXAA_18760) - 3828877..3829028 (+) 152 Protein_3631 hypothetical protein -
  ACFXAA_RS18765 (ACFXAA_18765) holA 3829068..3830109 (+) 1042 Protein_3632 DNA polymerase III subunit delta -
  ACFXAA_RS18770 (ACFXAA_18770) rpsT 3830126..3830392 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  ACFXAA_RS18775 (ACFXAA_18775) gpr 3830595..3831701 (+) 1107 WP_007408268.1 GPR endopeptidase -
  ACFXAA_RS18780 (ACFXAA_18780) spoIIP 3831769..3832962 (+) 1194 WP_007612673.1 stage II sporulation protein P -
  ACFXAA_RS18785 (ACFXAA_18785) - 3832979..3833317 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86638.38 Da        Isoelectric Point: 8.7881

>NTDB_id=1058691 ACFXAA_RS18750 WP_317859499.1 3826332..3828683(+) (comEC) [Bacillus amyloliquefaciens strain JP5014]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKQYAPVIVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIEQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGESRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVIVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFGMFDLVMKPVHDFITYAASVDLFTMIVSKPDFVSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVFGGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDVSGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=1058691 ACFXAA_RS18750 WP_317859499.1 3826332..3828683(+) (comEC) [Bacillus amyloliquefaciens strain JP5014]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCAGTATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGCTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATTACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAGTG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTGAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGGAAAGCAGACTTGCCGGCCTTGCGATGGCTTCATTCATTGCGGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATTGTGCCTTTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGATGGGAGAATGTTTGTTTGGTATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTGTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCATATTCTATCGGCGAGAAGGTTTTGATTCCATTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTAGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGGGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTTGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAGCAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATGT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.865

98.595

0.561


Multiple sequence alignment