Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   E3U39_RS17105 Genome accession   NZ_CP038028
Coordinates   3393502..3395853 (+) Length   783 a.a.
NCBI ID   WP_053573181.1    Uniprot ID   -
Organism   Bacillus amyloliquefaciens strain FS1092     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3388502..3400853
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E3U39_RS17065 (E3U39_17065) yhbY 3388774..3389064 (+) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -
  E3U39_RS17070 (E3U39_17070) - 3389075..3389644 (+) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  E3U39_RS17075 (E3U39_17075) yqeK 3389634..3390194 (+) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  E3U39_RS17080 (E3U39_17080) rsfS 3390212..3390568 (+) 357 WP_007408260.1 ribosome silencing factor -
  E3U39_RS17085 (E3U39_17085) - 3390565..3391302 (+) 738 WP_015417856.1 class I SAM-dependent methyltransferase -
  E3U39_RS17090 (E3U39_17090) comER 3391371..3392192 (-) 822 WP_015417855.1 late competence protein ComER -
  E3U39_RS17095 (E3U39_17095) comEA 3392251..3392865 (+) 615 WP_053573180.1 helix-hairpin-helix domain-containing protein Machinery gene
  E3U39_RS17100 (E3U39_17100) - 3392932..3393501 (+) 570 WP_003152868.1 ComE operon protein 2 -
  E3U39_RS17105 (E3U39_17105) comEC 3393502..3395853 (+) 2352 WP_053573181.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  E3U39_RS17110 (E3U39_17110) - 3395872..3396006 (-) 135 WP_003152870.1 YqzM family protein -
  E3U39_RS17115 (E3U39_17115) - 3396047..3396201 (+) 155 Protein_3275 hypothetical protein -
  E3U39_RS17120 (E3U39_17120) holA 3396241..3397282 (+) 1042 Protein_3276 DNA polymerase III subunit delta -
  E3U39_RS17125 (E3U39_17125) rpsT 3397299..3397565 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  E3U39_RS17130 (E3U39_17130) gpr 3397768..3398874 (+) 1107 WP_053573183.1 GPR endopeptidase -
  E3U39_RS17135 (E3U39_17135) - 3398942..3400135 (+) 1194 WP_015417852.1 stage II sporulation protein P -
  E3U39_RS17140 (E3U39_17140) - 3400152..3400490 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86668.43 Da        Isoelectric Point: 8.6744

>NTDB_id=351617 E3U39_RS17105 WP_053573181.1 3393502..3395853(+) (comEC) [Bacillus amyloliquefaciens strain FS1092]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKHHAPVIVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIEQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGESRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFGMFDLVMKPVHDFITYAASVDLFTMIVSKPDFVSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVFGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=351617 E3U39_RS17105 WP_053573181.1 3393502..3395853(+) (comEC) [Bacillus amyloliquefaciens strain FS1092]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCTGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCACCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGCTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATTACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAGTG
GGCGGCTTCGTACAAAATCCGTTCTCTTGAGGAAAAAAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTGAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCTTGTCTTATCTGCTGCTTCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGGAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTCATTGCGGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCTTTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGATGGGAGAATGTTTGTTTGGTATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTGTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCATATTCTATCGGCGAGAAGGTTTTGATTCCCTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTAGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGGGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTTGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.865

98.595

0.561


Multiple sequence alignment