Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CWD84_RS09010 Genome accession   NZ_CP025001
Coordinates   1657312..1659663 (+) Length   783 a.a.
NCBI ID   WP_060964715.1    Uniprot ID   A0AAI8HMV4
Organism   Bacillus siamensis strain SCSIO 05746     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1652312..1664663
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CWD84_RS08970 (CWD84_08970) yhbY 1652586..1652876 (+) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -
  CWD84_RS08975 (CWD84_08975) - 1652887..1653456 (+) 570 WP_016938889.1 nicotinate-nucleotide adenylyltransferase -
  CWD84_RS08980 (CWD84_08980) yqeK 1653446..1654006 (+) 561 WP_060964711.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CWD84_RS08985 (CWD84_08985) rsfS 1654023..1654379 (+) 357 WP_016938891.1 ribosome silencing factor -
  CWD84_RS08990 (CWD84_08990) - 1654376..1655113 (+) 738 WP_060964712.1 class I SAM-dependent DNA methyltransferase -
  CWD84_RS08995 (CWD84_08995) comER 1655182..1656003 (-) 822 WP_060964713.1 late competence protein ComER -
  CWD84_RS09000 (CWD84_09000) comEA 1656062..1656676 (+) 615 WP_060964714.1 helix-hairpin-helix domain-containing protein Machinery gene
  CWD84_RS09005 (CWD84_09005) - 1656742..1657311 (+) 570 WP_016938895.1 ComE operon protein 2 -
  CWD84_RS09010 (CWD84_09010) comEC 1657312..1659663 (+) 2352 WP_060964715.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CWD84_RS09015 (CWD84_09015) - 1659682..1659816 (-) 135 WP_003152870.1 YqzM family protein -
  CWD84_RS22095 - 1659857..1660011 (+) 155 Protein_1777 hypothetical protein -
  CWD84_RS09020 (CWD84_09020) holA 1660051..1661092 (+) 1042 Protein_1778 DNA polymerase III subunit delta -
  CWD84_RS09025 (CWD84_09025) rpsT 1661110..1661376 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  CWD84_RS09030 (CWD84_09030) gpr 1661579..1662685 (+) 1107 WP_060964717.1 GPR endopeptidase -
  CWD84_RS09035 (CWD84_09035) spoIIP 1662753..1663946 (+) 1194 WP_060964718.1 stage II sporulation protein P -
  CWD84_RS09040 (CWD84_09040) - 1663963..1664301 (+) 339 WP_016938900.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 87150.01 Da        Isoelectric Point: 9.1532

>NTDB_id=257079 CWD84_RS09010 WP_060964715.1 1657312..1659663(+) (comEC) [Bacillus siamensis strain SCSIO 05746]
MKYKYLLLPLAAVSATAGIAAANFFWVLFFFLLYLLFLFVKTKQYTPFIVSLVSFCLYFFLYTVCDAANVSRYQGGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKSLIEQLEPGMRCVFTGSLEQPGHATIPGGFDYKEHLNSQHI
HWLFSVTSIQQCEKSGMPLYKLLNIRKNLISIIQNHVPESSAGIVEALTLGERFSIEDDILNAYQNLGVVHLMAISGMHV
GLITAGIFYVLIRIGLTREKAGVLLLFFLPVYTLLSGAAPSVLRASLMLGFYIAGTLFKRRIHSSAALSVSYLLLLMFNP
YLLWQAGFQLSFAVSAALILSSSILKKAGKRRFAALAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPF
SVIGFLLLLLSRQVGECLFYMFDLVMKPVHDFMTYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSP
LFFCAVLVYLMFRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVVAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKKHRVKRLIVPVGFVKEPKDQDILKMAKENNIPVAEAKRGDTITAGHLQFQVLSP
ESSDGRSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKAYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGE
DNRYHHPHEEVLDRLKAYSVNVLRTDLSGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=257079 CWD84_RS09010 WP_060964715.1 1657312..1659663(+) (comEC) [Bacillus siamensis strain SCSIO 05746]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACCGCGGGAATTGCCGCCGCAAATTTCTTCTGGGT
TCTATTCTTTTTTCTTCTGTATCTTCTCTTTCTTTTTGTAAAAACAAAGCAGTACACACCGTTTATCGTCAGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTGTATACGGTTTGTGACGCTGCGAATGTATCCCGGTATCAGGGCGGGAGCTATACTGAA
CAGGCCGTCATCACAAATATTCCGAAGGTTGACGGGGCGAAAATGTCTGCCGTTATCCGAACGCATGACAAAGAAAAATG
GGCCGCTTCATACAAAATCCGGTCTCTTGAAGAAAAAAGTCTCATTGAACAGCTGGAACCGGGGATGCGCTGCGTGTTTA
CCGGCTCTTTGGAACAGCCCGGACATGCGACGATTCCCGGAGGTTTTGATTATAAGGAACATCTTAATTCTCAGCATATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCGGCATGCCGCTGTATAAATTGCTCAACATCAGAAA
GAATTTGATTTCGATCATTCAAAATCACGTGCCTGAATCCTCTGCCGGAATTGTAGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAACGCTTATCAAAATCTGGGAGTCGTTCATTTAATGGCCATTTCCGGAATGCATGTC
GGTCTGATTACCGCGGGAATATTTTATGTTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGGTTTTGCTGCTGTT
TTTTTTACCGGTGTATACGCTGTTAAGCGGTGCGGCCCCGTCCGTATTGCGCGCATCTCTCATGCTGGGATTTTATATCG
CCGGAACTCTTTTTAAACGCAGAATTCATTCCTCGGCTGCATTGTCTGTGTCTTATCTGCTGCTCCTCATGTTTAATCCT
TACCTCCTTTGGCAGGCGGGCTTCCAGCTTTCTTTTGCGGTAAGCGCCGCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGAAGATTTGCCGCGCTAGCGATGGCCTCATTCATTGCGGAGCTCAGTTCGCTTCCGTTTCTGCTCTATC
ATTTTCAGCAGATTTCGCTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCTTTTTATACGTTATTTGTCATTCCGTTT
TCGGTCATCGGTTTCCTTCTTCTCTTACTTTCAAGGCAGGTCGGGGAATGTTTGTTTTATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATGACGTATGCGGCATCCGTTGATTTATTTACAATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTGTTTACGCTATTTGCAGCTTTGGAAAAGGGCGGTTTTTTGAAACTCAGGAAATCGCCT
CTTTTTTTCTGCGCGGTTTTGGTTTATTTAATGTTCCGTCCTTATTTCAGTCCCTGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGTGCGCCGCACCGCAAAGGAACCGTAATGGTTGATACAGGGGGAGTGGTGGCTT
ATCCAGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGTGAGAAGGTTTTGATTCCGTTTTTGAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAGGATCATATCGGGGAAGCCGGAGTATTAATCAAAAA
ACATAGAGTCAAACGGCTGATTGTTCCCGTGGGATTCGTGAAGGAACCGAAGGATCAGGACATATTGAAGATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTCATCTTCAATTTCAGGTGCTGTCCCCG
GAGTCGTCTGACGGAAGAAGTAAAAACGATTCGTCACTGGTGCTTTGGACAGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAGCGTATCCGAATCTGAAGGCTGATATATTGAAGGCCGGTC
ATCACGGCAGCAAAAGCTCTACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCGGCACTGATTTCAGCAGGTGAA
GATAATCGTTACCATCATCCGCATGAAGAGGTGCTGGATCGTTTGAAAGCGTACTCTGTCAATGTGCTTCGCACCGATCT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCTGTCTTTCCTCCATATGATATAGAAGAAACCA
GGGCGCAGGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

55.57

98.595

0.548


Multiple sequence alignment