Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CDO84_RS11835 Genome accession   NZ_CP021911
Coordinates   2365067..2367397 (-) Length   776 a.a.
NCBI ID   WP_040082574.1    Uniprot ID   -
Organism   Bacillus sp. MD-5     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2360067..2372397
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CDO84_RS11805 (CDO84_11805) - 2360405..2360743 (-) 339 WP_040082578.1 YqxA family protein -
  CDO84_RS11810 (CDO84_11810) spoIIP 2360760..2361965 (-) 1206 WP_040082576.1 spore autolysin SpoIIP -
  CDO84_RS11815 (CDO84_11815) gpr 2362028..2363134 (-) 1107 WP_040082575.1 GPR endopeptidase -
  CDO84_RS11820 (CDO84_11820) rpsT 2363338..2363604 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  CDO84_RS11825 (CDO84_11825) holA 2363619..2364662 (-) 1044 WP_014664666.1 DNA polymerase III subunit delta -
  CDO84_RS21405 - 2364702..2364815 (-) 114 WP_134975484.1 hypothetical protein -
  CDO84_RS11830 (CDO84_11830) - 2364892..2365026 (+) 135 WP_003229983.1 YqzM family protein -
  CDO84_RS11835 (CDO84_11835) comEC 2365067..2367397 (-) 2331 WP_040082574.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CDO84_RS11840 (CDO84_11840) - 2367401..2367970 (-) 570 WP_014664668.1 ComE operon protein 2 -
  CDO84_RS11845 (CDO84_11845) comEA 2368037..2368654 (-) 618 WP_014664669.1 helix-hairpin-helix domain-containing protein Machinery gene
  CDO84_RS11850 (CDO84_11850) comER 2368738..2369559 (+) 822 WP_004398597.1 late competence protein ComER -
  CDO84_RS11855 (CDO84_11855) - 2369625..2370368 (-) 744 WP_014664670.1 class I SAM-dependent methyltransferase -
  CDO84_RS11860 (CDO84_11860) rsfS 2370365..2370721 (-) 357 WP_014664671.1 ribosome silencing factor -
  CDO84_RS11865 (CDO84_11865) yqeK 2370739..2371299 (-) 561 WP_014664672.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CDO84_RS11870 (CDO84_11870) - 2371289..2371858 (-) 570 WP_014664673.1 nicotinate-nucleotide adenylyltransferase -
  CDO84_RS11875 (CDO84_11875) yhbY 2371870..2372160 (-) 291 WP_014664674.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86673.24 Da        Isoelectric Point: 7.6657

>NTDB_id=234810 CDO84_RS11835 WP_040082574.1 2365067..2367397(-) (comEC) [Bacillus sp. MD-5]
MRNSRLLLPLAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYHF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSADEKEKLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCGEPENFKYKVPSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFFIMIRVGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVRSATAICLSYITLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLAIVSLIAQLGSLPILLYHFHQFSIISIPMNMLLVPFYTFCILPGA
VAGVLLLSLSSSFGRLFFSWFDFLISWTNKLITKIADIDVFTLIIARPVPVLLFLFTVTIILLLMSIEKRSFSQLMITSG
ICCSVLFLLFASPRLSPEGEVDMIDIGQGDSMYVGAPHQRGHVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
INQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQAAKEEGVAIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMEMGGMSWILTGDLEKEGEQEVMKVFPNIKADVLKVGHHGSKGSTGEEFIKQLQPKTAIISAGKN
NRYHHPHQEVLQILQRHSIRVLRTDQNGTIQYRYQNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=234810 CDO84_RS11835 WP_040082574.1 2365067..2367397(-) (comEC) [Bacillus sp. MD-5]
ATGCGTAATTCGCGTTTGTTATTGCCTTTGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTATTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTCATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCATTTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGATGAAAAAGAAAAGCTGTTATACATAGAACCAGGAATGTCATGTGAATTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAATGAGTATCTTTATCGACAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCGGCGAACCTGAAAATTTCAAGTATAAGGTGCCCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCGATTTCAGGGCTCCACGTT
GGGATTTTGACAGCGGGTTTGTTTTTTATCATGATCCGTGTCGGTATAACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTTCGCGCCGCTTTAATGTCAGGTGTTTACTTAG
CTGGAAGTCTCGTCAAATGGCGTGTCCGCTCGGCAACGGCAATCTGTCTTTCATATATCACCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCTGTCAGTTTTTCTTTAATTCTTTCCTCTTCTATTTTTCAGCA
GGTAAAAACCTCCTTGGGTCAGCTGGCAATTGTATCACTCATCGCCCAGCTCGGATCGCTTCCGATTCTCCTGTATCATT
TTCATCAGTTTTCAATCATCAGCATACCGATGAATATGCTGTTGGTTCCATTTTATACGTTCTGTATTTTGCCGGGGGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCTCTTCATTTGGCAGATTGTTTTTCAGTTGGTTTGATTTCTTGATAAGCTG
GACCAATAAGCTGATCACAAAAATCGCAGATATTGATGTGTTCACGCTTATCATCGCACGTCCCGTGCCTGTTCTTCTCT
TTTTATTCACAGTGACGATCATCCTATTGCTTATGTCGATTGAAAAACGCTCCTTTTCTCAGTTGATGATAACCAGCGGC
ATTTGCTGCTCGGTGCTGTTTCTTCTCTTTGCATCTCCGCGTCTTAGTCCCGAAGGAGAAGTGGATATGATTGATATTGG
GCAGGGCGACAGCATGTATGTAGGTGCTCCGCATCAGCGGGGACATGTCTTAATCGATACTGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAACATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAACCAGCTTGATGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACCTTGCTGAAGCACCA
CAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAAAAAGTGCTGCAGGCAGCTAAAGAAG
AGGGAGTGGCAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCGGAA
GCGCCTGATCCGGCAAGCAAAAATAATTCATCTCTCGTTCTATGGATGGAAATGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAAGAGGTGATGAAGGTGTTTCCGAATATAAAAGCAGATGTGTTAAAGGTGGGGCATC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCAAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTATCACCATCCTCATCAAGAAGTTCTGCAGATATTACAGAGACATTCTATCCGTGTGCTGCGTACAGATCAAAA
CGGAACGATCCAATATAGATATCAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

94.201

100

0.942


Multiple sequence alignment