Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   C2H94_RS02180 Genome accession   NZ_CP026034
Coordinates   371027..373357 (-) Length   776 a.a.
NCBI ID   WP_167568605.1    Uniprot ID   -
Organism   Bacillus subtilis strain PK5_52     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 366027..378357
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2H94_RS02145 (C2H94_02135) yqxA 366365..366703 (-) 339 WP_021480060.1 YqxA family protein -
  C2H94_RS02150 (C2H94_02140) spoIIP 366720..367925 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  C2H94_RS02155 (C2H94_02145) gpr 367988..369094 (-) 1107 WP_032726190.1 GPR endopeptidase -
  C2H94_RS02160 (C2H94_02150) rpsT 369298..369564 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  C2H94_RS02165 (C2H94_02155) holA 369579..370622 (-) 1044 WP_167568607.1 DNA polymerase III subunit delta -
  C2H94_RS02170 - 370662..370775 (-) 114 WP_122060479.1 hypothetical protein -
  C2H94_RS02175 (C2H94_02160) yqzM 370852..370986 (+) 135 WP_003229983.1 YqzM family protein -
  C2H94_RS02180 (C2H94_02165) comEC 371027..373357 (-) 2331 WP_167568605.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  C2H94_RS02185 (C2H94_02170) comEB 373361..373930 (-) 570 WP_003229978.1 ComE operon protein 2 -
  C2H94_RS02190 (C2H94_02175) comEA 373997..374614 (-) 618 WP_032726193.1 competence protein ComEA Machinery gene
  C2H94_RS02195 (C2H94_02180) comER 374698..375519 (+) 822 WP_032726195.1 late competence protein ComER -
  C2H94_RS02200 (C2H94_02185) yqeM 375585..376328 (-) 744 WP_032726197.1 class I SAM-dependent methyltransferase -
  C2H94_RS02205 (C2H94_02190) rsfS 376325..376681 (-) 357 WP_014477391.1 ribosome silencing factor -
  C2H94_RS02210 (C2H94_02195) yqeK 376699..377259 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  C2H94_RS02215 (C2H94_02200) nadD 377249..377818 (-) 570 WP_021480066.1 nicotinate-nucleotide adenylyltransferase -
  C2H94_RS02220 (C2H94_02205) yhbY 377830..378120 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86677.25 Da        Isoelectric Point: 6.9842

>NTDB_id=265438 C2H94_RS02180 WP_167568605.1 371027..373357(-) (comEC) [Bacillus subtilis strain PK5_52]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIMLIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYDEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIIIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMAGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVSEPKDEKVLQAAREEGVAIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=265438 C2H94_RS02180 WP_167568605.1 371027..373357(-) (comEC) [Bacillus subtilis strain PK5_52]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATGTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATGACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGTGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTCGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATAATCCGTCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGGCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGACGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAGCAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGATTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGGCAGCCAGGGAAG
AGGGAGTGGCAATTGAAGAGGTGAAGCGAGGCGATGTGTTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCGGAA
GCACCTGATCCGGCAAGCAAAAATAATTCATCTCTCGTTCTTTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAAGAGGTGATGAACGTGTTTCCGAATATAAAAGCAGATGTGTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGACCAAAA
CGGAACGATCCAATATAGATATAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.582

100

0.986


Multiple sequence alignment