Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   C2H95_RS16025 Genome accession   NZ_CP026037
Coordinates   3117608..3119938 (-) Length   776 a.a.
NCBI ID   WP_167568605.1    Uniprot ID   -
Organism   Bacillus subtilis strain PK5_17     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3112608..3124938
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2H95_RS15990 (C2H95_15930) yqxA 3112946..3113284 (-) 339 WP_021480060.1 YqxA family protein -
  C2H95_RS15995 (C2H95_15935) spoIIP 3113301..3114506 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  C2H95_RS16000 (C2H95_15940) gpr 3114569..3115675 (-) 1107 WP_032726190.1 GPR endopeptidase -
  C2H95_RS16005 (C2H95_15945) rpsT 3115879..3116145 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  C2H95_RS16010 (C2H95_15950) holA 3116160..3117203 (-) 1044 WP_167568607.1 DNA polymerase III subunit delta -
  C2H95_RS16015 - 3117243..3117356 (-) 114 WP_122060479.1 hypothetical protein -
  C2H95_RS16020 (C2H95_15955) yqzM 3117433..3117567 (+) 135 WP_003229983.1 YqzM family protein -
  C2H95_RS16025 (C2H95_15960) comEC 3117608..3119938 (-) 2331 WP_167568605.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  C2H95_RS16030 (C2H95_15965) comEB 3119942..3120511 (-) 570 WP_003229978.1 ComE operon protein 2 -
  C2H95_RS16035 (C2H95_15970) comEA 3120578..3121195 (-) 618 WP_032726193.1 competence protein ComEA Machinery gene
  C2H95_RS16040 (C2H95_15975) comER 3121279..3122100 (+) 822 WP_032726195.1 late competence protein ComER -
  C2H95_RS16045 (C2H95_15980) yqeM 3122166..3122909 (-) 744 WP_032726197.1 class I SAM-dependent methyltransferase -
  C2H95_RS16050 (C2H95_15985) rsfS 3122906..3123262 (-) 357 WP_014477391.1 ribosome silencing factor -
  C2H95_RS16055 (C2H95_15990) yqeK 3123280..3123840 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  C2H95_RS16060 (C2H95_15995) nadD 3123830..3124399 (-) 570 WP_021480066.1 nicotinate-nucleotide adenylyltransferase -
  C2H95_RS16065 (C2H95_16000) yhbY 3124411..3124701 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86677.25 Da        Isoelectric Point: 6.9842

>NTDB_id=265723 C2H95_RS16025 WP_167568605.1 3117608..3119938(-) (comEC) [Bacillus subtilis strain PK5_17]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIMLIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYDEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIIIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMAGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVSEPKDEKVLQAAREEGVAIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=265723 C2H95_RS16025 WP_167568605.1 3117608..3119938(-) (comEC) [Bacillus subtilis strain PK5_17]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATGTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATGACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGTGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTCTGCCTCCTGATTCGGCAGGAATTGTACAGGCACTTACAGTCGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATAATCCGTCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGGCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGACGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAGCAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGATTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGGCAGCCAGGGAAG
AGGGAGTGGCAATTGAAGAGGTGAAGCGAGGCGATGTGTTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCGGAA
GCACCTGATCCGGCAAGCAAAAATAATTCATCTCTCGTTCTTTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAAGAGGTGATGAACGTGTTTCCGAATATAAAAGCAGATGTGTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGACCAAAA
CGGAACGATCCAATATAGATATAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.582

100

0.986


Multiple sequence alignment