Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   C2H91_RS18050 Genome accession   NZ_CP026030
Coordinates   3604330..3606660 (+) Length   776 a.a.
NCBI ID   WP_163136229.1    Uniprot ID   -
Organism   Bacillus subtilis strain PK3_9     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3599330..3611660
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2H91_RS18010 (C2H91_18030) yhbY 3599567..3599857 (+) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -
  C2H91_RS18015 (C2H91_18035) nadD 3599869..3600438 (+) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  C2H91_RS18020 (C2H91_18040) yqeK 3600428..3600988 (+) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  C2H91_RS18025 (C2H91_18045) rsfS 3601006..3601362 (+) 357 WP_014477391.1 ribosome silencing factor -
  C2H91_RS18030 (C2H91_18050) yqeM 3601359..3602102 (+) 744 WP_124072952.1 class I SAM-dependent methyltransferase -
  C2H91_RS18035 (C2H91_18055) comER 3602168..3602989 (-) 822 WP_014480313.1 late competence protein ComER -
  C2H91_RS18040 (C2H91_18060) comEA 3603073..3603690 (+) 618 WP_032726193.1 competence protein ComEA Machinery gene
  C2H91_RS18045 (C2H91_18065) comEB 3603757..3604326 (+) 570 WP_003229978.1 ComE operon protein 2 -
  C2H91_RS18050 (C2H91_18070) comEC 3604330..3606660 (+) 2331 WP_163136229.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  C2H91_RS18055 (C2H91_18075) yqzM 3606701..3606835 (-) 135 WP_003229983.1 YqzM family protein -
  C2H91_RS18060 - 3606912..3607025 (+) 114 WP_134975484.1 hypothetical protein -
  C2H91_RS18065 (C2H91_18080) holA 3607065..3608108 (+) 1044 WP_021480061.1 DNA polymerase III subunit delta -
  C2H91_RS18070 (C2H91_18085) rpsT 3608123..3608389 (-) 267 WP_003229989.1 30S ribosomal protein S20 -
  C2H91_RS18075 (C2H91_18090) gpr 3608593..3609699 (+) 1107 WP_014480309.1 GPR endopeptidase -
  C2H91_RS18080 (C2H91_18095) spoIIP 3609762..3610967 (+) 1206 WP_014477383.1 spore autolysin SpoIIP -
  C2H91_RS18085 (C2H91_18100) yqxA 3610984..3611322 (+) 339 WP_032726189.1 YqxA family protein -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86610.16 Da        Isoelectric Point: 7.2478

>NTDB_id=265302 C2H91_RS18050 WP_163136229.1 3604330..3606660(+) (comEC) [Bacillus subtilis strain PK3_9]
MRNSRLLLPMAAASATVGITAAAYFPAIFLFILLLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLVSWTNRLITNIADVDVFTIMIAHPAPALLFLFTVTIILLLMAIEKRSLSQSMVTGG
ICCAVLFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKMLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQILQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=265302 C2H91_RS18050 WP_163136229.1 3604330..3606660(+) (comEC) [Bacillus subtilis strain PK3_9]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGTTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCCTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAGCCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGTGAACCTGAAAATTTCAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCGATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATCCGTCTCGGTATAACAAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGTCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGCCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGACAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGGTAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGATGTGTTCACGATTATGATCGCACATCCTGCACCTGCTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTCGATGGTAACCGGAGGC
ATTTGCTGCGCGGTGCTGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGATGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGATTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATACCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAAGAGGTGATGAACGTGTTTCCGAATATAAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAAATATTACAAAGGCATTCTATCCGTGTGCTGCGAACAGACCAAAA
CGGAACGATCCAATATAGATATAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

97.938

100

0.979


Multiple sequence alignment