Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   EQH88_RS13235 Genome accession   NZ_CP035161
Coordinates   2536641..2538971 (-) Length   776 a.a.
NCBI ID   WP_080477815.1    Uniprot ID   -
Organism   Bacillus subtilis strain SRCM103862     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2531641..2543971
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EQH88_RS13200 (EQH88_13200) yqxA 2531980..2532318 (-) 339 WP_004399162.1 YqxA family protein -
  EQH88_RS13205 (EQH88_13205) spoIIP 2532335..2533540 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  EQH88_RS13210 (EQH88_13210) gpr 2533603..2534709 (-) 1107 WP_014480309.1 GPR endopeptidase -
  EQH88_RS13215 (EQH88_13215) rpsT 2534913..2535179 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  EQH88_RS13220 (EQH88_13220) holA 2535194..2536237 (-) 1044 WP_015714286.1 DNA polymerase III subunit delta -
  EQH88_RS13225 (EQH88_13225) - 2536277..2536426 (-) 150 WP_003229985.1 hypothetical protein -
  EQH88_RS13230 (EQH88_13230) yqzM 2536467..2536601 (+) 135 WP_003229983.1 YqzM family protein -
  EQH88_RS13235 (EQH88_13235) comEC 2536641..2538971 (-) 2331 WP_080477815.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  EQH88_RS13240 (EQH88_13240) comEB 2538968..2539537 (-) 570 WP_015714288.1 ComE operon protein 2 -
  EQH88_RS13245 (EQH88_13245) comEA 2539604..2540221 (-) 618 WP_004398514.1 competence protein ComEA Machinery gene
  EQH88_RS13250 (EQH88_13250) comER 2540305..2541126 (+) 822 WP_004398597.1 late competence protein ComER -
  EQH88_RS13255 (EQH88_13255) yqeM 2541192..2541935 (-) 744 WP_003229973.1 class I SAM-dependent methyltransferase -
  EQH88_RS13260 (EQH88_13260) rsfS 2541932..2542288 (-) 357 WP_003229971.1 ribosome silencing factor -
  EQH88_RS13265 (EQH88_13265) yqeK 2542306..2542866 (-) 561 WP_004399059.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  EQH88_RS13270 (EQH88_13270) nadD 2542856..2543425 (-) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  EQH88_RS13275 (EQH88_13275) yhbY 2543437..2543727 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86623.22 Da        Isoelectric Point: 7.1612

>NTDB_id=334970 EQH88_RS13235 WP_080477815.1 2536641..2538971(-) (comEC) [Bacillus subtilis strain SRCM103862]
MMNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMVVKTPDKEKWAAAYRILSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHTISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITNIADVDVFTIMIAHPAPVLLFLFTVAIILLLMAIEKRSLSQLMITGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWHEKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
TPDPASKNNSSLVLWMETGVLSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQKVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=334970 EQH88_RS13235 WP_080477815.1 2536641..2538971(-) (comEC) [Bacillus subtilis strain SRCM103862]
GTGATGAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGGTGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCTGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGAGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATACCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGACGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTTTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGCCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGACAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTTCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGATGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTTGCGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGATAACCGGCGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCACGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
ACACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGTTCTGAGCTGGATTTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAAAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

97.68

100

0.977


Multiple sequence alignment