Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   DIC78_RS18625 Genome accession   NZ_CP029364
Coordinates   3618036..3620366 (+) Length   776 a.a.
NCBI ID   WP_127696944.1    Uniprot ID   -
Organism   Bacillus halotolerans strain ZB201702     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3613036..3625366
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DIC78_RS18585 (DIC78_18615) yhbY 3613275..3613565 (+) 291 WP_024122118.1 ribosome assembly RNA-binding protein YhbY -
  DIC78_RS18590 (DIC78_18620) - 3613576..3614145 (+) 570 WP_010335014.1 nicotinate-nucleotide adenylyltransferase -
  DIC78_RS18595 (DIC78_18625) yqeK 3614135..3614695 (+) 561 WP_024122116.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  DIC78_RS18600 (DIC78_18630) rsfS 3614713..3615069 (+) 357 WP_044154484.1 ribosome silencing factor -
  DIC78_RS18605 (DIC78_18635) - 3615066..3615809 (+) 744 WP_106020124.1 class I SAM-dependent DNA methyltransferase -
  DIC78_RS18610 (DIC78_18640) comER 3615874..3616695 (-) 822 WP_024122113.1 late competence protein ComER -
  DIC78_RS18615 (DIC78_18645) comEA 3616779..3617396 (+) 618 WP_127696752.1 helix-hairpin-helix domain-containing protein Machinery gene
  DIC78_RS18620 (DIC78_18650) - 3617463..3618032 (+) 570 WP_010335007.1 ComE operon protein 2 -
  DIC78_RS18625 (DIC78_18655) comEC 3618036..3620366 (+) 2331 WP_127696944.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  DIC78_RS18630 (DIC78_18660) - 3620405..3620539 (-) 135 WP_010335005.1 YqzM family protein -
  DIC78_RS18635 - 3620580..3620729 (+) 150 WP_003229985.1 hypothetical protein -
  DIC78_RS18640 (DIC78_18665) holA 3620769..3621812 (+) 1044 WP_103672292.1 DNA polymerase III subunit delta -
  DIC78_RS18645 (DIC78_18670) rpsT 3621827..3622093 (-) 267 WP_010335003.1 30S ribosomal protein S20 -
  DIC78_RS18650 (DIC78_18675) gpr 3622297..3623403 (+) 1107 WP_044154490.1 GPR endopeptidase -
  DIC78_RS18655 (DIC78_18680) spoIIP 3623466..3624671 (+) 1206 WP_095713315.1 spore autolysin SpoIIP -
  DIC78_RS18660 (DIC78_18685) - 3624688..3625026 (+) 339 WP_024122107.1 YqxA family protein -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86213.48 Da        Isoelectric Point: 7.8416

>NTDB_id=292001 DIC78_RS18625 WP_127696944.1 3618036..3620366(+) (comEC) [Bacillus halotolerans strain ZB201702]
MRNSRLLLPLAAASATAGIAAASYFSAVFLFFLFLLIVSIKTRHAPLIFVCLFSFVLNVALFKITDSQNTSSYQQGSYHF
KAVIHSIPKIDGDRMSMLIKTPDGEKWAASYRIQSLAEKDKLSHLEPGMACELTGTLEKPKQATVPGTFDYQQYLYRQHI
HWSYSVTSIGNCSEPSDVRYKLLSLRKYIVSFTNTQLPPDSAGIVQALTVGERSYLDDDVLNAYQSLGVVHLLAISGLHI
GILTAGLFYAMIRLGITRENASILLLLFLPIYVILTGAAPSVLRAALMSGIYLAGSLFQQRVNSAGVICLSYIVLLLFNP
YYLFEAGFQLSFAVSFSLILSLSIFQHIKTSLGQLTIVSVIAQLGSLPILLYHFQQFSIISIPMNMVLVPFYTFCILPAA
IIGVILLFFSASVGQFLFYWFDVMMTWINRLITKIADIEIFTIIISRAAPALLLLFTLTVIILLMSIEKPSFPRLTVSTG
LFCATLILLFVSPYVSSEGEVDMIDIGQGDSMFVSAPYQKGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEVLLKHHKVKRLIIPKGFVSEPHDEKVLRTARQEGVAIEEVKRGDVLQIKDLQFHVLSPE
TPDPASKNNSSLVLWMKTGRISWLLTGDLEKEGEQEVIDVFPNIKADVLKVGHHGSKGSTGEEFIKQLQPKTAVISAGEN
NRYHHPHQEVLQLLKSRSIRVLRTDQSGTIQYTYKSGSGTFSVYPPYDTSDMTETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=292001 DIC78_RS18625 WP_127696944.1 3618036..3620366(+) (comEC) [Bacillus halotolerans strain ZB201702]
ATGCGGAATTCGCGTCTATTATTGCCTTTGGCGGCAGCTTCAGCAACGGCTGGAATTGCTGCCGCCTCTTATTTCTCCGC
TGTCTTTCTCTTCTTCCTCTTTCTTCTTATTGTGTCAATCAAAACAAGGCACGCTCCACTCATTTTTGTTTGTTTATTCT
CTTTTGTACTCAATGTTGCCCTGTTTAAGATCACAGATTCTCAAAATACTTCTTCGTATCAGCAGGGCAGCTATCACTTC
AAGGCGGTTATTCATAGTATTCCTAAAATTGACGGGGATCGGATGTCTATGCTGATCAAGACGCCCGATGGCGAAAAATG
GGCTGCCTCTTATCGGATTCAGTCTTTAGCTGAAAAAGACAAACTGTCACATCTAGAGCCGGGTATGGCATGCGAATTGA
CTGGTACTTTGGAGAAGCCAAAACAAGCAACAGTGCCGGGAACATTTGATTATCAACAGTATCTTTACCGGCAGCATATT
CATTGGAGCTATTCTGTCACATCTATTGGAAATTGCAGCGAGCCTTCAGATGTCCGGTATAAGCTTCTCAGTTTGAGAAA
ATACATCGTATCATTTACGAACACGCAGCTTCCGCCAGATTCAGCAGGAATCGTACAGGCGCTTACAGTAGGTGAAAGAT
CTTATTTAGATGACGATGTGCTTAATGCTTACCAAAGTCTAGGTGTTGTCCATCTGTTAGCCATCTCAGGACTCCATATC
GGGATTTTGACAGCAGGCCTATTTTACGCAATGATCCGTCTGGGCATAACAAGAGAAAATGCGTCAATCCTATTGCTTTT
GTTTCTGCCGATCTATGTTATTTTGACAGGTGCAGCGCCTTCTGTTCTCCGAGCCGCTCTGATGTCCGGCATCTACTTAG
CGGGAAGCCTTTTTCAGCAGCGGGTGAATTCTGCCGGGGTAATCTGTCTTTCGTATATTGTCCTCTTGCTTTTTAATCCC
TACTACCTTTTTGAAGCTGGATTTCAGCTTTCATTTGCGGTCAGTTTTTCTTTAATTCTCTCATTGTCTATTTTTCAGCA
TATCAAAACGAGTCTGGGACAGCTGACGATTGTATCTGTTATTGCCCAGCTAGGCTCACTACCCATTCTTCTGTATCATT
TTCAGCAGTTTTCTATCATTAGCATTCCAATGAATATGGTGTTGGTGCCTTTTTATACATTCTGTATTTTACCGGCTGCT
ATTATAGGAGTTATTCTATTATTTTTTTCAGCGTCCGTCGGTCAATTTCTGTTCTATTGGTTTGATGTAATGATGACCTG
GATCAACAGACTGATCACTAAAATCGCTGATATAGAAATATTCACAATCATCATCTCTCGCGCTGCACCTGCTCTTCTTC
TCTTATTCACGCTCACTGTCATCATATTGCTTATGAGTATTGAGAAACCCTCATTTCCCCGGCTGACAGTGTCCACAGGC
CTTTTTTGTGCAACGCTTATCCTGCTCTTTGTTTCCCCTTATGTTAGTTCCGAGGGAGAAGTAGATATGATTGATATCGG
ACAAGGTGACAGTATGTTTGTCAGTGCACCGTATCAGAAAGGCCGTGTTTTAATTGATACGGGAGGCACTTTGTCTTATT
CGTCAGAGCCTTGGCGGGAAAAACAGCATCCTTTTTCACTGGGGGAAAAGGTGCTGATCCCTTTTTTAACGGCAAAGGGC
ATCAAGCAGCTTGATGCGCTGATACTGACGCACGCAGATCAAGACCATATCGGAGAGGCAGAGGTTTTGCTGAAGCATCA
TAAAGTGAAGCGTCTCATCATTCCGAAAGGGTTTGTTTCTGAACCTCATGATGAGAAAGTGCTGCGGACTGCCAGACAGG
AGGGTGTGGCAATTGAAGAGGTGAAGCGGGGTGATGTATTGCAAATAAAGGACTTACAGTTTCATGTGCTGTCACCGGAA
ACGCCTGATCCGGCAAGTAAAAATAATTCTTCTCTCGTTCTATGGATGAAAACAGGCAGGATCAGCTGGCTCCTGACGGG
TGATCTGGAAAAAGAGGGGGAACAGGAGGTTATCGATGTGTTCCCAAATATAAAAGCGGATGTGTTAAAGGTGGGGCATC
ATGGAAGCAAGGGCTCGACCGGTGAAGAATTCATCAAACAGCTTCAGCCCAAAACGGCGGTCATCTCAGCGGGTGAAAAC
AATAGGTACCATCATCCGCATCAAGAAGTTCTTCAATTGTTAAAGAGCCGATCCATTCGTGTGCTGCGCACTGATCAAAG
CGGCACGATCCAATACACGTACAAAAGCGGATCTGGAACCTTTTCTGTTTATCCTCCATATGATACATCAGATATGACAG
AGACGAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

79.639

100

0.796


Multiple sequence alignment