Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   NSQ16_RS12825 Genome accession   NZ_CP150155
Coordinates   2489302..2491632 (-) Length   776 a.a.
NCBI ID   WP_339232215.1    Uniprot ID   -
Organism   Bacillus sp. PS108     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2484302..2496632
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NSQ16_RS12790 (NSQ16_12790) - 2484640..2484978 (-) 339 WP_014477382.1 YqxA family protein -
  NSQ16_RS12795 (NSQ16_12795) spoIIP 2484995..2486200 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  NSQ16_RS12800 (NSQ16_12800) gpr 2486263..2487369 (-) 1107 WP_003229991.1 GPR endopeptidase -
  NSQ16_RS12805 (NSQ16_12805) rpsT 2487574..2487840 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  NSQ16_RS12810 (NSQ16_12810) holA 2487855..2488898 (-) 1044 WP_015714286.1 DNA polymerase III subunit delta -
  NSQ16_RS12815 (NSQ16_12815) - 2488938..2489087 (-) 150 WP_003229985.1 hypothetical protein -
  NSQ16_RS12820 (NSQ16_12820) - 2489128..2489262 (+) 135 WP_003229983.1 YqzM family protein -
  NSQ16_RS12825 (NSQ16_12825) comEC 2489302..2491632 (-) 2331 WP_339232215.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  NSQ16_RS12830 (NSQ16_12830) - 2491636..2492205 (-) 570 WP_003229978.1 ComE operon protein 2 -
  NSQ16_RS12835 (NSQ16_12835) comEA 2492272..2492889 (-) 618 WP_014480312.1 competence protein ComEA Machinery gene
  NSQ16_RS12840 (NSQ16_12840) comER 2492973..2493794 (+) 822 WP_014480313.1 late competence protein ComER -
  NSQ16_RS12845 (NSQ16_12845) - 2493860..2494603 (-) 744 WP_003229973.1 class I SAM-dependent methyltransferase -
  NSQ16_RS12850 (NSQ16_12850) rsfS 2494600..2494956 (-) 357 WP_041333759.1 ribosome silencing factor -
  NSQ16_RS12855 (NSQ16_12855) yqeK 2494974..2495534 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  NSQ16_RS12860 (NSQ16_12860) - 2495524..2496093 (-) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  NSQ16_RS12865 (NSQ16_12865) yhbY 2496105..2496395 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86651.15 Da        Isoelectric Point: 7.0687

>NTDB_id=963673 NSQ16_RS12825 WP_339232215.1 2489302..2491632(-) (comEC) [Bacillus sp. PS108]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFHQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSVSFGRLFFSWFDLLISWTNRLITNIADVEVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVLFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPG
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQEVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=963673 NSQ16_RS12825 WP_339232215.1 2489302..2491632(-) (comEC) [Bacillus sp. PS108]
ATGCGTAATTCGCGCTTATTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTCATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTGAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATTCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TCTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCCGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCATCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTATATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGTTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGCTG
GACCAATAGGCTAATCACAAACATTGCAGATGTTGAAGTGTTCACGATTATGATCGCACATCCTGCACCTGTTTTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGAGGC
ATTTGCTGCACGGTGCTGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGGA
GCACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGGTATGAGCTGGATCTTGACGGG
TGACCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAGAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.325

100

0.983