Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   NSQ83_RS12955 Genome accession   NZ_CP150157
Coordinates   2516287..2518617 (-) Length   776 a.a.
NCBI ID   WP_198878618.1    Uniprot ID   -
Organism   Bacillus sp. PS217     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2511287..2523617
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NSQ83_RS12920 (NSQ83_12920) - 2511626..2511964 (-) 339 WP_014477382.1 YqxA family protein -
  NSQ83_RS12925 (NSQ83_12925) spoIIP 2511981..2513186 (-) 1206 WP_014477383.1 spore autolysin SpoIIP -
  NSQ83_RS12930 (NSQ83_12930) gpr 2513249..2514355 (-) 1107 WP_003229991.1 GPR endopeptidase -
  NSQ83_RS12935 (NSQ83_12935) rpsT 2514559..2514825 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  NSQ83_RS12940 (NSQ83_12940) holA 2514840..2515883 (-) 1044 WP_003229987.1 DNA polymerase III subunit delta -
  NSQ83_RS12945 (NSQ83_12945) - 2515923..2516072 (-) 150 WP_003229985.1 hypothetical protein -
  NSQ83_RS12950 (NSQ83_12950) - 2516113..2516247 (+) 135 WP_003229983.1 YqzM family protein -
  NSQ83_RS12955 (NSQ83_12955) comEC 2516287..2518617 (-) 2331 WP_198878618.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  NSQ83_RS12960 (NSQ83_12960) - 2518621..2519190 (-) 570 WP_003229978.1 ComE operon protein 2 -
  NSQ83_RS12965 (NSQ83_12965) comEA 2519257..2519874 (-) 618 WP_198878617.1 competence protein ComEA Machinery gene
  NSQ83_RS12970 (NSQ83_12970) comER 2519958..2520779 (+) 822 WP_014480313.1 late competence protein ComER -
  NSQ83_RS12975 (NSQ83_12975) - 2520845..2521588 (-) 744 WP_198878616.1 class I SAM-dependent methyltransferase -
  NSQ83_RS12980 (NSQ83_12980) rsfS 2521585..2521941 (-) 357 WP_014480315.1 ribosome silencing factor -
  NSQ83_RS12985 (NSQ83_12985) yqeK 2521959..2522519 (-) 561 WP_014480316.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  NSQ83_RS12990 (NSQ83_12990) - 2522509..2523078 (-) 570 WP_004398676.1 nicotinate-nucleotide adenylyltransferase -
  NSQ83_RS12995 (NSQ83_12995) yhbY 2523090..2523380 (-) 291 WP_003226133.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86756.42 Da        Isoelectric Point: 7.4973

>NTDB_id=963837 NSQ83_RS12955 WP_198878618.1 2516287..2518617(-) (comEC) [Bacillus sp. PS217]
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVKTPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSAGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWTNRLITKIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLKHHKVKRLVIPKGFVSEPKDEKVLQTAREEGVTIEEVKRGDVLQIKDLQFHVLSPE
TPDPASKNNSSLVLWMETGVLSWILTGDLEKEGEQEVMDVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQKVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=963837 NSQ83_RS12955 WP_198878618.1 2516287..2518617(-) (comEC) [Bacillus sp. PS217]
ATGCGTAATTCGCGTTTGTTATTGCCTATGGCGGCAGCTTCGGCAACGGCTGGAATTACTGCCGCCGCTTATTTCCCCGC
TATTTTTCTTTTCATCCTCTTTCTCCTCATCATTTTAATCAAAACGAGGCACGCTTTTCTTATTATTGTTTGTTTCTTCT
CTTTTATATTGTTTTTTGTACTGTATGCAGTCACAGATTCTCAGAATGTCTCTTCCTATCGGCAGGGAACCTATCAATTC
AAGGCAGTGATTGACACTATTCCCAAAATTGACGGCGACCGTATGTCTATGATGGTTAAGACACCTGATAAGGAAAAATG
GGCTGCTGCGTATCGCATTCAGTCTGCTGGTGAAAAAGAACAGCTGTTATACATAGAACCAGGAATGTCATGTGAGTTGA
CTGGTACATTGGAAGAACCGAATCACGCAACTGTGCCGGGTGCATTTGATTATAACGAGTATCTTTATCGGCAGCATATT
CATTGGAACTACTCTGTCACGTCTATCCAAAACTGCAGCGAACCTGAAAATTTTAAGTACAAGGTGCTCAGCTTGAGAAA
ACATATCATATCATTCACAAACAGCCTTTTGCCTCCTGATTCGGCAGGGATTGTACAGGCACTTACAGTTGGTGACAGAT
TTTATGTGGAGGATGAAGTGCTTACCGCGTATCAAAAGCTTGGTGTTGTCCATCTCTTGGCAATATCAGGACTCCACGTG
GGGATTTTGACAGCAGGTTTGTTTTATATCATGATTCGCCTTGGTATAACTAGAGAAAAGGCGTCAATTCTGTTGCTGTT
ATTTCTGCCGCTCTATGTGATGTTGACCGGCGCTGCTCCTTCAGTGCTACGCGCCGCTCTCATGTCGGGTGTTTACTTAG
CTGGAAGCCTTGTCAAATGGCGTGTCCACTCTGCAACTGCAATTTGTCTTTCATACATCGTCCTTCTGCTCTTCAATCCT
TATCATCTCTTTGAAGCTGGTTTTCAGCTATCGTTCGCCGTCAGTTTTTCTTTAATTCTATCCTCTTCTATTTTTCAGCA
GGTTAAAACCTCCTTGGGGCAGCTGACAATTGTATCACTCATCGCTCAGCTGGGCTCGCTTCCGATTCTCCTGTATCATT
TTCATCAGTTTTCTATCATCAGCGTACCGATGAATATGTTGATGGTACCATTTTATACCTTCTGTATTTTGCCGGGAGCT
GTAGCAGGTGTTCTTCTATTAAGTCTTTCCGCTTCGTTTGGGAGATTGTTTTTCAGCTGGTTTGATTTATTGATAAGTTG
GACCAATAGGTTAATCACAAAAATTGCAGATGTTGATGTGTTCACGATTATGATCGCACATCCTGCACCTGTTCTGCTTT
TTTTATTCACGGTCACGATCATCCTATTGCTTATGGCGATTGAAAAACGCTCCTTGTCGCAGTTGATGGTAACCGGCGGC
ATTTGCTGCACGGTGATGTTTCTGCTCTTTATATATCCGTGTCTTAGTTCCGAAGGAGAAGTGGATATGATAGATATTGG
ACAGGGTGACAGCATGTTTGTAGGTGCTCCGCATCAGCGGGGTCGTGTCTTAATTGATACCGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAGCATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGACGCTTTAATTCTGACGCACGCTGACCAAGATCATATCGGAGAGGCGGAGACTCTGCTGAAGCATCA
TAAAGTAAAGCGCCTCGTGATTCCGAAGGGGTTCGTTTCTGAACCTAAAGATGAGAAAGTGCTGCAGACAGCCAGAGAAG
AGGGAGTGACAATTGAAGAGGTGAAGCGAGGCGATGTATTGCAAATAAAGGATTTGCAGTTCCATGTACTGTCACCTGAA
ACACCTGATCCGGCAAGCAAAAATAATTCCTCTCTCGTTCTGTGGATGGAGACGGGCGTTCTGAGCTGGATTTTGACGGG
TGATCTGGAGAAAGAAGGGGAACAGGAGGTGATGGACGTGTTTCCAAATATTAAAGCAGATGTCTTAAAGGTGGGGCACC
ATGGGAGCAAAGGCTCTACCGGTGAAGAATTCATCCAACAGCTTCAGCCGAAAACGGCCATTATCTCAGCCGGGAAAAAC
AATCGGTACCATCATCCTCATCAAAAAGTTCTGCAACTATTACAGAGACATTCTATCCGCGTGCTGCGAACAGATCAAAA
CGGAACGATCCAATATAGATACAAAAACAGAGTTGGAACCTTTTCTGTCTATCCTCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

98.454

100

0.985