Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   GWK37_RS01955 Genome accession   NZ_CP048002
Coordinates   390924..393275 (-) Length   783 a.a.
NCBI ID   WP_115940649.1    Uniprot ID   -
Organism   Bacillus velezensis strain CACC 316     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 385924..398275
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GWK37_RS01920 (GWK37_01925) - 386290..386628 (-) 339 WP_007408270.1 YqxA family protein -
  GWK37_RS01925 (GWK37_01930) spoIIP 386645..387838 (-) 1194 WP_007612673.1 stage II sporulation protein P -
  GWK37_RS01930 (GWK37_01935) gpr 387906..389012 (-) 1107 WP_014418417.1 GPR endopeptidase -
  GWK37_RS01935 (GWK37_01940) rpsT 389215..389481 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  GWK37_RS01940 (GWK37_01945) holA 389498..390539 (-) 1042 Protein_385 DNA polymerase III subunit delta -
  GWK37_RS01945 (GWK37_01950) - 390579..390730 (-) 152 Protein_386 hypothetical protein -
  GWK37_RS01950 (GWK37_01955) - 390771..390905 (+) 135 WP_003152870.1 YqzM family protein -
  GWK37_RS01955 (GWK37_01960) comEC 390924..393275 (-) 2352 WP_115940649.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GWK37_RS01960 (GWK37_01965) - 393276..393845 (-) 570 WP_003152868.1 ComE operon protein 2 -
  GWK37_RS01965 (GWK37_01970) comEA 393912..394526 (-) 615 WP_115940650.1 helix-hairpin-helix domain-containing protein Machinery gene
  GWK37_RS01970 (GWK37_01975) comER 394585..395406 (+) 822 WP_012118027.1 late competence protein ComER -
  GWK37_RS01975 (GWK37_01980) - 395475..396212 (-) 738 WP_015417856.1 class I SAM-dependent DNA methyltransferase -
  GWK37_RS01980 (GWK37_01985) rsfS 396209..396565 (-) 357 WP_007408260.1 ribosome silencing factor -
  GWK37_RS01985 (GWK37_01990) yqeK 396583..397143 (-) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  GWK37_RS01990 (GWK37_01995) - 397133..397702 (-) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  GWK37_RS01995 (GWK37_02000) yhbY 397713..398003 (-) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86694.51 Da        Isoelectric Point: 8.9059

>NTDB_id=419207 GWK37_RS01955 WP_115940649.1 390924..393275(-) (comEC) [Bacillus velezensis strain CACC 316]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKQHAPVIVCLVSFCLYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTVFVIPV
SVIGFLLLLLSRQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFVSLLLLAVSVFTLFAALEKGGFLKLRKTA
LFFCAVLAYLICHPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVISYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQTILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVLSGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=419207 GWK37_RS01955 WP_115940649.1 390924..393275(-) (comEC) [Bacillus velezensis strain CACC 316]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCAGCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAACATCCCGAAGGTTGACGGAGCAAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAGCAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCGGGAATTTTGCTGCTGTT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCACCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTTCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCGGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTTATTGCGGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGGTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTTTCAAGGCAGATGGGAGAATGTTTGTTTGATATGTTCGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTGTTTCCC
TTCTGCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAAACGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCATCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTTCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCACTGATTTTAACCCATGCGGATCAAGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAAGATCAGACCATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCATTGGTGCTTTGGACGGTTTTAAGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCGGCACTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

57.383

98.595

0.566


Multiple sequence alignment