Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CMR26_RS04120 Genome accession   NZ_CP023414
Coordinates   791907..794258 (+) Length   783 a.a.
NCBI ID   WP_108724812.1    Uniprot ID   -
Organism   Bacillus velezensis strain BS-37     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 786907..799258
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CMR26_RS04080 (CMR26_04075) yhbY 787180..787470 (+) 291 WP_007408257.1 ribosome assembly RNA-binding protein YhbY -
  CMR26_RS04085 (CMR26_04080) - 787481..788050 (+) 570 WP_042635364.1 nicotinate-nucleotide adenylyltransferase -
  CMR26_RS04090 (CMR26_04085) yqeK 788040..788600 (+) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CMR26_RS04095 (CMR26_04090) rsfS 788617..788973 (+) 357 WP_007408260.1 ribosome silencing factor -
  CMR26_RS04100 (CMR26_04095) - 788970..789707 (+) 738 WP_015240247.1 class I SAM-dependent DNA methyltransferase -
  CMR26_RS04105 (CMR26_04100) comER 789776..790597 (-) 822 WP_053103971.1 late competence protein ComER -
  CMR26_RS04110 (CMR26_04105) comEA 790656..791270 (+) 615 WP_108724811.1 helix-hairpin-helix domain-containing protein Machinery gene
  CMR26_RS04115 (CMR26_04110) - 791337..791906 (+) 570 WP_003152868.1 ComE operon protein 2 -
  CMR26_RS04120 (CMR26_04115) comEC 791907..794258 (+) 2352 WP_108724812.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CMR26_RS04125 (CMR26_04120) - 794277..794411 (-) 135 WP_003152870.1 YqzM family protein -
  CMR26_RS20050 - 794452..794608 (+) 157 Protein_786 hypothetical protein -
  CMR26_RS04130 (CMR26_04125) holA 794643..795684 (+) 1042 Protein_787 DNA polymerase III subunit delta -
  CMR26_RS04135 (CMR26_04130) rpsT 795701..795967 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  CMR26_RS04140 (CMR26_04135) gpr 796170..797276 (+) 1107 WP_007408268.1 GPR endopeptidase -
  CMR26_RS04145 (CMR26_04140) spoIIP 797345..798538 (+) 1194 WP_007612673.1 stage II sporulation protein P -
  CMR26_RS04150 (CMR26_04145) - 798555..798893 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86788.65 Da        Isoelectric Point: 9.0019

>NTDB_id=246866 CMR26_RS04120 WP_108724812.1 791907..794258(+) (comEC) [Bacillus velezensis strain BS-37]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKQYAPVIVCLVSFCLYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIEQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASLIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQVGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDRRSKNDSSLVLWTVFGGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=246866 CMR26_RS04120 WP_108724812.1 791907..794258(+) (comEC) [Bacillus velezensis strain BS-37]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCAGTATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGCTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCTGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGTTCTCTTGAGGAAAAAAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTGAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCGTATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGGTTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTTCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTGATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGGTGGGAGAATGTTTGTTTGATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTGCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAGTCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGTCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCATTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCACTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTAGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAATAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACAGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTTGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAGCAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTAAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

57.124

98.595

0.563


Multiple sequence alignment