Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   NHG71_RS03535 Genome accession   NZ_CP100040
Coordinates   662254..664605 (+) Length   783 a.a.
NCBI ID   WP_032873958.1    Uniprot ID   -
Organism   Bacillus velezensis strain B1     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 657254..669605
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NHG71_RS03495 (NHG71_03475) yhbY 657525..657815 (+) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -
  NHG71_RS03500 (NHG71_03480) - 657826..658395 (+) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  NHG71_RS03505 (NHG71_03485) yqeK 658385..658960 (+) 576 WP_032873949.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  NHG71_RS03510 (NHG71_03490) rsfS 658964..659320 (+) 357 WP_007408260.1 ribosome silencing factor -
  NHG71_RS03515 (NHG71_03495) - 659317..660054 (+) 738 WP_032873951.1 class I SAM-dependent methyltransferase -
  NHG71_RS03520 (NHG71_03500) comER 660123..660944 (-) 822 WP_012118027.1 late competence protein ComER -
  NHG71_RS03525 (NHG71_03505) comEA 661003..661617 (+) 615 WP_032873955.1 helix-hairpin-helix domain-containing protein Machinery gene
  NHG71_RS03530 (NHG71_03510) - 661684..662253 (+) 570 WP_003152868.1 ComE operon protein 2 -
  NHG71_RS03535 (NHG71_03515) comEC 662254..664605 (+) 2352 WP_032873958.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  NHG71_RS03540 (NHG71_03520) - 664624..664758 (-) 135 WP_003152870.1 YqzM family protein -
  NHG71_RS03545 (NHG71_03525) - 664799..664950 (+) 152 Protein_682 hypothetical protein -
  NHG71_RS03550 (NHG71_03530) holA 664990..666031 (+) 1042 Protein_683 DNA polymerase III subunit delta -
  NHG71_RS03555 (NHG71_03535) rpsT 666048..666314 (-) 267 WP_003152876.1 30S ribosomal protein S20 -
  NHG71_RS03560 (NHG71_03540) gpr 666517..667623 (+) 1107 WP_007408268.1 GPR endopeptidase -
  NHG71_RS03565 (NHG71_03545) spoIIP 667691..668884 (+) 1194 WP_032873961.1 stage II sporulation protein P -
  NHG71_RS03570 (NHG71_03550) - 668901..669239 (+) 339 WP_007408270.1 YqxA family protein -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86681.60 Da        Isoelectric Point: 9.1106

>NTDB_id=703308 NHG71_RS03535 WP_032873958.1 662254..664605(+) (comEC) [Bacillus velezensis strain B1]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIIVKTKHHAPVIVCLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLHSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGEKFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLMFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFLMNMVLVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFVSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVISYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKVYSVNVLRTDVSGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=703308 NHG71_RS03535 WP_032873958.1 662254..664605(+) (comEC) [Bacillus velezensis strain B1]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATTGTAAAAACAAAGCACCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGTTCTCTTGAGGAAAAAAGACTTATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTCACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAAAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCGGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGATGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCGGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTCATTGCGGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCTGATGAATATGGTGTTGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTTATCGGTTTCCTTCTTCTTTTACTTTCAAGGCAGATGGGAGAATGTTTGTTTGATATGTTCGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGATTTTGTTTCCC
TTCTTCTGCTTGCGGTTTCAGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTTCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAAGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCTACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGTGTACTCTGTCAATGTGCTTCGCACCGATGT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.606

98.595

0.558