Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CLI98_RS02125 Genome accession   NZ_CP023431
Coordinates   419729..422080 (-) Length   783 a.a.
NCBI ID   WP_022552933.1    Uniprot ID   -
Organism   Bacillus velezensis strain SCGB 574     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 414729..427080
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CLI98_RS02095 (CLI98_00413) - 415095..415433 (-) 339 WP_007408270.1 YqxA family protein -
  CLI98_RS02100 (CLI98_00414) - 415450..416643 (-) 1194 WP_007612673.1 stage II sporulation protein P -
  CLI98_RS02105 (CLI98_00415) gpr 416711..417817 (-) 1107 WP_022552935.1 GPR endopeptidase -
  CLI98_RS02110 (CLI98_00416) rpsT 418020..418286 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  CLI98_RS02115 (CLI98_00417) holA 418303..419344 (-) 1042 Protein_420 DNA polymerase III subunit delta -
  CLI98_RS19770 - 419384..419535 (-) 152 Protein_421 hypothetical protein -
  CLI98_RS02120 - 419576..419710 (+) 135 WP_003152870.1 YqzM family protein -
  CLI98_RS02125 (CLI98_00418) comEC 419729..422080 (-) 2352 WP_022552933.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CLI98_RS02130 (CLI98_00419) - 422081..422650 (-) 570 WP_003152868.1 ComE operon protein 2 -
  CLI98_RS02135 (CLI98_00420) comEA 422717..423331 (-) 615 WP_014418419.1 helix-hairpin-helix domain-containing protein Machinery gene
  CLI98_RS02140 (CLI98_00421) comER 423390..424211 (+) 822 WP_014418420.1 late competence protein ComER -
  CLI98_RS02145 (CLI98_00422) - 424280..425017 (-) 738 WP_014418421.1 class I SAM-dependent methyltransferase -
  CLI98_RS02150 (CLI98_00423) rsfS 425014..425370 (-) 357 WP_007408260.1 ribosome silencing factor -
  CLI98_RS02155 (CLI98_00424) yqeK 425388..425948 (-) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CLI98_RS02160 (CLI98_00425) - 425938..426507 (-) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  CLI98_RS02165 (CLI98_00426) yhbY 426518..426808 (-) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86713.55 Da        Isoelectric Point: 8.7941

>NTDB_id=247047 CLI98_RS02125 WP_022552933.1 419729..422080(-) (comEC) [Bacillus velezensis strain SCGB 574]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLFFIIVKTKQHAPVIVCLVSFCLYFFLYTVCDAANVTRYQAGSYTE
QVVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYNEYLYSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSGQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSV
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGRSKNDSSLVLWTVLSGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEEIRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=247047 CLI98_RS02125 WP_022552933.1 419729..422080(-) (comEC) [Bacillus velezensis strain SCGB 574]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTTTCTTTATTATTGTAAAAACAAAGCAGCATGCTCCGGTTATTGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGTCGTCATCACTAACATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGTTCTCTGGAGCAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAATGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCGGGAATTTTGCTACTGTT
GTTTTTGCCGGTGTATACGCTGCTGAGCGGTGCCGCCCCGTCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCCGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCTTCATTCATTGCGGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATTGGTTTCCTTCTTCTTTTACTTTCAGGGCAGATGGGAGAATGTTTGTTTGATATGTTCGACCTTGTGATGAA
GCCTGTACATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACTCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGTT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGATTTTAACCCATGCGGATCAAGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAGGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAAGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCGGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAATCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.921

98.723

0.562


Multiple sequence alignment