Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   CG798_RS18120 Genome accession   NZ_CP022531
Coordinates   3628921..3631272 (-) Length   783 a.a.
NCBI ID   WP_094031847.1    Uniprot ID   -
Organism   Bacillus velezensis strain TB1501     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3623921..3636272
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CG798_RS18090 (CG798_18090) - 3624287..3624625 (-) 339 WP_094031846.1 YqxA family protein -
  CG798_RS18095 (CG798_18095) - 3624642..3625835 (-) 1194 WP_007612673.1 stage II sporulation protein P -
  CG798_RS18100 (CG798_18100) gpr 3625903..3627009 (-) 1107 WP_007408268.1 GPR endopeptidase -
  CG798_RS18105 (CG798_18105) rpsT 3627212..3627478 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  CG798_RS18110 (CG798_18110) holA 3627495..3628536 (-) 1042 Protein_3473 DNA polymerase III subunit delta -
  CG798_RS19960 - 3628576..3628727 (-) 152 Protein_3474 hypothetical protein -
  CG798_RS18115 (CG798_18115) - 3628768..3628902 (+) 135 WP_003152870.1 YqzM family protein -
  CG798_RS18120 (CG798_18120) comEC 3628921..3631272 (-) 2352 WP_094031847.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  CG798_RS18125 (CG798_18125) - 3631273..3631842 (-) 570 WP_094031848.1 ComE operon protein 2 -
  CG798_RS18130 (CG798_18130) comEA 3631909..3632523 (-) 615 WP_043021637.1 helix-hairpin-helix domain-containing protein Machinery gene
  CG798_RS18135 (CG798_18135) comER 3632582..3633403 (+) 822 WP_012118027.1 late competence protein ComER -
  CG798_RS18140 (CG798_18140) - 3633472..3634209 (-) 738 WP_076424998.1 class I SAM-dependent methyltransferase -
  CG798_RS18145 (CG798_18145) rsfS 3634206..3634562 (-) 357 WP_007408260.1 ribosome silencing factor -
  CG798_RS18150 (CG798_18150) yqeK 3634580..3635140 (-) 561 WP_014418422.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  CG798_RS18155 (CG798_18155) - 3635130..3635699 (-) 570 WP_007408258.1 nicotinate-nucleotide adenylyltransferase -
  CG798_RS18160 (CG798_18160) yhbY 3635710..3636000 (-) 291 WP_007408257.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86502.42 Da        Isoelectric Point: 8.9041

>NTDB_id=240440 CG798_RS18120 WP_094031847.1 3628921..3631272(-) (comEC) [Bacillus velezensis strain TB1501]
MKYKYLLLPLAAVSATAGIAAAHVFLVLLLFLLYLLFIIVKTKQHAPVIVCLVSFCLYFFLYTVCDVANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATVPGGFDYKEYLYSQQI
HWLFSVTSIQQCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGILLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGESKLAGLAMASLIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFGMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPHRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALILTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILNMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPNLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFKKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=240440 CG798_RS18120 WP_094031847.1 3628921..3631272(-) (comEC) [Bacillus velezensis strain TB1501]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTTGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATCATTGTAAAAACAAAGCAGCATGCTCCGGTTATCGTCTGCCTCGTTT
CTTTTTGTCTTTATTTCTTTCTTTATACGGTTTGTGACGTTGCGAATGTAACGCGGTATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTACAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGGTTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCAGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCGATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATTTGGGAGTCGTTCATTTAATGGCGATTTCCGGAATGCATGTC
GGTCTTATTACGGCGGGATTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAAGCAGGAATTTTGCTGCTGCT
GTTTTTGCCGGTGTATACACTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCCCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCAGGGGAAAGCAAACTTGCCGGGCTTGCGATGGCTTCATTGATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAGCAGATTTCACTTGTCAGTTTTCCGATGAATATGGTGATGGTGCCTTTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGTTTCCTTCTTCTTTTACTCTCAAGGCAGATGGGAGAATGTTTGTTTGGTATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCACTGTTTATAAGCGCGCCGCACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCATTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCACTGATTTTAACCCATGCGGATCAAGATCACATCGGGGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTAGGATTCGTAAAAGAACCGAAAGATCAGAACATATTAAATATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCCGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCGTCTGACGGAAAGAGTAAAAATGATTCGTCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACGGAAGTGCTGAAAACGTATCCGAATCTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAGCAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTAAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTAAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

57.124

98.595

0.563


Multiple sequence alignment