Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   LSG23_RS12975 Genome accession   NZ_CP088973
Coordinates   2657472..2659823 (-) Length   783 a.a.
NCBI ID   WP_077722611.1    Uniprot ID   -
Organism   Bacillus velezensis strain CMML20-16     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2652472..2664823
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LSG23_RS12940 (LSG23_12920) - 2652835..2653173 (-) 339 WP_007408270.1 YqxA family protein -
  LSG23_RS12945 (LSG23_12925) spoIIP 2653190..2654383 (-) 1194 WP_013352939.1 stage II sporulation protein P -
  LSG23_RS12950 (LSG23_12930) gpr 2654451..2655557 (-) 1107 WP_003152878.1 GPR endopeptidase -
  LSG23_RS12955 (LSG23_12935) rpsT 2655760..2656026 (+) 267 WP_003152876.1 30S ribosomal protein S20 -
  LSG23_RS12960 (LSG23_12940) holA 2656043..2657084 (-) 1042 Protein_2507 DNA polymerase III subunit delta -
  LSG23_RS12965 (LSG23_12945) - 2657124..2657278 (-) 155 Protein_2508 hypothetical protein -
  LSG23_RS12970 (LSG23_12950) - 2657319..2657453 (+) 135 WP_003152870.1 YqzM family protein -
  LSG23_RS12975 (LSG23_12955) comEC 2657472..2659823 (-) 2352 WP_077722611.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  LSG23_RS12980 (LSG23_12960) - 2659824..2660393 (-) 570 WP_003152868.1 ComE operon protein 2 -
  LSG23_RS12985 (LSG23_12965) comEA 2660460..2661074 (-) 615 WP_013352942.1 helix-hairpin-helix domain-containing protein Machinery gene
  LSG23_RS12990 (LSG23_12970) comER 2661133..2661954 (+) 822 WP_014305449.1 late competence protein ComER -
  LSG23_RS12995 (LSG23_12975) - 2662023..2662760 (-) 738 WP_031378935.1 class I SAM-dependent methyltransferase -
  LSG23_RS13000 (LSG23_12980) rsfS 2662757..2663113 (-) 357 WP_003152864.1 ribosome silencing factor -
  LSG23_RS13005 (LSG23_12985) yqeK 2663131..2663691 (-) 561 WP_077722612.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  LSG23_RS13010 (LSG23_12990) - 2663681..2664250 (-) 570 WP_003152860.1 nicotinate-nucleotide adenylyltransferase -
  LSG23_RS13015 (LSG23_12995) yhbY 2664261..2664551 (-) 291 WP_003152858.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 783 a.a.        Molecular weight: 86807.83 Da        Isoelectric Point: 9.0931

>NTDB_id=634313 LSG23_RS12975 WP_077722611.1 2657472..2659823(-) (comEC) [Bacillus velezensis strain CMML20-16]
MKYKYLLLPLAAVSATAGIAAAHVFWVLLLFLLYLLFIMIKTKQPAPVIICLVSFCVYFFLYTVCDAANVTRYQAGSYTE
QAVITNIPKVDGAKMSAVIRTHDKEKWAASYKIRSLEEKRLIEQLEPGMRCTFTGSLEQPAHATIPGGFDYKEYLYSQQI
HWLFSVTSIQRCEKSKQPLFKLLNIRKNLISIIRNHVPESSAGIVEALTLGERFSIEDDILSAYQNLGVVHLMAISGMHV
GLITAGLFYALIRIGLTREKAGMLLLLFLPVYTLLSGAAPSVLRASLMLGFYIAGTLVKRGIHSSAALSLSYLLLLLFNP
YFLWQAGFQLSFAVSASLILSSSILKKAGKSRLAGLAMASFIAELSSLPFLLYHFQQISLVSFPMNMVMVPFYTLFVIPV
SVIGFLLLLLSRQMGECLFDMFDLVMKPVHDFITYAASVDLFTMIVSKPDFLSLLLLAVSVFTLFAALEKGGFLKLRKSA
LFFCAVLAYLICRPYFSPWGEADMLDIGQGDSLFISAPYRKGTVMVDTGGVIAYPGESWKEKRHPYSIGEKVLIPFLNGK
GVKKLDALFLTHADQDHIGEAGVLIKNHRVKRLIVPVGFVKEPKDQNILKMAKENNIPVAEAKRGDTITAGDLQFQVLSP
ESSDGKSKNDSSLVLWTVLGGVSWLLTGDLESDGETEVLKTYPKLKADILKAGHHGSKSSTSEAFLKQLQPEAALISAGK
ENRYHHPHEEVLDRLKAYSVNVLRTDISGTIQYRFEKGAGTFSVFPPYDIEETRAQEVKKTAD

Nucleotide


Download         Length: 2352 bp        

>NTDB_id=634313 LSG23_RS12975 WP_077722611.1 2657472..2659823(-) (comEC) [Bacillus velezensis strain CMML20-16]
ATGAAATATAAATACCTTCTTCTGCCTCTGGCGGCGGTTTCTGCAACTGCGGGAATTGCCGCCGCTCATGTCTTCTGGGT
TCTGCTCCTTTTTCTTCTGTATCTTCTCTTTATTATGATAAAAACAAAGCAGCCTGCTCCGGTTATCATCTGCCTCGTTT
CTTTTTGTGTTTATTTCTTTCTTTATACGGTTTGTGACGCTGCGAATGTAACGCGATATCAGGCCGGCAGTTATACTGAA
CAGGCCGTCATCACTAATATTCCGAAGGTTGACGGAGCGAAAATGTCAGCCGTTATCCGTACACATGACAAGGAAAAATG
GGCGGCTTCGTATAAAATCCGGTCTCTTGAGGAAAAGAGACTCATTGAACAGCTTGAACCGGGGATGCGCTGCACGTTTA
CAGGCTCTCTGGAACAGCCTGCACATGCGACGATTCCCGGAGGTTTTGATTATAAGGAATATCTTTACTCTCAGCAGATT
CACTGGTTATTTTCCGTGACTTCCATTCAGCGGTGTGAAAAATCCAAACAGCCGCTGTTTAAACTGCTGAACATCAGAAA
AAATTTGATTTCAATCATTCGGAATCACGTGCCTGAATCTTCCGCCGGAATTGTTGAAGCGCTGACCTTAGGTGAAAGAT
TTTCTATAGAGGACGATATACTGAGTGCATATCAAAATCTGGGAGTCGTTCATTTAATGGCGATTTCAGGAATGCATGTC
GGTCTTATTACGGCGGGACTATTTTATGCTCTGATCAGAATCGGGCTGACAAGAGAAAAGGCGGGGATGTTGCTGCTGCT
GTTTTTGCCGGTATATACGCTGCTGAGCGGTGCCGCCCCATCCGTATTGCGCGCATCCCTCATGCTGGGATTTTATATCG
CCGGAACTCTTGTTAAACGCGGCATTCATTCCTCTGCTGCATTGTCTCTGTCTTATCTGCTGCTCCTGCTGTTTAATCCT
TACTTCCTTTGGCAGGCGGGCTTCCAGCTTTCCTTTGCGGTAAGCGCCTCTTTAATTCTGTCATCCTCCATTTTAAAGAA
AGCGGGGAAAAGCAGACTTGCCGGGCTTGCGATGGCCTCATTCATTGCAGAGCTCAGCTCACTTCCGTTTCTTCTCTATC
ATTTTCAACAAATTTCACTTGTCAGTTTTCCGATGAACATGGTGATGGTGCCATTTTATACGTTATTTGTCATTCCGGTT
TCTGTCATCGGCTTCCTTCTTCTTTTGCTTTCAAGGCAGATGGGAGAATGTTTGTTTGATATGTTTGACCTTGTGATGAA
GCCTGTGCATGATTTCATTACATATGCGGCATCCGTTGATTTATTTACTATGATTGTGTCAAAGCCTGACTTTCTTTCCC
TTCTTCTGCTTGCGGTTTCCGTTTTTACGCTTTTTGCGGCTTTGGAAAAGGGAGGTTTTTTAAAACTCAGGAAATCGGCT
CTTTTTTTCTGCGCGGTTTTGGCTTATTTAATATGCCGTCCGTATTTCAGTCCATGGGGAGAAGCGGATATGCTTGATAT
CGGGCAGGGAGACTCGCTGTTTATAAGCGCGCCATACCGCAAAGGGACCGTAATGGTTGATACAGGGGGAGTGATTGCTT
ATCCCGGAGAATCATGGAAAGAAAAACGCCACCCGTATTCTATCGGCGAGAAGGTTTTGATTCCGTTTTTAAACGGAAAA
GGGGTGAAAAAGCTGGATGCGCTGTTTTTAACCCATGCGGATCAGGATCACATCGGAGAAGCCGGAGTGTTAATCAAAAA
TCATAGAGTCAAACGGTTAATTGTCCCCGTGGGATTCGTAAAAGAACCGAAGGATCAGAACATATTAAAAATGGCGAAAG
AAAACAACATTCCCGTTGCCGAAGCAAAGCGGGGCGACACCATTACAGCGGGTGATCTTCAGTTTCAGGTGCTGTCTCCG
GAGTCATCTGACGGAAAGAGTAAAAATGATTCATCACTGGTGCTTTGGACGGTTTTAGGCGGAGTGAGCTGGCTTTTGAC
GGGAGATTTAGAATCGGACGGCGAAACAGAAGTGCTGAAAACGTATCCGAAACTGAAGGCTGATATATTGAAGGCGGGTC
ATCACGGCAGCAAAAGCTCAACGAGTGAAGCCTTTTTGAAACAGCTTCAGCCGGAAGCAGCGCTGATTTCAGCAGGAAAA
GAGAATCGATACCATCATCCGCATGAAGAAGTGCTGGATCGTTTGAAGGCGTACTCTGTCAATGTGCTTCGCACCGATAT
CAGCGGAACGATTCAATACAGATTTGAAAAAGGCGCCGGAACGTTTTCCGTCTTCCCTCCATATGATATAGAAGAAACCA
GGGCGCAAGAAGTAAAAAAGACTGCCGATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

56.218

98.595

0.554