Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC   Type   Machinery gene
Locus tag   G4P54_RS13420 Genome accession   NZ_CP048852
Coordinates   2509639..2511969 (-) Length   776 a.a.
NCBI ID   WP_167873880.1    Uniprot ID   A0A6H0WSE2
Organism   Bacillus tequilensis strain EA-CB0015     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2504639..2516969
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G4P54_RS13385 (G4P54_13340) - 2504977..2505315 (-) 339 WP_024714995.1 YqxA family protein -
  G4P54_RS13390 (G4P54_13345) spoIIP 2505332..2506537 (-) 1206 WP_167872906.1 spore autolysin SpoIIP -
  G4P54_RS13395 (G4P54_13350) gpr 2506600..2507706 (-) 1107 WP_024714993.1 GPR endopeptidase -
  G4P54_RS13400 (G4P54_13355) rpsT 2507910..2508176 (+) 267 WP_003229989.1 30S ribosomal protein S20 -
  G4P54_RS13405 (G4P54_13360) holA 2508191..2509234 (-) 1044 WP_024714992.1 DNA polymerase III subunit delta -
  G4P54_RS13410 (G4P54_13365) - 2509274..2509423 (-) 150 WP_167872907.1 hypothetical protein -
  G4P54_RS13415 (G4P54_13370) - 2509464..2509598 (+) 135 WP_003229983.1 YqzM family protein -
  G4P54_RS13420 (G4P54_13375) comEC 2509639..2511969 (-) 2331 WP_167873880.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  G4P54_RS13425 (G4P54_13380) - 2511972..2512541 (-) 570 WP_024714990.1 ComE operon protein 2 -
  G4P54_RS13430 (G4P54_13385) comEA 2512609..2513226 (-) 618 WP_167872908.1 helix-hairpin-helix domain-containing protein Machinery gene
  G4P54_RS13435 (G4P54_13390) comER 2513310..2514131 (+) 822 WP_167872909.1 late competence protein ComER -
  G4P54_RS13440 (G4P54_13395) - 2514197..2514940 (-) 744 WP_167872910.1 class I SAM-dependent DNA methyltransferase -
  G4P54_RS13445 (G4P54_13400) rsfS 2514937..2515293 (-) 357 WP_167872911.1 ribosome silencing factor -
  G4P54_RS13450 (G4P54_13405) yqeK 2515311..2515871 (-) 561 WP_167872912.1 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) YqeK -
  G4P54_RS13455 (G4P54_13410) - 2515861..2516430 (-) 570 WP_024714985.1 nicotinate-nucleotide adenylyltransferase -
  G4P54_RS13460 (G4P54_13415) yhbY 2516442..2516732 (-) 291 WP_024714984.1 ribosome assembly RNA-binding protein YhbY -

Sequence


Protein


Download         Length: 776 a.a.        Molecular weight: 86718.15 Da        Isoelectric Point: 7.8414

>NTDB_id=423575 G4P54_RS13420 WP_167873880.1 2509639..2511969(-) (comEC) [Bacillus tequilensis strain EA-CB0015]
MRNSRLLLPLAAASATAGISAAVYFSAVILFFLFLLIIFIKTRHAFLIFVCFFSFLLFFALYTLTDSRNASSYQQGTYHF
KAVIDSIPKIDGDRMSMVVKTPDGEKWAASYRIQSANEKDRLSYIEPGMTCELTGALEEPKHATVPGIFDYHNYLYRQHI
HWNYSVTSVHNCSGPVNFKYRLLSLRKYIVSFTNESLPPDSAGIVQALTVGDRFYVEDDVLNAYQKLGVVHLLAISGLHV
GILTAGMFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVHSAGAICLSYLVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSLFLQLKTSLGQLSMVSLIAQLGSLPILLYHFQQFSIISIPMNMLMVPFYTCVILPGA
VAGVFLLLLSASAGRLFFSWFDFLMSLTNRLITKIAEIDVFTVIIARPEPALLFLFTVTMILLLMAIEKRSFTHLMVTGG
MCFAALCLLFVYPRLSSEGEVDMIDIGQGDSMYVGAPHQQGHVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAETLLRHHKVKRLVIPKGFVSEPKDEKVLRAAREEGVPIEEVIRGDVLQIKNLQFQVLSPG
TPDPENKNNSSLVLWMKTGDMSWILTGDLEKEGEQEVMNVFPNITADVLKVGHHGSKGSTGEEFIKQLRPKTAVISAGEN
NRYHHPHQEVLQILQRHSIRVLRTDQSGTLQYRYKNRVGTFSVYPPYDTSDITETN

Nucleotide


Download         Length: 2331 bp        

>NTDB_id=423575 G4P54_RS13420 WP_167873880.1 2509639..2511969(-) (comEC) [Bacillus tequilensis strain EA-CB0015]
ATGCGTAATTCGCGTTTGTTATTGCCTCTGGCGGCAGCTTCAGCAACGGCTGGAATTTCTGCCGCCGTTTATTTCTCCGC
TGTTATTCTTTTCTTCCTCTTTCTCCTCATCATTTTCATCAAAACGAGACACGCTTTTCTCATTTTTGTTTGTTTCTTCT
CTTTTCTATTGTTTTTTGCACTGTATACGCTCACAGATTCTCGGAATGCCTCTTCATATCAGCAGGGCACCTATCATTTC
AAGGCAGTGATTGATAGTATTCCTAAAATCGATGGCGACCGTATGTCTATGGTTGTTAAGACGCCTGATGGCGAAAAATG
GGCTGCTTCGTATCGCATTCAGTCTGCTAATGAAAAAGATCGGCTGTCATACATAGAACCAGGAATGACATGTGAATTAA
CTGGTGCATTGGAAGAGCCTAAACACGCAACTGTGCCTGGCATATTTGATTATCACAACTATCTGTACAGGCAGCATATT
CACTGGAACTATTCTGTCACATCTGTCCATAACTGCAGCGGACCTGTTAATTTCAAATATAGGCTGCTCAGCTTGAGAAA
ATACATCGTATCATTCACAAATGAATCGCTGCCCCCCGACTCGGCAGGGATCGTACAGGCTCTTACAGTGGGCGACAGAT
TTTATGTTGAGGACGATGTGCTTAATGCTTATCAAAAGCTTGGTGTCGTCCATCTCTTGGCGATTTCGGGGCTTCATGTC
GGCATTTTGACAGCCGGGATGTTTTATATCATGATCCGTCTAGGCATCACAAGAGAAAAGGCGTCAATCCTGCTGCTGTT
ATTTCTGCCGCTTTATGTGATGCTGACAGGCGCAGCTCCTTCAGTGCTTCGCGCCGCTCTGATGTCGGGTGTTTATTTAG
CGGGAAGTCTCGTTAAATGGCGGGTGCATTCAGCTGGTGCAATCTGTCTTTCATACCTTGTTCTATTGCTCTTCAATCCT
TATCATCTCTTTGAGGCCGGTTTTCAGCTGTCATTCGCCGTCAGTTTTTCTTTAATTCTTTCTTCTTCTCTTTTTCTCCA
GCTCAAAACCTCTCTGGGTCAGCTGTCAATGGTATCTCTGATCGCTCAATTGGGTTCACTTCCTATTCTTCTATATCATT
TCCAGCAGTTTTCTATAATCAGCATACCGATGAACATGTTGATGGTTCCATTTTATACATGCGTTATTTTGCCGGGGGCT
GTAGCAGGTGTTTTTCTGTTACTCCTATCCGCTTCGGCTGGCCGATTGTTCTTCAGCTGGTTTGATTTTTTGATGAGCTT
GACCAACAGACTGATTACAAAAATTGCGGAGATTGATGTATTTACAGTTATTATTGCACGTCCCGAACCCGCTCTGCTCT
TTTTATTCACAGTCACAATGATCCTATTGCTTATGGCGATTGAAAAACGTTCCTTCACGCACCTGATGGTAACTGGCGGT
ATGTGCTTCGCGGCGCTTTGTCTTCTCTTTGTTTATCCGCGGCTTAGTTCCGAAGGGGAAGTCGATATGATTGATATTGG
GCAGGGGGACAGCATGTATGTAGGTGCTCCGCATCAGCAAGGGCACGTCTTAATTGATACTGGCGGCACTTTGTCTTACT
CGTCAGAGCCTTGGCGCGAAAAACAACATCCGTTTTCACTGGGGGAAAAGGTGCTGATTCCGTTTTTAACTGCTAAGGGA
ATCAAACAGCTTGATGCTTTGATTTTGACGCATGCTGACCAAGACCACATCGGAGAGGCAGAGACTCTGCTGAGGCATCA
TAAAGTGAAGCGCCTCGTGATTCCGAAAGGGTTCGTTTCTGAACCAAAAGACGAGAAAGTGCTGCGGGCAGCCAGAGAAG
AGGGCGTGCCAATTGAAGAGGTGATAAGAGGCGATGTATTGCAAATAAAGAATTTGCAGTTTCAAGTACTTTCACCGGGC
ACACCTGATCCGGAAAACAAAAATAATTCTTCTCTCGTTTTGTGGATGAAAACGGGCGATATGAGCTGGATATTGACGGG
TGATCTTGAGAAGGAAGGGGAACAGGAGGTAATGAACGTGTTTCCGAATATAACAGCAGACGTGTTAAAGGTGGGTCACC
ATGGAAGCAAAGGCTCTACCGGTGAAGAGTTCATCAAACAGCTCCGGCCAAAAACGGCCGTTATCTCAGCGGGAGAAAAC
AATCGGTACCATCACCCCCATCAAGAAGTTCTGCAAATATTACAGAGACATTCCATCCGCGTGCTGCGCACGGATCAAAG
CGGAACGCTCCAATATAGATATAAAAACAGAGTTGGAACCTTTTCTGTCTATCCCCCATATGATACATCAGATATAACAG
AGACGAACTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6H0WSE2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC Bacillus subtilis subsp. subtilis str. 168

85.825

100

0.858