Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   PUW62_RS05770 Genome accession   NZ_CP118046
Coordinates   1162148..1162852 (+) Length   234 a.a.
NCBI ID   WP_024051601.1    Uniprot ID   A0A412PQ70
Organism   Streptococcus anginosus strain VSI52     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1157148..1167852
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PUW62_RS05745 (PUW62_05745) - 1157493..1158698 (+) 1206 Protein_1094 multidrug efflux MFS transporter -
  PUW62_RS05750 (PUW62_05750) rpmG 1158682..1158831 (+) 150 WP_003024746.1 50S ribosomal protein L33 -
  PUW62_RS05755 (PUW62_05755) secG 1158871..1159104 (+) 234 WP_003024744.1 preprotein translocase subunit SecG -
  PUW62_RS05760 (PUW62_05760) rnr 1159196..1161535 (+) 2340 WP_024051603.1 ribonuclease R -
  PUW62_RS05765 (PUW62_05765) smpB 1161498..1161965 (+) 468 WP_024051602.1 SsrA-binding protein SmpB -
  PUW62_RS05770 (PUW62_05770) comEA/celA/cilE 1162148..1162852 (+) 705 WP_024051601.1 helix-hairpin-helix domain-containing protein Machinery gene
  PUW62_RS05775 (PUW62_05775) comEC/celB 1162836..1165064 (+) 2229 WP_274996652.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  PUW62_RS05780 (PUW62_05780) - 1165146..1165706 (+) 561 WP_224783657.1 DUF805 domain-containing protein -
  PUW62_RS05785 (PUW62_05785) - 1166336..1167322 (+) 987 Protein_1102 Gfo/Idh/MocA family oxidoreductase -

Sequence


Protein


Download         Length: 234 a.a.        Molecular weight: 24966.91 Da        Isoelectric Point: 4.7586

>NTDB_id=789479 PUW62_RS05770 WP_024051601.1 1162148..1162852(+) (comEA/celA/cilE) [Streptococcus anginosus strain VSI52]
MLETIIEKMKEYKILIGLSLIGLIIAGFFMINVQSSRQSNVAELAQETVTSSGAESEEISTGTKKNSQKEKAEPQTSSSE
ESEFLTVDVKGAVKNPGIYQLKKTSRINDAIQKAGGLTTDADSKSINLAQKLTDEAVVYVATMGENAASVSSNTGQSSTS
GTSEVASQKGNKVNLNTADLSELQTISGIGQKRAQDILDYREANGKFNSVDDLKNVSGVGAKTLEKLKEYVTVD

Nucleotide


Download         Length: 705 bp        

>NTDB_id=789479 PUW62_RS05770 WP_024051601.1 1162148..1162852(+) (comEA/celA/cilE) [Streptococcus anginosus strain VSI52]
ATGTTAGAGACAATCATTGAAAAGATGAAAGAGTATAAAATTTTGATTGGTTTAAGTTTGATTGGTTTGATAATAGCAGG
ATTTTTTATGATAAATGTTCAATCTAGTAGACAATCAAATGTAGCAGAGCTCGCACAGGAAACAGTTACGAGTTCGGGGG
CAGAATCAGAAGAAATTTCAACTGGAACGAAGAAAAACTCACAGAAAGAAAAAGCAGAGCCTCAAACGAGTTCAAGTGAA
GAATCAGAATTTTTAACCGTAGATGTCAAGGGTGCAGTCAAAAATCCAGGTATTTATCAACTGAAAAAGACTAGCCGTAT
CAATGATGCGATTCAAAAAGCAGGTGGTTTAACGACAGACGCTGACAGCAAGTCTATCAATTTGGCGCAGAAACTAACGG
ATGAAGCTGTTGTTTATGTGGCAACTATGGGTGAGAATGCGGCGAGTGTTTCAAGTAATACAGGACAATCTTCAACGTCA
GGAACTAGTGAAGTCGCATCGCAAAAAGGAAATAAGGTCAATCTAAATACAGCTGATTTATCTGAGCTGCAGACCATTTC
TGGTATTGGTCAAAAACGTGCGCAAGATATTTTAGATTATCGTGAGGCAAACGGGAAATTTAATTCCGTTGATGACTTGA
AAAATGTATCAGGAGTAGGCGCTAAAACACTAGAAAAATTGAAAGAATATGTCACAGTGGATTAA

Domains


Predicted by InterproScan.

(87-140)

(169-232)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A412PQ70

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae D39

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus pneumoniae R6

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus pneumoniae Rx1

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus mitis NCTC 12261

51.502

99.573

0.513

  comEA/celA/cilE Streptococcus mitis SK321

50.215

99.573

0.5

  comEA Streptococcus thermophilus LMD-9

44.255

100

0.444

  comEA Lactococcus lactis subsp. cremoris KW2

42.06

99.573

0.419