Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   SAIN_RS04465 Genome accession   NC_022244
Coordinates   903562..904266 (+) Length   234 a.a.
NCBI ID   WP_003024731.1    Uniprot ID   A0AAP6BPY6
Organism   Streptococcus anginosus C1051     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 902080..903431 903562..904266 flank 131


Gene organization within MGE regions


Location: 902080..904266
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SAIN_RS04465 (SAIN_0867) comEA/celA/cilE 903562..904266 (+) 705 WP_003024731.1 helix-hairpin-helix domain-containing protein Machinery gene

Sequence


Protein


Download         Length: 234 a.a.        Molecular weight: 24996.90 Da        Isoelectric Point: 4.6812

>NTDB_id=61766 SAIN_RS04465 WP_003024731.1 903562..904266(+) (comEA/celA/cilE) [Streptococcus anginosus C1051]
MLETIIEKMKEYKILIGLSLIGLIIAGFFMINGQSSRQSNVAELAQETVTSSEAESEEISTGTKKNSQKEKAEPQTSSSE
ESEFLTVDVKGAVKNPGIYQLKKTSRINDAIQKAGGLTTDADSKSINLAQKLTDEAVVYVATMGENAASVSSNTGQSSTS
GTSEVASQKGNKVNLNTADLSELQTISGIGQKRAQDILDYREANGKFNSVDDLKNVSGVGAKTLEKLKEYVTVD

Nucleotide


Download         Length: 705 bp        

>NTDB_id=61766 SAIN_RS04465 WP_003024731.1 903562..904266(+) (comEA/celA/cilE) [Streptococcus anginosus C1051]
ATGTTAGAGACAATCATTGAAAAGATGAAAGAGTATAAAATTTTGATTGGTTTAAGTTTGATTGGTTTGATAATAGCAGG
ATTTTTTATGATAAATGGTCAATCTAGTAGACAATCAAATGTAGCAGAGCTCGCACAGGAAACAGTTACGAGTTCGGAGG
CAGAATCAGAAGAAATTTCAACTGGAACGAAGAAAAACTCACAGAAAGAAAAAGCAGAGCCTCAAACGAGTTCCAGTGAA
GAATCAGAATTTTTAACCGTAGATGTCAAGGGTGCAGTCAAAAATCCAGGTATTTATCAACTGAAAAAGACTAGCCGTAT
CAATGATGCGATTCAAAAAGCAGGTGGTTTAACGACAGACGCTGACAGCAAGTCTATCAATTTGGCGCAGAAACTAACGG
ATGAAGCTGTTGTTTATGTGGCAACTATGGGTGAGAATGCGGCGAGTGTTTCAAGCAATACAGGACAATCTTCAACATCA
GGAACTAGTGAAGTCGCATCGCAAAAAGGGAATAAGGTCAATCTAAATACAGCTGATTTATCTGAGTTGCAGACCATTTC
TGGTATTGGTCAAAAACGCGCGCAAGATATTTTAGATTATCGGGAGGCAAACGGGAAATTTAATTCCGTTGATGACCTGA
AAAATGTATCAGGAGTAGGTGCTAAAACACTAGAGAAATTGAAAGAATATGTCACAGTGGATTAA

Domains


Predicted by InterproScan.

(87-140)

(169-232)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae R6

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus pneumoniae Rx1

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus pneumoniae D39

53.219

99.573

0.53

  comEA/celA/cilE Streptococcus mitis SK321

52.361

99.573

0.521

  comEA/celA/cilE Streptococcus mitis NCTC 12261

51.502

99.573

0.513

  comEA Streptococcus thermophilus LMD-9

44.255

100

0.444

  comEA Lactococcus lactis subsp. cremoris KW2

42.06

99.573

0.419


Multiple sequence alignment