Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   SK637_RS04295 Genome accession   NZ_CP028415
Coordinates   784618..785268 (+) Length   216 a.a.
NCBI ID   WP_033688715.1    Uniprot ID   -
Organism   Streptococcus mitis strain SK637     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 779618..790268
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SK637_RS04265 (SK637_00833) pyrH 780241..780978 (+) 738 WP_033688711.1 UMP kinase -
  SK637_RS04270 (SK637_00834) frr 780987..781544 (+) 558 WP_001262225.1 ribosome recycling factor -
  SK637_RS04275 (SK637_00835) cvfB 781604..782458 (+) 855 WP_033688712.1 RNA-binding virulence regulatory protein CvfB -
  SK637_RS04280 (SK637_00836) - 782466..782681 (+) 216 WP_001232084.1 YozE family protein -
  SK637_RS04285 (SK637_00837) - 782766..783770 (+) 1005 WP_033688713.1 PhoH family protein -
  SK637_RS04290 (SK637_00838) - 783982..784551 (+) 570 WP_033688714.1 GNAT family N-acetyltransferase -
  SK637_RS04295 (SK637_00839) comEA/celA/cilE 784618..785268 (+) 651 WP_033688715.1 helix-hairpin-helix domain-containing protein Machinery gene
  SK637_RS04300 (SK637_00840) comEC/celB 785252..787492 (+) 2241 WP_033688716.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  SK637_RS10105 - 787634..787858 (+) 225 WP_033688717.1 hypothetical protein -
  SK637_RS04310 (SK637_00841) - 787891..788478 (+) 588 WP_033688718.1 ATP-binding cassette domain-containing protein -
  SK637_RS04315 (SK637_00842) - 788482..789666 (+) 1185 WP_033688719.1 membrane protein -

Sequence


Protein


Download         Length: 216 a.a.        Molecular weight: 23178.61 Da        Isoelectric Point: 6.5093

>NTDB_id=284706 SK637_RS04295 WP_033688715.1 784618..785268(+) (comEA/celA/cilE) [Streptococcus mitis strain SK637]
MEAIIEKIKEYKIIVICTGLGLLVGGFFLLKPAPHIPVKETNLQAEVAAVSKDSLTEKEVKKEEKEEPIEQDLITVDVKG
AVKAPGIYDLPVGSRVNDAVQKAGGLTEQADSKSLNLAQKVSDEALVYVPTKGEEAASQQAGSGAPSSTSKEKKLNLNKA
SLEELKQVKGLGGKRAQDIIDHREANGKFKSVDELKKVSGIGAKTIEKLKDYVTVD

Nucleotide


Download         Length: 651 bp        

>NTDB_id=284706 SK637_RS04295 WP_033688715.1 784618..785268(+) (comEA/celA/cilE) [Streptococcus mitis strain SK637]
ATGGAAGCAATTATCGAGAAAATCAAAGAGTATAAAATCATTGTCATCTGTACTGGTCTAGGCTTGCTTGTAGGAGGATT
TTTCCTGCTAAAGCCAGCTCCACACATACCTGTCAAAGAAACGAATTTGCAGGCAGAAGTTGCAGCTGTTTCCAAGGATT
CATTAACCGAAAAGGAAGTGAAGAAGGAAGAAAAAGAAGAACCTATTGAACAAGATCTAATCACAGTAGATGTGAAAGGA
GCTGTTAAAGCACCAGGAATTTATGATTTGCCGGTAGGTAGCAGAGTCAATGACGCTGTTCAGAAGGCTGGTGGGTTGAC
AGAGCAAGCCGACAGCAAGTCACTCAATCTAGCTCAGAAAGTTAGTGACGAGGCTCTGGTTTACGTTCCCACTAAGGGAG
AAGAAGCAGCTAGTCAACAGGCAGGTTCTGGAGCGCCTTCTTCAACAAGCAAGGAAAAGAAGCTCAATCTCAACAAGGCC
AGTCTGGAAGAACTTAAGCAGGTCAAGGGACTGGGAGGAAAACGAGCTCAGGACATTATTGATCATCGTGAGGCAAATGG
GAAATTCAAGTCGGTCGATGAACTCAAGAAAGTATCTGGCATTGGTGCCAAGACAATAGAAAAGCTAAAAGATTATGTTA
CAGTGGATTAA

Domains


Predicted by InterproScan.

(76-126)

(151-214)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus pneumoniae TIGR4

94.907

100

0.949

  comEA/celA/cilE Streptococcus mitis NCTC 12261

94.907

100

0.949

  comEA/celA/cilE Streptococcus mitis SK321

93.519

100

0.935

  comEA/celA/cilE Streptococcus pneumoniae Rx1

92.593

100

0.926

  comEA/celA/cilE Streptococcus pneumoniae D39

92.593

100

0.926

  comEA/celA/cilE Streptococcus pneumoniae R6

92.593

100

0.926

  comEA Lactococcus lactis subsp. cremoris KW2

40.969

100

0.431

  comEA Latilactobacillus sakei subsp. sakei 23K

33.476

100

0.361


Multiple sequence alignment