Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   SMI_RS04820 Genome accession   NC_013853
Coordinates   907845..908474 (+) Length   209 a.a.
NCBI ID   WP_000387350.1    Uniprot ID   D3H8K7
Organism   Streptococcus mitis B6     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 902845..913474
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SMI_RS04790 (smi_0949) pyrH 903592..904329 (+) 738 WP_000002996.1 UMP kinase -
  SMI_RS04795 (smi_0950) frr 904338..904895 (+) 558 WP_001262225.1 ribosome recycling factor -
  SMI_RS04800 (smi_0951) cvfB 904955..905809 (+) 855 WP_001095453.1 RNA-binding virulence regulatory protein CvfB -
  SMI_RS04805 (smi_0952) - 905818..906033 (+) 216 WP_001232084.1 YozE family protein -
  SMI_RS04810 (smi_0953) - 906118..907122 (+) 1005 WP_000658206.1 PhoH family protein -
  SMI_RS04815 (smi_0954) - 907209..907778 (+) 570 WP_000443760.1 GNAT family N-acetyltransferase -
  SMI_RS04820 (smi_0955) comEA/celA/cilE 907845..908474 (+) 630 WP_000387350.1 helix-hairpin-helix domain-containing protein Machinery gene
  SMI_RS04825 (smi_0956) comEC/celB 908458..910698 (+) 2241 WP_000942437.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  SMI_RS10765 - 910840..911064 (+) 225 WP_231840218.1 hypothetical protein -
  SMI_RS04830 (smi_0957) - 911097..911684 (+) 588 WP_000939892.1 ATP-binding cassette domain-containing protein -
  SMI_RS04835 (smi_0958) - 911688..912872 (+) 1185 WP_000655831.1 hypothetical protein -

Sequence


Protein


Download         Length: 209 a.a.        Molecular weight: 22346.59 Da        Isoelectric Point: 6.0485

>NTDB_id=36315 SMI_RS04820 WP_000387350.1 907845..908474(+) (comEA/celA/cilE) [Streptococcus mitis B6]
MEAIIEKIKEYKIIVICTGLGLLVGGFFLLKPAPQTPVKETNLQAEVVAVSKDSLTEKEEPLEQDLITVDVKGAVKSPGI
YDLPVGSRVNDAVQKAGGLTEQADSKSLNLAQKVSDEALVYVPTKGEESASQQAGSGAPSSTSKEKKINLNKASLEELKQ
VKGLGGKRAQDIIDHREANGKFKSVDELKKVSGIGAKTIEKLKDYVTVD

Nucleotide


Download         Length: 630 bp        

>NTDB_id=36315 SMI_RS04820 WP_000387350.1 907845..908474(+) (comEA/celA/cilE) [Streptococcus mitis B6]
ATGGAAGCAATTATCGAGAAAATCAAAGAGTATAAAATCATTGTCATCTGTACTGGTCTGGGCTTGCTTGTAGGAGGATT
TTTCCTGCTAAAACCAGCTCCACAAACACCTGTCAAGGAAACGAATTTGCAGGCAGAAGTTGTAGCAGTTTCCAAGGATT
CATTAACCGAAAAGGAAGAACCCCTTGAACAAGATCTAATCACGGTAGATGTAAAAGGTGCTGTCAAATCGCCAGGAATT
TATGATTTGCCGGTAGGTAGTCGAGTTAATGACGCTGTTCAGAAGGCTGGTGGCTTGACAGAGCAAGCAGACAGCAAGTC
GCTCAATCTAGCTCAGAAAGTTAGTGATGAGGCTCTGGTTTACGTTCCAACTAAGGGAGAAGAATCAGCTAGTCAACAGG
CAGGTTCTGGAGCGCCTTCTTCAACGAGCAAGGAAAAGAAGATCAACCTCAATAAGGCCAGTCTGGAAGAACTCAAGCAA
GTCAAGGGACTGGGAGGAAAACGAGCTCAAGACATTATTGACCATCGTGAGGCAAATGGGAAATTCAAGTCGGTTGATGA
ACTCAAGAAAGTCTCTGGAATTGGTGCCAAGACCATAGAAAAGTTAAAAGATTATGTTACAGTGGATTAA

Domains


Predicted by InterproScan.

(69-119)

(144-207)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB D3H8K7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus mitis NCTC 12261

93.519

100

0.967

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

92.13

100

0.952

  comEA/celA/cilE Streptococcus mitis SK321

91.204

100

0.943

  comEA/celA/cilE Streptococcus pneumoniae D39

90.278

100

0.933

  comEA/celA/cilE Streptococcus pneumoniae Rx1

90.278

100

0.933

  comEA/celA/cilE Streptococcus pneumoniae R6

90.278

100

0.933

  comEA Lactococcus lactis subsp. cremoris KW2

41.284

100

0.431

  comEA Latilactobacillus sakei subsp. sakei 23K

34.052

100

0.378

  comEA Streptococcus thermophilus LMD-9

51.351

70.813

0.364


Multiple sequence alignment