Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/celA/cilE   Type   Machinery gene
Locus tag   GPW48_RS04430 Genome accession   NZ_LR738720
Coordinates   838005..838667 (+) Length   220 a.a.
NCBI ID   WP_024405509.1    Uniprot ID   -
Organism   Streptococcus suis isolate GD-0001     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 833005..843667
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GPW48_RS04410 - 833722..834651 (-) 930 WP_012775038.1 ABC transporter substrate-binding protein -
  GPW48_RS04415 - 834664..835449 (-) 786 WP_011922762.1 ABC transporter ATP-binding protein -
  GPW48_RS04420 - 835601..837046 (-) 1446 WP_024405510.1 UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L- lysine ligase -
  GPW48_RS04425 - 837192..837938 (+) 747 WP_014735985.1 lysophospholipid acyltransferase family protein -
  GPW48_RS04430 comEA/celA/cilE 838005..838667 (+) 663 WP_024405509.1 helix-hairpin-helix domain-containing protein Machinery gene
  GPW48_RS04435 comEC/celB 838651..840888 (+) 2238 WP_024405508.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  GPW48_RS04440 - 840905..841030 (+) 126 Protein_816 IS982 family transposase -
  GPW48_RS04445 - 841213..841869 (+) 657 WP_011922763.1 CBS domain-containing protein -
  GPW48_RS04450 - 841963..842790 (+) 828 WP_011922194.1 hypothetical protein -

Sequence


Protein


Download         Length: 220 a.a.        Molecular weight: 23233.97 Da        Isoelectric Point: 4.1447

>NTDB_id=1129390 GPW48_RS04430 WP_024405509.1 838005..838667(+) (comEA/celA/cilE) [Streptococcus suis isolate GD-0001]
MDTIKTYIEMLKEYKWQIALPTVAGLLMATFLIFSQPAKSDQTGLTDFPQTAQTSSSSDLVEETSTEASEEPSQLVVDVK
GAVVKPGLYTLEAGARVNDAVEAAGGLTSQADPKSINLAQKLSDEAVVYVASKDENISVVASTTASSAMSPEEKSTSLVN
LNTATEADLQTISGIGAKRAADIIAYREANGGFKSVNDLNNVSGIGDKTMESIRPYVTVE

Nucleotide


Download         Length: 663 bp        

>NTDB_id=1129390 GPW48_RS04430 WP_024405509.1 838005..838667(+) (comEA/celA/cilE) [Streptococcus suis isolate GD-0001]
ATGGATACGATTAAAACTTATATAGAAATGCTTAAAGAATACAAGTGGCAAATTGCTCTGCCCACAGTGGCTGGCTTGCT
AATGGCGACGTTCTTAATATTCAGTCAACCAGCCAAGTCTGACCAGACAGGACTGACAGATTTTCCGCAGACCGCACAAA
CTTCTAGCAGCTCTGACTTGGTCGAGGAAACCAGTACAGAAGCAAGTGAGGAACCCAGCCAGCTGGTCGTTGATGTCAAA
GGAGCGGTAGTAAAACCAGGGCTCTACACTTTAGAAGCTGGTGCGCGTGTCAATGACGCAGTTGAAGCAGCTGGCGGCTT
GACCAGTCAGGCAGACCCCAAGTCTATCAATCTGGCTCAGAAGCTCAGCGATGAGGCGGTGGTCTATGTAGCCAGCAAGG
ACGAAAACATCTCGGTGGTGGCCAGCACGACTGCCAGCTCTGCTATGTCTCCAGAAGAAAAAAGCACCAGTCTGGTCAAT
CTGAATACGGCGACTGAGGCGGACTTGCAGACCATTTCGGGTATCGGTGCCAAGCGGGCGGCGGACATTATCGCCTATCG
TGAGGCAAACGGTGGCTTCAAGTCGGTGAACGACCTCAACAATGTTTCGGGCATTGGCGACAAGACCATGGAAAGCATTC
GGCCTTATGTCACGGTTGAGTAA

Domains


Predicted by InterproScan.

(157-218)

(76-130)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/celA/cilE Streptococcus mitis NCTC 12261

48.402

99.545

0.482

  comEA/celA/cilE Streptococcus pneumoniae Rx1

48.402

99.545

0.482

  comEA/celA/cilE Streptococcus pneumoniae D39

48.402

99.545

0.482

  comEA/celA/cilE Streptococcus pneumoniae R6

48.402

99.545

0.482

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

48.624

99.091

0.482

  comEA/celA/cilE Streptococcus mitis SK321

45.946

100

0.464

  comEA Streptococcus thermophilus LMD-9

59.259

73.636

0.436

  comEA Lactococcus lactis subsp. cremoris KW2

36.283

100

0.373

  comEA Bacillus subtilis subsp. subtilis str. 168

37.915

95.909

0.364


Multiple sequence alignment