Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   DQM60_RS08020 Genome accession   NZ_LS483366
Coordinates   1733236..1733934 (-) Length   232 a.a.
NCBI ID   WP_111686659.1    Uniprot ID   -
Organism   Streptococcus salivarius strain NCTC7366     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1728236..1738934
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM60_RS08005 (NCTC7366_01613) - 1729047..1729592 (-) 546 WP_084870984.1 GNAT family N-acetyltransferase -
  DQM60_RS08010 (NCTC7366_01614) - 1729658..1730920 (-) 1263 WP_013991032.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  DQM60_RS08015 (NCTC7366_01615) comEC/celB 1731006..1733246 (-) 2241 WP_084870985.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  DQM60_RS08020 (NCTC7366_01616) comEA 1733236..1733934 (-) 699 WP_111686659.1 helix-hairpin-helix domain-containing protein Machinery gene
  DQM60_RS08025 (NCTC7366_01617) - 1734042..1734797 (-) 756 WP_013991035.1 lysophospholipid acyltransferase family protein -
  DQM60_RS08030 (NCTC7366_01618) - 1734927..1735862 (+) 936 WP_045002039.1 polysaccharide deacetylase family protein -
  DQM60_RS08035 (NCTC7366_01619) - 1735912..1736676 (+) 765 WP_013991037.1 tRNA1(Val) (adenine(37)-N6)-methyltransferase -
  DQM60_RS08040 (NCTC7366_01620) - 1736680..1736949 (+) 270 WP_084870988.1 GIY-YIG nuclease family protein -
  DQM60_RS08045 (NCTC7366_01621) - 1737041..1737427 (-) 387 WP_002886743.1 IS110 family transposase -
  DQM60_RS08050 (NCTC7366_01622) - 1737447..1738208 (-) 762 WP_002886742.1 IS110 family transposase -

Sequence


Protein


Download         Length: 232 a.a.        Molecular weight: 24489.33 Da        Isoelectric Point: 4.4904

>NTDB_id=1138791 DQM60_RS08020 WP_111686659.1 1733236..1733934(-) (comEA) [Streptococcus salivarius strain NCTC7366]
MKEKILAYVKDNCLFVSVIAVLMVIFCFFLWMTYCAGNSMEAETSYTDVTALSTSSSSKQSSQSLSEASSQSKTEGSEKD
ESKVTVDVKGAVANPGVYTLKASARVTDAIKAAGGMTEDADAKSVNLAASLSDEEVIYVATKDENLSVLGQSGTGQVSDK
GGQTSAKDGKINLNTATSEELQTISGIGAKRAEDIIAYRESHGGFQSVDDLKNVSGIGDKTLDKIRESLYVA

Nucleotide


Download         Length: 699 bp        

>NTDB_id=1138791 DQM60_RS08020 WP_111686659.1 1733236..1733934(-) (comEA) [Streptococcus salivarius strain NCTC7366]
GTGAAGGAAAAGATTCTAGCCTATGTCAAAGATAATTGTCTGTTTGTGAGTGTTATTGCTGTACTGATGGTGATTTTTTG
CTTCTTCCTATGGATGACTTACTGTGCCGGCAACAGCATGGAGGCGGAGACGTCTTATACAGATGTGACAGCTTTGTCAA
CCTCCTCCTCCTCCAAACAAAGCTCACAGTCTCTTTCTGAGGCGTCTTCCCAGTCAAAGACTGAAGGAAGTGAAAAGGAT
GAGTCAAAAGTAACGGTAGATGTTAAGGGGGCTGTGGCTAATCCGGGTGTTTATACCTTAAAAGCAAGCGCTAGGGTGAC
TGATGCCATCAAAGCCGCTGGGGGAATGACTGAGGATGCGGATGCTAAGAGTGTTAACTTAGCTGCAAGTCTGTCGGACG
AAGAGGTTATCTATGTGGCAACTAAGGATGAAAACCTCTCTGTTCTTGGTCAATCAGGAACTGGTCAGGTCTCTGACAAA
GGAGGGCAAACTAGTGCTAAGGATGGCAAAATTAACTTAAATACAGCAACCTCAGAGGAGTTGCAAACTATTTCTGGAAT
TGGAGCTAAGCGGGCAGAGGATATCATTGCCTATCGTGAAAGTCATGGAGGCTTTCAATCCGTAGATGACTTGAAAAATG
TCTCAGGAATTGGTGATAAAACTTTAGATAAAATCAGAGAGTCCCTCTATGTGGCTTAA

Domains


Predicted by InterproScan.

(169-229)

(86-139)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Streptococcus thermophilus LMD-9

88.793

100

0.888

  comEA/celA/cilE Streptococcus mitis SK321

40

99.138

0.397

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

39.565

99.138

0.392


Multiple sequence alignment