Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   G3M63_RS13920 Genome accession   NZ_CP048629
Coordinates   2991365..2991655 (+) Length   96 a.a.
NCBI ID   WP_150301204.1    Uniprot ID   -
Organism   Pseudomonas sp. OIL-1     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2986365..2996655
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  G3M63_RS13895 (G3M63_13895) - 2987230..2987868 (+) 639 WP_163983524.1 SPOR domain-containing protein -
  G3M63_RS13900 (G3M63_13900) - 2987931..2988413 (+) 483 WP_150301201.1 CvpA family protein -
  G3M63_RS13910 (G3M63_13910) purF 2988465..2989976 (+) 1512 WP_150301202.1 amidophosphoribosyltransferase -
  G3M63_RS13915 (G3M63_13915) - 2989984..2991195 (+) 1212 WP_150301203.1 O-succinylhomoserine sulfhydrylase -
  G3M63_RS13920 (G3M63_13920) comE1/comEA 2991365..2991655 (+) 291 WP_150301204.1 helix-hairpin-helix domain-containing protein Machinery gene
  G3M63_RS19760 - 2991724..2992269 (-) 546 WP_371923588.1 methyl-accepting chemotaxis protein -
  G3M63_RS19765 - 2992588..2993172 (-) 585 Protein_2757 HAMP domain-containing protein -
  G3M63_RS13930 (G3M63_13930) rnd 2993647..2994783 (+) 1137 WP_223825051.1 ribonuclease D -
  G3M63_RS13935 (G3M63_13935) - 2994786..2995079 (+) 294 WP_150301206.1 YcgL domain-containing protein -
  G3M63_RS13940 (G3M63_13940) - 2995111..2996334 (-) 1224 WP_163983526.1 pyridoxal-dependent decarboxylase, exosortase A system-associated -
  G3M63_RS13945 (G3M63_13945) - 2996354..2996584 (-) 231 WP_150301208.1 DUF3820 family protein -

Sequence


Protein


Download         Length: 96 a.a.        Molecular weight: 9809.21 Da        Isoelectric Point: 4.5822

>NTDB_id=421976 G3M63_RS13920 WP_150301204.1 2991365..2991655(+) (comE1/comEA) [Pseudomonas sp. OIL-1]
MKAIYLAVLFVLSSWFALPVVAETSAAPASVISVNINNASAAEIAETLQGVGVAKADAIVAYREANGSFASVESLSEVSG
IGAATVEKNRARIALQ

Nucleotide


Download         Length: 291 bp        

>NTDB_id=421976 G3M63_RS13920 WP_150301204.1 2991365..2991655(+) (comE1/comEA) [Pseudomonas sp. OIL-1]
ATGAAAGCCATATATCTTGCTGTTCTGTTTGTCCTGTCGTCCTGGTTCGCGCTCCCGGTTGTTGCCGAGACATCCGCGGC
CCCGGCATCGGTTATCTCGGTCAACATCAATAACGCCAGCGCCGCGGAAATCGCCGAAACCCTGCAGGGTGTCGGTGTTG
CCAAGGCGGATGCCATTGTTGCCTACCGCGAAGCCAATGGTAGTTTTGCTTCCGTCGAGTCCCTGTCCGAAGTATCGGGT
ATTGGTGCGGCTACCGTCGAAAAAAACCGTGCCCGTATTGCTCTGCAGTAA

Domains


Predicted by InterproScan.

(33-93)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

44.545

100

0.51

  comEA Vibrio cholerae C6706

48.454

100

0.49

  comEA Vibrio cholerae strain A1552

48.454

100

0.49

  comEA/comE1 Glaesserella parasuis strain SC1401

52.273

91.667

0.479

  comEA Vibrio campbellii strain DS40M4

41.584

100

0.437

  comE Neisseria gonorrhoeae MS11

39.773

91.667

0.365

  comE Neisseria gonorrhoeae MS11

39.773

91.667

0.365

  comE Neisseria gonorrhoeae MS11

39.773

91.667

0.365

  comE Neisseria gonorrhoeae MS11

39.773

91.667

0.365

  comEA Legionella pneumophila strain ERS1305867

34.653

100

0.365

  comEA Legionella pneumophila str. Paris

34.653

100

0.365


Multiple sequence alignment