Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   MRK42_RS03855 Genome accession   NZ_CP095328
Coordinates   811541..811861 (-) Length   106 a.a.
NCBI ID   WP_026455883.1    Uniprot ID   A0AAU6TAA5
Organism   Aeromonas sp. 19NY04SH05-1     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 806541..816861
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  MRK42_RS03835 (MRK42_03835) - 806872..807558 (-) 687 WP_042070154.1 DnaT-like ssDNA-binding domain-containing protein -
  MRK42_RS03840 (MRK42_03840) - 807878..808849 (+) 972 WP_042070153.1 response regulator -
  MRK42_RS03845 (MRK42_03845) - 808923..810320 (-) 1398 WP_042028771.1 peptide MFS transporter -
  MRK42_RS03850 (MRK42_03850) galU 810534..811445 (-) 912 WP_026455884.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  MRK42_RS03855 (MRK42_03855) comE1/comEA 811541..811861 (-) 321 WP_026455883.1 ComEA family DNA-binding protein Machinery gene
  MRK42_RS03860 (MRK42_03860) pta 812236..814407 (-) 2172 WP_338611771.1 phosphate acetyltransferase -
  MRK42_RS03865 (MRK42_03865) - 814460..815662 (-) 1203 WP_026455881.1 acetate kinase -
  MRK42_RS03870 (MRK42_03870) yfbV 815968..816408 (+) 441 WP_026455880.1 terminus macrodomain insulation protein YfbV -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11456.27 Da        Isoelectric Point: 9.6917

>NTDB_id=676353 MRK42_RS03855 WP_026455883.1 811541..811861(-) (comE1/comEA) [Aeromonas sp. 19NY04SH05-1]
MQIHKFAAVMLLSTLPLFSQPSLAAEQTQTKQTTTTTAAKESGKLNLNTATLAELTTLKGIGEKKAQAILEYREKQGKFT
SVDQLADVTGIGPATLEANRDMMMVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=676353 MRK42_RS03855 WP_026455883.1 811541..811861(-) (comE1/comEA) [Aeromonas sp. 19NY04SH05-1]
ATGCAGATACACAAGTTCGCCGCCGTGATGTTGCTCTCAACCCTGCCCCTGTTCAGCCAACCCTCGCTGGCAGCCGAACA
GACGCAGACCAAGCAAACCACTACCACAACCGCCGCCAAGGAGAGTGGCAAACTCAACCTCAATACGGCCACCCTGGCCG
AGCTCACTACCCTCAAGGGTATTGGCGAGAAGAAGGCTCAGGCCATCCTCGAATATCGTGAGAAACAGGGGAAATTTACC
TCTGTAGATCAATTGGCAGATGTCACGGGGATCGGCCCGGCCACGCTGGAAGCCAACAGGGATATGATGATGGTCAAATA
A

Domains


Predicted by InterproScan.

(42-101)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

46.789

100

0.481

  comEA/comE1 Glaesserella parasuis strain SC1401

46.465

93.396

0.434

  comEA Vibrio cholerae C6706

45.055

85.849

0.387

  comEA Vibrio cholerae strain A1552

45.055

85.849

0.387

  comEA Vibrio parahaemolyticus RIMD 2210633

41

94.34

0.387

  comEA/celA/cilE Streptococcus mitis NCTC 12261

47.561

77.358

0.368