Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   LA341_RS10295 Genome accession   NZ_CP083626
Coordinates   2291486..2291806 (+) Length   106 a.a.
NCBI ID   WP_026455883.1    Uniprot ID   A0AAU6TAA5
Organism   Aeromonas enteropelogenes strain FDAARGOS_1509     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2286486..2296806
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LA341_RS10280 (LA341_10280) yfbV 2286939..2287379 (-) 441 WP_026455880.1 terminus macrodomain insulation protein YfbV -
  LA341_RS10285 (LA341_10285) - 2287685..2288887 (+) 1203 WP_026455881.1 acetate kinase -
  LA341_RS10290 (LA341_10290) pta 2288940..2291111 (+) 2172 WP_026455882.1 phosphate acetyltransferase -
  LA341_RS10295 (LA341_10295) comE1/comEA 2291486..2291806 (+) 321 WP_026455883.1 ComEA family DNA-binding protein Machinery gene
  LA341_RS10300 (LA341_10300) galU 2291902..2292813 (+) 912 WP_042028773.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  LA341_RS10305 (LA341_10305) - 2293027..2294424 (+) 1398 WP_042028771.1 peptide MFS transporter -
  LA341_RS10310 (LA341_10310) - 2294498..2295469 (-) 972 WP_042028768.1 response regulator -
  LA341_RS10315 (LA341_10315) - 2295789..2296475 (+) 687 WP_042028767.1 DnaT-like ssDNA-binding domain-containing protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11456.27 Da        Isoelectric Point: 9.6917

>NTDB_id=606769 LA341_RS10295 WP_026455883.1 2291486..2291806(+) (comE1/comEA) [Aeromonas enteropelogenes strain FDAARGOS_1509]
MQIHKFAAVMLLSTLPLFSQPSLAAEQTQTKQTTTTTAAKESGKLNLNTATLAELTTLKGIGEKKAQAILEYREKQGKFT
SVDQLADVTGIGPATLEANRDMMMVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=606769 LA341_RS10295 WP_026455883.1 2291486..2291806(+) (comE1/comEA) [Aeromonas enteropelogenes strain FDAARGOS_1509]
ATGCAGATACACAAGTTCGCCGCCGTGATGTTGCTCTCAACCCTGCCCCTGTTCAGCCAGCCCTCGCTGGCAGCCGAACA
GACGCAGACCAAGCAAACCACTACCACAACCGCCGCCAAGGAGAGTGGCAAACTCAACCTCAATACGGCCACCCTGGCCG
AGCTCACTACCCTCAAGGGTATTGGCGAGAAGAAGGCTCAGGCCATCCTCGAATATCGTGAGAAACAGGGGAAATTTACC
TCTGTAGATCAATTGGCAGATGTCACGGGGATCGGCCCGGCCACGCTGGAAGCCAACAGGGATATGATGATGGTCAAATA
A

Domains


Predicted by InterproScan.

(42-101)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

46.789

100

0.481

  comEA/comE1 Glaesserella parasuis strain SC1401

46.465

93.396

0.434

  comEA Vibrio cholerae C6706

45.055

85.849

0.387

  comEA Vibrio cholerae strain A1552

45.055

85.849

0.387

  comEA Vibrio parahaemolyticus RIMD 2210633

41

94.34

0.387

  comEA/celA/cilE Streptococcus mitis NCTC 12261

47.561

77.358

0.368