Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/comE1   Type   Machinery gene
Locus tag   U3H55_RS20010 Genome accession   NZ_CP140792
Coordinates   4239796..4240116 (+) Length   106 a.a.
NCBI ID   WP_322862711.1    Uniprot ID   -
Organism   Aeromonas allosaccharophila strain M01     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4234796..4245116
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  U3H55_RS19995 yfbV 4235232..4235672 (-) 441 WP_005339116.1 terminus macrodomain insulation protein YfbV -
  U3H55_RS20000 - 4235977..4237179 (+) 1203 WP_042059883.1 acetate kinase -
  U3H55_RS20005 pta 4237236..4239416 (+) 2181 WP_042059882.1 phosphate acetyltransferase -
  U3H55_RS20010 comEA/comE1 4239796..4240116 (+) 321 WP_322862711.1 ComEA family DNA-binding protein Machinery gene
  U3H55_RS20015 galU 4240213..4241124 (+) 912 WP_058053048.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  U3H55_RS20020 - 4241338..4242735 (+) 1398 WP_042059875.1 peptide MFS transporter -
  U3H55_RS20025 - 4242810..4243781 (-) 972 WP_042059874.1 response regulator -
  U3H55_RS20030 - 4244080..4244778 (+) 699 WP_042060132.1 DnaT-like ssDNA-binding domain-containing protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11253.24 Da        Isoelectric Point: 9.9854

>NTDB_id=915609 U3H55_RS20010 WP_322862711.1 4239796..4240116(+) (comEA/comE1) [Aeromonas allosaccharophila strain M01]
MLMKKLSAVMLLACLPLFSQPALAAEKTAPKQTTTATVAKEGGKLNIHTATLAELTSLKGIGDKKAQAIVDYREKQGKFT
SVDQLADVNGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=915609 U3H55_RS20010 WP_322862711.1 4239796..4240116(+) (comEA/comE1) [Aeromonas allosaccharophila strain M01]
ATGCTGATGAAGAAACTCTCTGCAGTCATGCTGCTGGCCTGCTTGCCCCTGTTCAGCCAACCCGCGCTGGCCGCGGAGAA
AACTGCGCCCAAGCAAACAACCACAGCCACAGTCGCCAAGGAGGGCGGCAAACTGAATATCCATACCGCCACCCTTGCTG
AACTCACCAGCCTGAAAGGGATTGGCGACAAGAAGGCACAAGCCATCGTCGACTATCGGGAAAAACAGGGAAAGTTCACC
TCGGTCGATCAGCTGGCGGATGTCAACGGCATTGGCCCGGCCACGCTGGAAGCCAATCGTGACATGATCATCGTCAAATA
A

Domains


Predicted by InterproScan.

(43-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/comE1 Glaesserella parasuis strain SC1401

46.903

100

0.5

  comE1/comEA Haemophilus influenzae Rd KW20

46.903

100

0.5

  comEA Vibrio cholerae C6706

49.524

99.057

0.491

  comEA Vibrio cholerae strain A1552

49.524

99.057

0.491

  comEA Vibrio parahaemolyticus RIMD 2210633

43.434

93.396

0.406

  comEA Legionella pneumophila str. Paris

39.394

93.396

0.368

  comEA Legionella pneumophila strain ERS1305867

39.394

93.396

0.368

  comEA/celA/cilE Streptococcus mitis NCTC 12261

41.489

88.679

0.368