Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/comE1   Type   Machinery gene
Locus tag   I6L41_RS12840 Genome accession   NZ_CP077214
Coordinates   2884575..2884895 (+) Length   106 a.a.
NCBI ID   WP_111900685.1    Uniprot ID   -
Organism   Aeromonas sp. FDAARGOS 1411     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2879575..2889895
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I6L41_RS12825 (I6L41_12825) yfbV 2880015..2880455 (-) 441 WP_005339116.1 terminus macrodomain insulation protein YfbV -
  I6L41_RS12830 (I6L41_12830) - 2880760..2881962 (+) 1203 WP_216959287.1 acetate kinase -
  I6L41_RS12835 (I6L41_12835) pta 2882017..2884197 (+) 2181 WP_005339119.1 phosphate acetyltransferase -
  I6L41_RS12840 (I6L41_12840) comEA/comE1 2884575..2884895 (+) 321 WP_111900685.1 ComEA family DNA-binding protein Machinery gene
  I6L41_RS12845 (I6L41_12845) galU 2884991..2885902 (+) 912 WP_139405564.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  I6L41_RS12850 (I6L41_12850) - 2886116..2887513 (+) 1398 WP_040065012.1 peptide MFS transporter -
  I6L41_RS12855 (I6L41_12855) - 2887588..2888559 (-) 972 WP_204484104.1 response regulator -
  I6L41_RS12860 (I6L41_12860) - 2888800..2889498 (+) 699 WP_216959290.1 DnaT-like ssDNA-binding domain-containing protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11272.29 Da        Isoelectric Point: 9.9853

>NTDB_id=579919 I6L41_RS12840 WP_111900685.1 2884575..2884895(+) (comEA/comE1) [Aeromonas sp. FDAARGOS 1411]
MLMKKLSAVMLLACLPLFSQPVLAVDKAAPKQTTTTAVAKESGKLNINTATLAELTSLKGIGDKKAQAIVDYREKQGKFT
SVDQLADVNGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=579919 I6L41_RS12840 WP_111900685.1 2884575..2884895(+) (comEA/comE1) [Aeromonas sp. FDAARGOS 1411]
ATGCTGATGAAGAAGCTCTCTGCTGTCATGCTGCTGGCCTGCTTGCCCCTGTTCAGCCAACCTGTGCTGGCCGTAGATAA
GGCAGCGCCCAAGCAAACAACCACGACCGCGGTTGCCAAGGAGTCCGGCAAGCTGAATATCAATACCGCCACCCTTGCCG
AACTCACCAGCCTGAAAGGGATTGGCGACAAGAAGGCACAAGCCATCGTCGACTATCGGGAAAAACAGGGAAAGTTCACC
TCGGTCGATCAGCTGGCAGATGTCAATGGCATTGGCCCGGCCACGCTGGAAGCCAACAGGGACATGATAATAGTCAAATA
A

Domains


Predicted by InterproScan.

(42-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/comE1 Glaesserella parasuis strain SC1401

46.018

100

0.491

  comEA Vibrio cholerae strain A1552

47.17

100

0.472

  comEA Vibrio cholerae C6706

47.17

100

0.472

  comEA Vibrio parahaemolyticus RIMD 2210633

43.299

91.509

0.396

  comE1/comEA Haemophilus influenzae Rd KW20

65.625

60.377

0.396

  comEA Acinetobacter baylyi ADP1

51.316

71.698

0.368

  comEA Legionella pneumophila strain ERS1305867

39.394

93.396

0.368

  comEA Legionella pneumophila str. Paris

39.394

93.396

0.368