Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/comE1   Type   Machinery gene
Locus tag   LAG74_RS17780 Genome accession   NZ_CP083461
Coordinates   3900662..3900982 (+) Length   106 a.a.
NCBI ID   WP_111900685.1    Uniprot ID   -
Organism   Aeromonas veronii strain SW3814     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3895662..3905982
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LAG74_RS17765 (LAG74_17795) yfbV 3896102..3896542 (-) 441 WP_005339116.1 terminus macrodomain insulation protein YfbV -
  LAG74_RS17770 (LAG74_17800) - 3896847..3898049 (+) 1203 WP_005349210.1 acetate kinase -
  LAG74_RS17775 (LAG74_17805) pta 3898104..3900284 (+) 2181 WP_005339119.1 phosphate acetyltransferase -
  LAG74_RS17780 (LAG74_17810) comEA/comE1 3900662..3900982 (+) 321 WP_111900685.1 ComEA family DNA-binding protein Machinery gene
  LAG74_RS17785 (LAG74_17815) galU 3901078..3901989 (+) 912 WP_139405564.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  LAG74_RS17790 (LAG74_17820) - 3902203..3903600 (+) 1398 WP_021231963.1 peptide MFS transporter -
  LAG74_RS17795 (LAG74_17825) - 3903675..3904646 (-) 972 WP_224968716.1 response regulator -
  LAG74_RS17800 (LAG74_17830) - 3904887..3905585 (+) 699 WP_005349201.1 DnaT-like ssDNA-binding domain-containing protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11272.29 Da        Isoelectric Point: 9.9853

>NTDB_id=606021 LAG74_RS17780 WP_111900685.1 3900662..3900982(+) (comEA/comE1) [Aeromonas veronii strain SW3814]
MLMKKLSAVMLLACLPLFSQPVLAVDKAAPKQTTTTAVAKESGKLNINTATLAELTSLKGIGDKKAQAIVDYREKQGKFT
SVDQLADVNGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=606021 LAG74_RS17780 WP_111900685.1 3900662..3900982(+) (comEA/comE1) [Aeromonas veronii strain SW3814]
ATGCTGATGAAGAAGCTCTCTGCTGTCATGCTGCTGGCCTGCTTGCCCCTGTTCAGCCAACCTGTGCTGGCCGTAGATAA
GGCAGCGCCCAAGCAAACAACCACGACCGCGGTTGCCAAGGAGTCCGGCAAGCTGAATATCAATACCGCCACCCTTGCCG
AACTCACCAGCCTGAAAGGGATTGGTGACAAGAAGGCACAAGCCATCGTCGACTATCGGGAAAAACAGGGAAAGTTCACC
TCGGTCGATCAGCTGGCGGATGTCAATGGCATTGGCCCGGCCACGCTGGAAGCCAACCGTGACATGATAATAGTCAAATA
A

Domains


Predicted by InterproScan.

(42-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/comE1 Glaesserella parasuis strain SC1401

46.018

100

0.491

  comEA Vibrio cholerae strain A1552

47.17

100

0.472

  comEA Vibrio cholerae C6706

47.17

100

0.472

  comEA Vibrio parahaemolyticus RIMD 2210633

43.299

91.509

0.396

  comE1/comEA Haemophilus influenzae Rd KW20

65.625

60.377

0.396

  comEA Acinetobacter baylyi ADP1

51.316

71.698

0.368

  comEA Legionella pneumophila strain ERS1305867

39.394

93.396

0.368

  comEA Legionella pneumophila str. Paris

39.394

93.396

0.368