Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   PYU98_RS19380 Genome accession   NZ_CP118988
Coordinates   4158876..4159196 (+) Length   106 a.a.
NCBI ID   WP_075113689.1    Uniprot ID   A0AAX3NQU3
Organism   Aeromonas allosaccharophila strain K520     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4153876..4164196
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PYU98_RS19365 (PYU98_19365) yfbV 4154312..4154752 (-) 441 WP_005339116.1 terminus macrodomain insulation protein YfbV -
  PYU98_RS19370 (PYU98_19370) - 4155057..4156259 (+) 1203 WP_042059883.1 acetate kinase -
  PYU98_RS19375 (PYU98_19375) pta 4156316..4158496 (+) 2181 WP_042059882.1 phosphate acetyltransferase -
  PYU98_RS19380 (PYU98_19380) comE1/comEA 4158876..4159196 (+) 321 WP_075113689.1 ComEA family DNA-binding protein Machinery gene
  PYU98_RS19385 (PYU98_19385) galU 4159293..4160204 (+) 912 WP_275056937.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  PYU98_RS19390 (PYU98_19390) - 4160418..4161815 (+) 1398 WP_042059875.1 peptide MFS transporter -
  PYU98_RS19395 (PYU98_19395) - 4161890..4162861 (-) 972 WP_042059874.1 response regulator -
  PYU98_RS19400 (PYU98_19400) - 4163160..4163858 (+) 699 WP_275056938.1 DnaT-like ssDNA-binding domain-containing protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11300.30 Da        Isoelectric Point: 9.6564

>NTDB_id=797179 PYU98_RS19380 WP_075113689.1 4158876..4159196(+) (comE1/comEA) [Aeromonas allosaccharophila strain K520]
MLMKKLSAVMLLACLPLFSQPVLAAEKTAPKQTTTIAAAKEDGKLNINTATLAELTSLKGIGDKKAQAIVDYREKQGKFT
SVDQLADVNGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=797179 PYU98_RS19380 WP_075113689.1 4158876..4159196(+) (comE1/comEA) [Aeromonas allosaccharophila strain K520]
ATGCTGATGAAGAAGCTCTCTGCAGTCATGCTGCTGGCCTGCTTGCCCCTGTTCAGCCAACCCGTGCTGGCAGCGGAGAA
AACTGCGCCCAAGCAGACCACCACAATCGCAGCCGCCAAGGAGGACGGCAAACTGAATATCAATACAGCCACCCTTGCCG
AACTCACCAGCCTGAAAGGGATTGGCGACAAGAAGGCACAAGCCATCGTCGACTATCGGGAAAAACAGGGAAAGTTCACC
TCGGTCGATCAGCTGGCGGATGTCAATGGCATTGGTCCGGCCACTCTGGAAGCCAACCGTGACATGATCATCGTCAAATA
A

Domains


Predicted by InterproScan.

(43-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

50

100

0.528

  comEA/comE1 Glaesserella parasuis strain SC1401

48.673

100

0.519

  comEA Vibrio cholerae strain A1552

50.943

100

0.509

  comEA Vibrio cholerae C6706

50.943

100

0.509

  comEA Vibrio parahaemolyticus RIMD 2210633

43.299

91.509

0.396

  comEA Legionella pneumophila str. Paris

40.404

93.396

0.377

  comEA Legionella pneumophila strain ERS1305867

40.404

93.396

0.377