Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   HED66_RS20215 Genome accession   NZ_CP051883
Coordinates   4395095..4395415 (-) Length   106 a.a.
NCBI ID   WP_021140625.1    Uniprot ID   T0QN50
Organism   Aeromonas salmonicida strain SRW-OG1     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4390095..4400415
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HED66_RS20195 - 4390871..4391335 (+) 465 WP_197665895.1 response regulator -
  HED66_RS20200 - 4391431..4392402 (+) 972 WP_011898383.1 response regulator -
  HED66_RS20205 - 4392478..4393875 (-) 1398 WP_169047469.1 peptide MFS transporter -
  HED66_RS20210 galU 4394089..4395000 (-) 912 WP_169047470.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  HED66_RS20215 comEA 4395095..4395415 (-) 321 WP_021140625.1 ComEA family DNA-binding protein Machinery gene
  HED66_RS20220 cysQ 4395582..4396352 (+) 771 WP_169047471.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  HED66_RS20225 - 4396434..4396730 (-) 297 WP_059114085.1 YciI family protein -
  HED66_RS20230 - 4396757..4397308 (-) 552 WP_099994232.1 septation protein A -
  HED66_RS20235 - 4397394..4398929 (-) 1536 WP_169047472.1 aminotransferase class V-fold PLP-dependent enzyme -
  HED66_RS20240 trhA 4399154..4399789 (+) 636 WP_021140622.1 PAQR family membrane homeostasis protein TrhA -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11211.08 Da        Isoelectric Point: 9.6863

>NTDB_id=441340 HED66_RS20215 WP_021140625.1 4395095..4395415(-) (comEA) [Aeromonas salmonicida strain SRW-OG1]
MNYKTLTATLLLSCLPLLSQPLLAADKPAAKPATTVTTAKESGKVNLNTASINELTALKGIGEKKAQAIVDFREKQGKFT
TVEQLADVSGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=441340 HED66_RS20215 WP_021140625.1 4395095..4395415(-) (comEA) [Aeromonas salmonicida strain SRW-OG1]
ATGAACTACAAGACCCTGACCGCCACCCTGCTGCTGAGCTGCCTGCCCCTGTTGAGCCAGCCTTTGCTGGCCGCCGACAA
GCCGGCTGCCAAGCCAGCGACCACGGTCACCACCGCCAAGGAGAGTGGCAAGGTGAACCTGAATACGGCCAGTATCAATG
AGTTAACTGCTCTCAAAGGGATCGGAGAGAAGAAGGCGCAGGCCATCGTCGATTTTCGTGAGAAACAGGGCAAGTTCACC
ACGGTTGAACAACTGGCGGATGTCAGCGGCATAGGGCCGGCAACTCTGGAAGCAAATCGGGACATGATCATCGTCAAATA
G

Domains


Predicted by InterproScan.

(42-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB T0QN50

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae strain A1552

51.818

100

0.538

  comEA Vibrio cholerae C6706

51.818

100

0.538

  comE1/comEA Haemophilus influenzae Rd KW20

44.144

100

0.462

  comEA Vibrio parahaemolyticus RIMD 2210633

46.392

91.509

0.425

  comEA Legionella pneumophila str. Paris

41.837

92.453

0.387

  comEA Legionella pneumophila strain ERS1305867

41.837

92.453

0.387

  comEA/comE1 Glaesserella parasuis strain SC1401

63.492

59.434

0.377

  comEA/celA/cilE Streptococcus mitis NCTC 12261

51.316

71.698

0.368