Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   WJM57_RS04290 Genome accession   NZ_CP149127
Coordinates   870199..870519 (-) Length   106 a.a.
NCBI ID   WP_021140625.1    Uniprot ID   T0QN50
Organism   Aeromonas salmonicida strain BE4     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 865199..875519
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WJM57_RS04270 - 865479..866177 (-) 699 WP_282683693.1 DnaT-like ssDNA-binding domain-containing protein -
  WJM57_RS04275 - 866535..867506 (+) 972 WP_339022809.1 response regulator -
  WJM57_RS04280 - 867582..868979 (-) 1398 WP_017412884.1 peptide MFS transporter -
  WJM57_RS04285 galU 869193..870104 (-) 912 WP_339022811.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  WJM57_RS04290 comEA 870199..870519 (-) 321 WP_021140625.1 ComEA family DNA-binding protein Machinery gene
  WJM57_RS04295 cysQ 870687..871457 (+) 771 WP_339022813.1 3'(2'),5'-bisphosphate nucleotidase CysQ -
  WJM57_RS04300 - 871539..871835 (-) 297 WP_005318066.1 YciI family protein -
  WJM57_RS04305 - 871863..872414 (-) 552 WP_005318063.1 septation protein A -
  WJM57_RS04310 - 872500..874035 (-) 1536 WP_339022816.1 aminotransferase class I/II-fold pyridoxal phosphate-dependent enzyme -
  WJM57_RS04315 - 874260..874895 (+) 636 WP_339022818.1 hemolysin III family protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11211.08 Da        Isoelectric Point: 9.6863

>NTDB_id=956709 WJM57_RS04290 WP_021140625.1 870199..870519(-) (comEA) [Aeromonas salmonicida strain BE4]
MNYKTLTATLLLSCLPLLSQPLLAADKPAAKPATTVTTAKESGKVNLNTASINELTALKGIGEKKAQAIVDFREKQGKFT
TVEQLADVSGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=956709 WJM57_RS04290 WP_021140625.1 870199..870519(-) (comEA) [Aeromonas salmonicida strain BE4]
ATGAACTACAAGACCCTGACCGCCACCCTGCTGCTGAGCTGCCTGCCCCTGTTGAGCCAGCCTTTGCTGGCCGCCGACAA
GCCGGCAGCCAAGCCAGCGACCACGGTCACCACCGCCAAGGAGAGTGGCAAGGTGAACCTGAATACGGCCAGTATCAATG
AGTTAACTGCCCTCAAGGGGATCGGAGAGAAGAAGGCGCAGGCCATCGTCGATTTTCGTGAGAAACAGGGCAAGTTCACC
ACGGTTGAACAACTGGCGGATGTCAGCGGCATAGGGCCGGCCACTCTGGAAGCAAATCGGGACATGATCATCGTCAAATA
G

Domains


Predicted by InterproScan.

(42-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB T0QN50

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae strain A1552

51.818

100

0.538

  comEA Vibrio cholerae C6706

51.818

100

0.538

  comE1/comEA Haemophilus influenzae Rd KW20

44.144

100

0.462

  comEA Vibrio parahaemolyticus RIMD 2210633

46.392

91.509

0.425

  comEA Legionella pneumophila str. Paris

41.837

92.453

0.387

  comEA Legionella pneumophila strain ERS1305867

41.837

92.453

0.387

  comEA/comE1 Glaesserella parasuis strain SC1401

63.492

59.434

0.377

  comEA/celA/cilE Streptococcus mitis NCTC 12261

51.316

71.698

0.368