Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   C0073_RS17265 Genome accession   NZ_CP032839
Coordinates   3765665..3765985 (+) Length   106 a.a.
NCBI ID   WP_047436205.1    Uniprot ID   A0A494UAU0
Organism   Aeromonas veronii strain FC951     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3760665..3770985
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C0073_RS17250 (C0073_017250) yfbV 3761104..3761544 (-) 441 WP_047436201.1 terminus macrodomain insulation protein YfbV -
  C0073_RS17255 (C0073_017255) - 3761849..3763051 (+) 1203 WP_005349210.1 acetate kinase -
  C0073_RS17260 (C0073_017260) pta 3763107..3765287 (+) 2181 WP_005339119.1 phosphate acetyltransferase -
  C0073_RS17265 (C0073_017265) comE1/comEA 3765665..3765985 (+) 321 WP_047436205.1 ComEA family DNA-binding protein Machinery gene
  C0073_RS17270 (C0073_017270) galU 3766082..3766993 (+) 912 WP_047436208.1 UTP--glucose-1-phosphate uridylyltransferase GalU -
  C0073_RS17275 (C0073_017275) - 3767207..3768604 (+) 1398 WP_047436211.1 peptide MFS transporter -
  C0073_RS17280 (C0073_017280) - 3768679..3769650 (-) 972 WP_101531302.1 response regulator -
  C0073_RS17285 (C0073_017285) - 3769891..3770589 (+) 699 WP_005349201.1 DnaT-like ssDNA-binding domain-containing protein -

Sequence


Protein


Download         Length: 106 a.a.        Molecular weight: 11274.26 Da        Isoelectric Point: 9.9853

>NTDB_id=319069 C0073_RS17265 WP_047436205.1 3765665..3765985(+) (comE1/comEA) [Aeromonas veronii strain FC951]
MLMKKLSAVMLLACLPLFSQPVLAADKATPKQTTTTAVAKESGKLNINTATLAELTSLKGIGDKKAQAIVDYREKQGKFT
SVDQLADVNGIGPATLEANRDMIIVK

Nucleotide


Download         Length: 321 bp        

>NTDB_id=319069 C0073_RS17265 WP_047436205.1 3765665..3765985(+) (comE1/comEA) [Aeromonas veronii strain FC951]
ATGCTGATGAAGAAGCTCTCTGCAGTCATGTTGCTGGCCTGCTTGCCACTGTTCAGCCAACCCGTGCTGGCCGCAGATAA
GGCAACGCCCAAGCAAACAACCACGACCGCGGTTGCCAAGGAGTCCGGCAAGCTGAATATCAATACGGCCACCCTTGCCG
AACTCACCAGCCTTAAAGGGATTGGTGACAAGAAGGCACAAGCCATCGTCGACTATCGGGAAAAACAGGGAAAGTTCACC
TCGGTCGATCAGCTGGCGGATGTCAATGGCATTGGCCCGGCCACGCTGGAAGCCAACCGTGACATGATCATCGTCAAATA
A

Domains


Predicted by InterproScan.

(42-103)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A494UAU0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

49.107

100

0.519

  comEA/comE1 Glaesserella parasuis strain SC1401

46.903

100

0.5

  comEA Vibrio cholerae strain A1552

47.17

100

0.472

  comEA Vibrio cholerae C6706

47.17

100

0.472

  comEA Vibrio parahaemolyticus RIMD 2210633

43.299

91.509

0.396

  comEA Acinetobacter baylyi ADP1

51.316

71.698

0.368

  comEA Vibrio campbellii strain DS40M4

37.143

99.057

0.368

  comEA Legionella pneumophila str. Paris

39.394

93.396

0.368

  comEA Legionella pneumophila strain ERS1305867

39.394

93.396

0.368


Multiple sequence alignment