Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   QAO71_RS09145 Genome accession   NZ_CP123445
Coordinates   1964607..1964891 (-) Length   94 a.a.
NCBI ID   WP_036989606.1    Uniprot ID   A0A031MIR7
Organism   Halopseudomonas sp. SMJS2     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1959607..1969891
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  QAO71_RS09120 (QAO71_09120) - 1959643..1961001 (-) 1359 WP_051610815.1 YcjX family protein -
  QAO71_RS09125 (QAO71_09125) - 1961069..1961470 (+) 402 WP_036989612.1 TM2 domain-containing protein -
  QAO71_RS09130 (QAO71_09130) - 1961471..1961764 (-) 294 WP_074778477.1 YcgL domain-containing protein -
  QAO71_RS09135 (QAO71_09135) rnd 1961761..1962888 (-) 1128 WP_280165672.1 ribonuclease D -
  QAO71_RS09140 (QAO71_09140) - 1963124..1964563 (+) 1440 WP_036989607.1 methyl-accepting chemotaxis protein -
  QAO71_RS09145 (QAO71_09145) comE1/comEA 1964607..1964891 (-) 285 WP_036989606.1 ComEA family DNA-binding protein Machinery gene
  QAO71_RS09150 (QAO71_09150) - 1965144..1966148 (+) 1005 WP_036989605.1 response regulator -
  QAO71_RS09155 (QAO71_09155) - 1966174..1967148 (+) 975 WP_280164204.1 diguanylate cyclase -
  QAO71_RS09160 (QAO71_09160) - 1967172..1968389 (-) 1218 WP_280164205.1 O-succinylhomoserine sulfhydrylase -

Sequence


Protein


Download         Length: 94 a.a.        Molecular weight: 9888.41 Da        Isoelectric Point: 4.3355

>NTDB_id=821136 QAO71_RS09145 WP_036989606.1 1964607..1964891(-) (comE1/comEA) [Halopseudomonas sp. SMJS2]
MKNILAAVILCLTAWSVVPAFAEIPAEPLVTVNINNASAAEIAETLNGIGLAKAEAIVAYREENGGFESVDQLTVVKGIG
PATIDKNRERIALQ

Nucleotide


Download         Length: 285 bp        

>NTDB_id=821136 QAO71_RS09145 WP_036989606.1 1964607..1964891(-) (comE1/comEA) [Halopseudomonas sp. SMJS2]
ATGAAAAATATTCTTGCTGCCGTCATCCTCTGTCTGACCGCCTGGTCTGTGGTACCTGCGTTCGCGGAAATCCCTGCTGA
ACCGCTGGTGACAGTCAACATCAACAATGCCAGCGCTGCGGAAATAGCCGAGACCCTGAACGGTATCGGTCTGGCCAAGG
CAGAGGCCATTGTTGCCTATCGGGAAGAGAACGGCGGATTCGAATCGGTTGACCAACTTACCGTGGTCAAGGGAATCGGC
CCTGCTACCATCGACAAGAACCGCGAGCGAATTGCGTTGCAGTGA

Domains


Predicted by InterproScan.

(30-91)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A031MIR7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

44.545

100

0.521

  comEA Vibrio parahaemolyticus RIMD 2210633

49.485

100

0.511

  comEA Vibrio campbellii strain DS40M4

46.067

94.681

0.436

  comEA Vibrio cholerae C6706

61.538

69.149

0.426

  comEA Vibrio cholerae strain A1552

61.538

69.149

0.426

  comEA/comE1 Glaesserella parasuis strain SC1401

58.73

67.021

0.394

  comEA Legionella pneumophila strain ERS1305867

37.5

100

0.383

  comEA Acinetobacter baylyi ADP1

56.25

68.085

0.383

  comEA Legionella pneumophila str. Paris

37.5

100

0.383

  comE Neisseria gonorrhoeae MS11

56.452

65.957

0.372

  comE Neisseria gonorrhoeae MS11

56.452

65.957

0.372

  comE Neisseria gonorrhoeae MS11

56.452

65.957

0.372

  comE Neisseria gonorrhoeae MS11

56.452

65.957

0.372