Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   EBAPG3_RS14270 Genome accession   NZ_CP021106
Coordinates   3120858..3121154 (-) Length   98 a.a.
NCBI ID   WP_004180083.1    Uniprot ID   A0A1W6SST4
Organism   Nitrosospira lacus strain APG3     
Function   DNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3115858..3126154
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EBAPG3_RS14245 (EBAPG3_014240) - 3115958..3116731 (-) 774 WP_004180099.1 3'-5' exonuclease -
  EBAPG3_RS14250 (EBAPG3_014245) - 3116793..3117215 (-) 423 WP_040853036.1 YdbL family protein -
  EBAPG3_RS14255 (EBAPG3_014250) - 3117251..3117460 (-) 210 WP_040853024.1 hypothetical protein -
  EBAPG3_RS14260 (EBAPG3_014255) - 3117468..3119510 (-) 2043 WP_004180093.1 hypothetical protein -
  EBAPG3_RS14265 (EBAPG3_014260) cysM 3119766..3120653 (-) 888 WP_004180085.1 cysteine synthase CysM -
  EBAPG3_RS14270 (EBAPG3_014265) comE 3120858..3121154 (-) 297 WP_004180083.1 ComEA family DNA-binding protein Machinery gene
  EBAPG3_RS14275 (EBAPG3_014270) ssb 3121186..3121647 (-) 462 WP_004180081.1 single-stranded DNA-binding protein Machinery gene
  EBAPG3_RS14280 (EBAPG3_014275) - 3121648..3123006 (-) 1359 WP_004180079.1 MFS transporter -
  EBAPG3_RS14285 (EBAPG3_014280) uvrA 3123133..3125952 (+) 2820 WP_085922070.1 excinuclease ABC subunit UvrA -

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 10436.40 Da        Isoelectric Point: 10.7726

>NTDB_id=228248 EBAPG3_RS14270 WP_004180083.1 3120858..3121154(-) (comE) [Nitrosospira lacus strain APG3]
MKKLLFLIVMLFALAGPVYAAVNINTATQGELETLQGIGPAKAKAIVDYRKKNGPFKLPGDLEKVNGIGPATMKKLRKDI
TVGGGTTAKTENKPAVRQ

Nucleotide


Download         Length: 297 bp        

>NTDB_id=228248 EBAPG3_RS14270 WP_004180083.1 3120858..3121154(-) (comE) [Nitrosospira lacus strain APG3]
ATGAAAAAACTGTTGTTTCTTATTGTCATGTTGTTCGCGCTTGCCGGTCCGGTCTACGCGGCTGTCAACATCAATACCGC
TACTCAGGGGGAGCTGGAAACCCTTCAGGGAATCGGTCCGGCCAAAGCCAAGGCTATCGTCGATTATCGTAAGAAGAATG
GTCCCTTCAAATTGCCGGGCGATCTGGAAAAGGTCAATGGCATTGGTCCCGCCACCATGAAGAAACTACGTAAGGATATT
ACGGTGGGCGGCGGCACCACGGCCAAGACGGAAAATAAACCAGCTGTCAGGCAATAA

Domains


Predicted by InterproScan.

(22-81)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1W6SST4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Neisseria gonorrhoeae MS11

58.333

85.714

0.5

  comE Neisseria gonorrhoeae MS11

58.333

85.714

0.5

  comE Neisseria gonorrhoeae MS11

58.333

85.714

0.5

  comE Neisseria gonorrhoeae MS11

58.333

85.714

0.5

  Cj0011c Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

54.878

83.673

0.459

  comEA Vibrio parahaemolyticus RIMD 2210633

48.276

88.776

0.429


Multiple sequence alignment