Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   BSQ33_RS14765 Genome accession   NZ_CP018835
Coordinates   3237915..3238220 (+) Length   101 a.a.
NCBI ID   WP_021020077.1    Uniprot ID   A0A1Z2SI26
Organism   Vibrio gazogenes strain ATCC 43942     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3232915..3243220
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BSQ33_RS14755 (BSQ33_14805) - 3235405..3235677 (+) 273 WP_021020075.1 HU family DNA-binding protein -
  BSQ33_RS14760 (BSQ33_14810) ppiD 3235927..3237780 (+) 1854 WP_088134415.1 peptidylprolyl isomerase -
  BSQ33_RS14765 (BSQ33_14815) comEA 3237915..3238220 (+) 306 WP_021020077.1 ComEA family DNA-binding protein Machinery gene
  BSQ33_RS14770 (BSQ33_14820) - 3238304..3240304 (-) 2001 WP_198298115.1 hybrid sensor histidine kinase/response regulator -
  BSQ33_RS14775 (BSQ33_14825) - 3240861..3242033 (-) 1173 WP_232471984.1 acyl-homoserine-lactone synthase -
  BSQ33_RS14780 (BSQ33_14830) - 3242306..3242566 (-) 261 WP_232471933.1 response regulator -

Sequence


Protein


Download         Length: 101 a.a.        Molecular weight: 10718.47 Da        Isoelectric Point: 5.7833

>NTDB_id=210790 BSQ33_RS14765 WP_021020077.1 3237915..3238220(+) (comEA) [Vibrio gazogenes strain ATCC 43942]
MLKLLLLSVLSVVCLPLQTVYAATGEPASVTPLESQANIVNINSATAEELATVLSGIGLKKAQALVNYREEHGPFAQVED
VTAVKGIGMSLVERNRSRISL

Nucleotide


Download         Length: 306 bp        

>NTDB_id=210790 BSQ33_RS14765 WP_021020077.1 3237915..3238220(+) (comEA) [Vibrio gazogenes strain ATCC 43942]
ATGCTTAAATTATTACTTCTTTCTGTACTTTCAGTTGTGTGTCTGCCACTACAAACGGTTTATGCTGCGACTGGTGAGCC
AGCTTCTGTTACACCTTTGGAAAGTCAGGCCAATATTGTCAATATCAACAGTGCAACAGCGGAAGAACTTGCCACAGTGC
TCAGCGGTATTGGTTTGAAAAAAGCACAGGCTCTCGTTAACTATCGAGAAGAACATGGGCCATTTGCTCAAGTAGAGGAT
GTGACAGCAGTCAAAGGGATCGGTATGTCGCTTGTTGAGAGGAACCGTAGTCGAATCAGCTTATAG

Domains


Predicted by InterproScan.

(36-99)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1Z2SI26

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

48.515

100

0.485

  comEA Vibrio cholerae strain A1552

48.515

100

0.485

  comE1/comEA Haemophilus influenzae Rd KW20

40

100

0.436

  comEA Bacillus subtilis subsp. subtilis str. 168

39.796

97.03

0.386

  comEA Vibrio campbellii strain DS40M4

61.29

61.386

0.376

  comEA Vibrio parahaemolyticus RIMD 2210633

59.677

61.386

0.366


Multiple sequence alignment