Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   PGW97_RS04695 Genome accession   NZ_CP117517
Coordinates   1039395..1039688 (+) Length   97 a.a.
NCBI ID   WP_047887563.1    Uniprot ID   A0A0J1GZN8
Organism   Photobacterium sp. GSS17     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1034395..1044688
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PGW97_RS04685 hupB 1036832..1037104 (+) 273 WP_047887561.1 nucleoid-associated protein HU-beta -
  PGW97_RS04690 ppiD 1037357..1039246 (+) 1890 WP_273858956.1 peptidylprolyl isomerase -
  PGW97_RS04695 comEA 1039395..1039688 (+) 294 WP_047887563.1 helix-hairpin-helix domain-containing protein Machinery gene
  PGW97_RS04700 rrtA 1039853..1040416 (-) 564 WP_273858959.1 rhombosortase -
  PGW97_RS04705 - 1040427..1041143 (+) 717 WP_217391116.1 tRNA-uridine aminocarboxypropyltransferase -
  PGW97_RS04710 - 1041262..1042644 (-) 1383 WP_273858961.1 anti-phage deoxyguanosine triphosphatase -
  PGW97_RS04715 yfbR 1042644..1043228 (-) 585 WP_047887566.1 5'-deoxynucleotidase -
  PGW97_RS04720 - 1043434..1044648 (-) 1215 WP_047887567.1 pyridoxal phosphate-dependent aminotransferase -

Sequence


Protein


Download         Length: 97 a.a.        Molecular weight: 10463.01 Da        Isoelectric Point: 4.8177

>NTDB_id=786172 PGW97_RS04695 WP_047887563.1 1039395..1039688(+) (comEA) [Photobacterium sp. GSS17]
MKTTLKALVLAVVVAGLPLGMAQANDNKHEGIEITVNINTANAEELDKLLLGVGPEKAVSIIEYRDTHGQFESIEQLSEV
KGIGPATVDKNRDRIQL

Nucleotide


Download         Length: 294 bp        

>NTDB_id=786172 PGW97_RS04695 WP_047887563.1 1039395..1039688(+) (comEA) [Photobacterium sp. GSS17]
ATGAAAACGACGTTGAAAGCACTGGTACTGGCCGTAGTTGTTGCCGGTCTGCCGCTGGGTATGGCACAAGCCAATGACAA
CAAGCATGAAGGCATTGAAATTACGGTCAACATCAATACGGCGAATGCCGAAGAGCTGGACAAACTCTTGCTGGGTGTAG
GGCCTGAGAAAGCTGTCAGTATCATCGAATACCGTGATACACATGGTCAGTTTGAATCGATTGAGCAACTCTCTGAAGTG
AAGGGGATTGGGCCGGCGACGGTGGACAAAAATCGTGACCGTATTCAGCTTTAA

Domains


Predicted by InterproScan.

(35-95)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0J1GZN8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae strain A1552

48.039

100

0.505

  comEA Vibrio cholerae C6706

48.039

100

0.505

  comEA Vibrio campbellii strain DS40M4

50

98.969

0.495

  comEA Vibrio parahaemolyticus RIMD 2210633

50

96.907

0.485

  comEA Legionella pneumophila strain ERS1305867

36.634

100

0.381

  comEA Legionella pneumophila str. Paris

36.634

100

0.381

  comE1/comEA Haemophilus influenzae Rd KW20

58.333

61.856

0.361