Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   ACEUKD_RS04520 Genome accession   NZ_CP169050
Coordinates   963983..964267 (+) Length   94 a.a.
NCBI ID   WP_031780151.1    Uniprot ID   -
Organism   Vibrio diabolicus strain SF42     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 958983..969267
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACEUKD_RS04510 (ACEUKD_04510) - 961424..961696 (+) 273 WP_005382341.1 HU family DNA-binding protein -
  ACEUKD_RS04515 (ACEUKD_04515) ppiD 961898..963757 (+) 1860 WP_012842021.1 peptidylprolyl isomerase -
  ACEUKD_RS04520 (ACEUKD_04520) comEA 963983..964267 (+) 285 WP_031780151.1 ComEA family DNA-binding protein Machinery gene
  ACEUKD_RS04525 (ACEUKD_04525) rrtA 964366..964917 (-) 552 WP_047009480.1 rhombosortase -
  ACEUKD_RS04530 (ACEUKD_04530) - 964921..965535 (+) 615 WP_114322553.1 tRNA-uridine aminocarboxypropyltransferase -
  ACEUKD_RS04535 (ACEUKD_04535) - 965542..966864 (-) 1323 WP_258586672.1 anti-phage deoxyguanosine triphosphatase -
  ACEUKD_RS04540 (ACEUKD_04540) yfbR 966880..967464 (-) 585 WP_047009477.1 5'-deoxynucleotidase -
  ACEUKD_RS04545 (ACEUKD_04545) - 967555..968769 (-) 1215 WP_005390543.1 pyridoxal phosphate-dependent aminotransferase -

Sequence


Protein


Download         Length: 94 a.a.        Molecular weight: 10322.94 Da        Isoelectric Point: 5.1423

>NTDB_id=1049538 ACEUKD_RS04520 WP_031780151.1 963983..964267(+) (comEA) [Vibrio diabolicus strain SF42]
MKWILTLCLFVFAPFSMAADSATDKYEGIEITVNINTASAEEIATMLKGIGEKKAQSIVDYREEHGPFKTAADLKSVKGI
GEATVKKNEDRILL

Nucleotide


Download         Length: 285 bp        

>NTDB_id=1049538 ACEUKD_RS04520 WP_031780151.1 963983..964267(+) (comEA) [Vibrio diabolicus strain SF42]
ATGAAATGGATTTTGACATTGTGCTTATTTGTTTTTGCGCCTTTTAGCATGGCTGCGGATTCAGCCACTGACAAATATGA
AGGAATTGAAATTACCGTAAATATCAACACGGCCTCTGCTGAAGAAATAGCGACCATGTTGAAAGGTATTGGTGAGAAGA
AAGCGCAGAGTATCGTGGACTACCGCGAGGAGCATGGCCCTTTTAAAACGGCAGCAGATTTGAAGAGCGTAAAAGGCATT
GGTGAAGCGACAGTTAAGAAAAACGAAGACCGCATTTTACTGTAG

Domains


Predicted by InterproScan.

(32-92)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio parahaemolyticus RIMD 2210633

89.362

100

0.894

  comEA Vibrio campbellii strain DS40M4

74.737

100

0.755

  comEA Vibrio cholerae C6706

61.957

97.872

0.606

  comEA Vibrio cholerae strain A1552

61.957

97.872

0.606

  comEA/comE1 Glaesserella parasuis strain SC1401

49.383

86.17

0.426

  comE1/comEA Haemophilus influenzae Rd KW20

58.065

65.957

0.383

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

39.326

94.681

0.372

  comE Neisseria gonorrhoeae MS11

38.889

95.745

0.372

  comE Neisseria gonorrhoeae MS11

38.889

95.745

0.372

  comE Neisseria gonorrhoeae MS11

38.889

95.745

0.372

  comE Neisseria gonorrhoeae MS11

38.889

95.745

0.372

  comEA Legionella pneumophila strain ERS1305867

36.842

100

0.372

  comEA Legionella pneumophila str. Paris

36.842

100

0.372


Multiple sequence alignment