Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   PN837_RS04180 Genome accession   NZ_CP171822
Coordinates   1046843..1047097 (+) Length   84 a.a.
NCBI ID   WP_395376650.1    Uniprot ID   -
Organism   Marinicella sp. W31     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1041843..1052097
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PN837_RS04165 (PN837_004165) - 1042536..1044065 (-) 1530 WP_395376644.1 hypothetical protein -
  PN837_RS04170 (PN837_004170) - 1044079..1044687 (-) 609 WP_395376645.1 hypothetical protein -
  PN837_RS04175 (PN837_004175) - 1044687..1046549 (-) 1863 WP_395376647.1 hypothetical protein -
  PN837_RS04180 (PN837_004180) comEA 1046843..1047097 (+) 255 WP_395376650.1 ComEA family DNA-binding protein Machinery gene
  PN837_RS04185 (PN837_004185) - 1047149..1047544 (-) 396 WP_395376653.1 YhcB family protein -
  PN837_RS04190 (PN837_004190) - 1047675..1047977 (-) 303 WP_395376654.1 DUF2288 family protein -
  PN837_RS04195 (PN837_004195) - 1047983..1050847 (-) 2865 WP_395376655.1 insulinase family protein -
  PN837_RS04200 (PN837_004200) - 1050860..1051702 (-) 843 WP_395376658.1 tetratricopeptide repeat protein -

Sequence


Protein


Download         Length: 84 a.a.        Molecular weight: 9144.72 Da        Isoelectric Point: 8.5619

>NTDB_id=1064421 PN837_RS04180 WP_395376650.1 1046843..1047097(+) (comEA) [Marinicella sp. W31]
MKTIKAILITLSLFINAAIAAVNVNSADAETIAKELKGIGMTKAQAIVEYREANGQFQTVEQLTEVKGIGLKTVEKNREE
ILLK

Nucleotide


Download         Length: 255 bp        

>NTDB_id=1064421 PN837_RS04180 WP_395376650.1 1046843..1047097(+) (comEA) [Marinicella sp. W31]
ATGAAAACCATAAAAGCAATTTTGATCACATTAAGCCTGTTCATCAATGCAGCCATTGCCGCAGTTAACGTCAACAGTGC
CGACGCTGAAACCATAGCCAAAGAACTGAAAGGTATTGGCATGACTAAAGCTCAGGCCATTGTTGAATACAGAGAAGCTA
ACGGCCAGTTTCAAACGGTTGAGCAATTGACGGAAGTGAAAGGCATTGGTTTGAAAACGGTTGAGAAAAACCGTGAAGAA
ATTCTATTGAAATAA

Domains


Predicted by InterproScan.

(21-81)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae strain A1552

53.409

100

0.56

  comEA Vibrio cholerae C6706

53.409

100

0.56

  comEA/comE1 Glaesserella parasuis strain SC1401

58.571

83.333

0.488

  comEA Vibrio campbellii strain DS40M4

61.29

73.81

0.452

  comEA Vibrio parahaemolyticus RIMD 2210633

58.065

73.81

0.429

  Cj0011c Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

42.683

97.619

0.417

  comE1/comEA Haemophilus influenzae Rd KW20

54.839

73.81

0.405

  comE Neisseria gonorrhoeae MS11

39.506

96.429

0.381

  comE Neisseria gonorrhoeae MS11

39.506

96.429

0.381

  comE Neisseria gonorrhoeae MS11

39.506

96.429

0.381

  comE Neisseria gonorrhoeae MS11

39.506

96.429

0.381

  comEA Acinetobacter baylyi ADP1

51.667

71.429

0.369

  comEA Bacillus subtilis subsp. subtilis str. 168

47.692

77.381

0.369


Multiple sequence alignment