Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   PUN47_RS04895 Genome accession   NZ_CP118599
Coordinates   1058939..1059235 (+) Length   98 a.a.
NCBI ID   WP_020327823.1    Uniprot ID   S7IAY2
Organism   Vibrio fluvialis strain 10M-VF     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1053939..1064235
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  PUN47_RS04880 (PUN47_04880) lon 1053967..1056318 (+) 2352 WP_020327825.1 endopeptidase La -
  PUN47_RS04885 (PUN47_04885) - 1056513..1056785 (+) 273 WP_004726502.1 HU family DNA-binding protein -
  PUN47_RS04890 (PUN47_04890) ppiD 1056941..1058800 (+) 1860 WP_202650089.1 peptidylprolyl isomerase -
  PUN47_RS04895 (PUN47_04895) comEA 1058939..1059235 (+) 297 WP_020327823.1 ComEA family DNA-binding protein Machinery gene
  PUN47_RS04900 (PUN47_04900) cmk 1059415..1060095 (+) 681 WP_024374736.1 (d)CMP kinase -
  PUN47_RS04905 (PUN47_04905) rpsA 1060200..1061870 (+) 1671 WP_020327821.1 30S ribosomal protein S1 -
  PUN47_RS04910 (PUN47_04910) ihfB 1062101..1062385 (+) 285 WP_020430001.1 integration host factor subunit beta -
  PUN47_RS04915 (PUN47_04915) - 1062525..1062809 (+) 285 WP_020327818.1 LapA family protein -
  PUN47_RS04920 (PUN47_04920) lapB 1062822..1063991 (+) 1170 WP_024374735.1 lipopolysaccharide assembly protein LapB -

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 10694.42 Da        Isoelectric Point: 7.2029

>NTDB_id=793845 PUN47_RS04895 WP_020327823.1 1058939..1059235(+) (comEA) [Vibrio fluvialis strain 10M-VF]
MKIHKFWLAILLAVTWPVSHSWAEENVSATEGVEITVNINTAPAEELATLLKGIGLKKAQAIVDYREANGAFKSKEDLTQ
VKGIGPAIVAQNDKRILL

Nucleotide


Download         Length: 297 bp        

>NTDB_id=793845 PUN47_RS04895 WP_020327823.1 1058939..1059235(+) (comEA) [Vibrio fluvialis strain 10M-VF]
ATGAAAATTCACAAATTTTGGTTGGCGATTCTCCTGGCTGTCACTTGGCCGGTGTCCCACAGTTGGGCGGAAGAAAATGT
TTCTGCTACAGAAGGGGTCGAGATAACTGTCAATATCAATACGGCACCAGCGGAAGAGTTGGCGACTTTACTCAAAGGTA
TCGGCCTGAAAAAAGCCCAAGCAATAGTTGACTATCGTGAGGCCAATGGTGCGTTCAAATCTAAAGAAGATCTGACGCAA
GTGAAAGGCATTGGGCCAGCTATCGTTGCTCAGAATGATAAACGTATCTTGTTATAA

Domains


Predicted by InterproScan.

(36-96)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB S7IAY2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

63

100

0.643

  comEA Vibrio cholerae strain A1552

63

100

0.643

  comEA Vibrio parahaemolyticus RIMD 2210633

57.447

95.918

0.551

  comEA Vibrio campbellii strain DS40M4

55.208

97.959

0.541

  comEA/comE1 Glaesserella parasuis strain SC1401

48.276

88.776

0.429

  comE1/comEA Haemophilus influenzae Rd KW20

38.532

100

0.429

  comEA Legionella pneumophila str. Paris

41.489

95.918

0.398

  comEA Legionella pneumophila strain ERS1305867

41.489

95.918

0.398