Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   CEG15_RS05080 Genome accession   NZ_CP022099
Coordinates   1094485..1094784 (+) Length   99 a.a.
NCBI ID   WP_017044597.1    Uniprot ID   A0A233HCF5
Organism   Vibrio anguillarum strain S3 4/9     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1089485..1099784
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CEG15_RS05065 (CEG15_05065) lon 1089512..1091863 (+) 2352 WP_088728763.1 endopeptidase La -
  CEG15_RS05070 (CEG15_05070) - 1092059..1092331 (+) 273 WP_026027116.1 HU family DNA-binding protein -
  CEG15_RS05075 (CEG15_05075) ppiD 1092489..1094348 (+) 1860 WP_088728764.1 peptidylprolyl isomerase -
  CEG15_RS05080 (CEG15_05080) comEA 1094485..1094784 (+) 300 WP_017044597.1 ComEA family DNA-binding protein Machinery gene
  CEG15_RS05085 (CEG15_05085) cmk 1094948..1095628 (+) 681 WP_013857180.1 (d)CMP kinase -
  CEG15_RS05090 (CEG15_05090) rpsA 1095732..1097402 (+) 1671 WP_013857179.1 30S ribosomal protein S1 -
  CEG15_RS05095 (CEG15_05095) ihfB 1097479..1097763 (+) 285 WP_010317449.1 integration host factor subunit beta -
  CEG15_RS05100 (CEG15_05100) - 1097905..1098180 (+) 276 WP_088728765.1 LapA family protein -
  CEG15_RS05105 (CEG15_05105) lapB 1098200..1099369 (+) 1170 WP_088728766.1 lipopolysaccharide assembly protein LapB -

Sequence


Protein


Download         Length: 99 a.a.        Molecular weight: 10776.69 Da        Isoelectric Point: 9.6909

>NTDB_id=236730 CEG15_RS05080 WP_017044597.1 1094485..1094784(+) (comEA) [Vibrio anguillarum strain S3 4/9]
MKLKTKLGLLLLSIVLPLSPTLAEEKVVETHQGIEITVNINQASAEEIATLLKGIGLKKAQAIVEYRQQNGDFKTKEDLS
LVKGIGAATVRQNAERIIL

Nucleotide


Download         Length: 300 bp        

>NTDB_id=236730 CEG15_RS05080 WP_017044597.1 1094485..1094784(+) (comEA) [Vibrio anguillarum strain S3 4/9]
ATGAAATTGAAAACAAAACTCGGGTTGTTACTTCTAAGTATCGTATTGCCTCTTTCACCGACATTAGCAGAAGAAAAGGT
GGTAGAAACGCACCAAGGCATTGAAATTACGGTTAATATAAATCAGGCTTCCGCTGAGGAAATTGCTACTTTGTTGAAAG
GTATTGGACTTAAAAAAGCACAAGCGATTGTTGAATATCGTCAACAAAATGGCGATTTTAAAACCAAAGAAGATCTGAGC
TTGGTGAAAGGTATTGGTGCCGCTACTGTCAGGCAAAACGCTGAGCGTATTATTTTATAA

Domains


Predicted by InterproScan.

(37-97)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A233HCF5

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

60.194

100

0.626

  comEA Vibrio cholerae strain A1552

60.194

100

0.626

  comEA Vibrio campbellii strain DS40M4

59.184

98.99

0.586

  comEA Vibrio parahaemolyticus RIMD 2210633

60

90.909

0.545

  comE1/comEA Haemophilus influenzae Rd KW20

40.179

100

0.455

  comEA/comE1 Glaesserella parasuis strain SC1401

40.909

100

0.455

  comEA Legionella pneumophila str. Paris

36.735

98.99

0.364

  comEA Legionella pneumophila strain ERS1305867

36.735

98.99

0.364


Multiple sequence alignment