Detailed information of TA system    

insolicoBioinformatically predicted

Overview


TA module


Type IV Classification (family/domain) cptAB/Cpta(toxin)
Location 1021111..1021765 Replicon chromosome
Accession NZ_CP031902
Organism Escherichia coli O104:H4 strain FWSEC0009

Toxin (Protein)


Gene name cptA Uniprot ID F4NJ21
Locus tag C8200_RS05190 Protein ID WP_000244772.1
Coordinates 1021358..1021765 (+) Length 136 a.a.

Antitoxin (Protein)


Gene name cptB Uniprot ID S1PPB1
Locus tag C8200_RS05185 Protein ID WP_000354046.1
Coordinates 1021111..1021377 (+) Length 89 a.a.

Genomic Context


Locus tag Coordinates Strand Size (bp) Protein ID Product Description
C8200_RS05160 1016398..1017141 + 744 WP_000951964.1 SDR family oxidoreductase -
C8200_RS05165 1017198..1018631 - 1434 WP_001387034.1 6-phospho-beta-glucosidase BglA -
C8200_RS05170 1018676..1018987 + 312 WP_001182954.1 N(4)-acetylcytidine aminohydrolase -
C8200_RS05175 1019151..1019810 + 660 WP_000250274.1 hemolysin III family protein -
C8200_RS05180 1019888..1020868 - 981 WP_000886062.1 tRNA-modifying protein YgfZ -
C8200_RS05185 1021111..1021377 + 267 WP_000354046.1 FAD assembly factor SdhE Antitoxin
C8200_RS05190 1021358..1021765 + 408 WP_000244772.1 protein YgfX Toxin
C8200_RS05195 1021805..1022326 - 522 WP_001055874.1 flavodoxin FldB -
C8200_RS05200 1022438..1023334 + 897 WP_000806638.1 site-specific tyrosine recombinase XerD -
C8200_RS05205 1023359..1024069 + 711 WP_000715208.1 bifunctional protein-disulfide isomerase/oxidoreductase DsbC -
C8200_RS05210 1024075..1025808 + 1734 WP_000813220.1 single-stranded-DNA-specific exonuclease RecJ -

Associated MGEs


MGE
detail
Similar
MGEs
Relative
position
MGE Type Cargo ARG Virulence gene Coordinates Length (bp)


Relative position:
(1) inside: TA loci is completely located inside the MGE;
(2) overlap: TA loci is partially overlapped with the MGE;
(3) flank: The TA loci is located in the 5 kb flanking regions of MGE.


Domains


Predicted by InterproScan

Toxin

(3-131)

Antitoxin

(7-79)


Sequences


Toxin        


Download         Length: 136 a.a.        Molecular weight: 16048.02 Da        Isoelectric Point: 11.2511

>T110554 WP_000244772.1 NZ_CP031902:1021358-1021765 [Escherichia coli O104:H4]
VVLWQSDLRVSWRAQWLSLLIHGLVAAVILLMPWPLSYTPLWMVLLSLVVFDCVRSQRRINARQGEIRLLMDGRLRWQGQ
EWCIVKAPWMIKSGMMLRLRSDGGKRQHLWLAADSMDEAEWRDLRRILLQQETQR

Download         Length: 408 bp

>T110554 NZ_CP031902:1021358-1021765 [Escherichia coli O104:H4]
GTGGTCCTGTGGCAATCTGATTTGCGCGTCTCCTGGCGCGCACAGTGGCTTTCCTTGCTGATTCATGGGCTGGTTGCCGC
TGTTATTTTACTCATGCCCTGGCCACTCAGTTACACCCCGTTATGGATGGTGTTACTTTCGCTGGTGGTGTTTGATTGCG
TTCGCAGCCAGCGGCGTATTAATGCTCGCCAGGGGGAAATTCGCTTGTTGATGGACGGGCGTTTGCGTTGGCAAGGGCAG
GAGTGGTGCATCGTCAAAGCACCGTGGATGATTAAGAGCGGCATGATGCTGCGTTTACGTTCTGATGGCGGTAAACGGCA
ACATTTATGGCTGGCAGCCGACAGCATGGACGAAGCCGAATGGCGGGATTTACGGCGGATTTTGTTGCAACAAGAGACGC
AAAGATAA

Antitoxin


Download         Length: 89 a.a.        Molecular weight: 10547.12 Da        Isoelectric Point: 5.1599

>AT110554 WP_000354046.1 NZ_CP031902:1021111-1021377 [Escherichia coli O104:H4]
MDINNKARIHWACRRGMRELDISIMPFFEHEYDSLSDDEKRIFIRLLECDDPDLFNWLMNHGKPADAELEMMVRLIQTRN
RERGPVAI

Download         Length: 267 bp

>AT110554 NZ_CP031902:1021111-1021377 [Escherichia coli O104:H4]
ATGGACATTAACAACAAAGCCCGCATTCATTGGGCATGCCGCCGTGGTATGCGCGAACTCGATATTTCAATCATGCCGTT
TTTCGAACATGAGTACGACAGCTTAAGCGATGACGAAAAACGCATCTTTATTCGTCTGCTGGAATGTGACGATCCGGACC
TGTTTAACTGGCTGATGAATCACGGTAAACCGGCCGATGCAGAACTGGAAATGATGGTCCGACTCATCCAGACACGGAAC
CGGGAACGTGGTCCTGTGGCAATCTGA

Similar Proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
Protein Organism Identities (%) Coverage (%) Ha-value

Structures


Toxin

Source ID Structure
AlphaFold DB A0A0E0XYB4


Antitoxin

Source ID Structure
PDB 6B58
PDB 1X6I
PDB 1X6J
PDB 6C12
AlphaFold DB A0A7U9QD57

References