Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ETA_RS01975 Genome accession   NC_010694
Coordinates   218319..219839 (-) Length   506 a.a.
NCBI ID   WP_012439949.1    Uniprot ID   B2VI68
Organism   Erwinia tasmaniensis Et1/99     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 213319..224839
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ETA_RS01965 (ETA_01810) hdfR 216999..217826 (-) 828 WP_012439947.1 HTH-type transcriptional regulator HdfR -
  ETA_RS01970 (ETA_01820) - 217947..218285 (+) 339 WP_012439948.1 DUF413 domain-containing protein -
  ETA_RS01975 (ETA_01830) comM 218319..219839 (-) 1521 WP_012439949.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ETA_RS19815 ilvL 220180..220278 (+) 99 WP_157037486.1 ilv operon leader peptide -
  ETA_RS01980 (ETA_01840) ilvG 220419..222065 (+) 1647 WP_012439950.1 acetolactate synthase 2 catalytic subunit -
  ETA_RS01985 (ETA_01850) ilvM 222062..222319 (+) 258 WP_012439951.1 acetolactate synthase 2 small subunit -
  ETA_RS01990 (ETA_01860) - 222338..223264 (+) 927 WP_012439952.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 54973.94 Da        Isoelectric Point: 7.1617

>NTDB_id=30887 ETA_RS01975 WP_012439949.1 218319..219839(-) (comM) [Erwinia tasmaniensis Et1/99]
MSLSVAFTRAAIGIQAPLVSVEVHLSNGLPALSLVGLPETTVKEARDRVRSAIINSGFNFPAKRITVSLAPADLPKEGGR
YDLPIAIAILAASEQIPAEKLTRYEFLGELALTGALRGVQGAIPAALAALNAQRQLILSADNQHDVSLISQGESLIAMHL
LEVCAFLQDEAKLEAAHGEPQECPPASGDLNEVIGQQQAKRALEIAAAGGHNLLFIGPPGTGKTMLASRLNGLMPPLSDR
EALESASVASLVNSGELQRNWRQRPYRAPHHSASLYALVGGGSLPKPGEISLAHNGILFLDELPEFERRALDALREPLES
GEITISRARAKITYPARFQLVAAMNPSPTGHYRGLHNRSSPQQTLRYLSRLSGPFLDRFDISLEVPLLAPGALSHRRDES
ESSQQVRERVVLARERQLARCSKMNSAMNNQEIRACCTLSPVDAEWLEQVILQLGLSVRAWQRILKVARTIADLAGEHII
SRDHLTEAVSYRAIDRLLIHLQNSLD

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=30887 ETA_RS01975 WP_012439949.1 218319..219839(-) (comM) [Erwinia tasmaniensis Et1/99]
ATGTCGTTATCTGTTGCTTTTACCCGTGCCGCGATTGGCATTCAGGCACCGTTGGTGTCTGTAGAAGTTCATCTCAGCAA
TGGGCTACCCGCGCTGTCACTGGTCGGATTGCCAGAAACAACCGTCAAGGAAGCGCGTGACAGAGTGCGTAGCGCTATCA
TCAACAGCGGGTTCAACTTCCCGGCTAAACGGATCACCGTCAGCCTGGCACCTGCGGATCTTCCCAAAGAAGGCGGCAGA
TATGACCTCCCTATCGCTATCGCCATTCTCGCCGCGTCAGAACAGATTCCTGCTGAAAAACTGACGCGGTATGAGTTCCT
CGGTGAATTAGCGCTTACAGGCGCGCTACGCGGCGTACAGGGCGCTATCCCTGCCGCGCTGGCAGCCCTGAATGCCCAGC
GGCAGCTGATCCTGTCAGCAGATAACCAGCATGATGTCAGCTTGATAAGTCAGGGAGAAAGTTTGATAGCCATGCACCTG
CTGGAGGTATGTGCGTTTTTACAGGACGAAGCAAAGTTGGAAGCGGCTCATGGCGAACCGCAGGAATGCCCTCCGGCCTC
CGGAGACCTGAACGAAGTCATCGGCCAGCAGCAGGCTAAGCGCGCGCTTGAGATTGCTGCCGCAGGAGGCCACAATCTGT
TGTTTATTGGCCCGCCGGGGACCGGAAAGACAATGCTTGCTTCCCGACTAAACGGCCTGATGCCGCCCCTGAGCGATCGC
GAGGCGTTAGAAAGCGCCAGCGTCGCCAGTCTGGTCAATAGCGGCGAGCTGCAGCGCAATTGGCGCCAGAGGCCCTATCG
GGCCCCTCACCACAGCGCATCACTGTATGCGCTGGTCGGCGGAGGGTCGCTGCCTAAGCCTGGCGAAATATCTTTAGCCC
ATAACGGGATACTGTTTCTCGATGAGTTGCCTGAGTTCGAGCGCCGGGCGCTTGATGCGCTACGCGAACCGCTTGAGTCG
GGGGAGATCACTATTTCACGCGCCAGAGCAAAGATTACCTATCCGGCACGCTTCCAGCTGGTTGCAGCAATGAATCCCAG
TCCTACCGGCCACTATCGCGGCCTGCATAATCGCTCATCCCCCCAGCAGACATTGCGCTATCTCAGTCGCCTGTCTGGCC
CCTTCCTCGATCGATTCGATATTTCTCTTGAAGTCCCGCTGCTGGCACCGGGAGCGCTAAGTCATCGACGCGACGAAAGC
GAGTCCAGCCAACAGGTCCGTGAGCGCGTAGTATTGGCTCGGGAGCGTCAGCTGGCACGCTGCAGTAAAATGAATTCGGC
AATGAACAACCAGGAAATACGGGCTTGCTGTACACTGTCGCCTGTGGATGCAGAATGGCTAGAGCAGGTCATTCTCCAAC
TCGGTCTGTCAGTGCGAGCCTGGCAACGCATACTCAAGGTTGCCCGCACGATCGCGGACTTAGCAGGGGAACATATTATT
AGTCGGGATCATCTGACCGAGGCCGTTAGCTATCGGGCTATCGATCGACTGCTAATCCATTTGCAAAATAGCCTGGACTG
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB B2VI68

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

63.189

100

0.634

  comM Glaesserella parasuis strain SC1401

62.13

100

0.623

  comM Vibrio cholerae strain A1552

61.753

99.209

0.613

  comM Vibrio campbellii strain DS40M4

61.554

99.209

0.611

  comM Legionella pneumophila str. Paris

48.509

99.407

0.482

  comM Legionella pneumophila strain ERS1305867

48.509

99.407

0.482

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.269

100

0.443


Multiple sequence alignment