Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   E2K93_RS07870 Genome accession   NZ_CP038493
Coordinates   1851993..1853531 (+) Length   512 a.a.
NCBI ID   WP_135438573.1    Uniprot ID   A0A4P7JSU3
Organism   Thalassotalea sp. HSM 43     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 1846993..1858531
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E2K93_RS07855 (E2K93_07855) ilvD 1848518..1850374 (-) 1857 WP_135438570.1 dihydroxy-acid dehydratase -
  E2K93_RS07860 (E2K93_07860) - 1850503..1851429 (-) 927 WP_135438571.1 branched-chain amino acid transaminase -
  E2K93_RS07865 (E2K93_07865) - 1851607..1851825 (+) 219 WP_135438572.1 hypothetical protein -
  E2K93_RS07870 (E2K93_07870) comM 1851993..1853531 (+) 1539 WP_135438573.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  E2K93_RS07875 (E2K93_07875) ilvY 1853528..1854418 (-) 891 WP_135438574.1 HTH-type transcriptional activator IlvY -
  E2K93_RS07880 (E2K93_07880) ilvC 1854551..1856035 (+) 1485 WP_135438575.1 ketol-acid reductoisomerase -
  E2K93_RS07885 (E2K93_07885) - 1856171..1857247 (+) 1077 WP_135438576.1 asparaginase -

Sequence


Protein


Download         Length: 512 a.a.        Molecular weight: 56253.57 Da        Isoelectric Point: 7.8216

>NTDB_id=354740 E2K93_RS07870 WP_135438573.1 1851993..1853531(+) (comM) [Thalassotalea sp. HSM 43]
MSLASIYSRARIGLDAPQVCIEIHLSNGLPAFHMVGMAETSVKEAKNRVRSAIINCGFEFPQKKIVVNLAPADLPKDGGH
FDLPIAVGILVASQQLPKNQIDDYEFAGELALNGQLRKIVGDIPMAMAVREANRQLFLPKANAAQASRVDGVAIIALQRL
DQLFAHFSGQQRLPLHQQAEDAAAQQVLKPPCHSADMQDVIGQPLAKRALELAAAGSHNLLFIGPPGTGKTMLASRLPGI
LPAMSDEESLQAAAIQSICSDDFDTSRWHIRPFRAPHHTASSAALVGGGSTPMPGEISLAHNGILFLDELPEYDRKVLDV
LREPMESGEVTISRALHKTTYPAQFQLVAAMNPSPTGFYSDRRSTPEQILRYLNKLSGPFLDRIDIQLEVARLPKGAWNQ
ANGNEESSEIIRQRVVACRQRQLTRQGKANAHLTSPQLKRYCRLVDDDAEFLDLAMEKLGLSTRAHHKILKIARTIADLK
QQTEIHRADLVEALSYRAMDRLIRHLTESVGP

Nucleotide


Download         Length: 1539 bp        

>NTDB_id=354740 E2K93_RS07870 WP_135438573.1 1851993..1853531(+) (comM) [Thalassotalea sp. HSM 43]
ATGAGTTTAGCCAGTATTTACAGTCGTGCCCGTATTGGTCTCGACGCACCGCAAGTATGTATTGAAATTCACCTCAGCAA
TGGCTTACCGGCATTTCACATGGTCGGCATGGCAGAGACCTCGGTAAAAGAAGCGAAAAACCGAGTACGCAGTGCCATCA
TCAATTGTGGCTTTGAGTTTCCACAAAAGAAAATCGTCGTCAACCTCGCGCCGGCAGACTTACCAAAAGACGGCGGCCAT
TTTGATTTGCCAATCGCCGTCGGTATCCTAGTCGCATCGCAGCAATTGCCAAAAAACCAAATCGACGATTATGAGTTTGC
CGGTGAACTGGCACTGAATGGTCAATTACGAAAAATCGTTGGCGATATTCCTATGGCAATGGCGGTGAGAGAGGCTAACC
GGCAATTGTTTTTGCCAAAAGCCAACGCCGCGCAAGCCAGTCGAGTTGACGGTGTGGCGATCATCGCCCTGCAGCGCCTT
GATCAGTTATTCGCGCATTTTAGTGGTCAGCAACGGTTACCATTGCATCAGCAAGCCGAAGATGCCGCTGCTCAACAAGT
CCTCAAACCACCATGCCACTCTGCCGATATGCAAGATGTCATCGGCCAACCGCTTGCCAAACGCGCACTCGAGCTTGCTG
CTGCCGGCTCACACAACCTGTTGTTTATCGGCCCACCGGGCACCGGTAAAACCATGTTGGCATCACGTCTACCTGGCATA
TTGCCCGCCATGTCCGATGAAGAATCACTACAAGCGGCGGCAATTCAATCGATTTGCAGCGATGATTTTGATACCAGCCG
ATGGCATATTCGGCCATTTCGCGCGCCGCACCATACCGCGTCATCAGCAGCCTTGGTTGGCGGCGGTTCAACGCCGATGC
CCGGTGAAATATCCTTAGCCCACAACGGCATTTTGTTTCTCGATGAACTGCCTGAGTACGATCGCAAGGTGCTTGATGTA
TTGCGCGAGCCAATGGAGTCTGGCGAAGTGACTATTTCCAGAGCGTTACACAAAACCACCTACCCGGCTCAGTTTCAATT
GGTGGCGGCAATGAATCCCAGCCCAACCGGTTTTTACAGCGATCGACGCAGCACCCCTGAGCAAATATTACGCTACCTCA
ACAAACTGTCCGGGCCGTTTTTAGACCGTATTGATATTCAGCTAGAAGTGGCTCGTCTACCCAAAGGCGCTTGGAATCAA
GCCAATGGCAATGAAGAGAGTAGTGAGATTATTCGCCAACGTGTGGTCGCCTGTCGCCAGCGGCAATTAACACGCCAAGG
CAAAGCCAATGCCCACCTAACCAGCCCACAGCTAAAGCGTTACTGTCGATTGGTCGATGACGATGCTGAGTTTCTTGATT
TGGCGATGGAAAAACTCGGTCTGTCGACCAGAGCACATCACAAAATTCTAAAAATTGCCCGTACCATTGCCGATCTTAAA
CAACAAACCGAGATACATCGCGCCGATTTAGTCGAAGCGTTAAGCTATCGAGCCATGGACCGGCTGATACGTCATTTGAC
CGAAAGTGTTGGCCCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4P7JSU3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.189

99.414

0.559

  comM Vibrio campbellii strain DS40M4

55.206

99.414

0.549

  comM Haemophilus influenzae Rd KW20

54.224

99.414

0.539

  comM Glaesserella parasuis strain SC1401

53.045

99.414

0.527

  comM Legionella pneumophila str. Paris

46.414

98.047

0.455

  comM Legionella pneumophila strain ERS1305867

46.414

98.047

0.455

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.118

99.609

0.439


Multiple sequence alignment