Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   K7R23_RS06550 Genome accession   NZ_CP082833
Coordinates   1388748..1390208 (-) Length   486 a.a.
NCBI ID   WP_042623004.1    Uniprot ID   A0A482PRQ3
Organism   Citrobacter rodentium NBRC 105723 = DSM 16636     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 1383748..1395208
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K7R23_RS06540 (K7R23_06455) hdfR 1387372..1388205 (-) 834 WP_012907957.1 HTH-type transcriptional regulator HdfR -
  K7R23_RS06545 (K7R23_06460) maoP 1388324..1388662 (+) 339 WP_012907956.1 macrodomain Ori organization protein MaoP -
  K7R23_RS06550 (K7R23_06465) comM 1388748..1390208 (-) 1461 WP_042623004.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  K7R23_RS06555 (K7R23_06470) ilvL 1390560..1390658 (+) 99 WP_001311244.1 ilv operon leader peptide -
  K7R23_RS25925 (K7R23_06475) ilvX 1390744..1390794 (+) 51 WP_217968170.1 peptide IlvX -
  K7R23_RS06560 (K7R23_06480) ilvG 1390797..1392443 (+) 1647 WP_012907955.1 acetolactate synthase 2 catalytic subunit -
  K7R23_RS06565 (K7R23_06485) ilvM 1392440..1392703 (+) 264 WP_012907954.1 acetolactate synthase 2 small subunit -
  K7R23_RS06570 (K7R23_06490) ilvE 1392722..1393651 (+) 930 WP_012907953.1 branched-chain-amino-acid transaminase -

Sequence


Protein


Download         Length: 486 a.a.        Molecular weight: 52810.32 Da        Isoelectric Point: 7.2585

>NTDB_id=602771 K7R23_RS06550 WP_042623004.1 1388748..1390208(-) (comM) [Citrobacter rodentium NBRC 105723 = DSM 16636]
MSLSIVHTRAALGVHAPAITIEVHISNGLPGLTLVGLPETTVKEARDRVRSAIINSGYEFPAKKITINLAPADLPKEGGR
YDLPIAVALLAASEQLTTHRLNQYELVGELALTGALRGVPGAISSATEAIRAGRGIIVAKENEEEVGLIGGDGCLVADHL
QAVCAFLEGKQSLDHPTASEAAPCATHEDLRDIIGQEQGKRSLEITAAGGHNLLLIGPPGTGKTMLASRLNGLLPPLSNE
EAQESAAILSLVNATCVQKQWKQRPFRAPHHSASLTAMVGGGSIPAPGEISLAHNGILFLDELPEFERRTLDALREPIES
GQIHLSRTRAKITYPARFQLVAAMNPSPTGHYQGNHNRCTPEQTLRYLARLSGPFLDRFDLSLEIPLLPPGILRQTELKG
ESSDTVKRRVIAAQSRQYQRQGKLNAHLHSREIRQYCSLAAEDSQWLEETLIHLGLSIRAWQRLLKVSRTIADLEQAEEI
TRRHLQ

Nucleotide


Download         Length: 1461 bp        

>NTDB_id=602771 K7R23_RS06550 WP_042623004.1 1388748..1390208(-) (comM) [Citrobacter rodentium NBRC 105723 = DSM 16636]
ATGTCACTGTCTATTGTTCATACGCGAGCTGCGCTTGGCGTTCACGCCCCAGCCATCACTATCGAAGTTCATATCAGTAA
TGGTCTGCCCGGTTTAACCCTGGTGGGTTTGCCGGAAACCACCGTTAAAGAGGCTCGCGACCGGGTGCGTAGCGCCATCA
TCAATAGCGGTTATGAATTTCCGGCGAAAAAAATCACCATTAACCTGGCTCCGGCCGATCTGCCGAAAGAGGGAGGAAGG
TACGATTTACCTATCGCTGTCGCGCTTCTGGCGGCTTCAGAACAGCTTACAACCCATAGGCTTAATCAATACGAGTTAGT
GGGCGAATTAGCGCTTACAGGCGCGCTACGCGGCGTTCCTGGCGCCATATCCAGCGCAACGGAAGCGATACGGGCAGGAA
GAGGCATCATTGTCGCCAAAGAGAACGAAGAGGAAGTCGGGCTTATTGGTGGCGATGGCTGCCTGGTGGCCGATCATTTG
CAGGCCGTCTGCGCTTTTCTGGAAGGAAAGCAGTCGCTTGATCATCCCACCGCCAGTGAAGCTGCGCCCTGCGCTACGCA
TGAGGATCTGCGCGATATCATTGGTCAGGAGCAAGGTAAACGCAGTCTCGAAATTACGGCGGCAGGCGGACATAACCTGC
TGTTGATAGGTCCACCAGGTACAGGGAAAACGATGCTGGCCAGCCGTCTCAATGGACTTTTGCCGCCGCTGAGCAACGAG
GAGGCGCAGGAGAGCGCGGCTATCCTCAGCCTGGTCAACGCAACCTGTGTGCAAAAGCAGTGGAAACAACGCCCTTTTCG
GGCGCCGCATCACAGCGCATCCCTGACGGCAATGGTCGGCGGCGGCTCAATCCCCGCGCCGGGAGAAATATCGCTGGCGC
ATAACGGCATACTTTTTCTGGATGAATTACCCGAATTTGAACGCCGCACCCTGGATGCGCTACGTGAACCCATTGAATCC
GGTCAGATTCATCTTTCGCGTACGCGAGCCAAGATCACTTATCCCGCCCGCTTTCAGCTGGTTGCGGCTATGAATCCCAG
CCCTACGGGGCATTATCAGGGCAATCATAACCGCTGTACGCCCGAACAGACGCTGCGCTATCTCGCCCGACTCTCCGGTC
CGTTTCTTGATCGCTTTGATCTCTCACTGGAGATCCCTTTACTGCCGCCTGGCATTCTCAGGCAAACGGAGCTCAAAGGG
GAAAGTAGCGATACGGTAAAACGGCGGGTCATCGCCGCACAGTCGCGCCAGTATCAGCGCCAGGGCAAACTGAATGCACA
TTTACACAGCAGGGAGATTCGCCAGTATTGTTCGCTGGCTGCTGAGGATTCGCAGTGGCTGGAGGAGACGCTTATCCATC
TCGGGTTATCAATACGGGCGTGGCAGCGGCTATTGAAAGTATCGAGGACCATTGCCGATCTTGAACAGGCGGAGGAGATT
ACACGCCGGCACTTGCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A482PRQ3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.146

100

0.599

  comM Vibrio cholerae strain A1552

59.465

100

0.595

  comM Glaesserella parasuis strain SC1401

58.215

100

0.591

  comM Vibrio campbellii strain DS40M4

58.436

100

0.584

  comM Legionella pneumophila str. Paris

47.561

100

0.481

  comM Legionella pneumophila strain ERS1305867

47.561

100

0.481

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.169

100

0.432