Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CPS_RS21795 Genome accession   NC_003910
Coordinates   5132247..5133776 (-) Length   509 a.a.
NCBI ID   WP_011045560.1    Uniprot ID   Q47UP2
Organism   Colwellia psychrerythraea 34H     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 5127247..5138776
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CPS_RS21770 (CPS_4836) - 5127323..5127781 (-) 459 WP_011045555.1 NfeD family protein -
  CPS_RS21775 (CPS_4837) - 5127860..5128837 (-) 978 WP_011045556.1 SPFH domain-containing protein -
  CPS_RS21780 (CPS_4838) - 5129245..5129559 (-) 315 WP_011045557.1 pyrimidine/purine nucleoside phosphorylase -
  CPS_RS21785 (CPS_4839) ilvC 5129718..5131199 (-) 1482 WP_011045558.1 ketol-acid reductoisomerase -
  CPS_RS21790 (CPS_4840) ilvY 5131355..5132233 (+) 879 WP_011045559.1 HTH-type transcriptional activator IlvY -
  CPS_RS21795 (CPS_4841) comM 5132247..5133776 (-) 1530 WP_011045560.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CPS_RS21800 (CPS_4843) ilvG 5134408..5136072 (+) 1665 WP_041737179.1 acetolactate synthase 2 catalytic subunit -
  CPS_RS21805 (CPS_4844) ilvM 5136074..5136352 (+) 279 WP_049757955.1 acetolactate synthase 2 small subunit -
  CPS_RS21810 (CPS_4845) - 5136475..5137401 (+) 927 WP_011045564.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55852.26 Da        Isoelectric Point: 7.8010

>NTDB_id=22025 CPS_RS21795 WP_011045560.1 5132247..5133776(-) (comM) [Colwellia psychrerythraea 34H]
MSLSCVYSRARIGLESPLVTVEVHLANGLPAFNIVGLPEASVKESKDRVRSAIINCGYEFPAKRITVNLAPADLPKEGGR
FDLPIAVGILAASEQLPAVDLSQYEFAGELALSGELRAIVGEIPVAMASNQSKRTLIIPTQNIEQASWVKQAKIHAISHL
TQLFSHFAGQQVLPLVTDKIMAEEMAELDQTLDMSDVMGQPLAKRALEIAASGGHNLLFIGPPGTGKTMLASRLAGILPP
MTENEALQVAAIQSISHQGITRKSWFTRPFRAPHHTASSAALVGGGSQPKPGEITLAHNGVLFLDELPEFERKVLDVLRE
PMESGEVTISRALHKQCFPARFQLIAAMNPSPTGFYNDQRSTPEQVLRYLNRLSGPFLDRIDIQIEVARLPRGTWAQSSQ
KNESSAQVQQRVRVCRTIQLERQGKANAHISSSELKRYCDLNTDDNEFLELAVEKLGLSTRAHHKILKIARTLADMEGSL
NICNKHLVEALSYRAMDRLLRHLMNAVSV

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=22025 CPS_RS21795 WP_011045560.1 5132247..5133776(-) (comM) [Colwellia psychrerythraea 34H]
ATGTCACTTTCTTGTGTGTACAGTCGGGCACGAATCGGCCTTGAATCTCCCCTTGTTACGGTTGAAGTTCATCTTGCCAA
TGGTTTACCAGCATTCAATATCGTCGGGTTACCCGAAGCCTCAGTAAAAGAGTCAAAAGACAGGGTGCGTAGCGCCATCA
TAAATTGTGGTTACGAATTTCCGGCTAAACGTATCACGGTTAATTTGGCGCCTGCTGATTTACCCAAAGAAGGTGGCCGT
TTCGATTTACCCATAGCGGTTGGAATTCTCGCTGCGTCGGAACAACTTCCAGCCGTTGATCTATCTCAATATGAATTTGC
GGGAGAGCTGGCCTTATCCGGTGAGCTACGGGCGATTGTCGGGGAAATTCCTGTAGCCATGGCAAGTAATCAAAGTAAGC
GGACTTTGATTATTCCTACACAAAATATCGAACAGGCCAGTTGGGTTAAACAAGCAAAAATTCATGCCATAAGCCATTTA
ACGCAGCTGTTTTCTCATTTTGCCGGTCAACAAGTTTTACCGCTAGTGACTGATAAAATAATGGCAGAAGAAATGGCTGA
ACTAGATCAAACATTGGATATGAGTGATGTTATGGGGCAACCCTTGGCAAAACGAGCGTTGGAAATTGCTGCTAGCGGAG
GTCATAATTTATTATTCATCGGCCCTCCAGGTACCGGTAAAACCATGTTAGCTAGTCGTTTAGCCGGTATTTTACCGCCG
ATGACCGAGAACGAGGCGCTTCAAGTCGCCGCTATTCAATCGATTAGCCATCAAGGTATTACACGTAAGTCGTGGTTTAC
TCGACCTTTTCGAGCACCGCATCATACTGCGTCATCAGCGGCTTTAGTCGGAGGAGGCAGTCAGCCCAAGCCCGGAGAAA
TCACGCTCGCACACAACGGTGTATTATTTTTAGATGAATTACCTGAGTTTGAACGAAAAGTATTAGACGTATTGCGAGAG
CCAATGGAGTCGGGTGAAGTTACTATTTCCAGAGCGTTACACAAACAATGCTTTCCCGCACGTTTTCAATTGATTGCAGC
GATGAACCCGAGTCCTACCGGCTTTTATAACGACCAACGGAGTACGCCAGAACAAGTTTTACGCTATTTAAATCGTTTGT
CAGGACCTTTTCTTGATCGCATTGATATTCAAATTGAAGTAGCAAGACTACCTCGAGGTACATGGGCACAGAGCAGCCAA
AAAAATGAGTCAAGCGCACAGGTACAACAACGAGTGCGAGTCTGTCGGACTATCCAACTTGAAAGGCAAGGCAAAGCCAA
CGCGCACATCAGCAGTAGTGAGTTAAAGCGGTATTGTGATCTAAATACGGATGACAATGAATTTTTAGAACTTGCCGTAG
AAAAGTTAGGTTTATCAACCCGCGCGCATCACAAAATTTTGAAAATTGCCCGTACCCTTGCAGACATGGAAGGGAGCTTG
AATATTTGTAATAAACATTTGGTTGAGGCCCTATCTTATCGGGCTATGGACAGATTGTTACGTCATTTAATGAATGCCGT
TTCGGTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q47UP2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

59.136

100

0.591

  comM Vibrio cholerae strain A1552

58.218

99.214

0.578

  comM Vibrio campbellii strain DS40M4

56.436

99.214

0.56

  comM Glaesserella parasuis strain SC1401

55.512

99.804

0.554

  comM Legionella pneumophila str. Paris

47.809

98.625

0.472

  comM Legionella pneumophila strain ERS1305867

47.809

98.625

0.472

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.669

99.804

0.456


Multiple sequence alignment