Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   R1Y13_RS29055 Genome accession   NZ_CP136999
Coordinates   6222120..6223610 (+) Length   496 a.a.
NCBI ID   WP_012274789.1    Uniprot ID   B0KQ52
Organism   Pseudomonas sp. NY8938     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 6217120..6228610
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R1Y13_RS29030 (R1Y13_28910) glnK 6218046..6218384 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  R1Y13_RS29035 (R1Y13_28915) - 6218757..6219017 (+) 261 WP_041166618.1 accessory factor UbiK family protein -
  R1Y13_RS29040 (R1Y13_28920) - 6219018..6219371 (+) 354 WP_012274786.1 gamma-glutamylcyclotransferase family protein -
  R1Y13_RS29045 (R1Y13_28925) - 6219410..6219838 (+) 429 WP_012274787.1 molybdopterin-binding protein -
  R1Y13_RS29050 (R1Y13_28930) - 6219868..6221862 (-) 1995 WP_012274788.1 DUF4034 domain-containing protein -
  R1Y13_RS29055 (R1Y13_28935) comM 6222120..6223610 (+) 1491 WP_012274789.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  R1Y13_RS29060 (R1Y13_28940) - 6223786..6225573 (+) 1788 WP_012274790.1 monovalent cation:proton antiporter-2 (CPA2) family protein -
  R1Y13_RS29065 (R1Y13_28945) - 6225965..6226258 (+) 294 WP_012274791.1 hypothetical protein -
  R1Y13_RS29070 (R1Y13_28950) ycaC 6226374..6227003 (-) 630 WP_371039631.1 isochorismate family cysteine hydrolase YcaC -
  R1Y13_RS29075 (R1Y13_28955) - 6227139..6228050 (+) 912 WP_012274793.1 LysR substrate-binding domain-containing protein -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52824.68 Da        Isoelectric Point: 7.6866

>NTDB_id=894162 R1Y13_RS29055 WP_012274789.1 6222120..6223610(+) (comM) [Pseudomonas sp. NY8938]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPQRRITQNLAPADLPKDGGR
YDLAIALGILAADGQVPVAALTEVECLGELALSGKLRPVQGVLPAALAAREAGRALVVPQENAEEASLAGGLVVYAVGHL
LELVAHLNGQVPLPPYAANGLILQQRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIQSISGHAPLNSWPQRPFRHPHHSASGPALVGGSSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCSTEQIARYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
EPSADVAAKVAQARAVQHKRQGCANAFLDLEGLRRHCGLAPADQAWLEGACERLTLSLRSAHRLLKVARTLADLECCETI
GRPHLAEALQYRPGSG

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=894162 R1Y13_RS29055 WP_012274789.1 6222120..6223610(+) (comM) [Pseudomonas sp. NY8938]
ATGTCCCTAGCACTCGTCCACAGCCGCGCCCAAGTGGGCGTGCAGGCTCCAGCAGTCAGTGTCGAAACCCACCTGGCCAA
TGGCTTGCCCCATCTCACCCTAGTCGGCCTGCCGGAAACCACGGTCAAGGAAAGCAAGGACCGGGTGCGCAGTGCCATCG
TCAATTCCGGGCTGAACTACCCGCAGCGTCGCATTACCCAAAACCTCGCCCCCGCCGACCTGCCCAAGGATGGCGGGCGT
TACGACCTGGCCATTGCCTTGGGCATCCTCGCCGCCGACGGCCAGGTGCCGGTTGCCGCCCTTACCGAGGTCGAGTGCCT
GGGCGAACTGGCGCTGTCTGGCAAACTGCGCCCGGTGCAGGGTGTGCTGCCGGCGGCACTGGCGGCGCGCGAGGCAGGCC
GGGCACTGGTAGTACCACAGGAGAATGCCGAGGAAGCCAGCCTGGCAGGCGGGCTGGTGGTGTATGCGGTGGGGCACCTG
TTGGAGCTGGTCGCCCACCTGAATGGCCAGGTGCCGCTGCCGCCGTATGCTGCCAACGGCCTGATCCTGCAGCAACGGCC
TTACCCGGACCTGAGTGAAGTGCAGGGGCAGCTGGCGGCCAAGCGTGCCCTGTTGCTGGCGGCCGCCGGGGCACACAACC
TGCTGTTCACCGGGCCTCCCGGCACCGGTAAAACGTTGCTTGCCAGCCGCTTGCCGGGGCTGCTGCCGCCGCTGGACGAG
CACGAGGCGCTCGAAGTGGCGGCAATCCAGTCAATCAGCGGGCATGCACCGCTGAACAGCTGGCCGCAGCGGCCGTTCCG
CCACCCTCATCACTCCGCCTCCGGCCCGGCGCTGGTCGGGGGCAGCAGCCGACCGCAACCAGGCGAAATAACCCTGGCCC
ATCACGGCGTACTGTTTCTGGATGAGCTGCCAGAGTTCGAGCGGCGCGTGCTGGAGGTGCTGCGCGAGCCGCTCGAGTCG
GGCGAAATCGTGATTGCCCGGGCCCGCGACAAGGTGCGCTTCCCAGCGCGCTTCCAACTGGTGGCGGCGATGAACCCGTG
CCCTTGCGGCTACCTGGGCGACCCCACCGGCCGCTGCCGCTGCAGCACCGAGCAGATCGCACGGTATCGCAACAAGCTGT
CCGGGCCACTGCTGGACCGCATCGACCTGCACCTGACCGTGGCCCGCGAGAGCACCACGCTGAACAACCAACCCTGCGGT
GAGCCCAGTGCCGATGTGGCGGCCAAAGTGGCCCAGGCGCGCGCGGTGCAGCACAAGCGGCAAGGGTGTGCGAATGCTTT
TCTCGATCTCGAGGGCTTGCGCCGCCACTGCGGGTTGGCGCCGGCGGACCAGGCCTGGCTGGAAGGGGCCTGCGAGCGGC
TGACCTTGTCGTTGCGATCGGCACACCGGTTATTGAAAGTGGCGCGGACACTGGCGGACCTGGAGTGTTGTGAGACGATT
GGCAGGCCGCATCTGGCCGAGGCCCTGCAATACCGACCAGGGAGCGGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB B0KQ52

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

55.444

100

0.554

  comM Haemophilus influenzae Rd KW20

54.6

100

0.55

  comM Vibrio cholerae strain A1552

54.747

99.798

0.546

  comM Glaesserella parasuis strain SC1401

53.094

100

0.536

  comM Legionella pneumophila str. Paris

48.992

100

0.49

  comM Legionella pneumophila strain ERS1305867

48.992

100

0.49

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.016

100

0.466