Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   HG549_RS25030 Genome accession   NZ_CP051857
Coordinates   5494627..5496117 (+) Length   496 a.a.
NCBI ID   WP_170033713.1    Uniprot ID   A0A7Z3CJI4
Organism   Pseudomonas sp. SK     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5489627..5501117
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HG549_RS25005 (HG549_25020) - 5489637..5490968 (-) 1332 WP_170033705.1 ammonium transporter -
  HG549_RS25010 (HG549_25025) glnK 5491017..5491355 (-) 339 WP_002555808.1 P-II family nitrogen regulator -
  HG549_RS25015 (HG549_25030) - 5491754..5492020 (+) 267 WP_170033707.1 accessory factor UbiK family protein -
  HG549_RS25020 (HG549_25035) - 5492021..5492374 (+) 354 WP_170033709.1 gamma-glutamylcyclotransferase family protein -
  HG549_RS25025 (HG549_25040) - 5492328..5494374 (-) 2047 Protein_4933 DUF4034 domain-containing protein -
  HG549_RS25030 (HG549_25045) comM 5494627..5496117 (+) 1491 WP_170033713.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  HG549_RS25035 (HG549_25050) - 5496185..5496613 (-) 429 WP_170033715.1 DoxX family protein -
  HG549_RS25040 (HG549_25055) - 5496720..5497088 (-) 369 WP_170033717.1 response regulator transcription factor -
  HG549_RS25045 (HG549_25060) - 5497085..5498077 (-) 993 WP_170033719.1 response regulator -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52738.42 Da        Isoelectric Point: 7.9062

>NTDB_id=440966 HG549_RS25030 WP_170033713.1 5494627..5496117(+) (comM) [Pseudomonas sp. SK]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPPRRITQNLAPADLPKDGGR
YDLAIALGILAADGQVPTAPLTELECLGELALSGKLRPVQGVLPAALAARDAGRALVVPRENAEEASLAGGLVVYAVGHL
LELVAHLNGQVPLPPYAANGLILQQRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIRSVSGHTPLSSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPSGRCRCSTEQIARYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
ETSADVAAKVAEARDAQQRRQGCANAFLDLEGLRRNCGLAAADQAWLESACERLTLSLRAAHRLLKVARTLADLEGSQAI
GRAHLAEALQYRPGSS

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=440966 HG549_RS25030 WP_170033713.1 5494627..5496117(+) (comM) [Pseudomonas sp. SK]
ATGTCCCTAGCCCTCGTCCATAGCCGCGCCCAAGTGGGCGTACAGGCACCAGCGGTCAGCGTCGAAACTCACCTGGCCAA
TGGTTTGCCCCATCTCACCCTGGTCGGCCTGCCGGAAACCACGGTCAAGGAAAGCAAGGACCGGGTGCGCAGCGCCATTG
TCAATTCCGGGCTGAACTACCCGCCGCGGCGCATCACCCAGAACCTTGCGCCCGCCGACCTGCCCAAGGATGGCGGGCGT
TACGACCTGGCCATTGCCCTGGGCATCCTGGCCGCCGATGGCCAGGTGCCAACGGCGCCGCTAACCGAACTTGAATGCCT
GGGTGAACTGGCTTTGTCTGGCAAGCTGCGCCCGGTTCAGGGCGTGCTGCCCGCAGCGCTGGCAGCACGCGATGCAGGCA
GGGCGCTGGTGGTGCCGCGGGAAAATGCCGAGGAAGCCAGCCTGGCTGGCGGGCTGGTGGTGTATGCGGTGGGGCATCTG
CTGGAACTGGTCGCCCACCTGAACGGCCAGGTACCACTGCCGCCCTATGCCGCCAACGGCCTGATACTGCAACAACGTCC
CTACCCGGACCTCAGCGAGGTGCAAGGCCAGCTGGCGGCCAAGCGTGCATTGCTGTTGGCGGCGGCCGGGGCGCATAACC
TGTTGTTCACCGGGCCACCCGGCACTGGCAAGACCTTGCTCGCCAGCCGCCTGCCGGGGCTGCTGCCGCCGCTGGACGAG
CACGAGGCGCTGGAAGTGGCTGCGATCCGCTCGGTGAGTGGCCATACACCGCTGAGCAGTTGGCCGCAGCGGCCCTTTCG
CCATCCGCACCACTCGGCCTCCGGTCCGGCGCTGGTCGGTGGCGGCAGCCGACCGCAGCCGGGCGAAATCACCCTTGCCC
ACCATGGTGTGCTGTTTCTGGATGAGCTGCCGGAATTCGAACGGCGGGTACTGGAAGTGCTGCGCGAGCCCCTGGAATCC
GGCGAGATCGTGATTGCCCGGGCCCGCGACAAGGTGCGCTTCCCCGCCCGGTTCCAGTTAGTGGCGGCAATGAACCCGTG
CCCTTGCGGCTACCTGGGCGATCCATCCGGGCGCTGTCGCTGCAGCACCGAGCAGATCGCGCGGTACCGCAACAAGCTGT
CCGGGCCGTTGCTGGACCGTATCGACCTGCACCTGACCGTGGCCCGCGAGAGCACCACGCTGAACAACCAGCCTTGCGGT
GAAACCAGTGCCGACGTCGCCGCCAAGGTTGCCGAGGCACGGGATGCCCAGCAAAGACGGCAGGGATGCGCCAATGCGTT
TCTCGACCTTGAGGGGCTGCGCCGTAATTGCGGACTGGCAGCGGCAGACCAGGCCTGGCTGGAGAGTGCGTGTGAACGGC
TGACCCTGTCGTTGCGCGCGGCGCACCGCTTGCTGAAGGTGGCGCGAACACTGGCCGATCTGGAAGGTAGCCAGGCAATT
GGCCGGGCGCACCTGGCCGAGGCCCTGCAGTACCGGCCGGGGAGCAGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Z3CJI4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

56.162

99.798

0.56

  comM Haemophilus influenzae Rd KW20

55.6

100

0.56

  comM Vibrio cholerae strain A1552

55.354

99.798

0.552

  comM Glaesserella parasuis strain SC1401

53.6

100

0.54

  comM Legionella pneumophila str. Paris

49.194

100

0.492

  comM Legionella pneumophila strain ERS1305867

49.194

100

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.614

100

0.472