Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GU3_RS02415 Genome accession   NC_016745
Coordinates   497935..499446 (-) Length   503 a.a.
NCBI ID   WP_014290964.1    Uniprot ID   H2G0Y9
Organism   Oceanimonas sp. GK1     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 492935..504446
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GU3_RS02390 (GU3_02330) cueR 493931..494344 (+) 414 WP_014290959.1 Cu(I)-responsive transcriptional regulator -
  GU3_RS02395 (GU3_02335) ccoG 494476..495897 (+) 1422 WP_041542862.1 cytochrome c oxidase accessory protein CcoG -
  GU3_RS02400 (GU3_02340) - 495959..496927 (+) 969 WP_014290961.1 serine/threonine protein kinase -
  GU3_RS02405 (GU3_02345) - 496970..497572 (+) 603 WP_014290962.1 thiol:disulfide interchange protein DsbA/DsbL -
  GU3_RS02410 (GU3_02350) - 497616..497906 (-) 291 WP_014290963.1 hypothetical protein -
  GU3_RS02415 (GU3_02355) comM 497935..499446 (-) 1512 WP_014290964.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GU3_RS02420 (GU3_02360) ilvG 499825..501474 (+) 1650 WP_014290965.1 acetolactate synthase 2 catalytic subunit -
  GU3_RS02425 (GU3_02365) ilvM 501471..501731 (+) 261 WP_014290966.1 acetolactate synthase 2 small subunit -
  GU3_RS02430 (GU3_02370) - 501743..502666 (+) 924 WP_014290967.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 503 a.a.        Molecular weight: 54055.41 Da        Isoelectric Point: 8.8742

>NTDB_id=43266 GU3_RS02415 WP_014290964.1 497935..499446(-) (comM) [Oceanimonas sp. GK1]
MSLAVVFARASLGVTAPLVTVEAHLANGLPAFNIVGLPETTVKEARDRVRSALINAGFEFPARRITVNLAPADLPKDGGR
FDLPIAMAILAASEQIPASSLDGLEFLGELALTGELRGIKGTLPAVLAARASDRRLIMPAANGREAGLITPCPALLAPHL
LAITAWLQQQGELAHPEPVPIHEDAITPCLSEVIGQETGKRALEIAAAGEHNLLFLGPPGTGKTMLAARLPGLLPPMTET
EALEAAAIRSISGQPLDPEHWRQRAFRQPHHSSSAAALVGGGSHPRPGEISLAHRGVLFLDELTEFERRVLDALREPLET
GTISISRAAHSVTFPARFQLIGAMNPSPCGHYQDGLSRSSPEQILRYLGKISGPFVDRFDLSVEIPLLPPGELSRPRQKS
ASTAEVRARVLAARQRQQARAGKPNARLGAADLDRLCPLSADDAAFLEQALHRMKLSIRAWHKLIRVARTIADLAGEPHI
GRPHLMEALGYRAMDRLLARLRQ

Nucleotide


Download         Length: 1512 bp        

>NTDB_id=43266 GU3_RS02415 WP_014290964.1 497935..499446(-) (comM) [Oceanimonas sp. GK1]
ATGTCACTTGCCGTTGTGTTTGCACGGGCCAGCCTGGGCGTGACCGCCCCCCTGGTGACGGTAGAAGCTCACCTGGCCAA
CGGCCTGCCGGCCTTCAACATCGTCGGTTTGCCCGAAACCACGGTCAAGGAAGCCCGGGATCGGGTACGCAGTGCCCTGA
TCAACGCCGGCTTTGAATTTCCCGCCCGGCGCATTACCGTCAACCTGGCGCCGGCGGATCTGCCCAAGGACGGCGGTCGC
TTCGACCTGCCCATCGCCATGGCCATTCTCGCCGCCTCGGAGCAGATCCCGGCCTCGTCCCTGGACGGCCTGGAGTTCCT
CGGAGAGCTGGCACTGACCGGTGAACTGCGCGGCATCAAGGGGACCCTGCCGGCGGTGCTGGCCGCCCGCGCCAGCGATC
GCCGGCTGATCATGCCCGCCGCCAACGGCCGCGAAGCCGGGCTAATTACTCCCTGCCCGGCCCTGCTGGCTCCCCACCTG
CTGGCCATTACCGCCTGGTTGCAACAGCAGGGCGAGCTGGCCCACCCCGAGCCTGTGCCCATTCATGAGGATGCCATCAC
CCCCTGTCTGAGTGAGGTGATCGGCCAGGAAACCGGCAAACGGGCGCTGGAGATTGCCGCCGCCGGCGAACATAACCTGT
TGTTTCTCGGTCCGCCCGGTACCGGCAAGACCATGCTGGCGGCACGCCTGCCGGGCCTGCTGCCACCCATGACCGAAACC
GAAGCACTGGAGGCGGCCGCCATTCGTTCCATCAGCGGCCAGCCCCTTGACCCCGAGCACTGGCGGCAGCGAGCCTTTCG
CCAGCCTCACCACTCTTCCTCGGCTGCCGCCCTGGTAGGGGGCGGCAGCCATCCCAGACCGGGGGAGATATCCCTCGCCC
ACCGCGGGGTGCTGTTTCTCGACGAACTCACCGAGTTTGAACGCCGAGTGCTGGATGCCCTGCGCGAACCGCTGGAAACC
GGTACCATCAGCATTTCCCGGGCCGCCCACAGCGTGACCTTTCCGGCCCGCTTTCAGTTGATCGGCGCCATGAACCCCAG
TCCTTGTGGCCACTATCAGGACGGCCTGAGCCGCAGCTCGCCGGAGCAGATATTGCGTTATCTCGGCAAGATATCCGGCC
CCTTTGTCGACCGTTTCGATCTGTCGGTGGAAATTCCCCTGCTGCCACCGGGCGAGCTGAGCCGGCCACGGCAAAAGTCC
GCCAGCACGGCCGAGGTTCGCGCCCGGGTGCTGGCTGCGCGGCAACGCCAACAGGCCCGGGCCGGCAAACCCAACGCACG
CCTGGGTGCTGCCGATCTCGACCGGCTGTGCCCGCTGAGTGCCGACGATGCCGCCTTCCTGGAGCAGGCACTGCATCGCA
TGAAGCTGTCCATTCGAGCCTGGCATAAACTGATCCGGGTAGCCCGCACCATCGCCGATCTGGCCGGCGAGCCCCATATC
GGCCGGCCTCACCTGATGGAAGCCCTGGGTTACCGGGCCATGGACCGGCTGCTGGCCCGGCTGCGGCAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB H2G0Y9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

60.956

99.801

0.608

  comM Vibrio campbellii strain DS40M4

59.562

99.801

0.594

  comM Haemophilus influenzae Rd KW20

58.974

100

0.594

  comM Glaesserella parasuis strain SC1401

57.791

100

0.583

  comM Legionella pneumophila str. Paris

48.589

98.608

0.479

  comM Legionella pneumophila strain ERS1305867

48.589

98.608

0.479

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.604

100

0.429


Multiple sequence alignment