Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GKQ51_RS20045 Genome accession   NZ_CP066310
Coordinates   4234914..4236407 (+) Length   497 a.a.
NCBI ID   WP_198866785.1    Uniprot ID   A0AAQ0BYZ0
Organism   Azotobacter chroococcum strain HR1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 4229914..4241407
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GKQ51_RS20010 (GKQ51_20010) - 4229969..4230670 (+) 702 WP_089169477.1 HAD family hydrolase -
  GKQ51_RS20015 (GKQ51_20015) - 4230676..4231488 (+) 813 WP_198866783.1 CPBP family intramembrane glutamic endopeptidase -
  GKQ51_RS20020 (GKQ51_20020) sutA 4231557..4231877 (-) 321 WP_039806498.1 transcriptional regulator SutA -
  GKQ51_RS20025 (GKQ51_20025) - 4232000..4232425 (-) 426 WP_198866784.1 secondary thiamine-phosphate synthase enzyme YjbQ -
  GKQ51_RS20030 (GKQ51_20030) - 4232582..4233895 (-) 1314 WP_089169480.1 ammonium transporter -
  GKQ51_RS20035 (GKQ51_20035) glnK 4233928..4234266 (-) 339 WP_012703245.1 P-II family nitrogen regulator -
  GKQ51_RS20040 (GKQ51_20040) - 4234607..4234882 (+) 276 WP_039806510.1 accessory factor UbiK family protein -
  GKQ51_RS20045 (GKQ51_20045) comM 4234914..4236407 (+) 1494 WP_198866785.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GKQ51_RS20050 (GKQ51_20050) - 4236513..4237376 (-) 864 WP_089169482.1 SPFH domain-containing protein -
  GKQ51_RS20055 (GKQ51_20055) - 4237405..4237842 (-) 438 WP_198866786.1 NfeD family protein -
  GKQ51_RS20060 (GKQ51_20060) - 4237953..4238834 (-) 882 WP_198866787.1 LysR family transcriptional regulator -
  GKQ51_RS20065 (GKQ51_20065) - 4238952..4239374 (+) 423 WP_198866788.1 DoxX family protein -
  GKQ51_RS20070 (GKQ51_20070) - 4239423..4240151 (+) 729 WP_198866789.1 pirin family protein -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 52151.93 Da        Isoelectric Point: 7.3127

>NTDB_id=519263 GKQ51_RS20045 WP_198866785.1 4234914..4236407(+) (comM) [Azotobacter chroococcum strain HR1]
MSLAIVHSRAQVGVEAPAVTVEAHLANGLPALTLVGLPETAVRESKDRVRSAILTSGFDFPARRITLNLAPADLPKDGGR
FDLAIALGILAASEQLPAEALGNLECLGELALSGSLRPVRGVLPAALAARAAGRTLVVPRANAEEASLASGLNVLAVDHL
LELAAHLNGQSPLAPYQAQGLLRQTLPYPDLADVQGQAAAKRALLVAAAGSHNLLLSGPPGTGKTLLASRLPGLLPPLDE
GEALEVAAIHSVAGSAPLAAWPQRPFRQPHHSASGPALVGGGSRPRPGEITLAHQGVLFLDELPEFDRKVLEVLREPLES
GEIVIARASDKVRFPARFQLVAAMNPCPCGYLGDPAGRCRCTPEQIQRYRAKLSGPLLDRIDLHIGVTREATALGAPRLD
GPDSAGAAAQVAAARTLQLARQGCPNAFLDLPGLHQHCALDNEDRQWLERACERLGLSLRAAHRILKVARTLADLEAAPE
IARAHLAEALQYRASSA

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=519263 GKQ51_RS20045 WP_198866785.1 4234914..4236407(+) (comM) [Azotobacter chroococcum strain HR1]
ATGTCCCTGGCCATCGTCCACAGCCGCGCCCAGGTGGGCGTCGAGGCGCCCGCCGTCACCGTCGAGGCGCATCTGGCCAA
CGGCCTGCCGGCGCTGACCCTGGTCGGCCTGCCGGAAACCGCGGTCCGCGAGAGCAAGGACCGCGTGCGCAGCGCCATCC
TCACCTCCGGCTTCGACTTCCCGGCGCGGCGCATCACCCTCAACCTGGCCCCCGCCGACCTGCCCAAGGACGGCGGACGC
TTCGACCTGGCCATCGCCCTGGGCATCCTCGCCGCCAGTGAGCAGTTGCCCGCCGAGGCCCTCGGCAACCTGGAGTGCCT
CGGCGAGCTGGCCCTCTCCGGCAGCCTGCGGCCGGTCCGGGGCGTGCTGCCCGCCGCGCTGGCCGCCCGTGCCGCCGGAC
GCACCCTGGTGGTGCCACGGGCCAACGCCGAGGAAGCCAGCCTGGCCTCGGGGCTGAACGTGCTGGCGGTCGACCACCTG
CTGGAGCTGGCCGCCCACCTGAACGGCCAGTCCCCGCTGGCGCCCTACCAGGCCCAGGGCCTGCTGCGCCAGACGCTGCC
CTACCCCGACCTTGCCGACGTGCAGGGCCAGGCCGCGGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCAGCCACAACC
TGCTGCTCAGCGGCCCGCCGGGAACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCGGGACTGCTGCCACCGCTGGACGAG
GGCGAGGCGCTGGAGGTGGCGGCGATCCATTCGGTGGCCGGCAGCGCGCCGCTCGCCGCCTGGCCGCAGCGGCCGTTCCG
CCAGCCGCACCACAGCGCCTCGGGACCGGCGCTGGTCGGCGGCGGCAGCCGGCCGCGTCCCGGCGAGATCACCCTGGCGC
ACCAGGGCGTACTGTTCCTCGACGAGTTGCCGGAGTTCGACCGCAAGGTGCTGGAAGTGCTGCGCGAACCGCTGGAAAGC
GGCGAGATCGTCATCGCCCGGGCCAGCGACAAGGTGCGCTTTCCGGCACGCTTCCAGCTGGTGGCGGCCATGAACCCCTG
CCCCTGCGGCTACCTGGGCGACCCTGCCGGCCGCTGCCGCTGTACCCCGGAGCAGATCCAGCGCTACCGTGCCAAGCTGT
CCGGCCCGCTGCTCGACCGCATCGACCTGCACATCGGCGTCACCCGCGAGGCCACCGCCCTGGGCGCACCGCGCCTGGAC
GGTCCGGACAGCGCTGGCGCCGCGGCCCAGGTGGCGGCGGCGCGCACCCTTCAGCTGGCGCGCCAGGGCTGCCCCAATGC
GTTCCTCGATCTGCCCGGATTGCACCAGCACTGTGCACTGGACAACGAGGACCGCCAGTGGCTGGAACGCGCCTGCGAGC
GCCTCGGCCTGTCGCTGCGCGCCGCCCACCGCATTCTCAAGGTGGCGCGCACCCTGGCCGATCTCGAGGCGGCGCCGGAG
ATCGCCCGCGCTCACCTGGCCGAAGCCCTGCAGTACCGGGCCAGCAGCGCCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

57.43

100

0.575

  comM Vibrio cholerae strain A1552

56.827

100

0.569

  comM Haemophilus influenzae Rd KW20

54.89

100

0.553

  comM Glaesserella parasuis strain SC1401

54.6

100

0.549

  comM Legionella pneumophila str. Paris

50

100

0.515

  comM Legionella pneumophila strain ERS1305867

50

100

0.515

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

45.527

100

0.461