Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CDA09_RS21245 Genome accession   NZ_CP021731
Coordinates   4576399..4577898 (-) Length   499 a.a.
NCBI ID   WP_121430472.1    Uniprot ID   -
Organism   Azoarcus sp. DN11     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 4571399..4582898
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CDA09_RS21220 (CDA09_21195) - 4571818..4572918 (+) 1101 WP_121430467.1 ABC transporter ATP-binding protein -
  CDA09_RS21225 (CDA09_21200) - 4572915..4573691 (+) 777 WP_121430468.1 ABC transporter ATP-binding protein -
  CDA09_RS21230 (CDA09_21205) - 4573681..4574418 (+) 738 WP_121430469.1 ABC transporter ATP-binding protein -
  CDA09_RS21235 (CDA09_21210) - 4574539..4574964 (+) 426 WP_121430470.1 aldehyde-activating protein -
  CDA09_RS21240 (CDA09_21215) - 4575128..4576321 (-) 1194 WP_121430471.1 multidrug effflux MFS transporter -
  CDA09_RS21245 (CDA09_21220) comM 4576399..4577898 (-) 1500 WP_121430472.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CDA09_RS21250 (CDA09_21225) - 4577998..4578261 (-) 264 WP_121430473.1 accessory factor UbiK family protein -
  CDA09_RS21255 (CDA09_21230) - 4578590..4579333 (+) 744 WP_121430474.1 TorF family putative porin -
  CDA09_RS21260 (CDA09_21235) glnK 4579384..4579722 (+) 339 WP_121430475.1 P-II family nitrogen regulator -
  CDA09_RS21265 (CDA09_21240) amt 4579733..4581196 (+) 1464 WP_121430476.1 ammonium transporter -
  CDA09_RS21270 (CDA09_21245) purU 4581293..4582180 (-) 888 WP_121430477.1 formyltetrahydrofolate deformylase -
  CDA09_RS21275 (CDA09_21250) thrH 4582185..4582799 (-) 615 WP_121430478.1 bifunctional phosphoserine phosphatase/homoserine phosphotransferase ThrH -

Sequence


Protein


Download         Length: 499 a.a.        Molecular weight: 53248.87 Da        Isoelectric Point: 8.5251

>NTDB_id=232934 CDA09_RS21245 WP_121430472.1 4576399..4577898(-) (comM) [Azoarcus sp. DN11]
MSLALVRTRALAGLGAPEVTVEVHLANGLPAFNLVGLPDTEVREARERVRAAIATSQFEFPQRRITVNLAPADLPKEGGR
FDLPIALGILAASGQVDAAALARHEFVGELSLDGSLRPVRGGLAMALESGRAGRALVLPAANADEAALARDASVLPAPSL
LAVCAHLNGHTPLTRRLAPPVAEGRDDEPDLAEVKGQLQARRALEVAAAGQHSLLMFGPPGTGKSMLARRLPGLLPPLDE
AEAIESASIQSLEGAFDARRWGRRPYRAPHHSASAPAVVGGGASPRPGEISLAHHGVLFLDELPEFERRVLEALREPLET
GTVTVSRARQRAEFPARFQLVAAMNPCPCGHAGDKNGRCRCTPDQVARYRGRLSGPLLDRMDIVIEVPLLDHADMLGQPA
GEPSAAVRERVTQAWAVQRERQGRANSHLAPGRVDALCAPDEQGKALLDHAIRRLNLSARGYHRILKVARTIADLAGAER
VGPAHLAEAIQYRRGLDSR

Nucleotide


Download         Length: 1500 bp        

>NTDB_id=232934 CDA09_RS21245 WP_121430472.1 4576399..4577898(-) (comM) [Azoarcus sp. DN11]
ATGTCGCTGGCTCTCGTACGCACCCGCGCGCTCGCGGGTCTGGGCGCACCCGAGGTGACGGTGGAAGTGCACCTCGCCAA
CGGCCTGCCGGCCTTCAACCTCGTCGGTCTGCCTGATACCGAGGTGCGCGAGGCGCGCGAGCGCGTGCGCGCCGCGATCG
CCACGTCGCAGTTCGAATTCCCGCAGCGCCGGATCACCGTCAATCTCGCCCCGGCCGACCTGCCCAAGGAAGGCGGGCGC
TTCGACCTGCCGATCGCGCTGGGCATCCTCGCCGCGTCGGGCCAGGTCGATGCGGCGGCGCTGGCCCGTCACGAATTCGT
CGGCGAGCTGTCGCTCGACGGCAGCCTGCGCCCGGTGCGCGGCGGTCTCGCGATGGCCCTCGAGAGCGGCCGCGCCGGCC
GCGCGCTGGTACTGCCCGCGGCGAATGCCGACGAAGCCGCGCTCGCGCGCGACGCGAGCGTGCTGCCGGCACCGAGCCTG
CTCGCCGTGTGTGCCCACCTCAACGGCCACACGCCGCTCACGCGCCGTCTTGCGCCGCCGGTGGCGGAAGGGCGGGACGA
CGAACCGGATCTCGCCGAGGTGAAGGGGCAGCTGCAGGCGCGCCGCGCGCTCGAAGTCGCCGCCGCCGGCCAGCACTCGC
TGCTGATGTTCGGCCCCCCGGGAACCGGCAAGTCCATGCTCGCGCGCCGCTTGCCGGGGCTCCTGCCGCCGCTCGACGAG
GCCGAGGCGATCGAGAGCGCGTCGATCCAGTCGCTCGAAGGGGCCTTCGATGCCCGCCGCTGGGGGCGGCGCCCCTACCG
CGCGCCGCACCACTCCGCCTCGGCCCCCGCGGTGGTCGGCGGCGGCGCCAGTCCGCGCCCCGGCGAGATCAGCCTCGCGC
ACCACGGCGTGCTGTTCCTCGACGAGCTGCCGGAGTTCGAGCGCCGCGTGCTCGAAGCCCTGCGCGAGCCGCTCGAGACC
GGCACTGTCACGGTGTCGCGGGCGCGGCAGCGCGCCGAGTTCCCCGCGCGCTTCCAGCTCGTCGCCGCGATGAATCCGTG
CCCGTGCGGGCATGCCGGCGACAAGAATGGGCGCTGCCGCTGCACGCCGGACCAGGTTGCACGCTACCGCGGGCGCCTGT
CGGGCCCGCTGCTCGACCGCATGGACATCGTCATCGAGGTGCCGCTGCTCGATCACGCCGACATGCTCGGTCAGCCCGCG
GGCGAGCCGAGTGCCGCGGTGCGCGAGCGCGTCACGCAGGCGTGGGCCGTGCAGCGCGAGCGCCAGGGGCGCGCCAACAG
CCATCTCGCGCCCGGCCGCGTCGATGCGCTGTGCGCGCCCGATGAGCAGGGCAAGGCGCTGCTCGATCACGCGATCCGCC
GGCTGAACCTGTCCGCGCGGGGGTATCATCGCATCCTCAAGGTTGCCCGCACGATCGCGGACCTGGCCGGCGCCGAGCGC
GTCGGCCCGGCGCATCTGGCCGAGGCGATCCAGTACCGGCGCGGCCTCGATTCCCGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

53.6

100

0.537

  comM Vibrio campbellii strain DS40M4

53.131

99.198

0.527

  comM Vibrio cholerae strain A1552

53.131

99.198

0.527

  comM Glaesserella parasuis strain SC1401

51.4

100

0.515

  comM Legionella pneumophila str. Paris

48.898

100

0.489

  comM Legionella pneumophila strain ERS1305867

48.898

100

0.489

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.466

100

0.451


Multiple sequence alignment