Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   CTR2_RS00970 Genome accession   NZ_AP026738
Coordinates   223673..225211 (+) Length   512 a.a.
NCBI ID   WP_087085443.1    Uniprot ID   -
Organism   Comamonas thiooxydans strain R2     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 218673..230211
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CTR2_RS00950 (CTR2_R01890) - 219412..220740 (-) 1329 WP_087085446.1 sulfatase -
  CTR2_RS00955 (CTR2_R01900) - 220854..221846 (-) 993 WP_087085769.1 Bug family tripartite tricarboxylate transporter substrate binding protein -
  CTR2_RS00960 (CTR2_R01910) - 221976..222947 (-) 972 WP_087085445.1 tripartite tricarboxylate transporter substrate binding protein -
  CTR2_RS00965 (CTR2_R01920) - 223040..223588 (+) 549 WP_087085444.1 MarR family winged helix-turn-helix transcriptional regulator -
  CTR2_RS00970 (CTR2_R01930) comM 223673..225211 (+) 1539 WP_087085443.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  CTR2_RS00975 (CTR2_R01940) - 225225..226154 (-) 930 WP_087085442.1 LysR substrate-binding domain-containing protein -
  CTR2_RS00980 (CTR2_R01950) - 226265..227446 (+) 1182 WP_087085441.1 YbfB/YjiJ family MFS transporter -
  CTR2_RS00985 (CTR2_R01960) - 227481..228548 (+) 1068 WP_087085440.1 nitronate monooxygenase family protein -

Sequence


Protein


Download         Length: 512 a.a.        Molecular weight: 54792.65 Da        Isoelectric Point: 7.1622

>NTDB_id=96771 CTR2_RS00970 WP_087085443.1 223673..225211(+) (comM) [Comamonas thiooxydans strain R2]
MGLALVQSRALLGLQAPAVTVEVHLANGLPSFTLVGLADVEVKEARERVRAAIVNAGLEFPNNQRITVNLAPADLPKDSG
RFDLPIALGILAASGQIDAQRLADYEFAGELSLTGALRPVRGALATALALQRQQQRVRLVLPPDSAQEAAFVPAIEVFGA
AHLLDVVRQFIAHDATLSEQGDGVEGWQRVHSRPAEASLQSLDLREVRGQMQAKRALEIAAAGAHGVLMIGPPGSGKSML
AQRFAGLLPGMTDEEALEAAAIASLSGRFTPQLWRQRPFAAPHHTASSIALVGGGSPPRPGEISYAHCGALFLDELPEFA
RSALEALREPLETGRITIVRAVQRAEFPARFQLVAAMNPCPCGYWGSRVRACRCSPDQVARYQARISGPLLDRIDLHVEV
AALSPEELLAAPEGESSAAVQQRVSAARDKALQRQGLPNHQLQGVQLDTHLQLEPEALTFAHKAAARLGWSARGTHRALK
VARTIADLADSDAITQAHLAEALQYRRALMQP

Nucleotide


Download         Length: 1539 bp        

>NTDB_id=96771 CTR2_RS00970 WP_087085443.1 223673..225211(+) (comM) [Comamonas thiooxydans strain R2]
ATGGGTCTGGCTCTGGTTCAAAGTCGTGCCTTGCTGGGCCTGCAGGCACCGGCCGTCACGGTGGAGGTTCATCTGGCCAA
CGGCCTGCCTTCGTTCACCCTGGTGGGCCTGGCGGATGTGGAGGTCAAGGAGGCGCGAGAGCGCGTGCGCGCGGCCATCG
TCAATGCGGGGCTGGAGTTCCCGAACAACCAGCGCATCACTGTCAACCTGGCTCCGGCGGATCTGCCCAAGGACTCAGGT
CGCTTTGACCTGCCGATAGCGCTGGGTATTCTGGCGGCCAGCGGGCAGATCGATGCACAGCGGCTCGCCGATTATGAGTT
TGCAGGAGAGCTGTCCCTGACAGGTGCCCTGCGCCCGGTACGCGGGGCCCTGGCGACGGCACTGGCCCTGCAGCGTCAGC
AACAGCGCGTGCGGCTGGTGCTGCCGCCGGACAGTGCGCAGGAGGCCGCCTTTGTGCCGGCTATCGAAGTCTTCGGCGCT
GCGCATCTGCTGGATGTGGTCAGGCAGTTCATCGCCCATGACGCCACCCTGTCGGAGCAGGGCGATGGCGTGGAGGGCTG
GCAGCGGGTGCACTCCAGACCTGCTGAAGCCTCTTTGCAGTCGCTGGATCTGCGCGAGGTGCGCGGTCAGATGCAGGCCA
AGCGTGCTCTTGAAATTGCAGCTGCCGGCGCACATGGCGTGCTGATGATCGGCCCTCCGGGTTCGGGGAAATCCATGCTG
GCCCAGCGCTTTGCGGGCTTGCTGCCTGGCATGACCGATGAGGAAGCGCTCGAAGCCGCAGCCATTGCCAGCCTCAGCGG
TCGTTTCACGCCGCAGTTGTGGCGTCAGCGGCCGTTTGCCGCTCCCCATCACACGGCCAGCTCCATCGCGCTGGTCGGCG
GCGGCTCTCCGCCCCGGCCTGGCGAAATCTCCTATGCCCATTGCGGGGCGCTGTTTCTCGACGAGTTGCCCGAGTTCGCG
CGCAGCGCCCTGGAGGCCCTGCGCGAGCCGCTGGAGACCGGGCGCATCACCATCGTGCGGGCCGTGCAGAGGGCGGAGTT
TCCGGCCCGTTTCCAGCTGGTGGCAGCCATGAACCCCTGTCCCTGCGGCTACTGGGGCTCGCGCGTGAGGGCCTGCCGCT
GCTCGCCTGATCAGGTGGCACGTTATCAGGCTCGTATCAGCGGACCCTTGCTGGACCGTATCGATCTGCATGTTGAGGTG
GCGGCACTGTCGCCCGAGGAGCTGCTGGCAGCGCCCGAAGGGGAGAGCAGCGCTGCCGTGCAGCAACGTGTGAGTGCCGC
CAGAGACAAGGCCTTGCAACGCCAGGGCCTGCCCAATCATCAGTTGCAGGGGGTGCAGCTCGACACGCATCTGCAGCTGG
AGCCCGAGGCGCTGACCTTCGCGCACAAGGCTGCAGCGCGCCTTGGCTGGTCGGCGCGCGGCACGCACCGGGCCTTGAAG
GTGGCGCGAACCATTGCCGATCTGGCGGACTCGGATGCCATCACGCAAGCCCATCTGGCCGAGGCATTGCAATACCGCCG
CGCGCTGATGCAGCCGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

52.838

99.805

0.527

  comM Vibrio campbellii strain DS40M4

51.176

99.609

0.51

  comM Haemophilus influenzae Rd KW20

50.984

99.219

0.506

  comM Glaesserella parasuis strain SC1401

49.511

99.805

0.494

  comM Legionella pneumophila str. Paris

47.5

100

0.482

  comM Legionella pneumophila strain ERS1305867

47.5

100

0.482

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.529

99.609

0.434


Multiple sequence alignment