Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   KUD94_RS11120 Genome accession   NZ_CP078069
Coordinates   2318771..2320303 (+) Length   510 a.a.
NCBI ID   WP_218237269.1    Uniprot ID   -
Organism   Comamonas sp. NLF-1-9     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 2313771..2325303
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  KUD94_RS11100 (KUD94_11100) - 2314157..2314855 (-) 699 WP_218237265.1 ProQ/FinO family protein -
  KUD94_RS11105 (KUD94_11105) glcF 2314852..2316090 (-) 1239 WP_218237266.1 glycolate oxidase subunit GlcF -
  KUD94_RS11110 (KUD94_11110) glcE 2316094..2317191 (-) 1098 WP_218237267.1 glycolate oxidase subunit GlcE -
  KUD94_RS11115 (KUD94_11115) - 2317253..2318452 (-) 1200 WP_218237268.1 ammonium transporter -
  KUD94_RS11120 (KUD94_11120) comM 2318771..2320303 (+) 1533 WP_218237269.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  KUD94_RS11125 (KUD94_11125) - 2320464..2321117 (+) 654 WP_218237270.1 glutathione S-transferase family protein -
  KUD94_RS11130 (KUD94_11130) - 2321202..2322839 (-) 1638 WP_218237271.1 hypothetical protein -
  KUD94_RS11135 (KUD94_11135) - 2323415..2323762 (+) 348 WP_218237272.1 hypothetical protein -
  KUD94_RS11140 (KUD94_11140) - 2323962..2324477 (+) 516 WP_218237273.1 DUF305 domain-containing protein -
  KUD94_RS11145 (KUD94_11145) - 2324611..2325006 (+) 396 WP_218237274.1 tautomerase -

Sequence


Protein


Download         Length: 510 a.a.        Molecular weight: 53058.74 Da        Isoelectric Point: 8.2094

>NTDB_id=586547 KUD94_RS11120 WP_218237269.1 2318771..2320303(+) (comM) [Comamonas sp. NLF-1-9]
MSLALVQSRALLGLAAPAVTVEVHLANGLPAFTLVGLADVEVKEARERVRAAILNSGLEFPSNKRITVNLAPADLPKDSG
RFDLPIALGLLAASGQLDGARLAGWEFAGELSLSGQLRSVRGALATALALRSQGEAARLVLPLDSAREAALVPDAPIYGA
EHLLDVVAQFLPPGAEGQAGEPGGWQRVLPAPVADAEPGPDLADVKGQAAARRALEIAAAGGHGLLLAGPPGSGKSMLAQ
RFASILPPMTVQEALESAAVASLAGRFTPARWMRRPTGSPHHSASAVALVGGGSPPRPGEISLAHHGVLFLDEFPEFARS
ALEALREPLESGHITIARAAQRAEFPARFQLVAAMNPCPCGFAGSRQRACRCTPEQIARYQGKLSGPLLDRIDLHVEVPH
LPAEELLGAPPGEPSAAVRARVVAARERALARQGGPNQTLSGQALQDAAGLDEAASRFLQTAATRLAWSARATHRALKVA
RTIADLGGSEHVGLAHVGEAIQYRRVLTAK

Nucleotide


Download         Length: 1533 bp        

>NTDB_id=586547 KUD94_RS11120 WP_218237269.1 2318771..2320303(+) (comM) [Comamonas sp. NLF-1-9]
ATGAGCCTTGCCCTGGTTCAAAGCCGCGCCCTGCTGGGCCTGGCTGCGCCCGCCGTCACCGTCGAGGTGCACTTGGCCAA
CGGCCTGCCGGCGTTCACGCTGGTCGGCCTGGCCGACGTGGAAGTGAAGGAGGCGCGCGAGCGCGTGCGCGCCGCCATCC
TGAACAGCGGACTGGAGTTCCCGTCGAACAAGCGCATCACGGTGAACCTGGCGCCGGCCGATCTGCCCAAGGATTCAGGC
CGTTTTGATCTGCCGATCGCGCTGGGCCTGCTCGCGGCCAGCGGGCAGCTGGACGGCGCGCGCCTGGCCGGCTGGGAGTT
TGCCGGCGAGCTCTCGCTGTCCGGACAGTTGCGCAGCGTGCGCGGCGCGCTGGCCACCGCGCTGGCGCTGCGCAGCCAGG
GCGAGGCGGCGCGCCTGGTGCTGCCGCTGGACAGCGCCCGCGAGGCCGCGCTGGTGCCCGACGCGCCGATCTACGGCGCC
GAGCACCTGCTGGACGTGGTAGCGCAATTCCTGCCGCCCGGCGCCGAAGGCCAGGCGGGCGAGCCCGGCGGCTGGCAGCG
CGTGCTGCCCGCGCCCGTCGCAGATGCCGAGCCCGGCCCGGACCTGGCCGACGTCAAGGGCCAGGCCGCCGCCCGCCGCG
CGCTGGAAATCGCCGCGGCCGGCGGCCATGGCCTGCTGCTGGCCGGCCCGCCCGGCTCGGGCAAGTCCATGCTGGCGCAG
CGCTTTGCCTCCATCTTGCCGCCGATGACGGTGCAGGAGGCGCTCGAGAGTGCGGCAGTGGCCAGCCTGGCCGGGCGCTT
CACGCCCGCGCGCTGGATGCGCCGGCCCACCGGCAGCCCGCACCACAGCGCCAGCGCGGTGGCGCTGGTGGGCGGCGGTT
CGCCCCCGCGGCCGGGCGAGATATCGCTCGCGCACCACGGCGTGCTGTTTCTGGATGAATTCCCCGAGTTCGCGCGCAGC
GCGCTCGAAGCCCTGCGCGAGCCGCTGGAGAGCGGCCACATCACCATCGCGCGCGCCGCGCAGCGCGCCGAATTCCCGGC
GCGCTTTCAACTGGTCGCGGCCATGAACCCCTGCCCCTGCGGCTTTGCCGGCTCACGCCAGCGCGCCTGCCGCTGCACGC
CCGAGCAGATCGCGCGCTACCAGGGCAAGCTCAGCGGCCCGCTGCTGGACCGCATAGACCTGCACGTGGAAGTGCCGCAC
CTGCCTGCCGAAGAGCTGCTGGGCGCGCCGCCGGGCGAGCCCAGCGCCGCAGTGCGCGCGCGCGTGGTGGCCGCGCGCGA
GCGCGCCCTTGCACGCCAGGGCGGGCCCAACCAGACGCTGTCGGGCCAGGCGCTGCAGGATGCGGCCGGCCTGGACGAGG
CCGCGAGCCGCTTCCTGCAAACCGCCGCCACCCGCCTGGCCTGGTCGGCGCGCGCGACCCACCGTGCGCTGAAAGTGGCG
CGCACCATTGCCGACCTGGGCGGCTCGGAGCACGTGGGCCTGGCGCACGTGGGCGAAGCCATCCAGTACCGGCGTGTGCT
GACGGCCAAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

52.174

99.216

0.518

  comM Haemophilus influenzae Rd KW20

51.772

99.608

0.516

  comM Vibrio campbellii strain DS40M4

51.383

99.216

0.51

  comM Glaesserella parasuis strain SC1401

49.706

100

0.498

  comM Legionella pneumophila str. Paris

48.362

100

0.492

  comM Legionella pneumophila strain ERS1305867

48.362

100

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.86

100

0.441