Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   K5607_RS11070 Genome accession   NZ_AP019782
Coordinates   2535641..2536399 (+) Length   252 a.a.
NCBI ID   WP_221047035.1    Uniprot ID   -
Organism   Methylogaea oryzae strain E10     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 2530641..2541399
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K5607_RS11050 - 2531026..2532156 (-) 1131 WP_221047032.1 TniQ family protein -
  K5607_RS11055 (MoryE10_21560) - 2532114..2533004 (-) 891 WP_221047033.1 TniB family NTP-binding protein -
  K5607_RS11060 (MoryE10_21570) - 2533001..2534929 (-) 1929 WP_221047034.1 Mu transposase C-terminal domain-containing protein -
  K5607_RS11065 (MoryE10_21580) - 2534922..2535443 (-) 522 WP_217994865.1 TnsA endonuclease N-terminal domain-containing protein -
  K5607_RS11070 (MoryE10_21590) comM 2535641..2536399 (+) 759 WP_221047035.1 ATP-binding protein Machinery gene
  K5607_RS11075 (MoryE10_21600) - 2536405..2537736 (-) 1332 WP_221047036.1 ATP-binding protein -
  K5607_RS11080 (MoryE10_21610) - 2537733..2539682 (-) 1950 WP_221047037.1 SPOR domain-containing protein -
  K5607_RS11085 (MoryE10_21620) - 2539679..2540419 (-) 741 WP_221047038.1 type III pantothenate kinase -

Sequence


Protein


Download         Length: 252 a.a.        Molecular weight: 27363.34 Da        Isoelectric Point: 9.5509

>NTDB_id=73982 K5607_RS11070 WP_221047035.1 2535641..2536399(+) (comM) [Methylogaea oryzae strain E10]
MKNVAAISRNNSVSYCFLNRIVAFISATDSGGVPKPGEISLAHGGVLFLDELPEFDRKVLEVLREPLESGRIVISRAANR
AEFPANFQLLAAMNPCPCGYLGDASGRCHCTAEQVQRYRARISGPLLDRIDLHVEVPRVPHAVLRQGQPGGEESSAAVRA
RVERARRVARARSGTANAQLGVAEIKRHCALDGDGQKLMEQAMDKLGLSHRAYHRILKVARTIADLAGSETIAVNHLGEA
IGYRRLDRAISG

Nucleotide


Download         Length: 759 bp        

>NTDB_id=73982 K5607_RS11070 WP_221047035.1 2535641..2536399(+) (comM) [Methylogaea oryzae strain E10]
ATGAAAAACGTCGCAGCTATTTCAAGAAACAATAGCGTAAGCTATTGTTTCTTGAATCGCATTGTCGCATTTATTTCTGC
AACTGACAGCGGCGGCGTGCCCAAACCGGGGGAGATTTCCTTAGCCCATGGAGGCGTTTTATTTTTGGACGAGCTGCCGG
AATTCGACCGCAAAGTGCTGGAGGTGCTGCGCGAACCCTTGGAATCCGGCCGCATCGTCATCTCCCGCGCCGCCAACCGG
GCGGAATTCCCGGCGAACTTCCAGTTGTTGGCGGCCATGAACCCTTGTCCCTGCGGCTATTTGGGCGATGCCTCCGGCCG
CTGCCATTGCACCGCCGAACAAGTGCAACGCTACCGGGCGCGCATTTCCGGCCCTTTGCTGGACAGGATAGACCTGCACG
TGGAAGTGCCGCGCGTGCCCCACGCCGTGTTGCGGCAAGGCCAGCCCGGCGGCGAGGAAAGCAGCGCCGCCGTGCGGGCG
CGGGTGGAGCGGGCGCGCCGCGTCGCCCGGGCGCGCAGCGGCACGGCCAACGCCCAACTCGGCGTCGCGGAAATCAAACG
CCATTGCGCCTTGGACGGAGACGGGCAAAAGCTGATGGAGCAAGCCATGGACAAGCTCGGGCTGTCCCACCGCGCCTACC
ACCGCATCCTTAAAGTCGCGCGCACCATCGCCGACTTGGCCGGCAGCGAAACCATCGCCGTCAACCATCTCGGCGAAGCC
ATCGGCTACCGGCGCTTGGATCGCGCTATTTCGGGATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

55

87.302

0.48

  comM Vibrio campbellii strain DS40M4

54.545

87.302

0.476

  comM Glaesserella parasuis strain SC1401

52.193

90.476

0.472

  comM Haemophilus influenzae Rd KW20

53.636

87.302

0.468

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

50.45

88.095

0.444

  comM Legionella pneumophila str. Paris

50.698

85.317

0.433

  comM Legionella pneumophila strain ERS1305867

50.698

85.317

0.433


Multiple sequence alignment