Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   K5607_RS11000 Genome accession   NZ_AP019782
Coordinates   2519380..2520348 (+) Length   322 a.a.
NCBI ID   WP_221047030.1    Uniprot ID   A0A8D4VPT1
Organism   Methylogaea oryzae strain E10     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 2514380..2525348
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K5607_RS10980 (MoryE10_21430) - 2514621..2515913 (-) 1293 WP_221047028.1 alginate export family protein -
  K5607_RS10985 (MoryE10_21440) - 2516047..2517177 (-) 1131 WP_054772942.1 glucose-6-phosphate dehydrogenase assembly protein OpcA -
  K5607_RS10990 (MoryE10_21450) zwf 2517174..2518718 (-) 1545 WP_221047029.1 glucose-6-phosphate dehydrogenase -
  K5607_RS18255 (MoryE10_21460) - 2518934..2519353 (+) 420 WP_343222897.1 accessory factor UbiK family protein -
  K5607_RS11000 (MoryE10_21470) comM 2519380..2520348 (+) 969 WP_221047030.1 magnesium chelatase domain-containing protein Machinery gene
  K5607_RS11005 (MoryE10_21480) - 2520356..2522155 (+) 1800 WP_054772941.1 hypothetical protein -
  K5607_RS11010 (MoryE10_21490) - 2522215..2522673 (+) 459 WP_246598837.1 type II toxin-antitoxin system HicB family antitoxin -
  K5607_RS11015 (MoryE10_21500) - 2522731..2523432 (+) 702 WP_082411373.1 thermonuclease family protein -

Sequence


Protein


Download         Length: 322 a.a.        Molecular weight: 33598.56 Da        Isoelectric Point: 6.6855

>NTDB_id=73981 K5607_RS11000 WP_221047030.1 2519380..2520348(+) (comM) [Methylogaea oryzae strain E10]
MTLATVYSRGKQGIQAPLVTVEAHLSNGLPSLSIVGLPEAAVKESKDRVRSALLTCHFEFPAQRITINLAPADLPKEGGR
FDLAIAVTILAASGQIRQAELGRYELLGELSLSGELRPAKGALPVAVAARDCGRALILPGSNAAEAALAAGAEILAANHL
LEVCGHLNGEAPLPETPSDRPSAPPVFDVDLADVHGQYQAKRALEIAAAGRHNLLMLGPPGTGKSMLAARLPTLLPALSE
AEALETAAITSVSDLPLDPGRWLAPPYRAPHHTASAAALVGGGCQFSKLPRRFGNSCDNFVTTESYSPATPSLLAQGWRF
CL

Nucleotide


Download         Length: 969 bp        

>NTDB_id=73981 K5607_RS11000 WP_221047030.1 2519380..2520348(+) (comM) [Methylogaea oryzae strain E10]
ATGACCCTTGCCACCGTCTATAGCCGGGGCAAGCAAGGCATCCAAGCGCCGCTGGTGACGGTGGAGGCGCACCTGTCCAA
CGGCCTGCCCAGCCTGTCCATCGTCGGATTGCCGGAAGCGGCGGTCAAGGAAAGCAAGGACCGGGTGCGCAGCGCCCTGC
TCACCTGCCATTTCGAATTCCCGGCCCAACGCATCACCATCAACCTGGCGCCGGCCGATTTGCCCAAGGAAGGCGGACGC
TTCGACCTGGCCATCGCCGTGACCATCCTGGCCGCCTCCGGCCAGATCCGCCAGGCCGAATTGGGCCGCTACGAGTTGCT
GGGCGAACTGTCCCTCAGCGGCGAGCTGCGCCCCGCCAAGGGAGCCCTGCCCGTGGCCGTGGCGGCCCGCGATTGCGGCC
GCGCCTTGATCCTGCCCGGAAGCAACGCCGCCGAGGCCGCCCTGGCCGCCGGCGCGGAAATCCTGGCGGCCAATCACCTG
TTGGAAGTCTGCGGCCACCTCAACGGCGAGGCCCCCTTGCCGGAAACGCCCAGCGACCGGCCGTCCGCCCCGCCCGTCTT
CGACGTGGACCTGGCCGACGTGCACGGCCAATACCAAGCCAAGCGCGCCCTGGAAATCGCCGCCGCCGGCCGCCACAACC
TGCTCATGCTAGGCCCGCCCGGCACCGGCAAGTCCATGCTGGCGGCGCGTTTGCCGACCTTGCTGCCGGCACTGAGCGAA
GCGGAAGCCTTGGAAACCGCCGCCATCACCTCGGTCAGCGATCTACCCCTGGACCCCGGCCGCTGGCTGGCGCCGCCCTA
CAGGGCGCCGCACCATACGGCGTCCGCCGCCGCCCTGGTCGGCGGCGGTTGTCAGTTTTCGAAATTGCCGCGACGTTTTG
GAAATAGCTGCGACAACTTCGTAACCACCGAATCATATTCACCCGCAACGCCATCCTTGCTCGCACAAGGATGGCGTTTT
TGTTTGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

57.292

89.441

0.512

  comM Vibrio campbellii strain DS40M4

55.477

87.888

0.488

  comM Glaesserella parasuis strain SC1401

54.355

89.13

0.484

  comM Haemophilus influenzae Rd KW20

53.472

89.441

0.478

  comM Legionella pneumophila str. Paris

50.883

87.888

0.447

  comM Legionella pneumophila strain ERS1305867

50.883

87.888

0.447

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.444

89.441

0.398


Multiple sequence alignment