Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   AAGR08_RS19500 Genome accession   NZ_CP155066
Coordinates   4218129..4219649 (-) Length   506 a.a.
NCBI ID   WP_345827792.1    Uniprot ID   -
Organism   Pantoea sp. BRR-3P     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 4213129..4224649
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AAGR08_RS19490 (AAGR08_19490) hdfR 4216818..4217639 (-) 822 WP_097095670.1 HTH-type transcriptional regulator HdfR -
  AAGR08_RS19495 (AAGR08_19495) - 4217761..4218099 (+) 339 WP_007888811.1 DUF413 domain-containing protein -
  AAGR08_RS19500 (AAGR08_19500) comM 4218129..4219649 (-) 1521 WP_345827792.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  AAGR08_RS19505 (AAGR08_19505) ilvL 4219978..4220076 (+) 99 WP_071531048.1 ilv operon leader peptide -
  AAGR08_RS19510 (AAGR08_19510) ilvG 4220226..4221872 (+) 1647 WP_097095672.1 acetolactate synthase 2 catalytic subunit -
  AAGR08_RS19515 (AAGR08_19515) ilvM 4221869..4222126 (+) 258 WP_097095673.1 acetolactate synthase 2 small subunit -
  AAGR08_RS19520 (AAGR08_19520) - 4222146..4223075 (+) 930 WP_097095674.1 branched-chain amino acid transaminase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 55069.20 Da        Isoelectric Point: 7.5726

>NTDB_id=996093 AAGR08_RS19500 WP_345827792.1 4218129..4219649(-) (comM) [Pantoea sp. BRR-3P]
MSLSKVLTRAALGVQAPLVTVEVHISNGLPALTLVGLPETTVKEARERVRSAIITSGFTFPAKRVTINLAPADLPKEGGR
YDLPIAIAILAASEQLPDDKLTQYEFLGELALNGALCGVQAAIPAAMAALQAGRQMVLAEQNQQDVGLIQQGETLVGNHL
VDICAFLHNKTTLTVARYHPEALDQQTQDLSDIIGQDQGRRALEITAAGGHNLLLIGPPGTGKTMLATRLPGIMPPLDDQ
EALECAAIASLVSSGNLHHQWRKRPFRAPHHSASLYALVGGGSLPRPGEISLAHNGVLFLDELPEFERKTLDALREPIES
GEISISRTRAKVTYPARFQLVGAMNPSPTGHYQGNHNRCTPQQVLRYLSRLSGPFLDRFDLSLEVPLLPSGTLSKKQQVS
ESSAEVLSRVIAARKIQTARSGKINAQLSNPEILRWCSLRQEEAEWLEGVLNSLGLSVRAWQRILKVARTIADLAGEEQI
TRSHLQEAVGYRSIDRLMIYLHKSLE

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=996093 AAGR08_RS19500 WP_345827792.1 4218129..4219649(-) (comM) [Pantoea sp. BRR-3P]
ATGTCTTTATCAAAGGTTTTAACACGGGCTGCGTTAGGGGTTCAGGCTCCGCTCGTCACGGTAGAAGTGCATATCAGTAA
TGGCTTGCCTGCCCTGACATTAGTCGGTTTGCCCGAAACCACCGTGAAAGAGGCACGAGAACGGGTTCGCAGTGCGATTA
TCACTAGCGGTTTCACTTTCCCGGCCAAACGCGTCACCATTAATCTTGCCCCTGCCGATCTGCCTAAAGAAGGGGGGCGT
TATGACCTACCCATTGCCATTGCGATTCTGGCAGCCTCTGAGCAGCTACCAGACGATAAACTGACGCAATATGAATTCCT
GGGAGAGCTAGCCCTCAACGGCGCGCTCTGTGGCGTTCAGGCCGCCATTCCGGCAGCAATGGCGGCGCTGCAAGCAGGAC
GACAAATGGTACTCGCTGAGCAGAATCAACAGGATGTTGGTTTGATTCAGCAAGGTGAAACCTTGGTGGGCAATCATCTG
GTGGATATCTGCGCCTTTCTCCATAACAAAACGACGCTCACCGTTGCCCGCTATCATCCTGAAGCCCTTGATCAGCAAAC
GCAGGACCTTAGCGATATCATTGGCCAGGATCAGGGCCGGCGTGCGCTGGAAATTACAGCTGCAGGTGGCCATAACCTAT
TACTGATTGGTCCGCCCGGTACGGGAAAAACCATGTTAGCGACACGTCTGCCCGGCATCATGCCCCCGTTAGACGACCAG
GAAGCGCTGGAGTGCGCCGCTATTGCCAGCCTGGTAAGTAGTGGCAACCTGCATCATCAATGGCGTAAACGGCCGTTCAG
GGCCCCTCACCACAGTGCTTCTCTGTACGCACTGGTAGGGGGCGGTTCATTGCCGCGCCCGGGTGAAATCTCTTTGGCGC
ACAATGGCGTGTTATTCCTGGATGAGTTACCGGAGTTTGAACGCAAAACGCTGGATGCATTGCGTGAGCCAATAGAATCG
GGCGAGATTTCTATTTCGCGCACCCGCGCCAAAGTGACCTATCCTGCGCGCTTTCAATTAGTGGGAGCCATGAATCCTAG
CCCCACGGGCCATTATCAGGGCAACCACAATCGTTGTACGCCCCAACAGGTTTTACGTTACCTCAGCCGTCTATCAGGAC
CTTTTCTTGACCGCTTTGATTTGTCGCTGGAAGTACCGCTCCTGCCCAGCGGTACCCTGAGTAAGAAACAGCAAGTTAGC
GAAAGCAGTGCTGAAGTGCTAAGCCGTGTTATAGCTGCCCGTAAGATTCAGACGGCGCGCAGTGGCAAGATAAACGCACA
ACTTTCAAACCCTGAGATTTTACGTTGGTGTTCCCTGAGGCAGGAGGAAGCTGAATGGCTGGAAGGGGTACTGAATTCGT
TAGGACTTTCAGTACGCGCCTGGCAGCGGATTTTAAAAGTCGCAAGGACAATTGCTGACTTAGCGGGTGAAGAACAGATT
ACGCGGAGCCATCTGCAGGAAGCGGTGGGATATCGTAGTATTGACCGTTTAATGATCTATTTGCACAAGAGCCTGGAATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

60.757

99.209

0.603

  comM Haemophilus influenzae Rd KW20

60.437

99.407

0.601

  comM Glaesserella parasuis strain SC1401

59.841

99.407

0.595

  comM Vibrio campbellii strain DS40M4

59.562

99.209

0.591

  comM Legionella pneumophila str. Paris

47.082

98.221

0.462

  comM Legionella pneumophila strain ERS1305867

47.082

98.221

0.462

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.083

100

0.431


Multiple sequence alignment