Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   I3B46_RS20500 Genome accession   NZ_CP065420
Coordinates   4322615..4324141 (+) Length   508 a.a.
NCBI ID   WP_004915428.1    Uniprot ID   A0A140NMR7
Organism   Providencia sp. 2.29     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 4317615..4329141
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I3B46_RS20480 (I3B46_20480) - 4319049..4319975 (-) 927 WP_004915437.1 branched-chain amino acid transaminase -
  I3B46_RS20485 (I3B46_20485) ilvM 4319988..4320257 (-) 270 WP_004915433.1 acetolactate synthase 2 small subunit -
  I3B46_RS20490 (I3B46_20490) ilvG 4320254..4321900 (-) 1647 WP_004915430.1 acetolactate synthase 2 catalytic subunit -
  I3B46_RS20495 (I3B46_20495) ilvL 4322034..4322135 (-) 102 WP_071599630.1 ilv operon leader peptide -
  I3B46_RS20500 (I3B46_20500) comM 4322615..4324141 (+) 1527 WP_004915428.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  I3B46_RS20505 (I3B46_20505) - 4324326..4324664 (-) 339 WP_004915426.1 DUF413 domain-containing protein -
  I3B46_RS20510 (I3B46_20510) hdfR 4324776..4325621 (+) 846 WP_004915423.1 HTH-type transcriptional regulator HdfR -
  I3B46_RS20515 (I3B46_20515) - 4325862..4326657 (+) 796 Protein_4030 transposase -

Sequence


Protein


Download         Length: 508 a.a.        Molecular weight: 55554.29 Da        Isoelectric Point: 8.7576

>NTDB_id=509359 I3B46_RS20500 WP_004915428.1 4322615..4324141(+) (comM) [Providencia sp. 2.29]
MALAIVYTRASIGMNAPLVTVEAHISNGLPGLTLVGLPETAVKESRDRVRSALLNSGFEFPAKKMTINLAPADLPKEGGR
YDLAIAIAILASSGQVPTKILHQYEFLGELALSGHIRHVNGAIPAALAALTQKRQLVLSSENQYELNLLPDNSVKFAGTL
LELCHFLYEKMTLSGNFHPTEIPEPPICDGNISDIIGQEQGKRALEICASGGHNLLLLGPPGTGKTMLASRLKTLLPSLT
PQEALEVASIHSLCHSSDLTSPWPPRPFRAPHHSASMAALIGGGSLPKPGEISLAHNGILFLDELPEFSRSVLDALREPL
ESRQVIISRAKAKVCFPANFQLIAALNPSPTGQYQGDYCRSSSTKILRYLSRVSGPFLDRFDLSIEIPLLPLGTLSHQQY
QSENSEQIRARVILAREIQMKRMGKLNSQLTARETTNLCQLTSKDSLFLEHALNKLGLSIRAWHRILRVARTIADLQNTN
NIEKPHLLEALGYRAMDKLLLHLQKQVS

Nucleotide


Download         Length: 1527 bp        

>NTDB_id=509359 I3B46_RS20500 WP_004915428.1 4322615..4324141(+) (comM) [Providencia sp. 2.29]
ATGGCATTAGCAATTGTGTATACAAGAGCATCGATCGGAATGAATGCGCCACTGGTTACTGTTGAAGCACATATTAGTAA
TGGGCTACCTGGATTGACACTTGTAGGTTTACCTGAAACGGCTGTAAAAGAATCGAGAGATAGAGTACGGAGCGCATTAT
TAAACAGCGGTTTTGAATTCCCTGCAAAAAAAATGACGATTAACTTAGCCCCTGCCGATCTCCCGAAAGAAGGGGGACGT
TACGATCTTGCGATAGCAATCGCTATCTTAGCGTCATCAGGCCAAGTGCCTACAAAAATACTTCATCAGTATGAATTTCT
CGGTGAACTCGCTTTGTCTGGTCATATTCGCCATGTGAATGGTGCTATCCCTGCCGCACTCGCGGCATTAACACAAAAAA
GGCAACTCGTTCTTTCATCTGAAAATCAATATGAGCTGAATTTATTACCTGATAATAGTGTTAAGTTTGCAGGAACATTA
TTAGAACTTTGTCATTTTTTATACGAGAAAATGACATTATCAGGTAATTTCCATCCGACCGAAATCCCTGAGCCCCCCAT
TTGCGACGGGAATATTAGCGATATTATTGGACAAGAGCAGGGAAAAAGAGCGTTAGAGATCTGTGCAAGTGGTGGGCACA
ACTTATTGTTATTAGGCCCTCCTGGCACAGGAAAAACGATGCTAGCAAGTCGCCTAAAAACATTACTACCGTCCCTAACG
CCACAGGAAGCTCTAGAAGTCGCCTCTATACATAGCCTATGCCATTCAAGCGATTTAACTTCTCCTTGGCCTCCTAGACC
CTTTAGAGCTCCTCACCATAGCGCGTCAATGGCAGCACTCATTGGTGGAGGGAGCCTTCCTAAACCCGGTGAAATATCAT
TAGCCCATAATGGCATTTTATTTCTGGATGAATTGCCTGAGTTTAGCCGCTCGGTACTTGACGCATTACGTGAGCCACTT
GAATCAAGGCAAGTTATTATTTCCCGCGCTAAAGCAAAAGTGTGTTTCCCAGCCAACTTTCAACTCATCGCCGCACTGAA
CCCAAGCCCTACAGGCCAATATCAAGGTGATTACTGCCGAAGCTCCTCCACGAAAATCTTACGCTACTTATCACGCGTTT
CAGGTCCTTTTCTTGATCGATTTGATCTGTCTATTGAAATACCGCTGCTTCCGTTGGGTACACTCAGTCATCAACAGTAC
CAAAGCGAAAATAGTGAGCAAATCCGCGCTAGAGTCATACTTGCAAGAGAGATACAAATGAAAAGAATGGGGAAACTCAA
TAGCCAATTAACCGCCAGAGAAACGACAAACCTATGCCAACTCACATCCAAGGATTCACTTTTTTTGGAGCATGCCTTAA
ACAAACTTGGGTTATCGATACGTGCTTGGCATCGCATTTTGAGAGTCGCTCGAACTATTGCCGACTTACAAAATACCAAT
AATATCGAAAAACCCCATCTTCTCGAAGCTCTCGGTTATAGAGCGATGGATAAGTTATTGCTTCATTTGCAAAAACAAGT
AAGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A140NMR7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

60.672

99.606

0.604

  comM Glaesserella parasuis strain SC1401

60.314

100

0.604

  comM Haemophilus influenzae Rd KW20

59.883

100

0.602

  comM Vibrio campbellii strain DS40M4

59.055

100

0.591

  comM Legionella pneumophila str. Paris

48.104

98.622

0.474

  comM Legionella pneumophila strain ERS1305867

48.104

98.622

0.474

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.902

100

0.451