Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   EBL_RS18480 Genome accession   NC_017910
Coordinates   3951491..3953011 (+) Length   506 a.a.
NCBI ID   WP_002444127.1    Uniprot ID   I2BE97
Organism   Shimwellia blattae DSM 4481 = NBRC 105725     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 3946491..3958011
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EBL_RS18465 (EBL_c38120) - 3948072..3949001 (-) 930 WP_002444132.1 branched-chain amino acid transaminase -
  EBL_RS18470 (EBL_c38130) ilvM 3949019..3949282 (-) 264 WP_002444130.1 acetolactate synthase 2 small subunit -
  EBL_RS18475 (EBL_c38140) ilvG 3949279..3950925 (-) 1647 WP_002444128.1 acetolactate synthase 2 catalytic subunit -
  EBL_RS20200 ilvL 3951065..3951163 (-) 99 WP_071840843.1 ilv operon leader peptide -
  EBL_RS18480 (EBL_c38160) comM 3951491..3953011 (+) 1521 WP_002444127.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  EBL_RS18485 (EBL_c38170) - 3953039..3953377 (-) 339 WP_002444126.1 DUF413 domain-containing protein -
  EBL_RS18490 (EBL_c38180) hdfR 3953497..3954357 (+) 861 WP_002444125.1 HTH-type transcriptional regulator HdfR -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 54708.03 Da        Isoelectric Point: 7.8849

>NTDB_id=51201 EBL_RS18480 WP_002444127.1 3951491..3953011(+) (comM) [Shimwellia blattae DSM 4481 = NBRC 105725]
MTLARIHTRAALGIAAPYVTVEAHISAGLPALTIVGLPETTVKESRDRVRSAIINSGYTFPARKITINLAPADLPKEGSR
YDLPIAIALLAASEQLSATALNQYEFIGELALTGAMRGVCGAISSAIQASHAGREMIVPQDNEAEVGLIDGPGCYVAGTL
LEVCAILEGKQPPLRPRAPDPTPAEQAEDLRDIIGQEQGKRALEITAAGGHNLLFIGPPGTGKTMLATRLIGLLPPLTLS
EALESAAIKSLADNRELAATWRQRPFRAPHHSASLNAMVGGGRIPLPGEISLAHNGVLFLDELPQFERRALDALREPIET
GEIHLSRTRAKIHYPARFQLIAAMNPSPGGHYQGELNRSTPGQILRYLSRLSGPFLDRFDITLEVPLLAPGMLSQGHTAG
ETSATVRERVIQARALQLARQHKLNAQLSSQEVNALCPLEKPDARWLEEAITQLGLSIRAWQRILKVARTIADLAGAEQL
EKKHLQEALSYRAMDRLLIHLQKQLA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=51201 EBL_RS18480 WP_002444127.1 3951491..3953011(+) (comM) [Shimwellia blattae DSM 4481 = NBRC 105725]
ATGACACTTGCAAGAATTCATACCCGGGCAGCGCTGGGGATCGCGGCACCTTATGTCACCGTAGAGGCACATATCAGCGC
CGGGTTGCCCGCGCTCACCATTGTCGGCTTACCCGAAACCACAGTGAAAGAGTCCCGGGATCGGGTGCGCAGTGCCATCA
TTAACAGTGGTTATACCTTCCCGGCACGTAAAATTACTATCAACCTCGCACCGGCTGATTTACCCAAAGAGGGCAGCCGC
TATGATCTCCCCATTGCTATTGCACTTCTGGCAGCGTCAGAGCAGCTTTCTGCCACTGCACTGAACCAGTATGAGTTCAT
TGGTGAACTGGCGCTCACAGGCGCAATGCGGGGGGTTTGCGGGGCCATATCCAGTGCCATCCAGGCAAGCCACGCTGGCA
GAGAAATGATTGTCCCTCAGGATAACGAAGCTGAAGTGGGGCTGATTGACGGGCCTGGCTGTTATGTTGCCGGAACGCTG
CTGGAAGTCTGCGCCATTCTTGAAGGCAAACAGCCTCCACTGCGGCCCCGGGCACCGGATCCCACCCCGGCGGAACAGGC
TGAAGATTTGCGCGACATTATTGGCCAGGAGCAGGGAAAACGGGCGCTGGAGATAACCGCTGCGGGCGGCCACAATCTGC
TGTTCATCGGCCCACCTGGCACGGGGAAAACCATGCTGGCGACACGCCTTATCGGGCTGCTTCCCCCGCTAACCCTGAGC
GAAGCGCTGGAGAGCGCCGCCATAAAGAGCCTGGCGGATAACCGGGAGCTGGCCGCCACCTGGCGCCAGCGGCCCTTTCG
CGCCCCTCACCACAGTGCATCATTGAATGCGATGGTGGGTGGCGGGCGTATACCGCTACCGGGGGAGATTTCGCTGGCCC
ATAACGGCGTGCTGTTTCTTGATGAATTACCGCAATTTGAACGCCGGGCGCTGGATGCCCTGCGCGAACCCATTGAAACC
GGTGAAATTCATTTATCGCGCACCCGGGCAAAAATACATTATCCAGCCCGTTTTCAGCTGATAGCCGCCATGAATCCCAG
CCCCGGGGGCCATTATCAGGGTGAGCTGAACCGCTCTACCCCCGGGCAAATCCTGCGCTATCTCAGCCGGTTATCCGGGC
CCTTTCTCGACCGGTTTGATATTACCCTGGAAGTCCCGCTGCTCGCCCCGGGGATGCTCAGCCAGGGGCATACCGCAGGG
GAAACCAGCGCGACCGTAAGAGAGCGGGTTATTCAGGCCCGGGCGTTGCAGCTTGCCCGGCAGCACAAACTTAATGCACA
GCTCAGCAGCCAGGAGGTAAATGCCCTATGCCCGCTGGAAAAACCAGACGCCCGCTGGCTTGAAGAGGCAATAACGCAGC
TGGGGTTATCCATCAGAGCCTGGCAGCGAATACTGAAGGTGGCCCGGACGATTGCCGATCTCGCCGGTGCTGAGCAGTTG
GAGAAAAAACACCTTCAGGAGGCGCTCAGCTACCGGGCTATGGACAGGCTGTTAATCCATCTGCAAAAACAACTGGCATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB I2BE97

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

59.406

99.802

0.593

  comM Glaesserella parasuis strain SC1401

58.858

100

0.591

  comM Vibrio campbellii strain DS40M4

58.02

99.802

0.579

  comM Haemophilus influenzae Rd KW20

57.905

100

0.579

  comM Legionella pneumophila str. Paris

49.2

98.814

0.486

  comM Legionella pneumophila strain ERS1305867

49.2

98.814

0.486

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

42.604

100

0.427


Multiple sequence alignment