Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   D0B88_RS02750 Genome accession   NZ_CP031727
Coordinates   621293..622801 (+) Length   502 a.a.
NCBI ID   WP_007639547.1    Uniprot ID   -
Organism   Cellvibrio sp. KY-YJ-3     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 620960..656930 621293..622801 within 0


Gene organization within MGE regions


Location: 620960..656930
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  D0B88_RS02745 (D0B88_02765) - 620960..621184 (+) 225 WP_007639548.1 accessory factor UbiK family protein -
  D0B88_RS02750 (D0B88_02770) comM 621293..622801 (+) 1509 WP_007639547.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  D0B88_RS02755 (D0B88_02775) - 622829..624544 (-) 1716 WP_151054842.1 bifunctional diguanylate cyclase/phosphodiesterase -
  D0B88_RS02760 (D0B88_02780) rep 624772..626793 (+) 2022 WP_151059201.1 DNA helicase Rep -
  D0B88_RS02765 (D0B88_02785) - 627134..630442 (+) 3309 WP_225318508.1 TonB-dependent receptor -
  D0B88_RS02770 (D0B88_02790) - 630551..630955 (-) 405 WP_191966500.1 cytochrome c5 family protein -
  D0B88_RS02775 (D0B88_02795) prsK 631129..633255 (+) 2127 WP_151054846.1 XrtA/PEP-CTERM system histidine kinase PrsK -
  D0B88_RS02780 (D0B88_02800) prsR 633265..634662 (+) 1398 WP_007639510.1 PEP-CTERM-box response regulator transcription factor -
  D0B88_RS02785 - 634903..636021 (+) 1119 WP_151054848.1 fibronectin type III domain-containing protein -
  D0B88_RS02790 (D0B88_02810) - 636095..637135 (+) 1041 WP_151054850.1 acyltransferase -
  D0B88_RS02795 (D0B88_02815) - 637322..638014 (+) 693 WP_191966501.1 PEP-CTERM sorting domain-containing protein -
  D0B88_RS02800 (D0B88_02820) - 638075..638941 (-) 867 WP_151054854.1 glycosyltransferase -
  D0B88_RS02805 (D0B88_02825) - 638983..639990 (-) 1008 WP_151054857.1 glycosyltransferase -
  D0B88_RS02810 (D0B88_02830) - 639980..642184 (-) 2205 WP_191966503.1 GNAT family N-acetyltransferase -
  D0B88_RS02815 (D0B88_02835) asnB 642187..644091 (-) 1905 WP_151054861.1 asparagine synthase (glutamine-hydrolyzing) -
  D0B88_RS02820 (D0B88_02840) - 644104..645720 (-) 1617 WP_151054863.1 lipopolysaccharide biosynthesis protein -
  D0B88_RS02825 (D0B88_02845) - 645605..646723 (-) 1119 WP_151054865.1 glycosyltransferase family 4 protein -
  D0B88_RS02830 (D0B88_02850) - 646720..648003 (-) 1284 WP_151054867.1 O-antigen ligase -
  D0B88_RS02835 (D0B88_02855) - 648012..650081 (-) 2070 WP_191966505.1 HAD family hydrolase -
  D0B88_RS02840 (D0B88_02860) - 650113..651108 (-) 996 WP_151054871.1 polysaccharide deacetylase family protein -
  D0B88_RS02845 (D0B88_02865) - 651105..652205 (-) 1101 WP_151054873.1 glycosyltransferase -
  D0B88_RS02850 (D0B88_02870) - 652202..653095 (-) 894 WP_191966506.1 glycosyltransferase -
  D0B88_RS02855 (D0B88_02875) - 653229..654503 (-) 1275 WP_151054877.1 hypothetical protein -
  D0B88_RS02860 (D0B88_02880) - 654596..655573 (-) 978 WP_151054878.1 glycosyltransferase family 2 protein -
  D0B88_RS02865 (D0B88_02885) - 655590..656930 (-) 1341 WP_151054880.1 phenylacetate--CoA ligase family protein -

Sequence


Protein


Download         Length: 502 a.a.        Molecular weight: 54557.40 Da        Isoelectric Point: 7.9906

>NTDB_id=310755 D0B88_RS02750 WP_007639547.1 621293..622801(+) (comM) [Cellvibrio sp. KY-YJ-3]
MSLAIVHSRAKLGIHAPQVTVEVHISNGLPGLSIVGLPETAVKESKDRVRSAIINSHLEFPAQRITVNLAPADLPKEGGR
YDLPIALGILAASGQIPLEALEQSEFLGELALSGELRPISAALPAALAAGDAQRNLIISGTNANEAAFSSITRVFGAENL
LQVCAHLHGRELLPRAEAIRDQQDKLLHALDILDVKGQSQAKRALEIAASGGHNLLFYGPPGTGKTMLASRLPGILPRLN
EREMLDVAAIYSVASQSKDYQWQQRPFRAPHHTASAIALVGGGSNPKPGEISLAHAGVLFLDELPEFSRQVLEVLREPLE
SGEVRISRARSQACFPARFQLVAAMNPCPCGYHGSDANRCRCTPDQVKRYRDKISGPLLDRIDMHVPVRALRQGELQTKT
LGDGSEAIRQRVEAARNLQLARQGKANHQLSAPELETYCELTQADKNLLEQAIEKLGLSTRAYHRVLKLARTLADMAARE
HLTTVDISEALSYRTLDRQLSQ

Nucleotide


Download         Length: 1509 bp        

>NTDB_id=310755 D0B88_RS02750 WP_007639547.1 621293..622801(+) (comM) [Cellvibrio sp. KY-YJ-3]
ATGTCTCTCGCCATAGTTCATTCACGCGCCAAATTAGGCATTCATGCCCCTCAGGTTACGGTGGAAGTCCATATTTCCAA
TGGTTTACCGGGCTTATCCATAGTTGGCCTGCCCGAAACAGCCGTGAAAGAAAGTAAAGACCGGGTGCGCAGCGCGATCA
TCAATAGCCACCTCGAATTCCCCGCCCAGCGGATTACCGTCAACCTTGCCCCTGCCGACCTGCCCAAAGAAGGTGGTCGC
TACGATTTACCTATAGCCCTGGGAATACTGGCCGCCTCCGGACAAATTCCACTGGAAGCCCTGGAACAAAGCGAATTTTT
GGGAGAGCTGGCGCTGTCCGGTGAATTGCGGCCGATAAGCGCCGCATTACCCGCAGCGTTGGCAGCAGGTGATGCGCAAC
GAAACCTGATTATCAGTGGAACCAATGCCAACGAAGCCGCCTTCAGCAGTATTACGCGCGTTTTTGGTGCAGAAAACCTG
TTGCAGGTTTGCGCCCACCTGCATGGGCGTGAACTTTTACCGCGCGCCGAAGCAATCCGCGACCAACAGGACAAGCTGCT
GCATGCACTGGATATTCTGGATGTTAAAGGACAATCGCAAGCCAAGCGGGCATTGGAGATAGCCGCCAGCGGTGGTCATA
ACCTGTTGTTTTACGGCCCGCCGGGCACGGGTAAAACCATGCTCGCCAGCCGCTTGCCGGGAATTCTGCCGCGCTTGAAT
GAACGCGAGATGCTCGATGTCGCCGCAATTTATTCAGTTGCCTCCCAGAGCAAAGACTATCAATGGCAGCAGCGCCCCTT
TCGCGCACCGCACCATACCGCCTCCGCTATTGCGCTGGTAGGTGGTGGCTCCAATCCCAAACCGGGCGAGATTTCCCTCG
CCCATGCCGGGGTATTATTTTTGGATGAGCTGCCCGAATTTTCGCGCCAAGTGTTGGAAGTGCTGCGCGAACCGCTGGAA
AGCGGCGAGGTGCGCATCTCCCGCGCACGCAGTCAGGCCTGTTTTCCCGCGCGCTTCCAGCTGGTCGCGGCGATGAACCC
CTGCCCCTGTGGCTACCACGGCAGCGATGCCAATCGCTGCCGCTGCACCCCGGATCAGGTAAAACGCTATCGCGATAAGA
TCTCGGGGCCATTATTGGATCGTATCGACATGCATGTGCCGGTGCGCGCCCTGCGGCAGGGGGAATTGCAAACCAAAACC
CTGGGCGATGGCAGTGAGGCGATTCGCCAGCGGGTAGAGGCGGCGCGCAATCTGCAGTTGGCGCGTCAGGGCAAAGCCAA
CCACCAACTGAGTGCACCGGAACTGGAAACTTACTGCGAGTTGACGCAAGCGGATAAGAACTTGCTGGAGCAGGCAATTG
AAAAGCTGGGCTTGTCGACGCGGGCTTACCACCGGGTATTAAAACTGGCGCGCACCCTGGCGGACATGGCCGCGCGCGAA
CACTTAACTACGGTGGATATCAGTGAAGCGCTGAGCTATCGCACGTTGGATCGACAACTCAGCCAATAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.349

100

0.566

  comM Glaesserella parasuis strain SC1401

55.248

100

0.556

  comM Vibrio campbellii strain DS40M4

54.96

100

0.552

  comM Legionella pneumophila strain ERS1305867

55

99.602

0.548

  comM Legionella pneumophila str. Paris

55

99.602

0.548

  comM Haemophilus influenzae Rd KW20

54.043

100

0.546

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.654

100

0.472


Multiple sequence alignment