Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   NTU39_RS02805 Genome accession   NZ_CP102780
Coordinates   530085..530750 (-) Length   221 a.a.
NCBI ID   WP_308340642.1    Uniprot ID   -
Organism   Pandoraea commovens strain LB-19-202-79     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 531461..550089 530085..530750 flank 711


Gene organization within MGE regions


Location: 530085..550089
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NTU39_RS02805 (NTU39_02805) comM 530085..530750 (-) 666 WP_308340642.1 ATP-binding protein Machinery gene
  NTU39_RS02810 (NTU39_02810) - 530794..531459 (+) 666 WP_257959152.1 TnsA endonuclease N-terminal domain-containing protein -
  NTU39_RS02815 (NTU39_02815) - 531461..533332 (+) 1872 WP_257959153.1 Mu transposase C-terminal domain-containing protein -
  NTU39_RS02820 (NTU39_02820) - 533341..534231 (+) 891 WP_257959154.1 TniB family NTP-binding protein -
  NTU39_RS26975 - 534285..535403 (+) 1119 WP_373275388.1 TniQ family protein -
  NTU39_RS02825 (NTU39_02825) - 535513..536028 (+) 516 WP_257959155.1 3'-5' exoribonuclease -
  NTU39_RS02830 (NTU39_02830) - 536876..537895 (+) 1020 WP_257959156.1 DUF4917 family protein -
  NTU39_RS02835 (NTU39_02835) - 538113..538931 (+) 819 WP_257959157.1 hypothetical protein -
  NTU39_RS02840 (NTU39_02840) - 539332..540129 (+) 798 WP_257959158.1 hypothetical protein -
  NTU39_RS02845 (NTU39_02845) - 540492..541304 (-) 813 WP_257959159.1 hypothetical protein -
  NTU39_RS02850 (NTU39_02850) - 541291..544299 (-) 3009 WP_257959161.1 class I SAM-dependent DNA methyltransferase -
  NTU39_RS02855 (NTU39_02855) - 544292..544654 (-) 363 WP_257959162.1 hypothetical protein -
  NTU39_RS02860 (NTU39_02860) - 544831..545169 (+) 339 WP_257959163.1 hypothetical protein -
  NTU39_RS02865 (NTU39_02865) - 545238..545531 (-) 294 WP_257959164.1 hypothetical protein -
  NTU39_RS02870 (NTU39_02870) - 545639..545830 (-) 192 WP_257959165.1 hypothetical protein -
  NTU39_RS02885 (NTU39_02885) - 546456..546986 (-) 531 WP_257959166.1 helix-turn-helix domain-containing protein -
  NTU39_RS02890 (NTU39_02890) - 547572..548159 (-) 588 WP_257959167.1 HAD family hydrolase -
  NTU39_RS02895 (NTU39_02895) - 548194..549348 (-) 1155 WP_257959168.1 hypothetical protein -

Sequence


Protein


Download         Length: 221 a.a.        Molecular weight: 24298.63 Da        Isoelectric Point: 7.4666

>NTDB_id=718986 NTU39_RS02805 WP_308340642.1 530085..530750(-) (comM) [Pandoraea commovens strain LB-19-202-79]
MEITRGSTPRPGEISLAHRGVLFLDELPEFQRRVLEVLREPLELGHVTISRAGGHATFPAAFQLVAAMNPCPCGDLGHPS
RNCRCTSDAVARYRNRLSGPLLDRIDLHVEVPALSAETFAAPATGEHSEVVAMRVRQAQTRQLARQGQLNCGLSNRALEA
QCPLQPEAQEVLHRAVTQHHWSARTYHRVLRVARTIADLAGCDAIDAFHIAEAVQYREAPG

Nucleotide


Download         Length: 666 bp        

>NTDB_id=718986 NTU39_RS02805 WP_308340642.1 530085..530750(-) (comM) [Pandoraea commovens strain LB-19-202-79]
GTGGAAATTACACGCGGAAGCACGCCGCGTCCGGGAGAGATCAGTCTGGCGCATCGCGGCGTGTTGTTCCTCGACGAATT
GCCGGAGTTTCAGCGTCGTGTCCTGGAAGTGCTGCGCGAACCGCTGGAACTGGGGCACGTCACGATTTCTCGCGCAGGGG
GACACGCCACGTTTCCCGCAGCCTTTCAACTCGTGGCGGCGATGAACCCGTGTCCATGCGGGGATCTGGGGCACCCGTCG
CGCAACTGCCGATGCACGAGCGACGCGGTGGCCCGTTACCGCAATCGCCTGTCGGGCCCACTGCTCGATCGCATCGACCT
GCATGTGGAAGTACCCGCGCTTAGCGCGGAGACGTTCGCCGCCCCGGCCACCGGCGAGCACAGCGAAGTCGTGGCGATGC
GCGTGAGGCAAGCGCAGACGCGCCAGCTGGCGCGTCAGGGTCAGTTGAACTGCGGTCTGTCGAATCGAGCGCTTGAAGCG
CAATGTCCATTGCAACCGGAAGCGCAGGAGGTTCTGCACCGCGCGGTGACGCAACATCACTGGTCGGCACGCACTTACCA
CCGGGTGTTGCGCGTGGCGCGCACCATTGCGGACCTCGCCGGGTGCGACGCGATCGATGCGTTTCACATCGCCGAGGCGG
TGCAATATCGAGAGGCACCGGGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

51.643

96.38

0.498

  comM Legionella pneumophila str. Paris

50.23

98.19

0.493

  comM Legionella pneumophila strain ERS1305867

50.23

98.19

0.493

  comM Vibrio cholerae strain A1552

50.704

96.38

0.489

  comM Glaesserella parasuis strain SC1401

49.533

96.833

0.48

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.222

97.738

0.462

  comM Haemophilus influenzae Rd KW20

47.418

96.38

0.457