Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   RRX38_RS16920 Genome accession   NZ_CP135971
Coordinates   3913884..3915377 (-) Length   497 a.a.
NCBI ID   WP_315960005.1    Uniprot ID   A0AA96U651
Organism   Pseudomonas sp. DTU_2021_1001937_2_SI_NGA_ILE_001     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3905112..3919076 3913884..3915377 within 0


Gene organization within MGE regions


Location: 3905112..3919076
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  RRX38_RS16870 (RRX38_16870) - 3905112..3905435 (+) 324 WP_315959996.1 helix-turn-helix domain-containing protein -
  RRX38_RS16875 (RRX38_16875) - 3905435..3905653 (+) 219 WP_315959997.1 hypothetical protein -
  RRX38_RS16880 (RRX38_16880) - 3905836..3906177 (-) 342 Protein_3333 MerR family DNA-binding transcriptional regulator -
  RRX38_RS16885 (RRX38_16885) - 3906262..3906684 (+) 423 WP_315959998.1 hypothetical protein -
  RRX38_RS16890 (RRX38_16890) - 3906815..3907723 (+) 909 WP_315959999.1 LysR family transcriptional regulator -
  RRX38_RS16895 (RRX38_16895) - 3907831..3909021 (+) 1191 WP_315960000.1 CaiB/BaiF CoA-transferase family protein -
  RRX38_RS16900 (RRX38_16900) - 3909018..3909953 (+) 936 WP_315960001.1 hydroxymethylglutaryl-CoA lyase -
  RRX38_RS16905 (RRX38_16905) - 3910037..3911368 (+) 1332 WP_315960002.1 MFS transporter -
  RRX38_RS16910 (RRX38_16910) leuC 3911587..3912990 (+) 1404 WP_315960003.1 3-isopropylmalate dehydratase large subunit -
  RRX38_RS16915 (RRX38_16915) leuD 3912997..3913650 (+) 654 WP_315960004.1 3-isopropylmalate dehydratase small subunit -
  RRX38_RS16920 (RRX38_16920) comM 3913884..3915377 (-) 1494 WP_315960005.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  RRX38_RS16925 (RRX38_16925) - 3915750..3916553 (+) 804 WP_315960006.1 glycine betaine ABC transporter substrate-binding protein -
  RRX38_RS16930 (RRX38_16930) - 3916562..3917464 (-) 903 WP_410524915.1 LysR substrate-binding domain-containing protein -
  RRX38_RS16935 (RRX38_16935) - 3917560..3918645 (+) 1086 WP_315960008.1 L-dopachrome tautomerase-related protein -
  RRX38_RS16940 (RRX38_16940) - 3918726..3919076 (-) 351 WP_315960009.1 putative quinol monooxygenase -

Sequence


Protein


Download         Length: 497 a.a.        Molecular weight: 52955.84 Da        Isoelectric Point: 8.4287

>NTDB_id=887975 RRX38_RS16920 WP_315960005.1 3913884..3915377(-) (comM) [Pseudomonas sp. DTU_2021_1001937_2_SI_NGA_ILE_001]
MSLAIVHSRAQVGVQAPAVTVETHLANGLPALTLVGLPEGAVKESKDRVRSAIVNSGLKFPDRRITLNLAPADLPKDGGR
FDLAIALGLLGANGQIPLTALQDIECLGELALSGALRPVQGVLPAALAARQAQRTLIVPAANAEEACLASGLTVIAVNHL
RELVAHFNGQTPMTPYRANGLLPQAQPYPDLSEVQGQLAAKRALVVAAAGGHNLLFSGPPGTGKTLLASRLPGLLPPLDE
QEALEVAAIHSVASQVPLSSWPLRPFRQPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFDRRVLEVLREPLES
GHIVVSRARDRVRFPARFQLVAAMNPCPCGYLGEPSGRCRCSTEQIQRYRNKLSGPLLDRIDLHLTVAREATSLSALPGG
GETTASAAAKVAEARELQQRRQGCANAFLDLPGLRQHCKLSPTDESWLETACERLNLSLRAAHRLLKVARTLADLERQAE
ISRGHLAEALQYRPASQ

Nucleotide


Download         Length: 1494 bp        

>NTDB_id=887975 RRX38_RS16920 WP_315960005.1 3913884..3915377(-) (comM) [Pseudomonas sp. DTU_2021_1001937_2_SI_NGA_ILE_001]
ATGTCCCTCGCCATCGTCCACAGCCGCGCCCAGGTCGGGGTGCAGGCGCCTGCCGTCACGGTCGAAACCCACCTCGCCAA
CGGCCTGCCGGCCCTCACTCTGGTGGGCCTGCCTGAAGGTGCAGTGAAGGAAAGCAAAGACCGGGTGCGCAGTGCCATCG
TCAATTCCGGGCTGAAATTCCCCGACCGGCGCATCACCCTCAACCTGGCGCCCGCCGACCTGCCCAAGGACGGCGGGCGC
TTCGACCTGGCGATTGCCTTGGGCCTGCTTGGCGCCAATGGGCAGATTCCGCTGACGGCCCTGCAGGACATCGAATGCCT
GGGCGAGCTGGCACTGTCTGGCGCGTTGCGCCCGGTGCAGGGCGTGTTGCCGGCGGCGCTGGCGGCACGCCAGGCACAGC
GCACCTTGATCGTACCCGCCGCCAATGCCGAAGAAGCCTGCCTGGCGTCGGGGCTGACGGTGATTGCGGTGAATCACCTG
CGCGAGCTGGTCGCCCATTTCAACGGCCAGACCCCCATGACGCCCTACCGCGCCAACGGCCTGCTGCCCCAGGCCCAACC
CTACCCGGACCTCAGCGAAGTGCAGGGCCAGCTGGCGGCCAAGCGGGCGCTGGTGGTGGCAGCAGCCGGAGGCCACAACC
TGCTGTTCAGCGGCCCGCCGGGCACCGGCAAGACCCTGCTGGCCAGCCGCCTGCCCGGCCTGCTGCCGCCGCTGGATGAA
CAGGAAGCGCTGGAAGTGGCGGCAATTCATTCGGTCGCCAGCCAGGTGCCGCTGAGCAGTTGGCCGCTGCGACCGTTCCG
TCAACCGCACCATTCGGCGTCCGGGCCGGCGCTGGTGGGGGGTGGCAGCCGCCCGCAACCCGGAGAAATTACCCTGGCGC
ATCACGGCGTATTGTTTCTCGATGAGTTGCCGGAGTTCGACCGTCGGGTACTGGAAGTGCTGCGCGAGCCGCTGGAGTCG
GGTCATATCGTGGTGTCGCGGGCCCGCGACCGGGTGCGCTTTCCGGCGCGGTTCCAGCTGGTGGCGGCGATGAACCCCTG
CCCTTGTGGCTATCTCGGCGAGCCCAGTGGTCGCTGCCGCTGCTCCACCGAACAGATTCAACGCTATCGCAACAAGCTGT
CCGGGCCGCTGCTGGACCGCATCGACCTGCACCTCACCGTGGCCCGCGAAGCCACCAGCCTTTCGGCCTTGCCGGGCGGT
GGCGAAACCACCGCCAGCGCCGCCGCCAAGGTCGCCGAAGCCCGCGAGCTGCAACAGCGCCGCCAAGGCTGTGCCAACGC
CTTTCTCGACCTGCCCGGCCTGCGCCAGCACTGCAAGTTATCCCCAACCGACGAAAGCTGGCTGGAAACCGCGTGCGAAC
GCCTCAACCTGTCGCTGCGTGCCGCACACCGCTTGCTCAAGGTGGCGCGCACCCTGGCAGATCTTGAGCGGCAGGCAGAG
ATAAGTCGCGGGCACCTGGCCGAGGCGTTGCAGTATCGACCGGCGAGTCAGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

54.183

100

0.547

  comM Vibrio campbellii strain DS40M4

54.949

99.598

0.547

  comM Glaesserella parasuis strain SC1401

53.242

100

0.545

  comM Haemophilus influenzae Rd KW20

53.892

100

0.543

  comM Legionella pneumophila str. Paris

50.602

100

0.507

  comM Legionella pneumophila strain ERS1305867

50.602

100

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.012

100

0.475