Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   E613_RS32720 Genome accession   NZ_CP020603
Coordinates   6719732..6721054 (+) Length   440 a.a.
NCBI ID   WP_023098906.1    Uniprot ID   -
Organism   Pseudomonas aeruginosa strain E6130952     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 6667475..6719654 6719732..6721054 flank 78


Gene organization within MGE regions


Location: 6667475..6721054
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  E613_RS34590 (E613_63310) - 6667475..6669277 (+) 1803 WP_023098875.1 hypothetical protein -
  E613_RS32440 (E613_63320) - 6669350..6669577 (-) 228 WP_031630401.1 BPSL0761 family protein -
  E613_RS32445 - 6669648..6670241 (-) 594 WP_031630404.1 DUF6088 family protein -
  E613_RS32450 (E613_63330) - 6670775..6671728 (-) 954 WP_023098876.1 toll/interleukin-1 receptor domain-containing protein -
  E613_RS32455 (E613_63340) - 6671882..6672238 (+) 357 WP_023098877.1 hypothetical protein -
  E613_RS32460 (E613_63350) - 6672456..6673031 (+) 576 WP_235440671.1 DNA adenine methylase -
  E613_RS32465 (E613_63360) - 6673031..6673864 (+) 834 WP_023098879.1 hypothetical protein -
  E613_RS32470 (E613_63370) - 6673799..6674479 (-) 681 WP_023098880.1 nitroreductase family protein -
  E613_RS32475 (E613_63380) - 6674457..6675062 (-) 606 WP_023098881.1 7-cyano-7-deazaguanine synthase -
  E613_RS32480 (E613_63390) - 6675059..6676225 (-) 1167 WP_232342060.1 PfkB family carbohydrate kinase -
  E613_RS34595 - 6676296..6676613 (-) 318 WP_071534605.1 hypothetical protein -
  E613_RS32490 (E613_63410) - 6677418..6677618 (+) 201 WP_023098884.1 hypothetical protein -
  E613_RS32495 (E613_63420) - 6677951..6678343 (+) 393 WP_023098885.1 hypothetical protein -
  E613_RS32500 (E613_63430) - 6678641..6679003 (-) 363 WP_023098886.1 histone-like nucleoid-structuring protein, MvaT/MvaU family -
  E613_RS35160 - 6679222..6679416 (-) 195 WP_071534606.1 AbrB/MazE/SpoVT family DNA-binding domain-containing protein -
  E613_RS32510 (E613_63440) - 6679586..6680068 (+) 483 WP_023098887.1 hypothetical protein -
  E613_RS35165 - 6680075..6680248 (+) 174 Protein_6283 hypothetical protein -
  E613_RS32520 (E613_63450) - 6680850..6681884 (-) 1035 WP_031759186.1 sensor domain-containing diguanylate cyclase -
  E613_RS32525 - 6682211..6682648 (-) 438 WP_175606544.1 VOC family protein -
  E613_RS32530 (E613_63460) - 6682672..6683541 (-) 870 WP_023098889.1 EamA family transporter -
  E613_RS32535 (E613_63470) - 6683554..6684681 (-) 1128 WP_023127453.1 hypothetical protein -
  E613_RS32540 (E613_63480) - 6684708..6685520 (-) 813 WP_023098891.1 DUF3865 domain-containing protein -
  E613_RS32545 (E613_63490) - 6685554..6687020 (-) 1467 WP_023098892.1 B12-binding domain-containing radical SAM protein -
  E613_RS32550 (E613_63500) - 6687305..6688513 (+) 1209 WP_023098893.1 winged helix-turn-helix domain-containing protein -
  E613_RS32555 (E613_63510) - 6688928..6689782 (-) 855 WP_031630415.1 Wadjet anti-phage system protein JetD domain-containing protein -
  E613_RS32560 (E613_63520) - 6690063..6691007 (-) 945 WP_023098895.1 hypothetical protein -
  E613_RS32565 - 6690910..6691689 (-) 780 WP_123809932.1 hypothetical protein -
  E613_RS32570 (E613_63530) - 6691989..6692228 (+) 240 WP_003089107.1 ribbon-helix-helix domain-containing protein -
  E613_RS32575 (E613_63540) - 6692228..6692638 (+) 411 WP_003150546.1 putative toxin-antitoxin system toxin component, PIN family -
  E613_RS32580 (E613_63550) - 6692642..6695635 (+) 2994 WP_003299771.1 Tn3 family transposase -
  E613_RS32585 (E613_63560) - 6695648..6695860 (-) 213 WP_003089113.1 GDCCVxC domain-containing (seleno)protein -
  E613_RS32590 (E613_63570) merP 6695868..6696143 (-) 276 WP_003150552.1 mercury resistance system periplasmic binding protein MerP -
  E613_RS32595 (E613_63580) - 6696156..6696506 (-) 351 WP_003089115.1 mercuric transporter MerT family protein -
  E613_RS32600 (E613_63590) merR 6696581..6696979 (+) 399 WP_003089120.1 Hg(II)-responsive transcriptional regulator -
  E613_RS32605 (E613_63600) - 6697282..6698127 (+) 846 WP_003464995.1 AraC family transcriptional regulator -
  E613_RS32610 (E613_63610) - 6698157..6698675 (-) 519 WP_003464991.1 YkgB family protein -
  E613_RS32615 (E613_63620) - 6699182..6699739 (-) 558 WP_003100847.1 recombinase family protein -
  E613_RS32620 (E613_63630) - 6699733..6700104 (-) 372 WP_003100853.1 hypothetical protein -
  E613_RS32625 (E613_63640) - 6700101..6700601 (-) 501 WP_003100856.1 hypothetical protein -
  E613_RS32630 (E613_63650) - 6700598..6700924 (-) 327 WP_003100858.1 hypothetical protein -
  E613_RS32635 - 6701179..6701535 (-) 357 WP_003465043.1 cupin domain-containing protein -
  E613_RS32640 (E613_63670) - 6701772..6702158 (-) 387 WP_003100872.1 DUF86 domain-containing protein -
  E613_RS32645 (E613_63680) - 6702155..6702445 (-) 291 WP_001247892.1 nucleotidyltransferase family protein -
  E613_RS35170 (E613_63690) - 6702734..6703504 (-) 771 WP_232342061.1 TolC family protein -
  E613_RS35175 (E613_63700) - 6703494..6704060 (-) 567 WP_232342063.1 TolC family protein -
  E613_RS32655 (E613_63710) - 6704245..6705375 (-) 1131 WP_021250127.1 ABC transporter permease -
  E613_RS32660 (E613_63720) - 6705372..6706532 (-) 1161 WP_021250126.1 ABC transporter permease -
  E613_RS32665 (E613_63730) - 6706532..6708301 (-) 1770 WP_023098898.1 ATP-binding cassette domain-containing protein -
  E613_RS32670 (E613_63740) - 6708326..6709318 (-) 993 WP_021250124.1 efflux RND transporter periplasmic adaptor subunit -
  E613_RS32675 - 6709424..6710080 (+) 657 WP_021250123.1 TetR/AcrR family transcriptional regulator -
  E613_RS32680 (E613_63760) - 6710205..6710828 (-) 624 WP_011222094.1 recombinase family protein -
  E613_RS32685 (E613_63770) - 6710890..6712107 (-) 1218 WP_023657929.1 TniQ family protein -
  E613_RS32690 (E613_63780) - 6712104..6713012 (-) 909 WP_023098901.1 TniB family NTP-binding protein -
  E613_RS32695 (E613_63790) - 6713015..6714694 (-) 1680 WP_023098902.1 Mu transposase C-terminal domain-containing protein -
  E613_RS35405 (E613_63800) - 6714928..6716088 (-) 1161 WP_077575469.1 TniQ family protein -
  E613_RS32705 (E613_63810) - 6716107..6717012 (-) 906 WP_023098904.1 TniB family NTP-binding protein -
  E613_RS32710 - 6717019..6718953 (-) 1935 WP_023098905.1 Mu transposase C-terminal domain-containing protein -
  E613_RS32715 (E613_63830) - 6718950..6719738 (-) 789 WP_049878009.1 heteromeric transposase endonuclease subunit TnsA -
  E613_RS32720 (E613_63840) comM 6719732..6721054 (+) 1323 WP_023098906.1 YifB family Mg chelatase-like AAA ATPase Machinery gene

Sequence


Protein


Download         Length: 440 a.a.        Molecular weight: 47310.35 Da        Isoelectric Point: 7.7787

>NTDB_id=224893 E613_RS32720 WP_023098906.1 6719732..6721054(+) (comM) [Pseudomonas aeruginosa strain E6130952]
MSFANTILVFLIYHNISDKDGGRFDLAIALGILAASGQLPGTTLDGLECLGELALSGAIRPVRGVLPAALAARDARRVLV
VPKENAEEASLASGLTVFAVDHLLEIAGHLSGQAPLLPYQARGLLRAPFPYPDLAEVQGQAAAKRALLVAAAGAHNLLLS
GPPGTGKTLLASRLPGLLPALDEDEALEVAAIHSVASHVPLRHWPQRPFRQPHHSASAPALVGGGSRPQPGEITLAHQGV
LFLDELPEFERKVLEVLREPLESGEIVIARANGRVRFPARFQLVAAMNPCPCGYLGDPSGRCRCTPEQVQRYRGKLSGPL
LDRIDLHVSVLRESTSLQPGHGETATAEISERVGAARQRQLARQGCANAHLDLQAMHRNCALAEADRRWLEAAGERLELS
LRALHRILKVARTLADLERIDAIERRHLAEALQYRATTST

Nucleotide


Download         Length: 1323 bp        

>NTDB_id=224893 E613_RS32720 WP_023098906.1 6719732..6721054(+) (comM) [Pseudomonas aeruginosa strain E6130952]
TTGTCATTCGCAAATACCATACTTGTCTTTCTCATATATCATAATATCTCAGACAAGGACGGCGGTCGCTTCGACCTGGC
CATCGCACTCGGCATCCTCGCCGCCAGCGGCCAGTTGCCCGGCACCACCCTCGACGGCCTGGAGTGCCTTGGCGAACTGG
CCCTGTCCGGGGCGATCCGGCCAGTGCGAGGCGTATTGCCGGCCGCGCTGGCGGCGCGCGACGCAAGGCGCGTTCTGGTG
GTACCGAAGGAAAATGCCGAAGAGGCCAGCCTGGCCAGCGGGCTGACGGTGTTCGCCGTGGACCACCTGCTGGAGATCGC
CGGACACCTCTCCGGCCAGGCCCCGCTGCTGCCCTACCAGGCCCGCGGCCTGCTCCGCGCGCCCTTCCCTTATCCAGACC
TGGCCGAGGTCCAGGGCCAGGCCGCCGCCAAGCGCGCCCTGCTGGTGGCCGCCGCCGGCGCGCACAACCTGTTGCTCAGC
GGCCCGCCGGGCACCGGCAAGACCCTCCTGGCCAGCCGCCTGCCCGGCCTGCTGCCGGCGCTCGACGAGGACGAGGCCCT
GGAGGTCGCAGCGATCCATTCGGTGGCCAGCCACGTCCCCCTCAGGCACTGGCCGCAGCGACCGTTCCGCCAGCCGCACC
ACTCCGCCTCCGCGCCGGCCCTGGTCGGCGGCGGCAGCCGCCCGCAGCCGGGCGAGATCACCCTGGCGCACCAGGGCGTG
CTGTTCCTCGACGAACTGCCGGAGTTCGAGCGCAAGGTCCTGGAGGTCCTGCGCGAGCCGCTGGAAAGCGGCGAGATCGT
CATTGCCCGGGCCAACGGCCGGGTACGTTTCCCGGCGCGCTTCCAACTGGTGGCGGCGATGAATCCCTGTCCCTGTGGCT
ACCTCGGCGATCCCAGCGGCCGCTGCCGCTGCACCCCGGAACAGGTCCAGCGCTACCGGGGCAAGCTGTCCGGACCGCTG
CTCGATCGCATCGACCTGCACGTCAGCGTGCTCCGCGAAAGCACCAGCCTGCAGCCAGGACACGGCGAAACCGCTACCGC
CGAGATCAGCGAACGGGTTGGCGCCGCACGGCAACGGCAACTGGCCCGCCAGGGCTGCGCCAATGCCCATCTCGACCTCC
AGGCGATGCACCGCAATTGTGCACTCGCCGAAGCGGACCGCCGCTGGCTGGAGGCTGCCGGAGAGCGCCTGGAACTTTCC
TTGCGCGCCTTGCATCGCATACTCAAGGTGGCCCGGACGCTGGCCGACCTGGAGCGCATCGATGCCATCGAACGCCGGCA
CCTGGCGGAAGCCCTGCAGTATCGGGCAACGACCTCCACGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

53.037

97.273

0.516

  comM Vibrio cholerae strain A1552

53.682

95.682

0.514

  comM Vibrio campbellii strain DS40M4

53.444

95.682

0.511

  comM Glaesserella parasuis strain SC1401

52.225

97.045

0.507

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.005

98.636

0.464

  comM Legionella pneumophila str. Paris

47.442

97.727

0.464

  comM Legionella pneumophila strain ERS1305867

47.442

97.727

0.464


Multiple sequence alignment