Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ACJA3S_RS24990 Genome accession   NZ_CP174199
Coordinates   5424566..5426056 (+) Length   496 a.a.
NCBI ID   WP_406820348.1    Uniprot ID   -
Organism   Pseudomonas sp. KnCO4     
Function   require for natural transformation (predicted from homology)   
Unclear

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 5426553..5442966 5424566..5426056 flank 497


Gene organization within MGE regions


Location: 5424566..5442966
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACJA3S_RS24990 (ACJA3S_24990) comM 5424566..5426056 (+) 1491 WP_406820348.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ACJA3S_RS24995 (ACJA3S_24995) - 5426553..5427023 (-) 471 WP_406820349.1 hypothetical protein -
  ACJA3S_RS25000 (ACJA3S_25000) - 5427690..5428958 (+) 1269 WP_406820350.1 hypothetical protein -
  ACJA3S_RS25005 (ACJA3S_25005) - 5429073..5430263 (-) 1191 WP_406820351.1 hypothetical protein -
  ACJA3S_RS25010 (ACJA3S_25010) - 5430260..5431237 (-) 978 WP_406820352.1 S1 family serine peptidase -
  ACJA3S_RS25015 (ACJA3S_25015) - 5431234..5432904 (-) 1671 WP_406820353.1 caspase family protein -
  ACJA3S_RS25020 (ACJA3S_25020) - 5433236..5433409 (-) 174 Protein_4904 nucleoid-associated protein -
  ACJA3S_RS25025 (ACJA3S_25025) - 5434004..5434285 (+) 282 WP_050703256.1 type II toxin-antitoxin system Phd/YefM family antitoxin -
  ACJA3S_RS25030 (ACJA3S_25030) - 5434254..5434559 (+) 306 WP_406820354.1 Txe/YoeB family addiction module toxin -
  ACJA3S_RS25035 (ACJA3S_25035) - 5434956..5435153 (+) 198 WP_152993855.1 hypothetical protein -
  ACJA3S_RS25040 (ACJA3S_25040) - 5435373..5436650 (-) 1278 WP_406820355.1 hypothetical protein -
  ACJA3S_RS25045 (ACJA3S_25045) - 5437257..5438060 (+) 804 WP_406820356.1 hypothetical protein -
  ACJA3S_RS25050 (ACJA3S_25050) - 5438742..5441156 (-) 2415 WP_406820357.1 S8 family peptidase -
  ACJA3S_RS25055 (ACJA3S_25055) - 5441181..5442164 (-) 984 WP_050703253.1 AAA family ATPase -

Sequence


Protein


Download         Length: 496 a.a.        Molecular weight: 52750.52 Da        Isoelectric Point: 7.9061

>NTDB_id=1074974 ACJA3S_RS24990 WP_406820348.1 5424566..5426056(+) (comM) [Pseudomonas sp. KnCO4]
MSLALVHSRAQVGVQAPAVSVETHLANGLPHLTLVGLPETTVKESKDRVRSAIVNSGLNYPPRRITQNLAPADLPKDGGR
YDLAIALGILAADGQVPIAPLTELECLGELALSGKLRPVQGVLPAALAARDAGRALVVPRENAEEASLAGGLVVYAVGHL
LELVAHLNGQVPLPPYAANGLILQQRPYPDLSEVQGQLAAKRALLLAAAGAHNLLFTGPPGTGKTLLASRLPGLLPPLDE
HEALEVAAIRSVSGHTPLSSWPQRPFRHPHHSASGPALVGGGSRPQPGEITLAHHGVLFLDELPEFERRVLEVLREPLES
GEIVIARARDKVRFPARFQLVAAMNPCPCGYLGDPTGRCRCSTEQIARYRNKLSGPLLDRIDLHLTVARESTTLNNQPCG
ETSADVAAKVAEARDVQQKRQGCANAFLDLEGLRRNCGLAAADQAWLESACERLTLSLRAAHRLLKVARTLADLDGSQAI
GRAHLAEALQYRPGSS

Nucleotide


Download         Length: 1491 bp        

>NTDB_id=1074974 ACJA3S_RS24990 WP_406820348.1 5424566..5426056(+) (comM) [Pseudomonas sp. KnCO4]
ATGTCCCTAGCCCTCGTCCATAGCCGCGCCCAGGTGGGCGTACAGGCCCCAGCGGTCAGCGTCGAAACCCACCTGGCCAA
TGGCTTGCCCCATCTCACCCTGGTCGGCCTGCCGGAAACCACGGTCAAGGAAAGCAAGGACCGGGTGCGCAGCGCCATCG
TCAATTCCGGGCTGAACTACCCGCCACGGCGCATCACCCAGAACCTCGCACCCGCCGACCTGCCCAAGGATGGCGGGCGT
TACGACCTGGCCATCGCCCTGGGCATCCTGGCTGCCGATGGCCAGGTACCAATCGCTCCGCTAACCGAACTTGAATGCCT
GGGTGAACTGGCTTTGTCTGGCAAGCTGCGCCCGGTCCAGGGCGTGCTGCCCGCAGCGCTGGCAGCACGCGACGCAGGCA
GGGCGCTGGTGGTGCCGCGGGAAAACGCCGAGGAAGCCAGCCTGGCTGGCGGGCTGGTGGTGTATGCGGTGGGGCATCTG
CTGGAACTGGTCGCCCACCTGAACGGCCAGGTACCACTGCCGCCCTATGCCGCCAACGGCCTGATACTGCAGCAACGCCC
TTACCCGGACCTCAGCGAGGTGCAAGGCCAACTGGCCGCCAAGCGTGCATTGCTGCTGGCCGCGGCCGGGGCGCATAACC
TGTTGTTCACCGGGCCACCCGGCACCGGCAAGACCTTGCTCGCCAGCCGCCTGCCGGGGCTGCTGCCGCCGCTGGACGAG
CACGAGGCGCTGGAAGTGGCTGCGATCCGCTCGGTGAGTGGCCATACACCGCTGAGCAGTTGGCCGCAGCGGCCCTTTCG
CCATCCGCACCACTCGGCCTCCGGCCCGGCGTTGGTCGGTGGCGGCAGCCGACCGCAGCCGGGCGAAATCACCCTTGCCC
ACCATGGTGTGCTGTTTCTGGATGAGTTGCCGGAATTCGAGCGGCGGGTACTGGAGGTGCTGCGCGAGCCCCTGGAATCC
GGCGAGATCGTGATTGCCCGGGCCCGCGACAAGGTGCGCTTCCCCGCCCGGTTCCAGTTGGTGGCGGCAATGAATCCGTG
CCCTTGCGGCTACCTGGGGGATCCCACTGGGCGCTGTCGCTGCAGCACCGAGCAGATCGCGCGGTACCGCAACAAGCTGT
CCGGGCCGTTGCTGGACCGTATCGACCTGCACCTGACCGTGGCCCGCGAGAGCACCACGCTGAATAACCAGCCTTGTGGT
GAAACCAGTGCCGACGTCGCCGCCAAGGTTGCCGAGGCACGGGATGTCCAGCAAAAACGGCAGGGATGCGCCAATGCGTT
TCTCGACCTTGAGGGGCTGCGCCGCAATTGCGGACTGGCAGCGGCAGACCAGGCCTGGCTGGAGAGTGCGTGTGAACGGC
TGACCCTGTCGTTGCGCGCGGCGCACCGCTTGCTGAAGGTGGCGCGAACCCTGGCCGATCTGGATGGTAGCCAGGCAATT
GGCCGGGCGCACCTGGCCGAGGCCCTGCAGTACCGGCCGGGGAGCAGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

55.6

100

0.56

  comM Vibrio campbellii strain DS40M4

55.758

99.798

0.556

  comM Vibrio cholerae strain A1552

55.354

99.798

0.552

  comM Glaesserella parasuis strain SC1401

53.892

100

0.544

  comM Legionella pneumophila str. Paris

49.194

100

0.492

  comM Legionella pneumophila strain ERS1305867

49.194

100

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.117

100

0.478


Multiple sequence alignment