Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   APZ20_RS00930 Genome accession   NZ_CP012947
Coordinates   203350..204870 (-) Length   506 a.a.
NCBI ID   WP_012443690.1    Uniprot ID   A0A0K0GFT4
Organism   Xanthomonas oryzae pv. oryzae strain PXO83     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 193074..203256 203350..204870 flank 94


Gene organization within MGE regions


Location: 193074..204870
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  APZ20_RS00885 (APZ20_00865) - 193074..193900 (-) 827 Protein_178 IS3 family transposase -
  APZ20_RS26155 - 194031..194177 (+) 147 Protein_179 transposase -
  APZ20_RS00890 (APZ20_00870) - 194241..195275 (+) 1035 WP_011407587.1 IS630 family transposase -
  APZ20_RS00895 (APZ20_00875) - 195382..196347 (+) 966 Protein_181 IS1595-like element ISXo5 family transposase -
  APZ20_RS00900 (APZ20_00880) - 196354..197355 (+) 1002 Protein_182 IS256-like element IS1113 family transposase -
  APZ20_RS00905 (APZ20_00885) - 197385..197834 (-) 450 Protein_183 IS1595-like element ISXo5 family transposase -
  APZ20_RS23810 imm45 198155..198481 (+) 327 WP_080494036.1 Imm45 family immunity protein -
  APZ20_RS23815 - 198551..198664 (+) 114 WP_012443686.1 hemagglutinin repeat-containing protein -
  APZ20_RS00910 (APZ20_00890) - 198747..200222 (+) 1476 WP_044756212.1 IS5-like element ISXoo5 family transposase -
  APZ20_RS00915 (APZ20_00895) - 200361..201737 (+) 1377 WP_044749647.1 IS5-like element ISXo4 family transposase -
  APZ20_RS23820 - 201880..203256 (+) 1377 Protein_188 IS5 family transposase -
  APZ20_RS00930 (APZ20_00910) comM 203350..204870 (-) 1521 WP_012443690.1 YifB family Mg chelatase-like AAA ATPase Machinery gene

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 54081.88 Da        Isoelectric Point: 8.2739

>NTDB_id=158423 APZ20_RS00930 WP_012443690.1 203350..204870(-) (comM) [Xanthomonas oryzae pv. oryzae strain PXO83]
MSLALVHSRARVGVHAPEVRVEVHLSGGLPSTQMVGLPEAAVRESRERVRAALLCAQFEFPARRITINLAPADLPKEGGR
FDLPIALGILAASGQIDRQALGDYEFLGELALTGELRGIDGVLPAALAAAQAGRRLIVPLANGAEAAIAEHVEAFTARTL
LEVCATLNGSQKAPAAELAVQALGARALPDMADVRGQPHARRALEIAAAGGHHLLLVGSPGCGKTLLASRLPGLLPEASE
AEALETAAITSISGRGLDLARWRQRPYRAPHHTASAVALVGGGTHPRPGEISLAHNGVLFLDELPEWQRQTLEVLREPLE
SGLVTISRAARSVDFPARFQLVAAMNPCPCGWAGDGSGRCRCSSDSIRRYRSRISGPLLDRIDLHVEVPRLPPQALRSGN
LGEDSASVRCRVVAARQRQLARGALPNAQLDQPDTDRHCRLQHDDQVLLERAIEHLQLSARSMHRILRVARTIADLQDSA
DIATRHLTEAIGYRKLDRALSAASAA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=158423 APZ20_RS00930 WP_012443690.1 203350..204870(-) (comM) [Xanthomonas oryzae pv. oryzae strain PXO83]
ATGAGTCTGGCGTTGGTGCACAGCCGTGCCCGCGTGGGGGTGCACGCGCCCGAAGTTCGGGTGGAAGTGCATCTCTCCGG
CGGTCTCCCCTCCACCCAGATGGTGGGCCTGCCCGAAGCGGCAGTGCGCGAATCGCGCGAACGCGTACGTGCCGCGCTGC
TTTGCGCGCAGTTCGAATTCCCCGCACGGCGCATTACCATCAATCTGGCGCCGGCCGATCTGCCTAAGGAAGGCGGACGG
TTCGATTTGCCGATCGCCCTCGGCATCCTGGCTGCCAGCGGGCAAATCGACCGCCAGGCCCTGGGCGATTACGAATTCCT
CGGCGAGCTTGCGCTTACCGGCGAGCTGCGCGGCATCGATGGCGTGCTGCCCGCGGCGCTGGCGGCCGCGCAGGCAGGGC
GACGGCTGATCGTGCCGCTTGCCAACGGTGCCGAAGCGGCGATTGCCGAGCACGTCGAAGCCTTCACCGCACGCACGCTG
CTTGAGGTGTGCGCGACGCTCAACGGCAGCCAGAAAGCACCTGCCGCCGAATTGGCGGTGCAGGCGCTCGGGGCCCGTGC
CCTGCCCGACATGGCCGATGTGCGCGGGCAACCGCACGCCCGCCGCGCGCTGGAGATCGCCGCTGCCGGTGGGCATCATC
TCCTTCTGGTCGGCAGCCCTGGCTGCGGCAAGACCCTGTTGGCCTCGCGCCTGCCTGGGCTATTGCCCGAAGCCAGCGAA
GCCGAAGCGCTGGAAACCGCGGCCATTACCTCCATCAGCGGCCGCGGACTGGATCTGGCCCGCTGGCGGCAGCGGCCCTA
CCGGGCTCCTCACCACACCGCCAGCGCAGTCGCCTTGGTTGGCGGTGGCACGCATCCGCGCCCCGGCGAAATCTCGCTGG
CCCACAACGGTGTCTTGTTTCTGGACGAGTTGCCCGAGTGGCAACGGCAGACACTCGAGGTGCTGCGCGAACCGTTGGAA
TCGGGCCTGGTCACGATCTCACGCGCGGCGCGCAGCGTCGACTTCCCTGCACGCTTCCAGCTGGTCGCTGCGATGAACCC
ATGCCCATGCGGTTGGGCAGGCGACGGCAGCGGGCGCTGCCGCTGCAGCAGCGACAGCATCCGCCGCTATCGCAGCCGTA
TCTCCGGCCCCTTGCTGGACCGCATCGATCTGCATGTCGAAGTGCCACGCCTACCACCGCAAGCGCTGCGCAGCGGCAAC
CTCGGCGAGGACAGCGCCAGCGTGCGTTGCCGCGTGGTCGCCGCGCGGCAACGCCAGCTTGCGCGCGGAGCGCTGCCCAA
TGCGCAACTGGATCAGCCCGACACCGACCGCCATTGCCGTCTGCAGCACGACGACCAAGTGCTGCTCGAGCGCGCCATCG
AACACCTGCAGCTGTCTGCACGCTCGATGCATCGCATACTGCGCGTGGCACGCACCATCGCCGATCTCCAGGACAGCGCG
GACATCGCCACGCGCCATCTCACCGAAGCGATCGGCTATCGCAAACTGGATCGCGCACTGAGTGCCGCCAGCGCGGCGTA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0K0GFT4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.719

100

0.567

  comM Haemophilus influenzae Rd KW20

55.382

100

0.559

  comM Glaesserella parasuis strain SC1401

54.241

100

0.543

  comM Vibrio campbellii strain DS40M4

53.953

100

0.54

  comM Legionella pneumophila str. Paris

51.71

98.221

0.508

  comM Legionella pneumophila strain ERS1305867

51.71

98.221

0.508

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.063

100

0.462


Multiple sequence alignment