Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   BHL62_RS22275 Genome accession   NZ_CP019228
Coordinates   4765030..4766550 (+) Length   506 a.a.
NCBI ID   WP_075243758.1    Uniprot ID   -
Organism   Xanthomonas oryzae pv. oryzae strain IX-221     
Function   ssDNA binding (predicted from homology)   
DNA processing

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
IScluster/Tn 4757504..4776048 4765030..4766550 within 0


Gene organization within MGE regions


Location: 4757504..4776048
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BHL62_RS22225 (BHL62_23905) - 4757504..4758688 (-) 1185 WP_075243401.1 IS256-like element IS1113 family transposase -
  BHL62_RS22230 (BHL62_23910) - 4758742..4758864 (-) 123 WP_260815732.1 hypothetical protein -
  BHL62_RS22235 (BHL62_23915) - 4759151..4759387 (+) 237 Protein_4256 hypothetical protein -
  BHL62_RS22245 (BHL62_23925) - 4759513..4761498 (-) 1986 WP_011407279.1 beta-N-acetylglucosaminidase domain-containing protein -
  BHL62_RS22250 (BHL62_23935) - 4762052..4762798 (+) 747 Protein_4258 IS701 family transposase -
  BHL62_RS22255 - 4763333..4763334 (-) 2 WP_109181977.1 IS5 family transposase -
  BHL62_RS22260 (BHL62_23950) - 4763648..4763935 (+) 288 Protein_4260 IS701 family transposase -
  BHL62_RS22265 (BHL62_23960) - 4764207..4764581 (-) 375 WP_269468962.1 P-II family nitrogen regulator -
  BHL62_RS22270 (BHL62_23965) - 4764735..4765013 (+) 279 WP_075243756.1 accessory factor UbiK family protein -
  BHL62_RS22275 (BHL62_23970) comM 4765030..4766550 (+) 1521 WP_075243758.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  BHL62_RS22280 (BHL62_23975) - 4766707..4768026 (-) 1320 WP_115862276.1 IS701-like element ISXo15 family transposase -
  BHL62_RS22285 (BHL62_23980) - 4768115..4769329 (-) 1215 WP_151420414.1 IS4 family transposase -
  BHL62_RS22290 (BHL62_23990) - 4769559..4770527 (+) 969 WP_260815733.1 IS5-like element ISXo1 family transposase -
  BHL62_RS22295 (BHL62_23995) - 4770740..4772059 (-) 1320 WP_260815734.1 IS701 family transposase -
  BHL62_RS22300 (BHL62_24000) - 4772130..4772408 (+) 279 Protein_4268 transposase -
  BHL62_RS22305 - 4772483..4773245 (+) 763 WP_109181916.1 IS5 family transposase -
  BHL62_RS22310 (BHL62_24015) - 4773281..4774237 (-) 957 WP_151420365.1 IS30-like element IS1112 family transposase -
  BHL62_RS22315 (BHL62_24020) - 4774326..4774994 (+) 669 Protein_4271 IS630 family transposase -
  BHL62_RS22320 (BHL62_24025) - 4775083..4776048 (+) 966 WP_151420364.1 IS1595-like element ISXo5 family transposase -

Sequence


Protein


Download         Length: 506 a.a.        Molecular weight: 54018.82 Da        Isoelectric Point: 8.4419

>NTDB_id=214255 BHL62_RS22275 WP_075243758.1 4765030..4766550(+) (comM) [Xanthomonas oryzae pv. oryzae strain IX-221]
MSLALVHSRARVGVHAPEVRVEVHLSGGLPSTQMVGLPEAAVRESRERVRAALLCAQFEFPARRITINLAPADLPKEGGR
FDLPIALGILAASGQIDRQALGDYEFLGELALTGELRGIDGVLPAALAAAQAGRRLIVPLANGAEAAIAGHVEAFTARTL
LEVCATLNGSQKAPAAELAVQALGARALPDMADVRGQPHARRALEIAAAGGHHLLLVGSPGCGKTLLASRLPGLLPEASE
AEALETAAITSISGRGLDLARWRQRPYRAPHHTASAVALVGGGTHPRPGEISLAHNGVLFLDELPEWQRQTLEVLREPLE
SGLVTISRAARSVDFPARFQLVAAMNPCPCGWAGDGSGRCRCSSDSIRRYRSRISGPLLDRIDLHVEVPRLPPQALRSGN
LGEDSASVRCRVVAARQRQLARGALPNAQLDQPDTDRHCRLQHDDQVLLERAIEHLQLSARSMHRILRVARTIADLHDSA
DIATRHLTEAIGYRKLDRALSAASAA

Nucleotide


Download         Length: 1521 bp        

>NTDB_id=214255 BHL62_RS22275 WP_075243758.1 4765030..4766550(+) (comM) [Xanthomonas oryzae pv. oryzae strain IX-221]
ATGAGTCTGGCGTTGGTGCACAGCCGTGCCCGCGTGGGGGTGCACGCGCCCGAAGTTCGGGTGGAAGTGCATCTCTCCGG
CGGTCTCCCCTCCACCCAGATGGTGGGCCTGCCCGAAGCGGCAGTGCGCGAATCGCGCGAACGCGTACGTGCCGCGCTGC
TGTGCGCGCAGTTCGAATTCCCCGCACGGCGCATTACCATCAATCTGGCGCCGGCCGATCTGCCTAAGGAAGGCGGACGG
TTCGATTTGCCGATCGCCCTCGGCATCCTGGCTGCCAGCGGGCAAATCGACCGCCAGGCCCTGGGCGATTACGAATTCCT
CGGCGAACTTGCGCTTACCGGCGAGCTGCGCGGCATCGATGGCGTGCTGCCCGCGGCGCTGGCGGCCGCGCAGGCAGGGC
GACGGCTGATCGTGCCGCTTGCCAACGGTGCCGAAGCGGCGATTGCCGGGCACGTCGAAGCCTTCACCGCACGCACGCTG
CTTGAGGTGTGCGCGACGCTCAACGGCAGCCAGAAAGCACCTGCCGCCGAATTGGCGGTGCAGGCGCTCGGGGCCCGTGC
CCTGCCCGACATGGCCGATGTGCGCGGGCAACCGCACGCCCGCCGCGCGCTGGAGATCGCCGCTGCCGGTGGGCATCATC
TCCTTCTGGTCGGCAGCCCTGGCTGCGGCAAGACCCTGTTGGCCTCGCGCCTGCCTGGGCTATTGCCCGAAGCCAGCGAA
GCCGAAGCGCTGGAAACCGCGGCCATTACCTCCATCAGCGGCCGCGGACTGGATCTGGCCCGCTGGCGGCAGCGGCCCTA
CCGGGCTCCTCACCACACCGCCAGCGCAGTCGCCTTGGTTGGCGGTGGCACGCATCCGCGCCCCGGCGAAATCTCGCTGG
CCCACAACGGTGTCTTGTTTCTGGACGAGTTGCCCGAGTGGCAACGGCAGACACTCGAGGTGCTGCGCGAACCGTTGGAA
TCGGGCCTGGTCACGATCTCACGCGCGGCGCGCAGCGTCGACTTCCCTGCACGCTTCCAGCTGGTCGCTGCGATGAACCC
ATGCCCATGCGGTTGGGCAGGCGACGGCAGCGGGCGCTGCCGCTGCAGCAGCGACAGCATCCGCCGCTATCGCAGCCGTA
TCTCCGGCCCCTTGCTGGACCGCATCGATCTGCATGTCGAAGTGCCACGCCTACCACCGCAAGCGCTGCGCAGCGGCAAC
CTCGGCGAGGACAGCGCCAGCGTGCGTTGCCGCGTGGTCGCCGCGCGGCAACGCCAGCTTGCGCGCGGAGCGCTGCCCAA
TGCGCAACTGGATCAGCCCGACACCGACCGCCATTGCCGTCTGCAGCACGACGACCAAGTGCTGCTCGAGCGCGCTATCG
AACACCTGCAGCTGTCTGCACGCTCGATGCATCGCATACTGCGCGTGGCACGCACCATCGCCGATCTCCACGACAGCGCG
GACATCGCCACGCGCCATCTCACCGAAGCGATCGGCTATCGCAAACTGGATCGCGCACTGAGTGCCGCCAGCGCGGCGTA
G


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

56.719

100

0.567

  comM Haemophilus influenzae Rd KW20

55.186

100

0.557

  comM Glaesserella parasuis strain SC1401

54.241

100

0.543

  comM Vibrio campbellii strain DS40M4

54.15

100

0.542

  comM Legionella pneumophila str. Paris

51.911

98.221

0.51

  comM Legionella pneumophila strain ERS1305867

51.911

98.221

0.51

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.063

100

0.462


Multiple sequence alignment