Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   WMO35_RS04820 Genome accession   NZ_CP150119
Coordinates   1048852..1049910 (+) Length   352 a.a.
NCBI ID   WP_041183432.1    Uniprot ID   -
Organism   Xanthomonas oryzae pv. oryzicola strain MD-Ivory-1-B     
Function   assembly of type IV pilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 1029104..1071914 1048852..1049910 within 0


Gene organization within MGE regions


Location: 1029104..1071914
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WMO35_RS04730 (WMO35_04730) - 1029319..1031700 (-) 2382 WP_048485368.1 DUF1631 domain-containing protein -
  WMO35_RS04735 (WMO35_04735) hemW 1031867..1033024 (-) 1158 WP_014504451.1 radical SAM family heme chaperone HemW -
  WMO35_RS04740 (WMO35_04740) rdgB 1033040..1033639 (-) 600 WP_024711477.1 RdgB/HAM1 family non-canonical purine NTP pyrophosphatase -
  WMO35_RS04745 (WMO35_04745) - 1033636..1034031 (-) 396 WP_047340130.1 VOC family protein -
  WMO35_RS04750 (WMO35_04750) rph 1034028..1034753 (-) 726 WP_047340129.1 ribonuclease PH -
  WMO35_RS04755 (WMO35_04755) - 1034863..1035723 (+) 861 WP_024711475.1 YicC/YloC family endoribonuclease -
  WMO35_RS04760 (WMO35_04760) gmk 1035839..1036450 (+) 612 WP_011038351.1 guanylate kinase -
  WMO35_RS04770 (WMO35_04770) rpoZ 1036719..1037018 (+) 300 WP_002812428.1 DNA-directed RNA polymerase subunit omega -
  WMO35_RS04775 (WMO35_04775) - 1037147..1039318 (+) 2172 WP_024711474.1 bifunctional (p)ppGpp synthetase/guanosine-3',5'-bis(diphosphate) 3'-pyrophosphohydrolase -
  WMO35_RS04780 (WMO35_04780) - 1039398..1039778 (+) 381 WP_011257987.1 RidA family protein -
  WMO35_RS04785 (WMO35_04785) recG 1039799..1041952 (+) 2154 WP_047340128.1 ATP-dependent DNA helicase RecG -
  WMO35_RS04790 (WMO35_04790) - 1042085..1043023 (+) 939 WP_024711472.1 nucleoside hydrolase -
  WMO35_RS04795 (WMO35_04795) - 1043095..1043337 (+) 243 WP_005911911.1 type B 50S ribosomal protein L31 -
  WMO35_RS04800 (WMO35_04800) - 1043568..1044857 (+) 1290 WP_024711471.1 citrate synthase -
  WMO35_RS04810 (WMO35_04810) - 1045241..1045891 (-) 651 WP_011407794.1 hypothetical protein -
  WMO35_RS04815 (WMO35_04815) - 1046207..1048636 (-) 2430 WP_014504440.1 penicillin-binding protein 1A -
  WMO35_RS04820 (WMO35_04820) comM 1048852..1049910 (+) 1059 WP_041183432.1 pilus assembly protein PilM Machinery gene
  WMO35_RS04825 (WMO35_04825) - 1049910..1050692 (+) 783 WP_044750904.1 PilN domain-containing protein -
  WMO35_RS04830 (WMO35_04830) - 1050689..1051354 (+) 666 WP_014504437.1 type 4a pilus biogenesis protein PilO -
  WMO35_RS04835 (WMO35_04835) - 1051351..1051884 (+) 534 WP_024712440.1 pilus assembly protein PilP -
  WMO35_RS04840 (WMO35_04840) - 1051904..1053850 (+) 1947 WP_047340127.1 type IV pilus secretin PilQ -
  WMO35_RS04850 (WMO35_04850) - 1054427..1055446 (+) 1020 WP_014504434.1 MoxR family ATPase -
  WMO35_RS04855 (WMO35_04855) - 1055467..1056429 (+) 963 WP_024712442.1 DUF58 domain-containing protein -
  WMO35_RS04860 (WMO35_04860) - 1056432..1056893 (+) 462 WP_011258002.1 DUF4381 domain-containing protein -
  WMO35_RS04865 (WMO35_04865) - 1056890..1057903 (+) 1014 WP_047340126.1 VWA domain-containing protein -
  WMO35_RS04870 (WMO35_04870) - 1057900..1059717 (+) 1818 WP_047340125.1 VWA domain-containing protein -
  WMO35_RS04875 (WMO35_04875) - 1059714..1061480 (+) 1767 WP_047340124.1 BatD family protein -
  WMO35_RS04885 (WMO35_04885) - 1061776..1062093 (+) 318 WP_024712445.1 DUF3658 domain-containing protein -
  WMO35_RS04890 (WMO35_04890) - 1062318..1062815 (+) 498 Protein_927 transposase -
  WMO35_RS04900 (WMO35_04900) - 1063986..1064450 (+) 465 Protein_929 dicarboxylate/amino acid:cation symporter -
  WMO35_RS04905 (WMO35_04905) tkt 1064683..1066683 (+) 2001 WP_047340123.1 transketolase -
  WMO35_RS04910 (WMO35_04915) - 1066910..1068308 (+) 1399 Protein_931 TonB-dependent receptor domain-containing protein -
  WMO35_RS04915 (WMO35_04920) - 1068627..1068746 (-) 120 Protein_932 M23 family peptidase -
  WMO35_RS04920 (WMO35_04925) - 1068749..1069126 (-) 378 Protein_933 L,D-transpeptidase -
  WMO35_RS04925 (WMO35_04930) - 1069347..1069890 (-) 544 Protein_934 peptidoglycan-binding protein -
  WMO35_RS04930 (WMO35_04935) - 1069985..1071330 (-) 1346 Protein_935 IS5 family transposase -
  WMO35_RS04935 (WMO35_04940) - 1071456..1071695 (+) 240 WP_143703532.1 hypothetical protein -

Sequence


Protein


Download         Length: 352 a.a.        Molecular weight: 37636.30 Da        Isoelectric Point: 4.3882

>NTDB_id=962885 WMO35_RS04820 WP_041183432.1 1048852..1049910(+) (comM) [Xanthomonas oryzae pv. oryzicola strain MD-Ivory-1-B]
MGLLPKSQSPLIGVDISSTAVKLLQLSRSGNRFRVEHYAVEPLPPNAVVEKNIVEVDAVGEAIRRAINRSGSKAKNAAAA
VAGSAVITKLIPMPADLDDSDMEAQVELEATNYIPYPIEEVNLDFEVLGPMPNIPDMVQVLLAASRSENVELRQSALELG
GLTAKVMDVEAFAVENAFALVASELPVAADAVVALVDIGATMTTLSVLRSGRSLYSREQVFGGKQLTDEVMRRYGLTYEE
AGLAKRQGGLPESYEVEVLEPFKEATVQQISRLLQFFYAGSEFNRVDCIVLAGGCAALARLPEMVEEQLGVTTVVANPLA
QMTLGPKVQAHALALDAPALMIATGLALRSFD

Nucleotide


Download         Length: 1059 bp        

>NTDB_id=962885 WMO35_RS04820 WP_041183432.1 1048852..1049910(+) (comM) [Xanthomonas oryzae pv. oryzicola strain MD-Ivory-1-B]
GTGGGGCTTTTACCCAAGAGTCAGTCGCCACTGATTGGTGTCGACATTAGTTCTACTGCCGTGAAGCTCTTGCAGCTATC
GCGCAGCGGGAACCGTTTTCGCGTGGAACATTACGCCGTGGAACCGTTGCCGCCGAACGCGGTGGTGGAAAAGAACATCG
TCGAAGTGGATGCGGTGGGCGAAGCCATTCGCCGCGCCATCAACCGGTCCGGCAGCAAGGCCAAGAACGCGGCCGCGGCC
GTGGCTGGGTCGGCGGTGATCACCAAGTTGATCCCGATGCCGGCCGATCTGGACGATAGCGACATGGAGGCCCAGGTCGA
ACTGGAAGCCACCAATTACATTCCGTACCCGATCGAGGAAGTGAATCTCGATTTCGAGGTGCTGGGCCCGATGCCCAACA
TCCCGGATATGGTCCAGGTGCTGCTGGCCGCATCGCGTTCGGAGAATGTGGAACTGCGCCAGTCGGCGCTGGAACTCGGT
GGTCTGACTGCCAAAGTGATGGACGTGGAGGCCTTCGCGGTCGAAAACGCCTTCGCCCTGGTCGCCAGCGAATTGCCGGT
GGCCGCTGACGCAGTCGTGGCGCTGGTGGATATCGGCGCCACCATGACCACGCTGAGCGTGCTGCGTTCCGGTCGCAGTC
TGTATAGCCGCGAACAGGTGTTCGGCGGCAAGCAGCTCACCGATGAAGTGATGCGCCGTTATGGACTCACCTACGAAGAA
GCTGGCCTGGCCAAGCGTCAGGGCGGTCTGCCGGAAAGCTACGAGGTCGAGGTGCTGGAGCCGTTCAAGGAAGCCACGGT
GCAGCAGATCAGCCGGTTGCTGCAGTTCTTCTATGCAGGCAGCGAGTTCAATCGCGTCGACTGCATCGTGCTGGCCGGCG
GTTGCGCCGCGCTGGCGCGCCTGCCGGAGATGGTGGAAGAGCAACTGGGGGTCACCACAGTGGTCGCCAACCCGCTTGCC
CAGATGACGCTGGGTCCGAAAGTCCAGGCCCATGCGCTGGCGCTGGATGCGCCTGCATTGATGATCGCCACCGGCCTGGC
CCTGAGGAGCTTCGACTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Acinetobacter baylyi ADP1

50.997

99.716

0.509

  comM Acinetobacter nosocomialis M2

50.852

100

0.509

  pilM Acinetobacter baumannii D1279779

50.568

100

0.506

  pilM Legionella pneumophila strain ERS1305867

47.578

99.716

0.474