Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   NH8B_RS17140 Genome accession   NC_016002
Coordinates   3696799..3698301 (-) Length   500 a.a.
NCBI ID   WP_014088362.1    Uniprot ID   G2IZG7
Organism   Pseudogulbenkiania sp. NH8B     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 3696799..3735840 3696799..3698301 within 0


Gene organization within MGE regions


Location: 3696799..3735840
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NH8B_RS17140 (NH8B_3525) comM 3696799..3698301 (-) 1503 WP_014088362.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  NH8B_RS17145 (NH8B_3526) - 3698312..3698599 (-) 288 WP_008952463.1 accessory factor UbiK family protein -
  NH8B_RS17150 (NH8B_3527) glnK 3698870..3699208 (+) 339 WP_008952462.1 P-II family nitrogen regulator -
  NH8B_RS17155 (NH8B_3528) - 3699240..3700544 (+) 1305 WP_014088364.1 ammonium transporter -
  NH8B_RS17160 (NH8B_3529) - 3700692..3701135 (-) 444 WP_014088365.1 ClpXP protease specificity-enhancing factor -
  NH8B_RS17165 (NH8B_3530) - 3701144..3701746 (-) 603 WP_040652169.1 glutathione S-transferase N-terminal domain-containing protein -
  NH8B_RS17170 (NH8B_3531) - 3701893..3702654 (-) 762 WP_008952458.1 cytochrome c1 -
  NH8B_RS17175 (NH8B_3532) - 3702672..3703991 (-) 1320 WP_008952457.1 cytochrome bc complex cytochrome b subunit -
  NH8B_RS17180 (NH8B_3533) petA 3704006..3704590 (-) 585 WP_008952456.1 ubiquinol-cytochrome c reductase iron-sulfur subunit -
  NH8B_RS17185 (NH8B_3534) - 3704706..3705452 (-) 747 WP_014088368.1 Nif3-like dinuclear metal center hexameric protein -
  NH8B_RS17190 (NH8B_3535) groL 3705628..3707265 (-) 1638 WP_014088369.1 chaperonin GroEL -
  NH8B_RS17195 (NH8B_3536) groES 3707310..3707597 (-) 288 WP_008952453.1 co-chaperone GroES -
  NH8B_RS17200 (NH8B_3537) - 3707746..3708888 (-) 1143 WP_014088370.1 glycosyltransferase -
  NH8B_RS17205 (NH8B_3538) rfbB 3709169..3710233 (+) 1065 WP_014088371.1 dTDP-glucose 4,6-dehydratase -
  NH8B_RS17210 (NH8B_3539) rfbD 3710230..3711141 (+) 912 WP_014088372.1 dTDP-4-dehydrorhamnose reductase -
  NH8B_RS17215 (NH8B_3540) rfbA 3711138..3712019 (+) 882 WP_014088373.1 glucose-1-phosphate thymidylyltransferase RfbA -
  NH8B_RS17220 (NH8B_3541) rfbC 3712019..3712567 (+) 549 WP_014088374.1 dTDP-4-dehydrorhamnose 3,5-epimerase -
  NH8B_RS17225 (NH8B_3543) pseB 3713041..3714039 (+) 999 WP_014088376.1 UDP-N-acetylglucosamine 4,6-dehydratase (inverting) -
  NH8B_RS17230 (NH8B_3544) pseC 3714039..3715187 (+) 1149 WP_014088377.1 UDP-4-amino-4, 6-dideoxy-N-acetyl-beta-L-altrosamine transaminase -
  NH8B_RS17235 (NH8B_3545) pseF 3715184..3715876 (+) 693 WP_014088378.1 pseudaminic acid cytidylyltransferase -
  NH8B_RS20985 (NH8B_3546) - 3715873..3716829 (+) 957 WP_014088379.1 PseG/SpsG family protein -
  NH8B_RS20575 (NH8B_3547) - 3716819..3717379 (+) 561 WP_014088380.1 GNAT family N-acetyltransferase -
  NH8B_RS20580 (NH8B_3548) - 3717376..3718071 (+) 696 WP_014088381.1 PIG-L deacetylase family protein -
  NH8B_RS20585 - 3718078..3718764 (+) 687 WP_083844281.1 methionyl-tRNA formyltransferase -
  NH8B_RS17240 (NH8B_3550) pseI 3718766..3719779 (+) 1014 WP_014088383.1 pseudaminic acid synthase -
  NH8B_RS20990 - 3719831..3721567 (+) 1737 WP_148282996.1 hypothetical protein -
  NH8B_RS20590 - 3721534..3722592 (+) 1059 WP_083844264.1 phytanoyl-CoA dioxygenase family protein -
  NH8B_RS20595 - 3722687..3723586 (+) 900 WP_232503494.1 DapH/DapD/GlmU-related protein -
  NH8B_RS20220 (NH8B_3553) - 3723583..3725325 (+) 1743 WP_158453625.1 ABC transporter ATP-binding protein -
  NH8B_RS20600 (NH8B_3554) - 3725462..3726436 (+) 975 WP_014088387.1 sugar transferase -
  NH8B_RS17265 (NH8B_3555) - 3726423..3727415 (+) 993 WP_014088388.1 NAD-dependent epimerase/dehydratase family protein -
  NH8B_RS20605 (NH8B_3556) - 3727430..3728524 (+) 1095 WP_014088389.1 DegT/DnrJ/EryC1/StrS aminotransferase family protein -
  NH8B_RS20610 (NH8B_3557) - 3728524..3729468 (+) 945 WP_014088390.1 hypothetical protein -
  NH8B_RS17270 (NH8B_3558) gmd 3729469..3730593 (+) 1125 WP_014088391.1 GDP-mannose 4,6-dehydratase -
  NH8B_RS17275 (NH8B_3559) - 3730701..3731678 (+) 978 WP_014088392.1 GDP-L-fucose synthase -
  NH8B_RS20615 (NH8B_3560) - 3731675..3732502 (+) 828 WP_014088393.1 glycosyltransferase family 2 protein -
  NH8B_RS20620 - 3732524..3733441 (+) 918 WP_158453626.1 glycosyltransferase family 4 protein -
  NH8B_RS17280 - 3733540..3734358 (+) 819 WP_041703108.1 glycosyltransferase -
  NH8B_RS17285 (NH8B_3563) - 3734407..3735840 (+) 1434 WP_041703110.1 mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase -

Sequence


Protein


Download         Length: 500 a.a.        Molecular weight: 53244.06 Da        Isoelectric Point: 8.8150

>NTDB_id=42376 NH8B_RS17140 WP_014088362.1 3696799..3698301(-) (comM) [Pseudogulbenkiania sp. NH8B]
MTLAIVHSRALSGMEASEVSVEVHMANGLPAFAIVGLPDTEVKESRDRVRAAILTSGFTFPARKITVNLAPADLPKDSGR
FDLPIALGILVASGQVKGDALTRYEFAGELALSGQLRAVRGGLAMTCHARRAGRAFVLPPQSAAEAALVADATVYSAVTL
LEVCAHLNGLQLLPRAQAESARTTFCYPDLADVKGQGAARGALEIAAAGGHSLLLIGPPGTGKSMLASRLPGLLPPMQDD
EALEAAAVQSLGSQGFSVEAWRARPFRAPHHTASAVAMVGGGSEPRPGEVSLAHRGVLFLDELPEFDRRVLEVLREPLEN
GVIHISRAARQATFPARFQLVAAMNPCPCGYLGHRSGRCQCTPEQVARYRGKISGPLLDRIDMHIEMPTLSPDELSGQGQ
GETSAAVRERVISARRIQLARQGKANADLSGAELDRHGLVEASAQQTLLQAVERLHLSARSYHRILRVARTIADLKGAER
ISRQEVLQAVQLRRAGLPNS

Nucleotide


Download         Length: 1503 bp        

>NTDB_id=42376 NH8B_RS17140 WP_014088362.1 3696799..3698301(-) (comM) [Pseudogulbenkiania sp. NH8B]
ATGACGTTGGCCATCGTTCATAGCCGGGCCTTGTCCGGCATGGAGGCTTCCGAGGTCTCTGTCGAGGTGCACATGGCCAA
TGGCCTGCCGGCGTTCGCCATCGTGGGTCTGCCCGATACCGAGGTAAAGGAAAGTCGCGACCGGGTCCGGGCCGCGATCC
TGACCTCGGGGTTTACCTTCCCGGCGCGGAAGATCACAGTGAATCTGGCTCCCGCCGATTTGCCCAAAGATTCGGGGCGC
TTCGATTTGCCGATCGCACTGGGCATTCTGGTGGCGTCCGGGCAGGTGAAGGGGGACGCTCTCACCCGCTATGAATTCGC
GGGGGAGCTGGCGTTGAGTGGCCAACTGCGAGCCGTCCGTGGTGGCTTGGCCATGACCTGCCATGCCCGGCGTGCCGGGC
GCGCTTTTGTATTGCCGCCGCAAAGTGCCGCCGAGGCCGCCTTGGTGGCCGATGCCACGGTGTATTCGGCGGTGACGCTG
CTGGAGGTTTGTGCCCACCTCAATGGTTTGCAGCTCTTGCCACGCGCGCAGGCGGAGAGTGCCCGCACCACTTTCTGCTA
TCCCGATCTGGCCGATGTCAAAGGGCAGGGGGCAGCGCGCGGGGCGCTGGAGATCGCCGCCGCGGGCGGCCACAGCTTGC
TGTTGATTGGTCCTCCGGGAACAGGGAAGTCGATGCTGGCCAGCCGTTTGCCCGGGCTGCTGCCGCCAATGCAGGATGAC
GAGGCACTGGAGGCGGCGGCGGTTCAGTCACTCGGATCGCAAGGGTTCTCTGTCGAGGCCTGGCGCGCCAGGCCATTCAG
AGCACCACACCACACCGCTTCCGCCGTGGCCATGGTGGGAGGGGGCTCCGAGCCACGTCCGGGAGAGGTCAGCCTGGCGC
ACCGTGGGGTGCTCTTCCTGGACGAATTGCCGGAGTTTGATCGTCGCGTGCTGGAGGTGCTGCGTGAACCTTTGGAGAAT
GGCGTGATCCATATCTCCCGTGCGGCCCGCCAGGCGACGTTTCCAGCCCGATTTCAGCTCGTGGCGGCAATGAATCCCTG
CCCCTGCGGTTATCTCGGGCATCGTTCCGGGCGCTGCCAGTGCACGCCGGAGCAGGTTGCACGTTATCGCGGTAAAATCT
CCGGCCCTTTGCTGGACCGGATCGACATGCATATCGAGATGCCGACCCTGTCGCCGGACGAGCTATCAGGGCAGGGGCAA
GGCGAAACGAGTGCGGCGGTGCGCGAGAGGGTGATTTCGGCACGCCGGATCCAGTTGGCGCGGCAGGGGAAGGCGAATGC
GGACCTTTCCGGGGCCGAGCTGGATCGGCATGGGTTGGTCGAAGCGTCGGCGCAGCAGACACTGCTTCAGGCGGTCGAGC
GGCTGCATCTTTCCGCTCGCAGCTATCATCGCATTCTGCGTGTGGCCCGAACCATTGCGGACTTGAAGGGCGCGGAACGG
ATATCGCGCCAGGAAGTGTTGCAGGCGGTACAACTGCGACGTGCCGGGCTGCCAAATAGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB G2IZG7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

54.545

99

0.54

  comM Vibrio cholerae strain A1552

54.141

99

0.536

  comM Haemophilus influenzae Rd KW20

51.703

99.8

0.516

  comM Glaesserella parasuis strain SC1401

50.501

99.8

0.504

  comM Legionella pneumophila str. Paris

49.299

99.8

0.492

  comM Legionella pneumophila strain ERS1305867

49.299

99.8

0.492

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.422

100

0.446


Multiple sequence alignment