Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   LL045_RS10920 Genome accession   NZ_CP086083
Coordinates   2234446..2235384 (-) Length   312 a.a.
NCBI ID   WP_012898520.1    Uniprot ID   A0A2A9ILQ4
Organism   Lactococcus lactis subsp. lactis strain EIP20A     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 2216904..2239347 2234446..2235384 within 0


Gene organization within MGE regions


Location: 2216904..2239347
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LL045_RS10815 (LL045_10785) - 2216904..2218085 (+) 1182 WP_127093632.1 site-specific integrase -
  LL045_RS10820 (LL045_10790) - 2218298..2218924 (+) 627 WP_146978268.1 copper homeostasis protein CutC -
  LL045_RS10825 (LL045_10795) - 2218960..2219979 (-) 1020 WP_058205026.1 ABC transporter permease -
  LL045_RS10830 (LL045_10800) - 2219972..2220742 (-) 771 WP_004254655.1 ABC transporter ATP-binding protein -
  LL045_RS10835 (LL045_10805) - 2220761..2222566 (-) 1806 WP_270224742.1 glycerophosphodiester phosphodiesterase -
  LL045_RS10840 (LL045_10810) - 2222845..2223246 (+) 402 WP_058205027.1 HIT family protein -
  LL045_RS10845 (LL045_10815) - 2223259..2223480 (+) 222 WP_021721766.1 hypothetical protein -
  LL045_RS10850 (LL045_10820) - 2223492..2223836 (-) 345 WP_015427109.1 hypothetical protein -
  LL045_RS10855 (LL045_10825) rplA 2223914..2224603 (-) 690 WP_004254644.1 50S ribosomal protein L1 -
  LL045_RS10860 (LL045_10830) rplK 2224900..2225325 (-) 426 WP_011677074.1 50S ribosomal protein L11 -
  LL045_RS10865 (LL045_10835) - 2225462..2226226 (-) 765 WP_058205028.1 ABC transporter permease -
  LL045_RS10870 (LL045_10840) - 2226226..2227104 (-) 879 WP_038602236.1 ATP-binding cassette domain-containing protein -
  LL045_RS10875 (LL045_10845) - 2227171..2227380 (-) 210 WP_010906259.1 YozE family protein -
  LL045_RS10880 (LL045_10850) msrA 2227415..2227933 (-) 519 WP_012898514.1 peptide-methionine (S)-S-oxide reductase MsrA -
  LL045_RS10885 (LL045_10855) - 2228108..2228965 (-) 858 WP_010906261.1 S1 RNA-binding domain-containing protein -
  LL045_RS10890 (LL045_10860) - 2229011..2229469 (-) 459 WP_195927835.1 GNAT family N-acetyltransferase -
  LL045_RS10895 (LL045_10865) frr 2229562..2230119 (-) 558 WP_004254618.1 ribosome recycling factor -
  LL045_RS10900 (LL045_10870) pyrH 2230332..2231048 (-) 717 WP_021464089.1 UMP kinase -
  LL045_RS10905 (LL045_10875) - 2231133..2231585 (-) 453 WP_195362782.1 hypothetical protein -
  LL045_RS10910 (LL045_10880) - 2231719..2232906 (-) 1188 WP_015427114.1 acetate kinase -
  LL045_RS10915 (LL045_10885) - 2233063..2234250 (-) 1188 WP_394530189.1 acetate kinase -
  LL045_RS10920 (LL045_10890) comYH 2234446..2235384 (-) 939 WP_012898520.1 class I SAM-dependent methyltransferase Machinery gene
  LL045_RS10925 (LL045_10895) - 2235472..2235723 (-) 252 WP_038602250.1 DUF3165 family protein -
  LL045_RS10930 (LL045_10900) typA 2235824..2237656 (-) 1833 WP_021723473.1 translational GTPase TypA -
  LL045_RS10935 - 2237844..2238302 (-) 459 WP_195927836.1 hypothetical protein -
  LL045_RS10940 (LL045_10910) - 2238304..2238714 (-) 411 WP_230315099.1 hypothetical protein -
  LL045_RS10945 (LL045_10915) - 2238721..2239347 (-) 627 WP_394530191.1 ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 312 a.a.        Molecular weight: 34798.94 Da        Isoelectric Point: 4.3717

>NTDB_id=622395 LL045_RS10920 WP_012898520.1 2234446..2235384(-) (comYH) [Lactococcus lactis subsp. lactis strain EIP20A]
MDMEKVAQGFELVVANIMLLSEKLDTDFYDAFVEQNAAFLDDTDQGIVELSVNNDKLRQLNLSNKEWQKLFQFVLLKGSQ
VAPLQPNHAMTPDAIGLIFNFIIEHLNKNSELRLIEFGSGMGNLAETLLVNLNKKVDYVGFEVDDLLLDLSASMAEIMGS
QAEFMQIDAVQKRLMEPADVVVSDLPIGFYPDDEVAKNFEVATTDGHTFAHHLLIEQSFNYLKEGSFAIFLAPEDLLTSP
QGPLLKEWISKHGSVMAVITLPKSLFNADAKAIYVLKKGPAAHATFAHPLSSLTDRESLEVFMEEFTKIVKL

Nucleotide


Download         Length: 939 bp        

>NTDB_id=622395 LL045_RS10920 WP_012898520.1 2234446..2235384(-) (comYH) [Lactococcus lactis subsp. lactis strain EIP20A]
ATGGATATGGAAAAAGTGGCTCAAGGTTTTGAGCTAGTTGTTGCAAATATTATGTTACTATCTGAGAAATTGGATACAGA
TTTTTATGATGCTTTTGTAGAACAAAATGCTGCTTTTTTAGATGATACTGATCAAGGAATTGTTGAATTGTCAGTAAATA
ATGACAAACTTCGTCAGTTAAATTTATCAAATAAGGAGTGGCAAAAACTTTTTCAATTTGTTTTGTTGAAAGGTTCACAA
GTCGCACCACTTCAACCCAATCATGCTATGACTCCAGATGCAATTGGTTTAATTTTTAATTTCATTATTGAACATCTTAA
TAAAAATTCTGAACTTCGTTTAATTGAATTTGGGTCTGGTATGGGTAATCTTGCTGAAACGCTTTTGGTTAATCTTAATA
AAAAGGTTGATTATGTGGGCTTTGAAGTTGATGATTTACTTTTAGATTTGTCTGCTTCAATGGCTGAAATTATGGGGAGT
CAGGCTGAGTTCATGCAAATTGATGCAGTACAAAAGCGTTTGATGGAACCTGCTGATGTTGTTGTTAGTGATTTACCAAT
TGGTTTTTATCCAGATGATGAAGTTGCAAAGAATTTTGAAGTTGCGACTACTGACGGACATACTTTTGCTCATCATCTTT
TGATTGAGCAATCATTTAATTATTTAAAAGAGGGTTCTTTTGCCATTTTCTTAGCTCCAGAAGATTTACTGACAAGTCCT
CAAGGACCACTATTAAAGGAGTGGATTAGTAAACATGGGAGTGTGATGGCTGTCATCACTTTACCAAAATCACTTTTTAA
TGCTGATGCTAAGGCGATTTATGTTTTGAAAAAGGGACCGGCTGCACATGCAACTTTTGCTCACCCTTTGTCTTCACTGA
CAGACAGAGAAAGTTTAGAAGTCTTTATGGAAGAATTTACAAAAATTGTAAAATTATAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2A9ILQ4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

48.882

100

0.49

  comYH Streptococcus mutans UA159

48.562

100

0.487