Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   FGL06_RS00660 Genome accession   NZ_LR594042
Coordinates   99922..100878 (+) Length   318 a.a.
NCBI ID   WP_024344497.1    Uniprot ID   A0A239RCI2
Organism   Streptococcus equinus strain NCTC8133     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 94922..105878
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FGL06_RS00620 (NCTC8133_00123) - 95816..96184 (+) 369 WP_021141429.1 DUF1033 family protein -
  FGL06_RS00625 (NCTC8133_00124) comYA 96253..97194 (+) 942 WP_027968018.1 competence type IV pilus ATPase ComGA Machinery gene
  FGL06_RS00630 (NCTC8133_00125) comYB 97118..98161 (+) 1044 WP_164554342.1 competence type IV pilus assembly protein ComGB Machinery gene
  FGL06_RS00635 (NCTC8133_00126) comYC 98161..98463 (+) 303 WP_021141433.1 competence type IV pilus major pilin ComGC Machinery gene
  FGL06_RS00640 (NCTC8133_00127) comGD 98438..98878 (+) 441 WP_126437530.1 competence type IV pilus minor pilin ComGD -
  FGL06_RS00645 (NCTC8133_00128) comGE 98832..99125 (+) 294 WP_039697664.1 competence type IV pilus minor pilin ComGE -
  FGL06_RS00650 (NCTC8133_00129) comYF 99109..99546 (+) 438 WP_126437531.1 competence type IV pilus minor pilin ComGF Machinery gene
  FGL06_RS00655 (NCTC8133_00130) comGG 99533..99868 (+) 336 WP_003067902.1 competence type IV pilus minor pilin ComGG -
  FGL06_RS00660 (NCTC8133_00131) comYH 99922..100878 (+) 957 WP_024344497.1 class I SAM-dependent methyltransferase Machinery gene
  FGL06_RS00665 (NCTC8133_00132) - 100932..102128 (+) 1197 WP_138083561.1 acetate kinase -
  FGL06_RS00670 (NCTC8133_00133) - 102286..102483 (+) 198 WP_021141442.1 helix-turn-helix transcriptional regulator -
  FGL06_RS00675 (NCTC8133_00134) - 102538..102990 (+) 453 WP_126437532.1 ABC transporter permease -
  FGL06_RS00680 (NCTC8133_00135) - 103002..103637 (+) 636 WP_027968028.1 CPBP family intramembrane glutamic endopeptidase -
  FGL06_RS00685 (NCTC8133_00136) proC 103689..104459 (-) 771 WP_024344501.1 pyrroline-5-carboxylate reductase -
  FGL06_RS00690 (NCTC8133_00137) pepA 104521..105588 (-) 1068 WP_024344502.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 35783.86 Da        Isoelectric Point: 4.4780

>NTDB_id=1127550 FGL06_RS00660 WP_024344497.1 99922..100878(+) (comYH) [Streptococcus equinus strain NCTC8133]
MNFENIETAYGLILENIQLIENELKTHIYDALIEQNSFYLGAEGASEVVAANNEKLRQLNLTKEEWRRAFQFIFIKAAQT
EALQANHQFTPDAIGFILMFLIENLTASKELDVLEIGSGTGNLAQTLLNNSSKDLNYLGIEVDDLLIDLSASIAEVMDSK
AQFVQEDAVRPQILKESDVIISDLPVGFYPNDEIAKRYKVASSEGHTYAHHLLMEQSLKYLKKDGIAVFLAPVSLLTSKQ
SDLLKAWLKDYADVIAVITLPESIFGNAANAKSIFVLKKQAEHTPETFVYPLADLQSREVLTDFIDKFKKWNVENMIF

Nucleotide


Download         Length: 957 bp        

>NTDB_id=1127550 FGL06_RS00660 WP_024344497.1 99922..100878(+) (comYH) [Streptococcus equinus strain NCTC8133]
ATGAATTTTGAAAATATCGAAACAGCCTATGGGTTGATTCTTGAAAATATACAATTAATCGAAAATGAGTTGAAAACACA
CATTTACGATGCACTTATTGAACAAAACTCTTTTTATCTTGGTGCTGAGGGTGCCAGTGAAGTTGTAGCTGCAAATAATG
AGAAACTACGCCAACTTAACTTAACTAAGGAAGAATGGCGCCGTGCTTTTCAGTTTATCTTTATTAAAGCTGCGCAAACA
GAAGCTCTTCAGGCAAATCACCAATTTACACCTGATGCTATTGGCTTCATTTTAATGTTCCTCATTGAGAATTTGACAGC
TTCTAAGGAACTTGATGTTTTGGAAATCGGTAGCGGAACAGGTAACCTTGCCCAAACGTTGTTGAACAACTCATCTAAAG
ACCTAAACTATCTAGGGATTGAAGTTGATGATTTGTTGATTGACTTGTCAGCAAGTATCGCTGAAGTTATGGATTCTAAA
GCTCAATTCGTTCAAGAAGATGCTGTACGCCCACAGATTCTTAAGGAAAGTGATGTCATCATCAGTGACCTTCCAGTCGG
ATTCTATCCAAATGATGAAATTGCAAAACGTTACAAAGTAGCTAGCAGTGAAGGGCACACTTATGCGCATCATTTGTTGA
TGGAACAATCTCTTAAATATCTCAAAAAAGACGGGATTGCTGTCTTTTTAGCACCAGTTAGTCTTTTGACAAGTAAGCAA
AGTGACCTGTTGAAAGCATGGTTGAAGGATTATGCTGATGTGATTGCTGTAATTACTTTGCCAGAATCTATCTTTGGAAA
TGCAGCCAACGCGAAATCAATTTTCGTTTTGAAAAAACAAGCTGAACATACTCCAGAAACCTTTGTTTATCCACTTGCTG
ACTTGCAAAGTCGAGAAGTGTTGACAGACTTCATTGATAAATTTAAAAAATGGAATGTTGAAAATATGATTTTTTAA

Domains


Predicted by InterproScan.

(69-291)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A239RCI2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA140

68.987

99.371

0.686

  comYH Streptococcus mutans UA159

68.671

99.371

0.682


Multiple sequence alignment