Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   FGL04_RS00660 Genome accession   NZ_LR594035
Coordinates   103508..104464 (+) Length   318 a.a.
NCBI ID   WP_138067850.1    Uniprot ID   A0A4U9XLG1
Organism   Streptococcus pseudoporcinus strain NCTC5385     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 98508..109464
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FGL04_RS00630 (NCTC5385_00128) - 101434..101805 (+) 372 WP_225247737.1 DUF1033 family protein -
  FGL04_RS00635 (NCTC5385_00129) comYC 101744..102010 (+) 267 WP_138067846.1 competence type IV pilus major pilin ComGC Machinery gene
  FGL04_RS00640 (NCTC5385_00130) comGD 101970..102398 (+) 429 WP_037598912.1 competence type IV pilus minor pilin ComGD -
  FGL04_RS00645 comGE 102451..102663 (+) 213 WP_229296016.1 competence system putative prepilin ComGE -
  FGL04_RS00650 (NCTC5385_00131) comGF 102683..103084 (+) 402 WP_171011213.1 competence type IV pilus minor pilin ComGF -
  FGL04_RS00655 (NCTC5385_00132) comGG 103062..103463 (+) 402 WP_138067849.1 competence type IV pilus minor pilin ComGG -
  FGL04_RS00660 (NCTC5385_00133) comYH 103508..104464 (+) 957 WP_138067850.1 class I SAM-dependent methyltransferase Machinery gene
  FGL04_RS00665 (NCTC5385_00134) - 104520..105713 (+) 1194 WP_138067851.1 acetate kinase -
  FGL04_RS00670 (NCTC5385_00135) - 106076..106285 (+) 210 WP_007895522.1 helix-turn-helix transcriptional regulator -
  FGL04_RS00675 (NCTC5385_00136) - 106282..106734 (+) 453 WP_138067852.1 hypothetical protein -
  FGL04_RS00680 (NCTC5385_00137) - 106746..107393 (+) 648 WP_138067853.1 CPBP family intramembrane glutamic endopeptidase -
  FGL04_RS00685 (NCTC5385_00138) proC 107571..108341 (-) 771 WP_138067854.1 pyrroline-5-carboxylate reductase -
  FGL04_RS00690 (NCTC5385_00139) pepA 108360..109427 (-) 1068 WP_138067855.1 glutamyl aminopeptidase -

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36017.10 Da        Isoelectric Point: 4.8716

>NTDB_id=1127276 FGL04_RS00660 WP_138067850.1 103508..104464(+) (comYH) [Streptococcus pseudoporcinus strain NCTC5385]
MNFEKIEKAYELILENSQLIENELKTHIYDALIEQNAYYLGADGAIEQVLKNNGDLHALNLSKEEWRRAFQFVFIKAGQT
AKLQANHQFTPDSIAFIILFLMEQLHSGDSLDVIEIGSGTGNLAQTLLNNSLKKINYLGLELDDLLIDLSASIAEVMKSS
AIFLQEDAVRPQVLKESDIIISDLPIGYYPNDEIASRYQVASAEGHTYAHHLLMEQSLKYLKKNGFAIFLAPSNLLNSPQ
SDLLKKWLKDYAQLRVVVTLPESIFGNQANAKSIIVLQKNTEKNGETFVYPLTDLQSPQALQRFMEEFKKWKQDNVFS

Nucleotide


Download         Length: 957 bp        

>NTDB_id=1127276 FGL04_RS00660 WP_138067850.1 103508..104464(+) (comYH) [Streptococcus pseudoporcinus strain NCTC5385]
ATGAACTTTGAAAAAATTGAGAAAGCCTACGAGCTTATTTTGGAGAATAGTCAACTCATTGAGAATGAGCTTAAGACGCA
TATTTATGATGCTCTAATTGAGCAGAATGCCTATTATTTAGGAGCTGATGGAGCCATTGAGCAGGTCTTGAAAAACAATG
GCGACTTGCATGCTTTAAACTTAAGTAAGGAAGAGTGGCGTCGCGCCTTTCAGTTTGTTTTTATAAAGGCTGGCCAGACG
GCAAAATTACAAGCTAATCACCAATTTACCCCCGACAGTATTGCTTTTATCATACTTTTTTTGATGGAGCAGTTGCATAG
CGGAGATAGCTTGGACGTGATTGAGATTGGTAGTGGTACAGGAAACCTTGCTCAGACGCTATTAAATAATAGTCTTAAAA
AGATCAACTATCTGGGTTTAGAACTAGATGATTTATTAATTGATTTGTCAGCCAGTATTGCTGAGGTGATGAAATCCTCT
GCAATCTTTCTCCAAGAAGATGCTGTTCGCCCACAAGTGCTAAAAGAAAGTGATATTATTATTAGTGATTTACCTATTGG
TTATTATCCTAATGATGAGATTGCAAGTCGCTATCAGGTGGCAAGTGCTGAAGGTCATACGTATGCCCATCATTTATTGA
TGGAGCAATCATTGAAGTATCTCAAAAAGAATGGCTTTGCGATTTTCCTAGCACCGAGCAATCTTTTAAACAGTCCGCAA
AGTGATTTGTTGAAAAAATGGTTAAAAGACTATGCTCAGCTGAGAGTGGTCGTGACCCTGCCAGAATCTATTTTTGGCAA
TCAAGCAAATGCAAAATCCATTATTGTCCTACAAAAGAACACTGAAAAAAATGGTGAAACCTTTGTTTACCCACTGACAG
ATTTACAATCACCACAGGCTCTTCAAAGGTTTATGGAAGAGTTTAAAAAATGGAAACAAGATAATGTTTTTTCATAA

Domains


Predicted by InterproScan.

(76-295)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A4U9XLG1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

67.508

99.686

0.673

  comYH Streptococcus mutans UA140

67.192

99.686

0.67


Multiple sequence alignment