Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   FY406_RS04175 Genome accession   NZ_CP043405
Coordinates   853215..854168 (+) Length   317 a.a.
NCBI ID   WP_003086833.1    Uniprot ID   -
Organism   Streptococcus ratti strain ATCC 31377     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 848215..859168
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FY406_RS04140 (FY406_04140) comYA 849479..850420 (+) 942 WP_003086841.1 competence type IV pilus ATPase ComGA Machinery gene
  FY406_RS04145 (FY406_04145) comYB 850344..851387 (+) 1044 WP_404827213.1 competence type IV pilus assembly protein ComGB Machinery gene
  FY406_RS04150 (FY406_04150) comYC 851387..851701 (+) 315 WP_003086839.1 competence type IV pilus major pilin ComGC Machinery gene
  FY406_RS04155 (FY406_04155) comYD 851688..852095 (+) 408 WP_003086838.1 competence type IV pilus minor pilin ComGD Machinery gene
  FY406_RS04160 (FY406_04160) comYE 852067..852360 (+) 294 WP_003086837.1 competence type IV pilus minor pilin ComGE Machinery gene
  FY406_RS04165 (FY406_04165) comYF 852347..852781 (+) 435 WP_003086836.1 competence type IV pilus minor pilin ComGF Machinery gene
  FY406_RS04170 (FY406_04170) comYG 852759..853166 (+) 408 WP_003091335.1 competence type IV pilus minor pilin ComGG Machinery gene
  FY406_RS04175 (FY406_04175) comYH 853215..854168 (+) 954 WP_003086833.1 class I SAM-dependent methyltransferase Machinery gene
  FY406_RS04180 (FY406_04180) - 854228..855427 (+) 1200 WP_003086832.1 acetate kinase -
  FY406_RS04185 (FY406_04185) - 855630..856307 (+) 678 WP_003086831.1 CPBP family intramembrane glutamic endopeptidase -
  FY406_RS04190 (FY406_04190) proC 856423..857193 (-) 771 WP_003086830.1 pyrroline-5-carboxylate reductase -
  FY406_RS04195 (FY406_04195) pepA 857285..858352 (-) 1068 WP_003086829.1 glutamyl aminopeptidase -
  FY406_RS04200 (FY406_04200) - 858471..858758 (+) 288 WP_003086826.1 DUF4651 domain-containing protein -
  FY406_RS04205 (FY406_04205) - 858755..859084 (+) 330 WP_003086824.1 thioredoxin family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36237.15 Da        Isoelectric Point: 4.3581

>NTDB_id=383573 FY406_RS04175 WP_003086833.1 853215..854168(+) (comYH) [Streptococcus ratti strain ATCC 31377]
MDFEKIETAYNLLLDNCQQLEAKIQTNLYDALIEQNAYYLGADGADDMIRQNNYQLRQLNLSKEEWRRTFQFLFIKAAQD
SQLQANHQFTPDSIGFILLYLLEELTDSDQLDVLEIGSGTGNLAETLVNNSSKELDYMGIEVDDLLIDLSASVADVLDSP
VHFIQEDAVRPQILKESDVIISDLPVGFYPNNDIAKRYEVAAAEGHTYAHHLLMEQSFKYLKKNGLAIFLAPVDLLTSEQ
SPLLKAWLQEKAAVLSVISLPEKVFSHKNNMKSIFILKKQETQDCETFVYPLTDLQNPDILRSFMKNFKKWKEDNVI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=383573 FY406_RS04175 WP_003086833.1 853215..854168(+) (comYH) [Streptococcus ratti strain ATCC 31377]
ATGGATTTTGAAAAAATAGAAACAGCCTACAATTTACTATTAGATAATTGTCAGCAACTGGAAGCGAAAATTCAGACAAA
TCTTTACGATGCTTTAATTGAGCAAAATGCTTATTATCTGGGAGCCGACGGAGCCGATGACATGATTAGACAGAACAATT
ATCAGTTACGGCAGCTGAATCTCAGTAAAGAAGAGTGGCGCAGAACCTTTCAATTTCTGTTTATTAAGGCAGCTCAAGAC
TCTCAGCTGCAAGCAAACCATCAGTTTACTCCAGACAGTATCGGCTTTATACTTCTTTATTTACTGGAAGAATTGACTGA
CAGTGATCAGTTAGATGTTCTTGAAATTGGTTCAGGAACAGGAAATCTCGCTGAAACGCTGGTTAATAATAGTTCGAAAG
AGCTAGACTATATGGGTATTGAGGTGGATGACCTTTTGATTGATTTATCCGCAAGTGTTGCAGATGTCTTAGATTCTCCT
GTTCATTTTATTCAAGAAGATGCTGTGCGGCCGCAAATTTTAAAGGAAAGTGATGTTATTATCAGTGATCTGCCTGTTGG
TTTTTATCCTAACAATGACATTGCTAAACGCTACGAAGTTGCAGCGGCAGAAGGTCACACCTATGCTCACCATCTTTTAA
TGGAGCAGTCGTTTAAGTATTTAAAGAAAAATGGACTGGCTATTTTTCTAGCACCTGTCGACTTGCTGACCAGCGAACAG
AGCCCTTTATTAAAGGCGTGGTTGCAGGAAAAAGCTGCTGTTTTATCGGTTATCTCTTTACCGGAGAAAGTTTTTAGTCA
TAAAAATAATATGAAATCTATTTTTATCTTAAAGAAGCAGGAAACACAAGATTGCGAGACCTTTGTTTACCCGCTGACGG
ATTTACAGAATCCTGATATTTTACGAAGTTTCATGAAAAATTTTAAAAAATGGAAGGAAGATAATGTCATTTAA

Domains


Predicted by InterproScan.

(70-290)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

79.43

99.685

0.792

  comYH Streptococcus mutans UA140

79.43

99.685

0.792


Multiple sequence alignment