Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   AB4X21_RS01170 Genome accession   NZ_CP163380
Coordinates   205182..206135 (+) Length   317 a.a.
NCBI ID   WP_369088029.1    Uniprot ID   A0AB39LBF5
Organism   Streptococcus sp. CP1998     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 200182..211135
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AB4X21_RS01130 (AB4X21_01130) - 200943..201332 (+) 390 WP_037607163.1 DUF1033 family protein -
  AB4X21_RS01135 (AB4X21_01135) comYA 201419..202360 (+) 942 WP_369088024.1 competence type IV pilus ATPase ComGA Machinery gene
  AB4X21_RS01140 (AB4X21_01140) comGB/cglB 202293..203324 (+) 1032 WP_369088354.1 competence type IV pilus assembly protein ComGB Machinery gene
  AB4X21_RS01145 (AB4X21_01145) comYC 203321..203638 (+) 318 WP_369088025.1 competence type IV pilus major pilin ComGC Machinery gene
  AB4X21_RS01150 (AB4X21_01150) comYD 203628..204032 (+) 405 WP_003004513.1 competence type IV pilus minor pilin ComGD Machinery gene
  AB4X21_RS01155 (AB4X21_01155) comGE 203998..204285 (+) 288 WP_369088026.1 competence type IV pilus minor pilin ComGE -
  AB4X21_RS01160 (AB4X21_01160) comGF/cglF 204275..204712 (+) 438 WP_369088027.1 competence type IV pilus minor pilin ComGF Machinery gene
  AB4X21_RS01165 (AB4X21_01165) comGG 204714..205151 (+) 438 WP_369088028.1 competence type IV pilus minor pilin ComGG -
  AB4X21_RS01170 (AB4X21_01170) comYH 205182..206135 (+) 954 WP_369088029.1 class I SAM-dependent methyltransferase Machinery gene
  AB4X21_RS01175 (AB4X21_01175) - 206187..207380 (+) 1194 WP_003004202.1 acetate kinase -
  AB4X21_RS01180 (AB4X21_01180) - 207453..208184 (+) 732 WP_369088030.1 CPBP family intramembrane glutamic endopeptidase -
  AB4X21_RS01185 (AB4X21_01185) folP 208347..209300 (+) 954 WP_369088031.1 dihydropteroate synthase -
  AB4X21_RS01190 (AB4X21_01190) - 209301..210611 (+) 1311 WP_369088032.1 folylpolyglutamate synthase/dihydrofolate synthase family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36017.90 Da        Isoelectric Point: 4.2657

>NTDB_id=1029402 AB4X21_RS01170 WP_369088029.1 205182..206135(+) (comYH) [Streptococcus sp. CP1998]
MNFEKIEQAYTYLLENTQSIQNELSTNFYDALIEQNVMYLQGKTDLDIVKNNSQKLKELGLSKEEWRRAYQFLFMKAAQT
EPLQANHQFTPDAIGFIITFLIDQLAKSDQLDVLEVGSGTGNLAETIVNNSRLKIDYLGLEVDDLLIDLSASIADVMESS
VVFAQGDAVRPQVLKESDLIVSDLPIGYYPDDAIAQRYQVASSEGHTYAHHLMMEQALKYLKPQGVAIFLAPNNLLTSPQ
SDLLKVWLKDKAQILAMLTLPESLFSNPAYAKTIFVLRKQEEESVQPFVYPFTDLQDQDQVVHFMESFQNWLKDSEI

Nucleotide


Download         Length: 954 bp        

>NTDB_id=1029402 AB4X21_RS01170 WP_369088029.1 205182..206135(+) (comYH) [Streptococcus sp. CP1998]
ATGAATTTCGAAAAAATTGAACAAGCCTATACCTATCTATTGGAAAACACTCAAAGTATTCAAAATGAATTGTCGACCAA
CTTTTATGACGCCTTGATTGAACAAAATGTCATGTATTTGCAGGGCAAGACAGACCTAGACATTGTCAAAAATAATAGCC
AAAAATTAAAAGAATTAGGTTTAAGTAAGGAAGAATGGCGCAGAGCCTACCAATTTCTATTTATGAAAGCTGCTCAGACA
GAACCTTTGCAAGCGAATCACCAGTTCACACCGGATGCGATTGGTTTTATCATTACATTTTTGATCGATCAGTTGGCTAA
AAGCGACCAACTGGATGTCTTAGAAGTGGGAAGTGGGACCGGAAATCTAGCTGAGACCATTGTCAATAATAGTCGTCTCA
AGATCGATTACTTAGGGTTGGAAGTGGATGATCTCTTGATTGACTTATCTGCTAGTATCGCAGATGTGATGGAGTCTAGC
GTTGTCTTTGCACAAGGAGATGCGGTGCGTCCGCAAGTGTTGAAAGAAAGTGACTTGATCGTTAGTGATCTTCCGATTGG
TTACTATCCAGATGATGCGATTGCACAGCGCTATCAGGTAGCGAGCTCCGAAGGCCATACCTATGCCCATCACCTCATGA
TGGAACAGGCTTTGAAATACCTGAAACCTCAAGGAGTAGCTATCTTTTTAGCCCCCAATAACCTCTTGACGAGCCCTCAG
AGTGATCTTTTAAAAGTTTGGTTGAAAGACAAGGCTCAAATTCTTGCCATGTTGACCTTGCCAGAATCTCTCTTTTCAAA
TCCAGCCTATGCTAAGACGATTTTCGTCCTACGAAAACAAGAAGAAGAGTCTGTTCAGCCCTTTGTCTATCCGTTTACCG
ATCTCCAGGATCAAGATCAGGTGGTTCACTTTATGGAAAGTTTCCAAAACTGGTTAAAGGATAGTGAAATTTGA

Domains


Predicted by InterproScan.

(68-286)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

61.905

99.369

0.615

  comYH Streptococcus mutans UA140

61.905

99.369

0.615


Multiple sequence alignment