Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   WH25_RS10350 Genome accession   NZ_CP017295
Coordinates   2160332..2161285 (-) Length   317 a.a.
NCBI ID   WP_046164731.1    Uniprot ID   -
Organism   Streptococcus gordonii strain IE35     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2155332..2166285
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  WH25_RS10330 (WH25_10330) - 2156001..2157332 (-) 1332 WP_046164763.1 bifunctional folylpolyglutamate synthase/dihydrofolate synthase -
  WH25_RS10335 (WH25_10335) folP 2157337..2158302 (-) 966 WP_046164729.1 dihydropteroate synthase -
  WH25_RS10340 (WH25_10340) - 2158379..2159044 (-) 666 WP_046164730.1 CPBP family intramembrane glutamic endopeptidase -
  WH25_RS10345 (WH25_10345) - 2159092..2160282 (-) 1191 WP_045634156.1 acetate kinase -
  WH25_RS10350 (WH25_10350) comYH 2160332..2161285 (-) 954 WP_046164731.1 class I SAM-dependent methyltransferase Machinery gene
  WH25_RS10355 (WH25_10355) comGG 2161316..2161747 (-) 432 WP_231108792.1 competence type IV pilus minor pilin ComGG -
  WH25_RS10360 (WH25_10360) comGF/cglF 2161728..2162165 (-) 438 WP_046164732.1 competence type IV pilus minor pilin ComGF Machinery gene
  WH25_RS10365 (WH25_10365) comGE/cglE 2162149..2162442 (-) 294 WP_046164733.1 competence type IV pilus minor pilin ComGE Machinery gene
  WH25_RS10370 (WH25_10370) comYD 2162414..2162842 (-) 429 WP_069097080.1 competence type IV pilus minor pilin ComGD Machinery gene
  WH25_RS10375 (WH25_10375) comYC 2162802..2163119 (-) 318 WP_012130940.1 competence type IV pilus major pilin ComGC Machinery gene
  WH25_RS10380 (WH25_10380) comYB 2163116..2164150 (-) 1035 WP_046164735.1 competence type IV pilus assembly protein ComGB Machinery gene
  WH25_RS10385 (WH25_10385) comYA 2164083..2165021 (-) 939 WP_046164736.1 competence type IV pilus ATPase ComGA Machinery gene
  WH25_RS10390 (WH25_10390) - 2165123..2165491 (-) 369 WP_046164737.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36006.21 Da        Isoelectric Point: 4.9649

>NTDB_id=197447 WH25_RS10350 WP_046164731.1 2160332..2161285(-) (comYH) [Streptococcus gordonii strain IE35]
MNFEKIEEAYTLLLENVQVIQNKLSTNFYDALIEQNGIYLDGQTDLEIVKKNHQTLKSLKLSKEEWRRAYQFILMKGAQT
EPLQANHQFTPDAIGFLLIFIIDQLMVASDITLLEMGSGTGNLAETILNNSQKEIDYLGLEIDDLLIDLSASIAEVMCSK
AHFAQGDAVRPQVLKESDLIISDLPVGYYPDDQIASRYQVASQTEHTYAHHLLMEQALKYLKADGYAIFLAPNHLLTSPQ
SDLLKSWLKYNASLVAMIALPEKLFASASQAKTVFVLQKQKNIKAEPFVYALADLQNHEEITRFRESFQKWRKVSEN

Nucleotide


Download         Length: 954 bp        

>NTDB_id=197447 WH25_RS10350 WP_046164731.1 2160332..2161285(-) (comYH) [Streptococcus gordonii strain IE35]
ATGAATTTTGAAAAAATCGAAGAAGCCTATACGCTCCTTTTAGAAAATGTCCAAGTCATTCAAAACAAGCTTTCAACTAA
TTTTTATGATGCCTTGATAGAGCAAAACGGTATCTATTTGGATGGGCAAACGGACTTAGAAATTGTCAAAAAGAATCATC
AGACCTTGAAGAGCTTAAAGCTAAGTAAAGAAGAATGGCGCAGAGCCTATCAGTTCATTCTGATGAAGGGTGCTCAGACA
GAGCCTTTGCAAGCCAATCATCAGTTTACACCAGATGCCATTGGTTTTCTGCTGATTTTCATCATTGACCAGCTGATGGT
AGCATCAGATATTACTCTTCTTGAGATGGGGAGCGGAACCGGAAATCTTGCAGAGACAATCCTGAACAATAGTCAGAAAG
AGATTGATTATCTAGGATTAGAGATCGATGATTTACTGATTGACCTATCTGCCAGTATTGCAGAAGTGATGTGTTCAAAG
GCACATTTTGCCCAGGGAGATGCTGTGCGCCCTCAAGTTCTAAAAGAGAGCGACTTGATTATCAGTGATCTGCCTGTCGG
TTACTATCCAGATGATCAGATTGCAAGTCGTTATCAAGTAGCCAGTCAGACAGAGCATACCTATGCTCATCATCTATTGA
TGGAGCAGGCATTAAAGTATTTGAAAGCTGATGGTTATGCTATCTTCTTAGCTCCTAATCACCTCTTGACCAGCCCGCAG
AGTGACTTGCTTAAATCCTGGCTCAAATACAATGCCAGCTTAGTAGCAATGATTGCTTTGCCAGAAAAACTCTTTGCATC
AGCCTCTCAGGCTAAGACGGTTTTTGTTTTGCAAAAACAAAAAAATATAAAAGCAGAGCCTTTTGTCTATGCATTGGCAG
ATCTACAAAATCATGAAGAAATTACTCGCTTCCGCGAAAGTTTTCAAAAATGGCGCAAAGTTAGTGAAAACTGA

Domains


Predicted by InterproScan.

(68-293)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

59.105

98.738

0.584

  comYH Streptococcus mutans UA140

59.105

98.738

0.584


Multiple sequence alignment