Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYH   Type   Machinery gene
Locus tag   DQM67_RS09440 Genome accession   NZ_LS483383
Coordinates   1850075..1851040 (-) Length   321 a.a.
NCBI ID   WP_005591540.1    Uniprot ID   -
Organism   Streptococcus cristatus ATCC 51100 strain NCTC 12479     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1845075..1856040
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM67_RS09425 (NCTC12479_01825) pbp2a 1845258..1847480 (-) 2223 WP_005591543.1 penicillin-binding protein PBP2A -
  DQM67_RS09430 (NCTC12479_01826) - 1847632..1848522 (+) 891 WP_005591542.1 RluA family pseudouridine synthase -
  DQM67_RS09435 (NCTC12479_01827) - 1848846..1850039 (-) 1194 WP_005591541.1 acetate kinase -
  DQM67_RS09440 (NCTC12479_01828) comYH 1850075..1851040 (-) 966 WP_005591540.1 class I SAM-dependent methyltransferase Machinery gene
  DQM67_RS09445 (NCTC12479_01829) comGG 1851121..1851519 (-) 399 WP_005591539.1 competence type IV pilus minor pilin ComGG -
  DQM67_RS09450 (NCTC12479_01830) comGF/cglF 1851500..1851937 (-) 438 WP_005591538.1 competence type IV pilus minor pilin ComGF Machinery gene
  DQM67_RS09455 (NCTC12479_01831) comGE 1851921..1852214 (-) 294 WP_174263560.1 competence type IV pilus minor pilin ComGE -
  DQM67_RS09460 (NCTC12479_01832) comYD 1852180..1852614 (-) 435 WP_037586086.1 competence type IV pilus minor pilin ComGD Machinery gene
  DQM67_RS09465 (NCTC12479_01833) comYC 1852574..1852891 (-) 318 WP_005591535.1 competence type IV pilus major pilin ComGC Machinery gene
  DQM67_RS09470 (NCTC12479_01834) comYB 1852888..1853904 (-) 1017 WP_255209852.1 competence type IV pilus assembly protein ComGB Machinery gene
  DQM67_RS09475 (NCTC12479_01835) comYA 1853852..1854793 (-) 942 WP_005591533.1 competence type IV pilus ATPase ComGA Machinery gene
  DQM67_RS09480 (NCTC12479_01836) - 1854879..1855250 (-) 372 WP_005591532.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 321 a.a.        Molecular weight: 36642.68 Da        Isoelectric Point: 4.7292

>NTDB_id=1139533 DQM67_RS09440 WP_005591540.1 1850075..1851040(-) (comYH) [Streptococcus cristatus ATCC 51100 strain NCTC 12479]
MKFEKIERAFHLLLENVQNIQNVLGTNFYDALIEQNGIYLDGDTDLQEILKNNEKLRALHLTKEEWRRAYQFIFMKASQT
EPLQANHQFTPDSVGFLLSFLIDQLAQDERVDLLEIGSGTGNLAETLLNHTQKNMDYLGLEIDDLLIDLSASIAEVMNSK
AHFAQGDAVRPQVLKESDLIVSDLPVGYYPDDAVAARYEVASPDEHTYAHHLLMEQSLKYLKPGGYAIFLAPNNLLTSPQ
SHLLKKWLLSSAQLLAMISLPEKIFASRQNAKTIFVLRKQGESDIQPFIYPLQDLQSQDEILKFRESFQNWVKVSEIQTN
F

Nucleotide


Download         Length: 966 bp        

>NTDB_id=1139533 DQM67_RS09440 WP_005591540.1 1850075..1851040(-) (comYH) [Streptococcus cristatus ATCC 51100 strain NCTC 12479]
ATGAAATTTGAAAAAATAGAACGAGCTTTTCACCTGCTTTTAGAAAATGTGCAAAATATCCAAAATGTGCTGGGGACTAA
TTTTTATGATGCCCTGATTGAGCAAAATGGGATTTATCTGGATGGAGATACAGACTTACAAGAAATTCTGAAAAATAACG
AGAAACTTCGTGCCTTGCACTTGACCAAGGAAGAGTGGCGTCGGGCTTATCAGTTTATTTTTATGAAGGCTTCTCAGACA
GAGCCGCTACAAGCCAATCATCAGTTCACACCGGATTCGGTCGGCTTCTTACTGAGTTTTTTGATTGACCAGTTGGCCCA
AGACGAGAGAGTCGATTTGCTAGAGATTGGCAGTGGAACAGGCAATTTGGCGGAGACTCTGCTCAACCACACGCAAAAGA
ATATGGATTATTTGGGCCTAGAGATTGACGACTTGCTGATTGATTTGTCTGCCAGCATTGCCGAAGTCATGAATTCTAAG
GCACACTTTGCTCAAGGAGATGCGGTGCGGCCTCAGGTTCTGAAAGAAAGCGACTTGATTGTCAGTGATTTGCCAGTGGG
CTATTACCCAGATGATGCCGTGGCGGCTCGTTATGAGGTTGCCAGTCCAGACGAGCATACTTATGCCCATCATCTCTTGA
TGGAACAATCGTTAAAATATTTAAAACCGGGGGGCTATGCTATCTTTCTCGCCCCAAATAATCTGCTGACGAGCCCACAA
AGTCATCTGCTGAAAAAATGGCTACTGTCTAGCGCTCAGCTCTTGGCAATGATTTCCCTGCCAGAAAAGATTTTCGCGAG
CCGGCAGAATGCTAAGACGATATTTGTCCTGAGAAAACAAGGCGAATCAGACATCCAGCCGTTTATCTACCCTTTGCAGG
ACTTGCAAAGTCAGGATGAAATCTTGAAGTTTCGTGAAAGTTTTCAAAACTGGGTCAAAGTTAGTGAAATTCAAACAAAT
TTTTGA

Domains


Predicted by InterproScan.

(68-283)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYH Streptococcus mutans UA159

59.164

96.885

0.573

  comYH Streptococcus mutans UA140

59.164

96.885

0.573


Multiple sequence alignment