Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYA   Type   Machinery gene
Locus tag   SIR_RS18235 Genome accession   NC_022246
Coordinates   1761207..1762148 (-) Length   313 a.a.
NCBI ID   WP_003073980.1    Uniprot ID   -
Organism   Streptococcus intermedius B196     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1756207..1767148
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SIR_RS18195 (SIR_1661) - 1756258..1757457 (-) 1200 WP_021003254.1 acetate kinase -
  SIR_RS18200 (SIR_1662) comYH 1757504..1758457 (-) 954 WP_021003255.1 class I SAM-dependent methyltransferase Machinery gene
  SIR_RS18205 (SIR_1663) comGG 1758548..1758874 (-) 327 WP_021003256.1 competence type IV pilus minor pilin ComGG -
  SIR_RS18210 (SIR_1664) comGF/cglF 1758855..1759292 (-) 438 WP_003073990.1 competence type IV pilus minor pilin ComGF Machinery gene
  SIR_RS18215 (SIR_1665) comGE/cglE 1759276..1759569 (-) 294 WP_003073987.1 competence type IV pilus minor pilin ComGE Machinery gene
  SIR_RS18220 (SIR_1666) comYD 1759541..1759969 (-) 429 WP_021003257.1 competence type IV pilus minor pilin ComGD Machinery gene
  SIR_RS18225 (SIR_1667) comYC 1759929..1760246 (-) 318 WP_003075998.1 competence type IV pilus major pilin ComGC Machinery gene
  SIR_RS18230 (SIR_1668) comYB 1760243..1761265 (-) 1023 WP_021003258.1 competence type IV pilus assembly protein ComGB Machinery gene
  SIR_RS18235 (SIR_1669) comYA 1761207..1762148 (-) 942 WP_003073980.1 competence type IV pilus ATPase ComGA Machinery gene
  SIR_RS18240 (SIR_1670) - 1762217..1762582 (-) 366 WP_021003259.1 DUF1033 family protein -
  SIR_RS18245 (SIR_1671) glnA 1762804..1764150 (-) 1347 WP_021003260.1 type I glutamate--ammonia ligase -
  SIR_RS18250 (SIR_1672) - 1764189..1764548 (-) 360 WP_003073972.1 MerR family transcriptional regulator -
  SIR_RS18255 (SIR_1673) - 1764625..1765155 (-) 531 WP_009569519.1 FUSC family protein -
  SIR_RS18260 (SIR_1674) - 1765330..1766010 (-) 681 WP_003075761.1 COG3942 and LysM peptidoglycan-binding domain-containing protein -

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 36048.41 Da        Isoelectric Point: 7.8529

>NTDB_id=61917 SIR_RS18235 WP_003073980.1 1761207..1762148(-) (comYA) [Streptococcus intermedius B196]
MVQNIAQDIICQAKEKQAQDIYFIPKENSYELYMRIGDERRFIQQYAFEKMAAVISHFKFTAGMNVGEKRRSQLGSCDYQ
YRDKKTSLRLSTVGDYRGYESLVIRLLHDEKAELKFWFAQIPELKGRLKQRGLYLFSGPVGSGKTTLMHQLAQERFADEQ
IMSIEDPVEIKQDRMLQLQLNEAIGLTYESLIKLSLRHRPDLLIIGEIRDTETARAVVRASLTGVTVFSTIHAKSVRGVY
ERLLELGVTEAELKIVLQGICYQRLIAKGGIVDFVSERYQEHDPSEWNRQIDQLYAAGHINLEQAKAEKIIHS

Nucleotide


Download         Length: 942 bp        

>NTDB_id=61917 SIR_RS18235 WP_003073980.1 1761207..1762148(-) (comYA) [Streptococcus intermedius B196]
ATGGTTCAAAACATTGCACAAGATATTATTTGCCAAGCTAAGGAAAAACAGGCTCAAGATATTTATTTTATCCCAAAGGA
AAATAGTTATGAACTGTACATGAGAATTGGTGATGAGCGACGTTTTATACAACAGTATGCATTTGAAAAGATGGCAGCGG
TGATTAGTCATTTCAAATTTACTGCTGGGATGAATGTTGGTGAAAAACGTAGAAGCCAACTAGGGTCTTGTGATTATCAA
TACAGGGATAAGAAAACATCACTGCGGTTGTCAACAGTAGGGGATTATCGCGGATATGAAAGTCTTGTGATTCGTTTATT
GCATGATGAAAAAGCAGAGTTGAAGTTTTGGTTTGCTCAAATTCCAGAGTTGAAAGGACGACTCAAGCAACGTGGACTTT
ATCTTTTTTCAGGACCGGTAGGGAGTGGCAAAACCACTCTTATGCATCAGCTGGCTCAAGAACGTTTTGCCGATGAGCAA
ATCATGTCTATCGAAGATCCTGTTGAAATTAAGCAAGACAGAATGTTGCAATTGCAATTGAATGAAGCGATTGGCTTAAC
TTATGAAAGTTTGATTAAACTTTCTCTTCGACATCGTCCAGATTTATTGATTATTGGAGAAATTCGAGATACAGAAACAG
CACGGGCGGTTGTACGAGCCAGCTTGACAGGAGTGACGGTCTTTTCAACGATTCATGCTAAAAGTGTGCGAGGCGTTTAT
GAACGATTGCTAGAGTTGGGAGTGACAGAAGCAGAGCTGAAAATTGTCTTGCAAGGTATTTGTTACCAACGTCTAATTGC
GAAAGGAGGAATAGTGGATTTTGTCAGTGAAAGGTATCAAGAACACGATCCAAGCGAATGGAATAGGCAAATTGATCAAC
TTTATGCAGCAGGACATATCAATCTTGAACAGGCAAAAGCGGAAAAAATTATCCACAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYA Streptococcus gordonii str. Challis substr. CH1

80.645

99.042

0.799

  comGA/cglA/cilD Streptococcus mitis NCTC 12261

75.08

100

0.751

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

74.76

100

0.748

  comGA/cglA/cilD Streptococcus pneumoniae D39

74.76

100

0.748

  comGA/cglA/cilD Streptococcus pneumoniae R6

74.76

100

0.748

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

74.76

100

0.748

  comYA Streptococcus mutans UA159

65.595

99.361

0.652

  comYA Streptococcus mutans UA140

65.595

99.361

0.652

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

63.226

99.042

0.626

  comGA Lactococcus lactis subsp. cremoris KW2

54.341

99.361

0.54

  comGA Latilactobacillus sakei subsp. sakei 23K

43.678

83.387

0.364


Multiple sequence alignment