Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYA   Type   Machinery gene
Locus tag   AB4X21_RS01135 Genome accession   NZ_CP163380
Coordinates   201419..202360 (+) Length   313 a.a.
NCBI ID   WP_369088024.1    Uniprot ID   A0AB39LDN1
Organism   Streptococcus sp. CP1998     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 196419..207360
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AB4X21_RS01110 (AB4X21_01110) - 197368..198156 (+) 789 WP_003004116.1 hypothetical protein -
  AB4X21_RS01115 (AB4X21_01115) - 198166..198306 (+) 141 WP_003004308.1 hypothetical protein -
  AB4X21_RS01120 (AB4X21_01120) - 198299..199609 (+) 1311 WP_003004511.1 glycosyltransferase -
  AB4X21_RS01125 (AB4X21_01125) - 199618..200721 (+) 1104 WP_369088023.1 glycosyl hydrolase family 8 -
  AB4X21_RS01130 (AB4X21_01130) - 200943..201332 (+) 390 WP_037607163.1 DUF1033 family protein -
  AB4X21_RS01135 (AB4X21_01135) comYA 201419..202360 (+) 942 WP_369088024.1 competence type IV pilus ATPase ComGA Machinery gene
  AB4X21_RS01140 (AB4X21_01140) comGB/cglB 202293..203324 (+) 1032 WP_369088354.1 competence type IV pilus assembly protein ComGB Machinery gene
  AB4X21_RS01145 (AB4X21_01145) comYC 203321..203638 (+) 318 WP_369088025.1 competence type IV pilus major pilin ComGC Machinery gene
  AB4X21_RS01150 (AB4X21_01150) comYD 203628..204032 (+) 405 WP_003004513.1 competence type IV pilus minor pilin ComGD Machinery gene
  AB4X21_RS01155 (AB4X21_01155) comGE 203998..204285 (+) 288 WP_369088026.1 competence type IV pilus minor pilin ComGE -
  AB4X21_RS01160 (AB4X21_01160) comGF/cglF 204275..204712 (+) 438 WP_369088027.1 competence type IV pilus minor pilin ComGF Machinery gene
  AB4X21_RS01165 (AB4X21_01165) comGG 204714..205151 (+) 438 WP_369088028.1 competence type IV pilus minor pilin ComGG -
  AB4X21_RS01170 (AB4X21_01170) comYH 205182..206135 (+) 954 WP_369088029.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 35853.12 Da        Isoelectric Point: 5.7290

>NTDB_id=1029397 AB4X21_RS01135 WP_369088024.1 201419..202360(+) (comYA) [Streptococcus sp. CP1998]
MVQEIAKKLIRLGKKEEAQDLYFIPRKEEYQVFMRVGDERRFVQSFPFEDMTAIISHFKFAAGMNVGEKRRDQLGSCDYP
LEDGVVSIRLSTVGDYRGYESLVIRLLHDEERELQFWFDQLPDLQKKLAGRGLYLFAGPVGSGKTTLMHALAQERFADQQ
VMSIEDPVEIKQENMLQLQLNDQIGMTYDNLIKLSLRHRPDLLIIGEIRDKETARAVVRASLTGVTVFSTIHAKSVRGVY
ERLLELGVSEEELKIVLQGICYQRLIAGGGVIDFATENYQEHSASRWNQQMDLLAQSGYIQLAQAQAEKIIYR

Nucleotide


Download         Length: 942 bp        

>NTDB_id=1029397 AB4X21_RS01135 WP_369088024.1 201419..202360(+) (comYA) [Streptococcus sp. CP1998]
ATGGTTCAAGAAATTGCAAAAAAATTGATCCGCCTGGGGAAAAAGGAAGAAGCTCAGGATCTTTACTTCATTCCTCGGAA
GGAGGAGTACCAGGTTTTTATGAGGGTGGGAGATGAGAGACGTTTTGTACAGTCCTTCCCTTTTGAGGACATGACAGCTA
TTATTAGCCACTTCAAATTTGCGGCAGGGATGAATGTAGGAGAAAAAAGGCGCGACCAGCTAGGATCGTGTGATTACCCA
TTAGAGGATGGAGTGGTTTCGATTCGCTTGTCGACAGTGGGGGATTATCGTGGCTATGAAAGCTTGGTCATTCGGCTCTT
GCATGATGAGGAACGTGAATTGCAGTTTTGGTTTGACCAGTTGCCGGACTTGCAGAAGAAATTGGCTGGTCGTGGGCTTT
ATCTTTTTGCGGGTCCCGTTGGCTCGGGAAAGACCACTCTCATGCATGCGTTGGCGCAAGAGCGCTTTGCGGACCAGCAA
GTCATGTCCATTGAAGATCCTGTCGAGATCAAGCAAGAGAATATGCTTCAGCTCCAGCTGAATGACCAGATCGGCATGAC
CTATGACAACCTGATCAAACTCTCCTTGCGCCATCGGCCGGACCTTCTCATTATTGGAGAAATCCGAGATAAGGAAACAG
CTCGTGCAGTCGTACGAGCTAGTTTGACAGGAGTCACCGTCTTCTCTACTATTCATGCCAAGAGCGTTCGAGGTGTTTAT
GAGAGGCTCTTAGAACTTGGAGTGAGCGAGGAAGAGCTCAAGATAGTTTTACAAGGTATTTGCTACCAACGATTAATTGC
AGGAGGAGGTGTGATCGATTTTGCGACGGAAAACTACCAAGAGCACTCAGCCAGCCGCTGGAACCAACAAATGGATCTCT
TGGCTCAATCGGGATATATCCAGCTGGCTCAGGCCCAAGCCGAAAAAATTATCTACCGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYA Streptococcus gordonii str. Challis substr. CH1

74.194

99.042

0.735

  comGA/cglA/cilD Streptococcus mitis NCTC 12261

70.513

99.681

0.703

  comGA/cglA/cilD Streptococcus pneumoniae D39

70.192

99.681

0.7

  comGA/cglA/cilD Streptococcus pneumoniae R6

70.192

99.681

0.7

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

70.192

99.681

0.7

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

70.192

99.681

0.7

  comYA Streptococcus mutans UA159

64.309

99.361

0.639

  comYA Streptococcus mutans UA140

64.309

99.361

0.639

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

62.581

99.042

0.62

  comGA Lactococcus lactis subsp. cremoris KW2

54.808

99.681

0.546

  comGA Latilactobacillus sakei subsp. sakei 23K

42.086

88.818

0.374


Multiple sequence alignment