Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB/cglB   Type   Machinery gene
Locus tag   AB4X21_RS01140 Genome accession   NZ_CP163380
Coordinates   202293..203324 (+) Length   343 a.a.
NCBI ID   WP_369088354.1    Uniprot ID   A0AB39LFU2
Organism   Streptococcus sp. CP1998     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 197293..208324
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AB4X21_RS01110 (AB4X21_01110) - 197368..198156 (+) 789 WP_003004116.1 hypothetical protein -
  AB4X21_RS01115 (AB4X21_01115) - 198166..198306 (+) 141 WP_003004308.1 hypothetical protein -
  AB4X21_RS01120 (AB4X21_01120) - 198299..199609 (+) 1311 WP_003004511.1 glycosyltransferase -
  AB4X21_RS01125 (AB4X21_01125) - 199618..200721 (+) 1104 WP_369088023.1 glycosyl hydrolase family 8 -
  AB4X21_RS01130 (AB4X21_01130) - 200943..201332 (+) 390 WP_037607163.1 DUF1033 family protein -
  AB4X21_RS01135 (AB4X21_01135) comYA 201419..202360 (+) 942 WP_369088024.1 competence type IV pilus ATPase ComGA Machinery gene
  AB4X21_RS01140 (AB4X21_01140) comGB/cglB 202293..203324 (+) 1032 WP_369088354.1 competence type IV pilus assembly protein ComGB Machinery gene
  AB4X21_RS01145 (AB4X21_01145) comYC 203321..203638 (+) 318 WP_369088025.1 competence type IV pilus major pilin ComGC Machinery gene
  AB4X21_RS01150 (AB4X21_01150) comYD 203628..204032 (+) 405 WP_003004513.1 competence type IV pilus minor pilin ComGD Machinery gene
  AB4X21_RS01155 (AB4X21_01155) comGE 203998..204285 (+) 288 WP_369088026.1 competence type IV pilus minor pilin ComGE -
  AB4X21_RS01160 (AB4X21_01160) comGF/cglF 204275..204712 (+) 438 WP_369088027.1 competence type IV pilus minor pilin ComGF Machinery gene
  AB4X21_RS01165 (AB4X21_01165) comGG 204714..205151 (+) 438 WP_369088028.1 competence type IV pilus minor pilin ComGG -
  AB4X21_RS01170 (AB4X21_01170) comYH 205182..206135 (+) 954 WP_369088029.1 class I SAM-dependent methyltransferase Machinery gene
  AB4X21_RS01175 (AB4X21_01175) - 206187..207380 (+) 1194 WP_003004202.1 acetate kinase -
  AB4X21_RS01180 (AB4X21_01180) - 207453..208184 (+) 732 WP_369088030.1 CPBP family intramembrane glutamic endopeptidase -

Sequence


Protein


Download         Length: 343 a.a.        Molecular weight: 39278.06 Da        Isoelectric Point: 9.1983

>NTDB_id=1029398 AB4X21_RS01140 WP_369088354.1 202293..203324(+) (comGB/cglB) [Streptococcus sp. CP1998]
MSWLNRDISSWLRPKPKKLSTAKQKQIIELFLNLYSSGFHLSEVVDFLDRSHLVESRLVSQMREDLFRGRSFSEMMAGIG
FSDAVTTQLSLAELHGNLALSLEKISAYLENMRKVKKKLIEVSTYPLILLGFLVLIMLGLRNYLLPQMDAQNIGTQLISS
FPQLFLALGAGLVTFFLLGFLYYRKSGKINVFRTLSHLPFGKGMIQAYLTAYYAREWGNLIGQGLELSQIFSMMQDQKSQ
LFQEIGRDLALSLDRGQSFSETVGGYPFFKEELPLMIEYGEVKSKLGSELEIYAEKTWEDFFRRVHKAMNVIQPLVFIFV
ALVIVLLYATMLSPIYQNMEVHL

Nucleotide


Download         Length: 1032 bp        

>NTDB_id=1029398 AB4X21_RS01140 WP_369088354.1 202293..203324(+) (comGB/cglB) [Streptococcus sp. CP1998]
ATCTCTTGGCTCAATCGGGATATATCCAGCTGGCTCAGGCCCAAGCCGAAAAAATTATCTACCGCTAAACAAAAGCAAAT
CATTGAATTGTTTTTAAATCTTTATTCGAGTGGTTTTCATCTGTCTGAGGTTGTCGATTTTCTGGATCGCTCTCACCTAG
TGGAGAGTCGTTTGGTTTCCCAGATGCGAGAGGACCTTTTTCGGGGGCGGAGTTTTTCAGAGATGATGGCAGGGATCGGT
TTTTCAGATGCGGTGACGACACAGCTGTCTCTCGCTGAGCTTCATGGTAATCTTGCATTGAGCTTGGAGAAAATCAGTGC
TTACCTAGAAAACATGCGCAAGGTCAAGAAAAAGCTGATTGAGGTGAGCACCTATCCTCTTATCTTACTTGGATTTTTAG
TTCTGATTATGCTTGGCTTGCGTAATTATTTGCTCCCTCAAATGGATGCTCAAAATATTGGGACGCAATTGATCAGTTCC
TTCCCCCAACTCTTTTTGGCCTTAGGAGCGGGACTGGTGACCTTCTTCTTACTCGGCTTTCTCTATTATCGAAAGTCAGG
CAAGATCAACGTTTTTAGAACCTTGTCTCATCTGCCTTTTGGAAAAGGCATGATTCAAGCTTATTTGACAGCCTATTATG
CTAGAGAATGGGGCAATCTGATTGGGCAAGGATTGGAGTTGTCTCAGATTTTTTCCATGATGCAGGACCAAAAATCCCAG
CTTTTTCAAGAAATTGGAAGGGACTTAGCTCTTTCTTTAGACCGTGGCCAGTCTTTTTCAGAGACGGTCGGGGGGTATCC
TTTTTTCAAAGAAGAATTGCCCCTTATGATTGAATATGGTGAAGTCAAATCAAAGCTTGGAAGTGAACTAGAGATCTACG
CTGAAAAAACATGGGAAGATTTCTTTCGTCGGGTTCACAAGGCCATGAATGTGATACAACCCTTGGTGTTTATCTTTGTG
GCTCTTGTGATTGTGTTACTCTATGCGACCATGTTGTCGCCGATTTATCAAAATATGGAGGTTCATTTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB/cglB Streptococcus mitis SK321

65.774

97.959

0.644

  comYB Streptococcus gordonii str. Challis substr. CH1

64.431

100

0.644

  comGB/cglB Streptococcus mitis NCTC 12261

65.476

97.959

0.641

  comGB/cglB Streptococcus pneumoniae Rx1

65.179

97.959

0.638

  comGB/cglB Streptococcus pneumoniae D39

65.179

97.959

0.638

  comGB/cglB Streptococcus pneumoniae R6

65.179

97.959

0.638

  comGB/cglB Streptococcus pneumoniae TIGR4

65.179

97.959

0.638

  comYB Streptococcus mutans UA159

56.734

100

0.577

  comYB Streptococcus mutans UA140

56.734

100

0.577

  comGB Lactococcus lactis subsp. cremoris KW2

48.665

98.251

0.478


Multiple sequence alignment