Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGB/cglB   Type   Machinery gene
Locus tag   ACD268_RS10425 Genome accession   NZ_CP168299
Coordinates   2002116..2003132 (-) Length   338 a.a.
NCBI ID   WP_074017570.1    Uniprot ID   -
Organism   Streptococcus pneumoniae strain FC1     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1961219..2008954 2002116..2003132 within 0


Gene organization within MGE regions


Location: 1961219..2008954
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACD268_RS10100 (ACD268_10100) lytA 1961219..1962175 (-) 957 WP_373381835.1 N-acetylmuramoyl-L-alanine amidase LytA -
  ACD268_RS10105 (ACD268_10105) - 1962179..1962511 (-) 333 WP_001186206.1 phage holin -
  ACD268_RS10110 (ACD268_10110) - 1962515..1962931 (-) 417 WP_001165344.1 phage holin family protein -
  ACD268_RS10115 (ACD268_10115) - 1962941..1963291 (-) 351 WP_000852249.1 hypothetical protein -
  ACD268_RS10120 (ACD268_10120) - 1963294..1963497 (-) 204 WP_001091107.1 hypothetical protein -
  ACD268_RS10125 (ACD268_10125) - 1963478..1963594 (-) 117 Protein_1970 dihydrodipicolinate reductase -
  ACD268_RS10130 (ACD268_10130) - 1963591..1970880 (-) 7290 WP_373381836.1 tail fiber domain-containing protein -
  ACD268_RS10135 (ACD268_10135) - 1970885..1971235 (-) 351 WP_000068025.1 DUF6711 family protein -
  ACD268_RS10140 (ACD268_10140) - 1971244..1974897 (-) 3654 WP_373381837.1 hypothetical protein -
  ACD268_RS10145 (ACD268_10145) - 1974884..1975234 (-) 351 WP_000478016.1 hypothetical protein -
  ACD268_RS10150 (ACD268_10150) - 1975273..1975653 (-) 381 WP_001185634.1 DUF6096 family protein -
  ACD268_RS10155 (ACD268_10155) - 1975658..1976071 (-) 414 WP_000880674.1 phage tail tube protein -
  ACD268_RS10160 (ACD268_10160) - 1976074..1976442 (-) 369 WP_000608235.1 hypothetical protein -
  ACD268_RS10165 (ACD268_10165) - 1976439..1976954 (-) 516 WP_000015941.1 HK97-gp10 family putative phage morphogenesis protein -
  ACD268_RS10170 (ACD268_10170) - 1976929..1977267 (-) 339 WP_050125325.1 hypothetical protein -
  ACD268_RS10175 (ACD268_10175) - 1977248..1977559 (-) 312 WP_050110663.1 phage head-tail connector protein -
  ACD268_RS10180 (ACD268_10180) - 1977561..1977749 (-) 189 WP_000669348.1 hypothetical protein -
  ACD268_RS10185 (ACD268_10185) - 1977739..1977921 (-) 183 WP_000054934.1 Rho termination factor N-terminal domain-containing protein -
  ACD268_RS10190 (ACD268_10190) - 1977933..1978778 (-) 846 WP_000123890.1 N4-gp56 family major capsid protein -
  ACD268_RS10195 (ACD268_10195) - 1978785..1979360 (-) 576 WP_050116223.1 DUF4355 domain-containing protein -
  ACD268_RS10200 (ACD268_10200) - 1979528..1979779 (-) 252 WP_050391397.1 DUF6275 family protein -
  ACD268_RS10205 (ACD268_10205) - 1979781..1980026 (-) 246 WP_000877357.1 hypothetical protein -
  ACD268_RS10210 (ACD268_10210) - 1980069..1980482 (-) 414 WP_000565276.1 HD domain-containing protein -
  ACD268_RS10215 (ACD268_10215) - 1980479..1980688 (-) 210 WP_000651747.1 hypothetical protein -
  ACD268_RS10220 (ACD268_10220) - 1980690..1982327 (-) 1638 WP_180681607.1 minor capsid protein -
  ACD268_RS10225 (ACD268_10225) - 1982236..1983705 (-) 1470 WP_373381838.1 phage portal protein -
  ACD268_RS10230 (ACD268_10230) - 1983717..1985015 (-) 1299 WP_050294909.1 PBSX family phage terminase large subunit -
  ACD268_RS10235 (ACD268_10235) - 1984993..1985433 (-) 441 WP_025173333.1 terminase small subunit -
  ACD268_RS10245 (ACD268_10245) - 1985901..1986305 (-) 405 WP_001030241.1 DUF1492 domain-containing protein -
  ACD268_RS10250 (ACD268_10250) - 1986377..1986748 (-) 372 WP_050119274.1 hypothetical protein -
  ACD268_RS10255 (ACD268_10255) - 1986745..1987182 (-) 438 WP_330783782.1 YopX family protein -
  ACD268_RS10260 (ACD268_10260) - 1987295..1987753 (-) 459 WP_000340952.1 hypothetical protein -
  ACD268_RS10265 (ACD268_10265) - 1987753..1988013 (-) 261 WP_001252173.1 DUF1372 family protein -
  ACD268_RS10270 (ACD268_10270) - 1988015..1988353 (-) 339 WP_000119455.1 hypothetical protein -
  ACD268_RS10275 (ACD268_10275) - 1988350..1988538 (-) 189 WP_001277861.1 hypothetical protein -
  ACD268_RS10280 (ACD268_10280) - 1988538..1989212 (-) 675 WP_330779091.1 DUF1642 domain-containing protein -
  ACD268_RS10285 (ACD268_10285) - 1989214..1989531 (-) 318 WP_001862972.1 hypothetical protein -
  ACD268_RS10290 (ACD268_10290) - 1989576..1989998 (-) 423 WP_000167804.1 hypothetical protein -
  ACD268_RS10295 (ACD268_10295) - 1990042..1990470 (-) 429 WP_000779142.1 RusA family crossover junction endodeoxyribonuclease -
  ACD268_RS10300 (ACD268_10300) - 1990467..1990796 (-) 330 WP_138033040.1 hypothetical protein -
  ACD268_RS10305 (ACD268_10305) - 1990810..1991019 (-) 210 WP_000455269.1 hypothetical protein -
  ACD268_RS10310 (ACD268_10310) - 1991021..1991716 (-) 696 WP_050099130.1 site-specific DNA-methyltransferase -
  ACD268_RS10315 (ACD268_10315) ssbA 1991729..1992145 (-) 417 WP_000609559.1 single-stranded DNA-binding protein Machinery gene
  ACD268_RS10320 (ACD268_10320) - 1992240..1992575 (-) 336 WP_000598346.1 sporulation protein Cse60 -
  ACD268_RS10325 (ACD268_10325) - 1992568..1992891 (-) 324 WP_373382344.1 hypothetical protein -
  ACD268_RS10330 (ACD268_10330) - 1993059..1994099 (-) 1041 WP_001157037.1 DUF1351 domain-containing protein -
  ACD268_RS10335 (ACD268_10335) bet 1994109..1994873 (-) 765 WP_000184008.1 phage recombination protein Bet -
  ACD268_RS10340 (ACD268_10340) - 1994885..1995115 (-) 231 WP_000192920.1 hypothetical protein -
  ACD268_RS10345 (ACD268_10345) - 1995235..1995378 (-) 144 WP_000161124.1 hypothetical protein -
  ACD268_RS10350 (ACD268_10350) - 1995365..1995625 (-) 261 WP_000471465.1 hypothetical protein -
  ACD268_RS10355 (ACD268_10355) - 1995638..1995892 (-) 255 WP_001866806.1 hypothetical protein -
  ACD268_RS10360 (ACD268_10360) - 1995893..1996114 (-) 222 WP_001862960.1 hypothetical protein -
  ACD268_RS10365 (ACD268_10365) - 1996114..1996275 (-) 162 WP_000823399.1 BOW99_gp33 family protein -
  ACD268_RS10370 (ACD268_10370) - 1996340..1996483 (-) 144 WP_000389589.1 hypothetical protein -
  ACD268_RS10375 (ACD268_10375) - 1996601..1996825 (+) 225 WP_001862956.1 hypothetical protein -
  ACD268_RS10380 (ACD268_10380) - 1997128..1997406 (-) 279 WP_000261154.1 HTH domain-containing protein -
  ACD268_RS10385 (ACD268_10385) - 1997573..1997809 (-) 237 WP_001157069.1 hypothetical protein -
  ACD268_RS10390 (ACD268_10390) - 1998001..1998291 (-) 291 WP_000167802.1 hypothetical protein -
  ACD268_RS10395 (ACD268_10395) - 1998288..1998434 (-) 147 WP_000389580.1 hypothetical protein -
  ACD268_RS10400 (ACD268_10400) - 1998829..1998951 (-) 123 WP_000343850.1 hypothetical protein -
  ACD268_RS10405 (ACD268_10405) - 1999025..1999300 (-) 276 WP_001094380.1 hypothetical protein -
  ACD268_RS10410 (ACD268_10410) - 1999461..2000213 (+) 753 WP_001023158.1 XRE family transcriptional regulator -
  ACD268_RS10415 (ACD268_10415) - 2000215..2000415 (+) 201 WP_000064302.1 hypothetical protein -
  ACD268_RS10420 (ACD268_10420) - 2000589..2002034 (+) 1446 WP_024478469.1 recombinase family protein -
  ACD268_RS10425 (ACD268_10425) comGB/cglB 2002116..2003132 (-) 1017 WP_074017570.1 competence type IV pilus assembly protein ComGB Machinery gene
  ACD268_RS10430 (ACD268_10430) comGA/cglA/cilD 2003080..2004021 (-) 942 WP_000249555.1 competence type IV pilus ATPase ComGA Machinery gene
  ACD268_RS10435 (ACD268_10435) - 2004096..2004461 (-) 366 WP_000286415.1 DUF1033 family protein -
  ACD268_RS10440 (ACD268_10440) - 2004612..2005670 (-) 1059 WP_000649466.1 zinc-dependent alcohol dehydrogenase family protein -
  ACD268_RS10445 (ACD268_10445) nagA 2005833..2006984 (-) 1152 WP_001134457.1 N-acetylglucosamine-6-phosphate deacetylase -
  ACD268_RS10450 (ACD268_10450) - 2007137..2008954 (-) 1818 WP_001220838.1 acyltransferase family protein -

Sequence


Protein


Download         Length: 338 a.a.        Molecular weight: 38482.48 Da        Isoelectric Point: 9.6162

>NTDB_id=1043752 ACD268_RS10425 WP_074017570.1 2002116..2003132(-) (comGB/cglB) [Streptococcus pneumoniae strain FC1]
MDISQVFRLRRKKLATAKQKNIITLFNNLFSSGFHLVETISFLDRSSLLDKQCVTQMRTGLSQGKSFSEMMESLGCSSTI
VTQLSLAEVHGNLHLSLGKIEEYLDNLAKVKKKLIEVATYPLILLGFLLLIMLGLRNYLLPQLDSSNIATRIIGNLPQIF
LGMVGLVSVLALLALTFYKRSSKMSVFSILARLPFIGIFVQTYLTAYYAREWGNMISQGMELTQIFQMMQEQGSQLFKEI
GQDLAQTLKNGREFSQTIGTYPFFRKELSLIIEYGEVKSKLGSELEIYAEKTWEAFFTRVNRTMNLVQPLVFIFVALIIV
LLYAAMLMPMYQNMEVNF

Nucleotide


Download         Length: 1017 bp        

>NTDB_id=1043752 ACD268_RS10425 WP_074017570.1 2002116..2003132(-) (comGB/cglB) [Streptococcus pneumoniae strain FC1]
ATGGACATATCACAAGTCTTCAGGCTGAGACGGAAAAAATTAGCTACAGCTAAGCAAAAAAATATCATCACCCTATTTAA
CAATCTCTTTTCTAGCGGTTTTCATCTGGTGGAGACTATCTCCTTTTTAGATAGGAGTTCCTTGTTGGACAAGCAGTGTG
TGACCCAGATGCGTACAGGCTTGTCTCAGGGGAAATCATTCTCAGAAATGATGGAAAGTTTGGGATGTTCAAGTACCATT
GTCACTCAGTTATCTCTAGCTGAAGTTCATGGTAATCTCCACCTGAGTTTGGGAAAGATAGAAGAATATCTAGACAATCT
GGCCAAGGTCAAGAAAAAATTGATTGAAGTAGCGACCTATCCTTTGATTTTGCTGGGTTTTCTTCTCTTAATTATGCTGG
GGCTACGGAATTACCTGCTCCCACAACTGGATAGTAGCAATATTGCCACCCGAATTATCGGTAATCTGCCACAAATTTTT
CTAGGCATGGTAGGGCTTGTTTCCGTGCTTGCCCTTTTAGCACTAACTTTTTATAAAAGAAGTTCTAAGATGAGTGTCTT
TTCTATCTTAGCACGCCTTCCCTTTATTGGAATATTTGTGCAGACCTACTTGACAGCCTATTATGCACGTGAATGGGGGA
ATATGATTTCACAGGGAATGGAGCTGACGCAGATTTTTCAAATGATGCAGGAACAAGGTTCCCAGCTCTTTAAAGAAATC
GGTCAAGATCTGGCTCAAACCCTGAAAAATGGCCGTGAATTTTCTCAGACGATAGGAACCTATCCTTTCTTTAGGAAGGA
ATTGAGTCTCATCATAGAGTATGGGGAAGTTAAGTCCAAGCTGGGTAGTGAGTTGGAAATCTATGCTGAAAAAACTTGGG
AAGCCTTTTTTACCCGAGTCAACCGCACCATGAATTTGGTGCAGCCACTGGTTTTTATTTTTGTGGCACTGATTATCGTT
TTACTTTATGCGGCAATGCTCATGCCCATGTATCAAAATATGGAGGTAAATTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGB/cglB Streptococcus pneumoniae Rx1

98.521

100

0.985

  comGB/cglB Streptococcus pneumoniae D39

98.521

100

0.985

  comGB/cglB Streptococcus pneumoniae R6

98.521

100

0.985

  comGB/cglB Streptococcus pneumoniae TIGR4

98.521

100

0.985

  comGB/cglB Streptococcus mitis SK321

94.97

100

0.95

  comGB/cglB Streptococcus mitis NCTC 12261

94.083

100

0.941

  comYB Streptococcus gordonii str. Challis substr. CH1

70.536

99.408

0.701

  comYB Streptococcus mutans UA140

58.176

94.083

0.547

  comYB Streptococcus mutans UA159

58.176

94.083

0.547

  comGB Lactococcus lactis subsp. cremoris KW2

50.898

98.817

0.503


Multiple sequence alignment