Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYB   Type   Machinery gene
Locus tag   FGL12_RS09815 Genome accession   NZ_LR594041
Coordinates   2029713..2030747 (-) Length   344 a.a.
NCBI ID   WP_012130941.1    Uniprot ID   A8AZH9
Organism   Streptococcus gordonii strain NCTC9124     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2024713..2035747
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FGL12_RS11060 - 2024975..2025406 (-) 432 WP_231069241.1 CPBP family intramembrane glutamic endopeptidase -
  FGL12_RS09780 (NCTC9124_01971) - 2025688..2026878 (-) 1191 WP_012130934.1 acetate kinase -
  FGL12_RS09785 (NCTC9124_01972) comYH 2026928..2027881 (-) 954 WP_012130935.1 class I SAM-dependent methyltransferase Machinery gene
  FGL12_RS09790 (NCTC9124_01973) comGG 2027912..2028343 (-) 432 WP_223341947.1 competence type IV pilus minor pilin ComGG -
  FGL12_RS09795 (NCTC9124_01974) comGF/cglF 2028324..2028713 (-) 390 WP_241974402.1 competence type IV pilus minor pilin ComGF Machinery gene
  FGL12_RS09800 (NCTC9124_01975) comGE/cglE 2028740..2029039 (-) 300 WP_138101351.1 competence type IV pilus minor pilin ComGE Machinery gene
  FGL12_RS09805 (NCTC9124_01976) comYD 2029011..2029439 (-) 429 WP_012130939.1 competence type IV pilus minor pilin ComGD Machinery gene
  FGL12_RS09810 (NCTC9124_01977) comYC 2029399..2029716 (-) 318 WP_012130940.1 competence type IV pilus major pilin ComGC Machinery gene
  FGL12_RS09815 (NCTC9124_01978) comYB 2029713..2030747 (-) 1035 WP_012130941.1 competence type IV pilus assembly protein ComGB Machinery gene
  FGL12_RS09820 (NCTC9124_01979) comYA 2030659..2031618 (-) 960 WP_012130942.1 competence type IV pilus ATPase ComGA Machinery gene
  FGL12_RS09825 (NCTC9124_01980) - 2031720..2032088 (-) 369 WP_012130943.1 DUF1033 family protein -

Sequence


Protein


Download         Length: 344 a.a.        Molecular weight: 39068.14 Da        Isoelectric Point: 9.1924

>NTDB_id=1127526 FGL12_RS09815 WP_012130941.1 2029713..2030747(-) (comYB) [Streptococcus gordonii strain NCTC9124]
MISFLQQDISILSKQKQKKLGTSKQKQVIELFNNLFSSGFHLAETVDFLGRSALLEKNYVQQMRQGLANGQAFSEIMASL
GFSDAVVTQLSLAELHGNLSLALLKIEEYLDNLAKVKKKLIEVATYPMMLLGFLVLIMIGLRNYLLPQLSSQNFATQLIG
HLPTIFLLTVLMLLGLTGAIYLVFKGQKRIPVYSFLARLPFVGSFVRIYLTAYYAREWGNMIGQGLELSQIFQIMQEQRS
VLFQEIGQDLGQALQNGQEFSDKIASYPFFKKELSLIIEYGEVKSKLGSELEIYALKTWEEFFGRVNRTMNLIQPLVFVF
VALMIVLLYAAMLLPLYQNMEVHL

Nucleotide


Download         Length: 1035 bp        

>NTDB_id=1127526 FGL12_RS09815 WP_012130941.1 2029713..2030747(-) (comYB) [Streptococcus gordonii strain NCTC9124]
TTGATCAGCTTCTTGCAGCAGGACATATCCATCCTGAGCAAGCAGAAGCAGAAAAAATTAGGAACCAGCAAGCAAAAACA
AGTTATTGAACTGTTTAATAATTTATTCTCTAGTGGTTTTCATCTGGCTGAGACTGTAGATTTTTTAGGTCGAAGCGCTC
TTTTGGAGAAGAACTATGTCCAGCAAATGCGTCAGGGTCTGGCTAATGGACAAGCATTTTCAGAGATTATGGCTAGTCTA
GGATTTTCTGATGCAGTAGTAACCCAGCTGTCATTGGCTGAGTTACATGGGAATTTATCGCTCGCCTTGCTAAAAATAGA
GGAGTATTTGGATAATCTTGCGAAGGTAAAAAAGAAGTTAATTGAAGTAGCAACCTATCCCATGATGCTTCTTGGCTTTT
TGGTATTGATTATGATAGGATTGAGAAATTACCTTTTACCCCAACTCAGCAGTCAAAATTTTGCCACTCAACTCATTGGC
CATTTACCGACTATTTTTCTACTAACTGTCTTAATGTTACTTGGATTAACAGGAGCGATTTATCTGGTATTCAAAGGTCA
GAAACGGATTCCTGTATACTCTTTCTTGGCCCGCCTGCCTTTTGTTGGATCCTTTGTAAGGATTTACCTAACCGCCTATT
ATGCGCGTGAATGGGGCAATATGATTGGACAAGGTTTGGAGCTCAGTCAGATTTTCCAGATTATGCAAGAGCAACGCTCG
GTTTTATTTCAGGAAATTGGCCAGGATTTGGGTCAAGCCTTGCAGAATGGTCAAGAATTTTCAGATAAAATTGCTTCTTA
CCCTTTTTTCAAAAAAGAATTGTCTCTAATCATCGAATATGGGGAAGTCAAATCCAAGCTGGGAAGTGAACTAGAAATTT
ATGCCCTTAAGACTTGGGAAGAATTTTTTGGCAGGGTCAATCGGACCATGAATCTGATTCAGCCCCTAGTCTTTGTCTTT
GTAGCCTTGATGATTGTCTTACTGTATGCGGCAATGTTATTGCCCCTTTATCAAAATATGGAGGTTCATCTATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A8AZH9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYB Streptococcus gordonii str. Challis substr. CH1

100

100

1

  comGB/cglB Streptococcus mitis NCTC 12261

72.434

99.128

0.718

  comGB/cglB Streptococcus mitis SK321

72.619

97.674

0.709

  comGB/cglB Streptococcus pneumoniae TIGR4

71.131

97.674

0.695

  comGB/cglB Streptococcus pneumoniae R6

71.131

97.674

0.695

  comGB/cglB Streptococcus pneumoniae Rx1

71.131

97.674

0.695

  comGB/cglB Streptococcus pneumoniae D39

71.131

97.674

0.695

  comYB Streptococcus mutans UA140

60.35

99.709

0.602

  comYB Streptococcus mutans UA159

60.058

99.709

0.599

  comGB Lactococcus lactis subsp. cremoris KW2

51.929

97.965

0.509


Multiple sequence alignment