Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA/cglA   Type   Machinery gene
Locus tag   DQM56_RS00705 Genome accession   NZ_LS483368
Coordinates   122914..123852 (+) Length   312 a.a.
NCBI ID   WP_111710107.1    Uniprot ID   -
Organism   Streptococcus equi subsp. zooepidemicus strain NCTC6176     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 117914..128852
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM56_RS00685 (NCTC6176_00137) - 118004..118369 (+) 366 WP_111710102.1 DUF1033 family protein -
  DQM56_RS00690 (NCTC6176_00138) - 118902..119405 (+) 504 WP_111710103.1 hypothetical protein -
  DQM56_RS00695 (NCTC6176_00139) - 120085..122088 (+) 2004 WP_111710105.1 DUF1430 domain-containing protein -
  DQM56_RS00700 (NCTC6176_00140) - 122090..122758 (+) 669 WP_043024949.1 ATP-binding cassette domain-containing protein -
  DQM56_RS00705 (NCTC6176_00141) comGA/cglA 122914..123852 (+) 939 WP_111710107.1 competence type IV pilus ATPase ComGA Machinery gene
  DQM56_RS00710 (NCTC6176_00142) comGB 123785..124818 (+) 1034 Protein_105 competence type IV pilus assembly protein ComGB -
  DQM56_RS00715 (NCTC6176_00143) comGC/cglC 124819..125142 (+) 324 WP_012514782.1 competence type IV pilus major pilin ComGC Machinery gene
  DQM56_RS00720 (NCTC6176_00144) comGD 125120..125545 (+) 426 WP_012514783.1 competence type IV pilus minor pilin ComGD -
  DQM56_RS00725 (NCTC6176_00145) comGE 125550..125792 (+) 243 WP_207622334.1 competence type IV pilus minor pilin ComGE -
  DQM56_RS00730 (NCTC6176_00146) comYF 125779..126213 (+) 435 WP_014622144.1 competence type IV pilus minor pilin ComGF Machinery gene
  DQM56_RS00735 (NCTC6176_00147) comGG 126191..126553 (+) 363 WP_012677238.1 competence type IV pilus minor pilin ComGG -
  DQM56_RS00740 (NCTC6176_00148) comYH 126615..127568 (+) 954 WP_012514788.1 class I SAM-dependent methyltransferase Machinery gene
  DQM56_RS00745 (NCTC6176_00149) - 127628..128827 (+) 1200 WP_111677653.1 acetate kinase -

Sequence


Protein


Download         Length: 312 a.a.        Molecular weight: 35313.57 Da        Isoelectric Point: 7.1374

>NTDB_id=1138875 DQM56_RS00705 WP_111710107.1 122914..123852(+) (comGA/cglA) [Streptococcus equi subsp. zooepidemicus strain NCTC6176]
MVQELAKRLIRQAEELQAQDIYILPRGASYELLMRIGDDKRLIDVCESDRMANLISHFKFVAGMNVGEKRRCQLGACDYD
LDGTKVSLRLSAVGDYQGQESLVIRLLYHHQRQLRYWFDGLERVTSAIGARGLYLFSGPVGSGKTTLMYQLVSDYCKELQ
VISIEDPVEIKNDQLLQLQVNDSIGMTYDNLIKLSLRHRPDVLIIGEIRDTQTARAVIRASLTGAMVFSTVHAKSISGVY
ARLLELGISRAELDNSLAMVTYQRFISGGALIDCAQKEFEHHQANRWNQQIDQLLAEGHLNTKQARLEKIIQ

Nucleotide


Download         Length: 939 bp        

>NTDB_id=1138875 DQM56_RS00705 WP_111710107.1 122914..123852(+) (comGA/cglA) [Streptococcus equi subsp. zooepidemicus strain NCTC6176]
ATGGTTCAAGAATTAGCCAAGAGGCTTATCAGGCAAGCCGAAGAATTGCAGGCTCAGGATATTTATATACTGCCAAGAGG
GGCAAGCTATGAGCTTTTGATGAGGATAGGTGATGACAAGAGGCTAATAGATGTTTGTGAAAGTGATCGGATGGCTAATC
TTATTAGTCACTTTAAGTTCGTTGCAGGAATGAATGTGGGAGAAAAAAGACGCTGCCAGCTCGGTGCTTGTGACTACGAC
CTTGATGGCACTAAGGTATCTCTACGATTATCAGCTGTGGGTGATTATCAGGGACAGGAGAGCTTGGTTATTCGTCTCTT
GTATCATCATCAACGCCAGCTCAGGTATTGGTTTGATGGTTTAGAGCGAGTCACATCAGCTATTGGAGCTAGAGGCCTTT
ACCTCTTTTCTGGACCAGTCGGATCAGGTAAGACAACCCTGATGTATCAGCTAGTCTCTGACTATTGCAAGGAGCTTCAG
GTGATCAGTATTGAGGATCCTGTGGAAATAAAAAATGACCAGCTCCTGCAATTACAGGTTAACGACAGCATTGGCATGAC
ATACGATAATCTGATTAAGCTCTCCTTGCGTCATCGTCCAGATGTGTTGATTATCGGAGAAATCAGAGATACACAGACTG
CTAGAGCGGTTATTAGGGCTAGCCTAACAGGAGCAATGGTCTTTTCAACCGTCCATGCAAAGAGTATTTCAGGCGTTTAC
GCTAGGCTGCTAGAATTAGGCATTTCTAGGGCAGAATTGGATAATAGTCTAGCAATGGTTACTTACCAGCGATTTATTAG
TGGAGGTGCTTTAATTGATTGTGCACAAAAGGAATTTGAACATCATCAAGCCAACCGGTGGAATCAGCAGATTGATCAGC
TTCTTGCAGAGGGACATCTCAATACCAAGCAGGCAAGGCTTGAAAAAATTATCCAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

68.065

99.359

0.676

  comYA Streptococcus mutans UA140

63.344

99.679

0.631

  comYA Streptococcus mutans UA159

63.344

99.679

0.631

  comYA Streptococcus gordonii str. Challis substr. CH1

61.29

99.359

0.609

  comGA/cglA/cilD Streptococcus mitis NCTC 12261

61.29

99.359

0.609

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

60.968

99.359

0.606

  comGA/cglA/cilD Streptococcus pneumoniae D39

60.968

99.359

0.606

  comGA/cglA/cilD Streptococcus pneumoniae R6

60.968

99.359

0.606

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

60.968

99.359

0.606

  comGA Lactococcus lactis subsp. cremoris KW2

52.733

99.679

0.526

  comGA Latilactobacillus sakei subsp. sakei 23K

39.583

92.308

0.365


Multiple sequence alignment