Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA/cglA   Type   Machinery gene
Locus tag   EL136_RS08700 Genome accession   NZ_LR134317
Coordinates   1836133..1837071 (-) Length   312 a.a.
NCBI ID   WP_154804229.1    Uniprot ID   A0A7Z9D3K9
Organism   Streptococcus equi subsp. zooepidemicus strain NCTC6180     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1831133..1842071
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EL136_RS08665 (NCTC6180_01778) - 1832054..1833396 (+) 1343 WP_165626751.1 IS3 family transposase -
  EL136_RS08670 (NCTC6180_01779) comGG 1833431..1833793 (-) 363 WP_154804226.1 competence type IV pilus minor pilin ComGG -
  EL136_RS08675 (NCTC6180_01780) comYF 1833771..1834205 (-) 435 WP_042669824.1 competence type IV pilus minor pilin ComGF Machinery gene
  EL136_RS08680 (NCTC6180_01781) comGE 1834192..1834482 (-) 291 WP_154804227.1 competence type IV pilus minor pilin ComGE -
  EL136_RS08685 (NCTC6180_01782) comGD 1834439..1834864 (-) 426 WP_012514783.1 competence type IV pilus minor pilin ComGD -
  EL136_RS08690 (NCTC6180_01783) comGC/cglC 1834842..1835165 (-) 324 WP_014622142.1 competence type IV pilus major pilin ComGC Machinery gene
  EL136_RS08695 (NCTC6180_01784) comYB 1835166..1836200 (-) 1035 WP_232013965.1 competence type IV pilus assembly protein ComGB Machinery gene
  EL136_RS08700 (NCTC6180_01785) comGA/cglA 1836133..1837071 (-) 939 WP_154804229.1 competence type IV pilus ATPase ComGA Machinery gene
  EL136_RS08705 (NCTC6180_01786) - 1837227..1837895 (-) 669 WP_043024949.1 ATP-binding cassette domain-containing protein -
  EL136_RS08710 (NCTC6180_01787) - 1837897..1839900 (-) 2004 WP_154804230.1 DUF1430 domain-containing protein -
  EL136_RS08715 (NCTC6180_01788) - 1840471..1841703 (-) 1233 WP_014622136.1 hypothetical protein -

Sequence


Protein


Download         Length: 312 a.a.        Molecular weight: 35284.53 Da        Isoelectric Point: 6.8988

>NTDB_id=1121293 EL136_RS08700 WP_154804229.1 1836133..1837071(-) (comGA/cglA) [Streptococcus equi subsp. zooepidemicus strain NCTC6180]
MVQELAKRLIRQAEELQAQDIYILPRGASYELLMRIGDDKRLIDVCESDRMANLISHFKFVAGMNVGEKRRCQLGACDYD
LDGTKVSLRLSAVGDYQGQESLVIRLLYHHQRQLRYWFDGLERVTSAIGARGLYLFSGPVGSGKTTLMYQLVSDYCKELQ
VISIEDPVEIKNDQLLQLQVNDSIGMTYDNLIKLSLRHRPDVLIIGEIRDTQTARAVIRASLTGAMVFSTVHAKSISGVY
ARLLELGISRVELDNSLAMVAYQRFISGGALIDCAQKEFEHHQASQWNQQIDQLLAEGHLNTRQARLEKIIQ

Nucleotide


Download         Length: 939 bp        

>NTDB_id=1121293 EL136_RS08700 WP_154804229.1 1836133..1837071(-) (comGA/cglA) [Streptococcus equi subsp. zooepidemicus strain NCTC6180]
ATGGTTCAAGAATTAGCCAAGAGGCTTATCAGGCAAGCCGAAGAATTGCAGGCTCAGGATATTTATATACTGCCAAGAGG
GGCAAGCTATGAGCTTTTGATGAGGATAGGTGATGACAAGAGGCTAATAGATGTTTGTGAAAGTGATCGGATGGCTAATC
TTATTAGTCACTTTAAGTTCGTTGCAGGAATGAATGTGGGAGAAAAAAGACGCTGCCAGCTCGGTGCTTGTGACTACGAC
CTTGATGGCACTAAGGTATCTCTACGATTATCAGCTGTGGGTGATTATCAGGGACAGGAGAGCTTGGTTATTCGTCTCTT
GTATCATCATCAACGCCAGCTCAGGTATTGGTTTGATGGTTTAGAGCGAGTCACATCAGCTATTGGAGCTAGAGGCCTTT
ACCTCTTTTCTGGACCAGTCGGATCAGGTAAGACAACCCTGATGTATCAGCTAGTCTCTGACTATTGCAAGGAGCTTCAG
GTGATCAGTATTGAGGATCCTGTGGAAATAAAAAATGACCAGCTCCTGCAATTACAGGTTAACGACAGCATTGGCATGAC
ATACGATAATCTGATTAAGCTCTCCTTGCGTCATCGTCCAGATGTGTTGATTATCGGAGAAATCAGAGATACACAGACTG
CTAGAGCGGTTATTAGGGCTAGCCTAACAGGAGCAATGGTCTTTTCAACCGTCCATGCAAAGAGTATTTCAGGCGTTTAC
GCTAGGCTGTTAGAATTAGGCATTTCTAGGGTAGAATTGGATAATAGTCTAGCAATGGTTGCTTACCAGCGATTTATTAG
TGGAGGTGCTTTAATTGATTGTGCGCAAAAGGAATTTGAACATCATCAAGCCAGCCAGTGGAATCAGCAGATTGATCAGC
TTCTTGCGGAGGGACATCTCAATACCAGGCAGGCAAGGCTTGAAAAAATTATCCAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Z9D3K9

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

68.387

99.359

0.679

  comYA Streptococcus mutans UA159

63.344

99.679

0.631

  comYA Streptococcus mutans UA140

63.344

99.679

0.631

  comYA Streptococcus gordonii str. Challis substr. CH1

61.29

99.359

0.609

  comGA/cglA/cilD Streptococcus mitis NCTC 12261

61.29

99.359

0.609

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

60.968

99.359

0.606

  comGA/cglA/cilD Streptococcus pneumoniae D39

60.968

99.359

0.606

  comGA/cglA/cilD Streptococcus pneumoniae R6

60.968

99.359

0.606

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

60.968

99.359

0.606

  comGA Lactococcus lactis subsp. cremoris KW2

52.733

99.679

0.526

  comGA Latilactobacillus sakei subsp. sakei 23K

39.322

94.551

0.372


Multiple sequence alignment