Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA/cglA/cilD   Type   Machinery gene
Locus tag   M594_RS00890 Genome accession   NZ_CP047883
Coordinates   179423..180364 (+) Length   313 a.a.
NCBI ID   WP_173875726.1    Uniprot ID   A0A6M9EXA7
Organism   Streptococcus mitis strain S022-V3-A4     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 174423..185364
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  M594_RS00870 (M594_00890) tgt 174438..175580 (+) 1143 WP_001285237.1 tRNA guanosine(34) transglycosylase Tgt -
  M594_RS00875 (M594_00895) - 175681..177498 (+) 1818 WP_173875724.1 acyltransferase family protein -
  M594_RS00880 (M594_00900) nagA 177639..178790 (+) 1152 WP_033687416.1 N-acetylglucosamine-6-phosphate deacetylase -
  M594_RS00885 (M594_00905) - 178982..179347 (+) 366 WP_173875725.1 DUF1033 family protein -
  M594_RS00890 (M594_00910) comGA/cglA/cilD 179423..180364 (+) 942 WP_173875726.1 competence type IV pilus ATPase ComGA Machinery gene
  M594_RS00895 (M594_00915) comGB/cglB 180312..181328 (+) 1017 WP_153209728.1 competence type IV pilus assembly protein ComGB Machinery gene
  M594_RS00900 (M594_00920) comGC/cglC 181330..181653 (+) 324 WP_000738641.1 comG operon protein ComGC Machinery gene
  M594_RS00905 (M594_00925) comGD/cglD 181616..181996 (+) 381 WP_173875727.1 competence type IV pilus minor pilin ComGD Machinery gene
  M594_RS00910 (M594_00930) comGE/cglE 182013..182315 (+) 303 WP_153209718.1 competence type IV pilus minor pilin ComGE Machinery gene
  M594_RS00915 (M594_00935) comGF/cglF 182278..182739 (+) 462 WP_173875728.1 competence type IV pilus minor pilin ComGF Machinery gene
  M594_RS00920 (M594_00940) comGG/cglG 182717..183130 (+) 414 WP_173875729.1 competence type IV pilus minor pilin ComGG Machinery gene
  M594_RS00925 (M594_00945) - 183163..183750 (+) 588 WP_173875730.1 class I SAM-dependent methyltransferase -
  M594_RS00930 (M594_00950) comYH 183811..184764 (+) 954 WP_173875731.1 class I SAM-dependent methyltransferase Machinery gene

Sequence


Protein


Download         Length: 313 a.a.        Molecular weight: 35530.47 Da        Isoelectric Point: 5.8157

>NTDB_id=418224 M594_RS00890 WP_173875726.1 179423..180364(+) (comGA/cglA/cilD) [Streptococcus mitis strain S022-V3-A4]
MVQEIAQEIIRSARKKGAQDIYFVPKLDAYELHMRIGDERCKIGCYDFEKFAAVISHFKFVAGMNVGEKRRSQLGSCDYA
YDQKVASLRLSTVGDYRGHESLVIRLLHDEEQDLHFWFQDIDELGKQYRQRGLYLFAGPVGSGKTTLMHELAKSIFKGQQ
VMSIEDPVEIKQEDMLQLQLNEAIGLTYENLIKLSLRHRPDLLIIGEIRDSETARAVVRASLTGATVFSTIHAKSIRGVY
ERLLELGVSEEELAVVLQGVCYQRLIGGGGIVDFANKDYQEHQPTSWNEQIDQLLKDGHITSLQAETEKISYS

Nucleotide


Download         Length: 942 bp        

>NTDB_id=418224 M594_RS00890 WP_173875726.1 179423..180364(+) (comGA/cglA/cilD) [Streptococcus mitis strain S022-V3-A4]
ATGGTTCAAGAAATTGCACAAGAAATCATTCGTTCAGCTCGGAAAAAAGGAGCGCAAGACATTTATTTTGTTCCTAAGTT
AGATGCCTATGAGCTTCATATGAGGATAGGAGACGAGCGCTGTAAAATTGGTTGTTATGATTTTGAAAAGTTTGCGGCCG
TCATCAGTCACTTTAAGTTTGTGGCGGGTATGAATGTTGGAGAAAAGCGACGTAGTCAACTTGGTTCCTGTGATTATGCC
TATGACCAGAAGGTGGCGTCCTTACGTTTATCTACTGTAGGCGACTATCGAGGGCATGAGAGTTTAGTTATTCGCTTGTT
GCATGATGAGGAGCAGGACTTGCATTTTTGGTTTCAGGATATTGATGAATTGGGCAAGCAGTATAGGCAACGGGGACTTT
ATCTTTTTGCTGGTCCAGTCGGAAGTGGTAAGACAACCCTGATGCACGAATTGGCCAAGTCCATCTTTAAAGGACAGCAA
GTCATGTCTATCGAAGATCCTGTAGAAATCAAGCAGGAGGACATGCTCCAGTTGCAGTTGAACGAAGCAATCGGACTAAC
CTATGAAAATCTGATTAAGCTTTCCCTACGGCATCGTCCTGATCTCTTGATTATCGGAGAAATTCGTGACAGCGAAACGG
CGCGTGCAGTGGTCAGAGCTAGTTTGACAGGTGCGACAGTCTTTTCAACCATTCATGCCAAGAGTATTCGAGGTGTTTAT
GAGCGCCTGCTGGAGTTGGGTGTGAGTGAGGAGGAATTAGCAGTCGTTCTGCAAGGAGTTTGTTACCAGAGATTAATCGG
GGGAGGAGGAATTGTTGACTTTGCAAACAAAGACTATCAAGAACACCAGCCAACTAGCTGGAATGAACAGATTGACCAGC
TTCTTAAAGATGGACATATCACAAGTCTTCAGGCTGAAACGGAAAAAATTAGCTACAGCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A6M9EXA7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA/cglA/cilD Streptococcus mitis NCTC 12261

97.125

100

0.971

  comGA/cglA/cilD Streptococcus pneumoniae Rx1

95.527

100

0.955

  comGA/cglA/cilD Streptococcus pneumoniae D39

95.527

100

0.955

  comGA/cglA/cilD Streptococcus pneumoniae R6

95.527

100

0.955

  comGA/cglA/cilD Streptococcus pneumoniae TIGR4

95.527

100

0.955

  comYA Streptococcus gordonii str. Challis substr. CH1

79.677

99.042

0.789

  comYA Streptococcus mutans UA159

66.238

99.361

0.658

  comYA Streptococcus mutans UA140

66.238

99.361

0.658

  comGA/cglA Streptococcus sobrinus strain NIDR 6715-7

62.258

99.042

0.617

  comGA Lactococcus lactis subsp. cremoris KW2

55.449

99.681

0.553

  comGA Latilactobacillus sakei subsp. sakei 23K

42.279

86.901

0.367


Multiple sequence alignment