Detailed information
Overview
| Name | comGA/cglA/cilD | Type | Machinery gene |
| Locus tag | H1W95_RS08860 | Genome accession | NZ_LR822030 |
| Coordinates | 1723120..1724061 (-) | Length | 313 a.a. |
| NCBI ID | WP_180483385.1 | Uniprot ID | A0AAU9H8P4 |
| Organism | Streptococcus thermophilus isolate STH_CIRM_1046 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| ICE | 1723788..1784241 | 1723120..1724061 | flank | -273 |
Gene organization within MGE regions
Location: 1723120..1784241
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| H1W95_RS08860 (STHERMO_2006) | comGA/cglA/cilD | 1723120..1724061 (-) | 942 | WP_180483385.1 | competence type IV pilus ATPase ComGA | Machinery gene |
| H1W95_RS08865 (STHERMO_2007) | - | 1724142..1724504 (-) | 363 | WP_014608746.1 | DUF1033 family protein | - |
| H1W95_RS08870 (STHERMO_2008) | rpoC | 1724663..1728301 (-) | 3639 | WP_011226631.1 | DNA-directed RNA polymerase subunit beta' | - |
| H1W95_RS08875 (STHERMO_2009) | rpoB | 1728402..1731983 (-) | 3582 | WP_023909983.1 | DNA-directed RNA polymerase subunit beta | - |
| H1W95_RS08880 (STHERMO_2010) | pbp1b | 1732344..1734767 (-) | 2424 | WP_180483387.1 | penicillin-binding protein PBP1B | - |
| H1W95_RS08885 (STHERMO_2011) | tyrS | 1734868..1736124 (+) | 1257 | WP_011681693.1 | tyrosine--tRNA ligase | - |
| H1W95_RS08890 (STHERMO_2012) | ilvC | 1736250..1737272 (-) | 1023 | WP_002952054.1 | ketol-acid reductoisomerase | - |
| H1W95_RS08895 (STHERMO_2013) | ilvN | 1737348..1737824 (-) | 477 | WP_011226635.1 | acetolactate synthase small subunit | - |
| H1W95_RS08900 (STHERMO_2014) | - | 1737817..1739517 (-) | 1701 | WP_014622011.1 | acetolactate synthase large subunit | - |
| H1W95_RS08905 (STHERMO_2015) | ilvD | 1739652..1741372 (-) | 1721 | Protein_1718 | dihydroxy-acid dehydratase | - |
| H1W95_RS08910 (STHERMO_2017) | - | 1741542..1743209 (-) | 1668 | WP_014608752.1 | DAK2 domain-containing protein | - |
| H1W95_RS08915 (STHERMO_2018) | - | 1743209..1743574 (-) | 366 | WP_011681698.1 | Asp23/Gls24 family envelope stress response protein | - |
| H1W95_RS08920 (STHERMO_2019) | - | 1743684..1744973 (-) | 1290 | WP_011681699.1 | MATE family efflux transporter | - |
| H1W95_RS08925 (STHERMO_2020) | thrC | 1745031..1746515 (-) | 1485 | WP_014608753.1 | threonine synthase | - |
| H1W95_RS10375 | adhE | 1747103..1749695 (-) | 2593 | Protein_1723 | bifunctional acetaldehyde-CoA/alcohol dehydrogenase | - |
| H1W95_RS08960 (STHERMO_2026) | - | 1749834..1751729 (-) | 1896 | WP_014608759.1 | M13 family metallopeptidase | - |
| H1W95_RS10380 | - | 1751810..1752962 (-) | 1153 | Protein_1725 | alpha-amylase family glycosyl hydrolase | - |
| H1W95_RS10750 (STHERMO_2031) | - | 1753029..1753345 (-) | 317 | Protein_1726 | PTS sugar transporter subunit IIA | - |
| H1W95_RS10755 (STHERMO_2032) | - | 1753400..1753765 (-) | 366 | WP_014608761.1 | PTS trehalose transporter subunit IIBC | - |
| H1W95_RS08990 (STHERMO_2035) | treR | 1754111..1754821 (+) | 711 | Protein_1728 | trehalose operon repressor | - |
| H1W95_RS08995 (STHERMO_2036) | - | 1754877..1755437 (-) | 561 | WP_002948266.1 | TIGR01440 family protein | - |
| H1W95_RS09000 (STHERMO_2037) | - | 1755450..1755788 (-) | 339 | WP_041829338.1 | hypothetical protein | - |
| H1W95_RS09005 (STHERMO_2038) | - | 1755821..1756177 (-) | 357 | WP_164135671.1 | hypothetical protein | - |
| H1W95_RS10385 (STHERMO_2039) | - | 1756441..1756728 (+) | 288 | WP_224102980.1 | helix-turn-helix domain-containing protein | - |
| H1W95_RS09010 (STHERMO_2040) | - | 1756745..1757431 (+) | 687 | WP_232087022.1 | IS30 family transposase | - |
| H1W95_RS09015 (STHERMO_2041) | - | 1757849..1758631 (-) | 783 | WP_180483390.1 | toll/interleukin-1 receptor domain-containing protein | - |
| H1W95_RS09860 (STHERMO_2042) | - | 1758791..1759558 (+) | 768 | WP_197926869.1 | DUF262 domain-containing protein | - |
| H1W95_RS09865 (STHERMO_2043) | - | 1759515..1759811 (+) | 297 | WP_197926870.1 | hypothetical protein | - |
| H1W95_RS10530 (STHERMO_2044) | - | 1759815..1760279 (+) | 465 | WP_197926871.1 | GmrSD restriction endonuclease domain-containing protein | - |
| H1W95_RS09875 (STHERMO_2045) | - | 1760263..1760628 (+) | 366 | WP_197926872.1 | GmrSD restriction endonuclease domain-containing protein | - |
| H1W95_RS09025 (STHERMO_2047) | - | 1761255..1761740 (+) | 486 | WP_232087023.1 | hypothetical protein | - |
| H1W95_RS09030 (STHERMO_2048) | - | 1761757..1762059 (+) | 303 | WP_167401585.1 | hypothetical protein | - |
| H1W95_RS10760 | - | 1762220..1762360 (+) | 141 | WP_180483615.1 | helix-turn-helix domain-containing protein | - |
| H1W95_RS09040 (STHERMO_2050) | - | 1762353..1763450 (+) | 1098 | WP_179959070.1 | DNA cytosine methyltransferase | - |
| H1W95_RS09045 (STHERMO_2051) | - | 1763437..1764663 (+) | 1227 | WP_232087024.1 | DNA cytosine methyltransferase | - |
| H1W95_RS09050 (STHERMO_2052) | - | 1764688..1765731 (-) | 1044 | WP_180483392.1 | restriction endonuclease PLD domain-containing protein | - |
| H1W95_RS09055 (STHERMO_2054) | - | 1765924..1766829 (-) | 906 | WP_180483618.1 | restriction endonuclease PLD domain-containing protein | - |
| H1W95_RS09060 | - | 1766933..1767720 (-) | 788 | Protein_1746 | ISL3 family transposase | - |
| H1W95_RS09065 | - | 1767848..1768105 (-) | 258 | Protein_1747 | hypothetical protein | - |
| H1W95_RS09070 (STHERMO_2057) | - | 1768255..1768941 (-) | 687 | WP_180483395.1 | helix-turn-helix domain-containing protein | - |
| H1W95_RS09075 (STHERMO_2058) | - | 1768950..1769810 (-) | 861 | WP_180483397.1 | ImmA/IrrE family metallo-endopeptidase | - |
| H1W95_RS09080 (STHERMO_2060) | - | 1770303..1770491 (-) | 189 | WP_180483399.1 | helix-turn-helix transcriptional regulator | - |
| H1W95_RS09085 (STHERMO_2061) | - | 1770497..1770838 (-) | 342 | WP_180483401.1 | helix-turn-helix domain-containing protein | - |
| H1W95_RS09090 (STHERMO_2063) | - | 1771701..1771997 (+) | 297 | WP_073945520.1 | cytoplasmic protein | - |
| H1W95_RS09095 (STHERMO_2064) | - | 1772019..1772480 (+) | 462 | WP_084831216.1 | conjugal transfer protein | - |
| H1W95_RS09100 (STHERMO_2065) | - | 1772494..1774182 (+) | 1689 | WP_180483403.1 | FtsK/SpoIIIE domain-containing protein | - |
| H1W95_RS09105 (STHERMO_2066) | - | 1774390..1775622 (+) | 1233 | WP_180483405.1 | replication initiation factor domain-containing protein | - |
| H1W95_RS09110 (STHERMO_2067) | - | 1775637..1775867 (+) | 231 | WP_021144867.1 | hypothetical protein | - |
| H1W95_RS09115 (STHERMO_2068) | - | 1775871..1776428 (+) | 558 | WP_045769331.1 | hypothetical protein | - |
| H1W95_RS09120 (STHERMO_2069) | - | 1776440..1777435 (+) | 996 | WP_180483407.1 | conjugal transfer protein | - |
| H1W95_RS09125 (STHERMO_2070) | - | 1777445..1777669 (+) | 225 | WP_084831221.1 | hypothetical protein | - |
| H1W95_RS09130 (STHERMO_2071) | - | 1777672..1778082 (+) | 411 | WP_180483409.1 | conjugal transfer protein | - |
| H1W95_RS09135 (STHERMO_2072) | - | 1778096..1780600 (+) | 2505 | WP_180483411.1 | ATP-binding protein | - |
| H1W95_RS09140 (STHERMO_2073) | - | 1780612..1782492 (+) | 1881 | WP_180483413.1 | conjugal transfer protein | - |
| H1W95_RS09145 (STHERMO_2074) | - | 1782494..1782718 (+) | 225 | WP_155222550.1 | hypothetical protein | - |
| H1W95_RS09150 (STHERMO_2075) | - | 1782744..1783856 (+) | 1113 | WP_180483415.1 | CHAP domain-containing protein | - |
| H1W95_RS09155 (STHERMO_2076) | - | 1783870..1784118 (+) | 249 | WP_037611155.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 313 a.a. Molecular weight: 35552.76 Da Isoelectric Point: 6.8365
>NTDB_id=1131492 H1W95_RS08860 WP_180483385.1 1723120..1724061(-) (comGA/cglA/cilD) [Streptococcus thermophilus isolate STH_CIRM_1046]
MVTEFAKEMIKNADSCGAQDIYVIPRQDNYELYMRVGQERRLIDVYRPDFMASLIGHFKFVARMMVGEKRRSQLGSCDYD
CGDGHLVSLRLSTVGDYRGLESLVIRILHSERRELVYWNQGIQPIKDALDYRGLYLFAGPVGSGKTTLMHELVQERFKGQ
QVISIEDPVEIKRDNVLQLQVNQAIDMTYDSLIKLSLRHRPDVLIIGEIRDKETARAVIRASLTGVTVLSTIHAKSIAGV
YERLLDLGVDKSELDNALQGISYMRLIKGGGVIDFARENFQNHSSTNWNQQLEGLVKQGCLTERDIQGEKIKD
MVTEFAKEMIKNADSCGAQDIYVIPRQDNYELYMRVGQERRLIDVYRPDFMASLIGHFKFVARMMVGEKRRSQLGSCDYD
CGDGHLVSLRLSTVGDYRGLESLVIRILHSERRELVYWNQGIQPIKDALDYRGLYLFAGPVGSGKTTLMHELVQERFKGQ
QVISIEDPVEIKRDNVLQLQVNQAIDMTYDSLIKLSLRHRPDVLIIGEIRDKETARAVIRASLTGVTVLSTIHAKSIAGV
YERLLDLGVDKSELDNALQGISYMRLIKGGGVIDFARENFQNHSSTNWNQQLEGLVKQGCLTERDIQGEKIKD
Nucleotide
Download Length: 942 bp
>NTDB_id=1131492 H1W95_RS08860 WP_180483385.1 1723120..1724061(-) (comGA/cglA/cilD) [Streptococcus thermophilus isolate STH_CIRM_1046]
ATGGTAACAGAATTTGCTAAGGAAATGATCAAAAATGCTGATAGTTGTGGGGCTCAAGACATCTATGTCATTCCACGTCA
GGATAATTATGAGCTCTATATGCGAGTCGGGCAAGAGAGGAGATTGATTGATGTATATCGGCCTGATTTCATGGCTAGTC
TTATTGGTCACTTTAAATTTGTGGCGAGAATGATGGTGGGTGAGAAGCGTAGGAGTCAACTGGGTTCATGTGACTATGAT
TGTGGTGACGGTCACTTGGTTTCTCTGCGTTTATCAACTGTTGGAGACTATCGAGGTTTGGAGAGCTTGGTTATCCGTAT
CCTACATTCCGAACGTCGAGAATTAGTGTATTGGAATCAAGGAATCCAGCCTATTAAGGATGCCTTGGATTATAGAGGAC
TGTATCTCTTTGCGGGTCCCGTAGGTTCTGGGAAGACCACACTTATGCATGAGTTGGTTCAGGAGCGTTTTAAGGGGCAG
CAGGTAATTTCGATTGAGGATCCTGTTGAAATCAAACGGGATAATGTCTTGCAACTCCAGGTTAATCAGGCGATTGATAT
GACCTATGATAGTTTGATTAAGCTATCTCTACGTCACCGTCCAGATGTTTTGATTATTGGGGAGATTCGGGATAAGGAGA
CTGCCCGGGCAGTTATTAGAGCTAGTCTAACAGGTGTAACTGTTCTTTCAACTATTCATGCTAAGAGTATTGCTGGCGTT
TACGAACGTCTCTTGGACCTTGGCGTAGATAAGTCTGAGTTGGATAATGCTCTACAAGGCATTTCCTACATGCGTTTGAT
CAAGGGAGGAGGTGTGATTGATTTTGCCAGAGAAAATTTCCAAAACCATTCGTCGACCAACTGGAATCAGCAGTTGGAAG
GTTTGGTTAAACAAGGATGTCTCACTGAGAGGGATATCCAAGGGGAAAAAATTAAAGATTAG
ATGGTAACAGAATTTGCTAAGGAAATGATCAAAAATGCTGATAGTTGTGGGGCTCAAGACATCTATGTCATTCCACGTCA
GGATAATTATGAGCTCTATATGCGAGTCGGGCAAGAGAGGAGATTGATTGATGTATATCGGCCTGATTTCATGGCTAGTC
TTATTGGTCACTTTAAATTTGTGGCGAGAATGATGGTGGGTGAGAAGCGTAGGAGTCAACTGGGTTCATGTGACTATGAT
TGTGGTGACGGTCACTTGGTTTCTCTGCGTTTATCAACTGTTGGAGACTATCGAGGTTTGGAGAGCTTGGTTATCCGTAT
CCTACATTCCGAACGTCGAGAATTAGTGTATTGGAATCAAGGAATCCAGCCTATTAAGGATGCCTTGGATTATAGAGGAC
TGTATCTCTTTGCGGGTCCCGTAGGTTCTGGGAAGACCACACTTATGCATGAGTTGGTTCAGGAGCGTTTTAAGGGGCAG
CAGGTAATTTCGATTGAGGATCCTGTTGAAATCAAACGGGATAATGTCTTGCAACTCCAGGTTAATCAGGCGATTGATAT
GACCTATGATAGTTTGATTAAGCTATCTCTACGTCACCGTCCAGATGTTTTGATTATTGGGGAGATTCGGGATAAGGAGA
CTGCCCGGGCAGTTATTAGAGCTAGTCTAACAGGTGTAACTGTTCTTTCAACTATTCATGCTAAGAGTATTGCTGGCGTT
TACGAACGTCTCTTGGACCTTGGCGTAGATAAGTCTGAGTTGGATAATGCTCTACAAGGCATTTCCTACATGCGTTTGAT
CAAGGGAGGAGGTGTGATTGATTTTGCCAGAGAAAATTTCCAAAACCATTCGTCGACCAACTGGAATCAGCAGTTGGAAG
GTTTGGTTAAACAAGGATGTCTCACTGAGAGGGATATCCAAGGGGAAAAAATTAAAGATTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA/cglA/cilD | Streptococcus mitis NCTC 12261 |
64.952 |
99.361 |
0.645 |
| comYA | Streptococcus mutans UA159 |
64.217 |
100 |
0.642 |
| comYA | Streptococcus mutans UA140 |
64.217 |
100 |
0.642 |
| comGA/cglA/cilD | Streptococcus pneumoniae D39 |
63.666 |
99.361 |
0.633 |
| comGA/cglA/cilD | Streptococcus pneumoniae Rx1 |
63.666 |
99.361 |
0.633 |
| comGA/cglA/cilD | Streptococcus pneumoniae R6 |
63.666 |
99.361 |
0.633 |
| comGA/cglA/cilD | Streptococcus pneumoniae TIGR4 |
63.666 |
99.361 |
0.633 |
| comYA | Streptococcus gordonii str. Challis substr. CH1 |
62.62 |
100 |
0.626 |
| comGA/cglA | Streptococcus sobrinus strain NIDR 6715-7 |
60.577 |
99.681 |
0.604 |
| comGA | Lactococcus lactis subsp. cremoris KW2 |
50.804 |
99.361 |
0.505 |
| comGA | Latilactobacillus sakei subsp. sakei 23K |
38.158 |
97.125 |
0.371 |