Detailed information
Overview
| Name | comGA | Type | Machinery gene |
| Locus tag | RS401_RS21150 | Genome accession | NZ_CP135601 |
| Coordinates | 4057185..4058228 (-) | Length | 347 a.a. |
| NCBI ID | WP_001013200.1 | Uniprot ID | A0AA96SWA8 |
| Organism | Bacillus sp. SI2 | ||
| Function | dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4052185..4063228
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| RS401_RS21105 (RS401_21105) | - | 4052612..4052812 (-) | 201 | WP_000106081.1 | YqzE family protein | - |
| RS401_RS21110 (RS401_21110) | aroK | 4052851..4053348 (-) | 498 | WP_153576739.1 | shikimate kinase AroK | - |
| RS401_RS21115 (RS401_21115) | - | 4053469..4054119 (-) | 651 | WP_000183879.1 | 2OG-Fe(II) oxygenase | - |
| RS401_RS21120 (RS401_21120) | comGG | 4054295..4054666 (-) | 372 | WP_001231193.1 | competence type IV pilus minor pilin ComGG | - |
| RS401_RS21125 (RS401_21125) | comGF | 4054663..4055133 (-) | 471 | WP_000923093.1 | competence type IV pilus minor pilin ComGF | - |
| RS401_RS21130 (RS401_21130) | comGE | 4055103..4055405 (-) | 303 | WP_000229990.1 | competence type IV pilus minor pilin ComGE | - |
| RS401_RS21135 (RS401_21135) | comGD | 4055398..4055853 (-) | 456 | WP_000810395.1 | comG operon protein ComGD | - |
| RS401_RS21140 (RS401_21140) | comGC | 4055850..4056149 (-) | 300 | WP_001178696.1 | comG operon protein ComGC | - |
| RS401_RS21145 (RS401_21145) | comGB | 4056161..4057192 (-) | 1032 | WP_000471911.1 | comG operon protein ComGB | - |
| RS401_RS21150 (RS401_21150) | comGA | 4057185..4058228 (-) | 1044 | WP_001013200.1 | competence protein ComGA | Machinery gene |
| RS401_RS21155 (RS401_21155) | - | 4058434..4059129 (+) | 696 | WP_000434117.1 | metalloregulator ArsR/SmtB family transcription factor | - |
| RS401_RS21160 (RS401_21160) | - | 4059255..4059497 (+) | 243 | WP_000440711.1 | DUF2626 domain-containing protein | - |
| RS401_RS21165 (RS401_21165) | - | 4059605..4060999 (+) | 1395 | WP_001094329.1 | L-cystine transporter | - |
| RS401_RS21170 (RS401_21170) | - | 4061174..4061581 (-) | 408 | WP_017560026.1 | hypothetical protein | - |
| RS401_RS21175 (RS401_21175) | - | 4061669..4061884 (-) | 216 | WP_001008320.1 | DUF3912 family protein | - |
| RS401_RS21180 (RS401_21180) | - | 4062138..4062449 (+) | 312 | WP_001093243.1 | hypothetical protein | - |
| RS401_RS21185 (RS401_21185) | - | 4062486..4062974 (-) | 489 | WP_000764252.1 | hypothetical protein | - |
Sequence
Protein
Download Length: 347 a.a. Molecular weight: 39412.97 Da Isoelectric Point: 9.2147
>NTDB_id=887121 RS401_RS21150 WP_001013200.1 4057185..4058228(-) (comGA) [Bacillus sp. SI2]
MNGIESFANMILKEACRVQASDLHIVPRQKDVVVQLRIGKDLMTKYCIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLRYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRNDDVLQIQINEKAGITYEAGLKAILRHDPDVILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDARGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCKRKCSTLCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYKHETLQSSIRKGYALGFLEEDVYV
MNGIESFANMILKEACRVQASDLHIVPRQKDVVVQLRIGKDLMTKYCIEKEFGEKLVSHFKFLASMDIGERRKPQNGSLY
LQMDGQEVYLRLSTLPTVYQESLVIRLHLQASIQPLSHLSLFPSTAKKLLSFLRYSHGLLVFTGPTGSGKTTTMYALLEV
IRKKKTRRIVTLEDPVEKRNDDVLQIQINEKAGITYEAGLKAILRHDPDVILVGEIRDEETAKIAIRASLTGHLVMTTLH
TNDARGAILRFMDFGITRQEIEQSLLAIAAQRLVELKCPFCKRKCSTLCKSMRQVRQASIYELLYGYELKQAIKEANGEC
VTYKHETLQSSIRKGYALGFLEEDVYV
Nucleotide
Download Length: 1044 bp
>NTDB_id=887121 RS401_RS21150 WP_001013200.1 4057185..4058228(-) (comGA) [Bacillus sp. SI2]
ATGAATGGAATTGAAAGCTTTGCGAATATGATTTTGAAAGAAGCGTGCAGGGTACAAGCGTCGGACTTACATATTGTGCC
CCGGCAGAAGGATGTAGTGGTTCAGCTGCGTATAGGAAAAGATTTAATGACGAAATATTGTATTGAAAAGGAATTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCTATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGTTCACTGTAT
TTACAAATGGATGGACAAGAAGTGTATTTGCGCCTTTCCACGCTTCCAACCGTATACCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAGAAACTACTTTCTTTTTTAC
GTTATTCTCATGGATTACTCGTATTTACTGGACCGACTGGTTCAGGGAAGACAACAACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCGTTACACTGGAAGATCCAGTTGAAAAAAGAAATGACGATGTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCTGGACTAAAGGCTATTTTGCGTCATGATCCAGATGTTATTTTAGTCG
GTGAAATTCGTGATGAAGAAACAGCGAAAATTGCTATAAGAGCAAGTTTGACTGGCCATTTAGTAATGACGACATTGCAT
ACGAATGATGCGAGAGGGGCGATACTTCGGTTCATGGATTTTGGCATAACGAGGCAAGAAATCGAACAATCTTTATTAGC
TATAGCTGCACAGCGACTTGTCGAATTAAAGTGTCCGTTTTGCAAAAGAAAGTGCTCAACTTTATGCAAATCAATGAGGC
AAGTAAGGCAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCAAACGGGGAATGT
GTCACATACAAGCACGAAACATTACAATCTTCGATACGAAAAGGATACGCTTTAGGGTTTTTAGAAGAAGATGTTTATGT
TTAA
ATGAATGGAATTGAAAGCTTTGCGAATATGATTTTGAAAGAAGCGTGCAGGGTACAAGCGTCGGACTTACATATTGTGCC
CCGGCAGAAGGATGTAGTGGTTCAGCTGCGTATAGGAAAAGATTTAATGACGAAATATTGTATTGAAAAGGAATTTGGAG
AAAAACTTGTTTCACACTTTAAATTTTTAGCATCTATGGATATAGGGGAGAGGCGGAAGCCACAAAATGGTTCACTGTAT
TTACAAATGGATGGACAAGAAGTGTATTTGCGCCTTTCCACGCTTCCAACCGTATACCAAGAAAGTCTCGTTATTCGTCT
TCATTTACAAGCATCTATTCAGCCGTTATCTCATCTTTCGTTATTTCCAAGTACAGCGAAGAAACTACTTTCTTTTTTAC
GTTATTCTCATGGATTACTCGTATTTACTGGACCGACTGGTTCAGGGAAGACAACAACAATGTATGCATTATTAGAGGTA
ATTAGAAAAAAGAAAACACGTCGCATCGTTACACTGGAAGATCCAGTTGAAAAAAGAAATGACGATGTATTACAAATTCA
AATAAATGAAAAAGCAGGTATCACATATGAGGCTGGACTAAAGGCTATTTTGCGTCATGATCCAGATGTTATTTTAGTCG
GTGAAATTCGTGATGAAGAAACAGCGAAAATTGCTATAAGAGCAAGTTTGACTGGCCATTTAGTAATGACGACATTGCAT
ACGAATGATGCGAGAGGGGCGATACTTCGGTTCATGGATTTTGGCATAACGAGGCAAGAAATCGAACAATCTTTATTAGC
TATAGCTGCACAGCGACTTGTCGAATTAAAGTGTCCGTTTTGCAAAAGAAAGTGCTCAACTTTATGCAAATCAATGAGGC
AAGTAAGGCAAGCGAGTATTTATGAGTTGTTATATGGATATGAGTTAAAACAAGCGATTAAAGAAGCAAACGGGGAATGT
GTCACATACAAGCACGAAACATTACAATCTTCGATACGAAAAGGATACGCTTTAGGGTTTTTAGAAGAAGATGTTTATGT
TTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comGA | Bacillus subtilis subsp. subtilis str. 168 |
57.925 |
100 |
0.579 |
| pilB | Vibrio campbellii strain DS40M4 |
35.977 |
100 |
0.366 |