Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | GSQ19_RS04065 | Genome accession | NZ_CP047242 |
| Coordinates | 1052892..1054520 (+) | Length | 542 a.a. |
| NCBI ID | WP_041456500.1 | Uniprot ID | - |
| Organism | Trichormus variabilis 0441 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 1047892..1059520
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GSQ19_RS04055 (GSQ19_04055) | - | 1049562..1051592 (+) | 2031 | WP_011321513.1 | serine/threonine-protein kinase | - |
| GSQ19_RS04060 (GSQ19_04060) | - | 1051609..1052778 (-) | 1170 | WP_011321514.1 | cysteine desulfurase family protein | - |
| GSQ19_RS04065 (GSQ19_04065) | comA | 1052892..1054520 (+) | 1629 | WP_041456500.1 | DUF655 domain-containing protein | Machinery gene |
| GSQ19_RS04070 (GSQ19_04070) | - | 1054621..1055103 (-) | 483 | WP_041456504.1 | gluconokinase | - |
| GSQ19_RS04075 (GSQ19_04075) | sfsA | 1055112..1055837 (-) | 726 | WP_011321517.1 | DNA/RNA nuclease SfsA | - |
| GSQ19_RS04080 (GSQ19_04080) | - | 1056309..1056977 (+) | 669 | WP_011321518.1 | 2OG-Fe dioxygenase family protein | - |
| GSQ19_RS04085 (GSQ19_04085) | gltX | 1057061..1058506 (-) | 1446 | WP_011321519.1 | glutamate--tRNA ligase | - |
Sequence
Protein
Download Length: 542 a.a. Molecular weight: 61021.15 Da Isoelectric Point: 7.8981
>NTDB_id=411811 GSQ19_RS04065 WP_041456500.1 1052892..1054520(+) (comA) [Trichormus variabilis 0441]
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF
Nucleotide
Download Length: 1629 bp
>NTDB_id=411811 GSQ19_RS04065 WP_041456500.1 1052892..1054520(+) (comA) [Trichormus variabilis 0441]
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Synechocystis sp. PCC 6803 |
49.355 |
100 |
0.494 |