Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | EH233_RS01060 | Genome accession | NZ_CP034058 |
| Coordinates | 235591..237219 (-) | Length | 542 a.a. |
| NCBI ID | WP_041456500.1 | Uniprot ID | - |
| Organism | Anabaena sp. YBS01 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 230591..242219
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| EH233_RS01040 (EH233_01040) | gltX | 231605..233050 (+) | 1446 | WP_011321519.1 | glutamate--tRNA ligase | - |
| EH233_RS01045 (EH233_01045) | - | 233134..233802 (-) | 669 | WP_011321518.1 | 2OG-Fe dioxygenase family protein | - |
| EH233_RS01050 (EH233_01050) | sfsA | 234274..234999 (+) | 726 | WP_011321517.1 | DNA/RNA nuclease SfsA | - |
| EH233_RS01055 (EH233_01055) | - | 235008..235490 (+) | 483 | WP_041456504.1 | gluconokinase | - |
| EH233_RS01060 (EH233_01060) | comA | 235591..237219 (-) | 1629 | WP_041456500.1 | DUF655 domain-containing protein | Machinery gene |
| EH233_RS01065 (EH233_01065) | - | 237333..238502 (+) | 1170 | WP_011321514.1 | cysteine desulfurase family protein | - |
| EH233_RS01070 (EH233_01070) | - | 238519..240549 (-) | 2031 | WP_011321513.1 | serine/threonine-protein kinase | - |
Sequence
Protein
Download Length: 542 a.a. Molecular weight: 61021.15 Da Isoelectric Point: 7.8981
>NTDB_id=328627 EH233_RS01060 WP_041456500.1 235591..237219(-) (comA) [Anabaena sp. YBS01]
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF
Nucleotide
Download Length: 1629 bp
>NTDB_id=328627 EH233_RS01060 WP_041456500.1 235591..237219(-) (comA) [Anabaena sp. YBS01]
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Synechocystis sp. PCC 6803 |
49.355 |
100 |
0.494 |