Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | Dongsha4_RS18425 | Genome accession | NZ_CP084098 |
| Coordinates | 4220021..4221319 (-) | Length | 432 a.a. |
| NCBI ID | WP_330203725.1 | Uniprot ID | - |
| Organism | Cyanobacterium sp. Dongsha4 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4215021..4226319
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| Dongsha4_RS18405 (Dongsha4_18380) | trmH | 4217879..4218556 (+) | 678 | WP_330203721.1 | tRNA (guanosine(18)-2'-O)-methyltransferase TrmH | - |
| Dongsha4_RS18410 (Dongsha4_18385) | - | 4218529..4218834 (-) | 306 | WP_330203722.1 | 2Fe-2S iron-sulfur cluster-binding protein | - |
| Dongsha4_RS18415 (Dongsha4_18390) | - | 4218961..4219179 (-) | 219 | WP_330203723.1 | DUF167 domain-containing protein | - |
| Dongsha4_RS18420 (Dongsha4_18395) | - | 4219213..4219932 (-) | 720 | WP_330203724.1 | pentapeptide repeat-containing protein | - |
| Dongsha4_RS18425 (Dongsha4_18400) | comA | 4220021..4221319 (-) | 1299 | WP_330203725.1 | phospholipase D-like domain-containing protein | Machinery gene |
| Dongsha4_RS18430 (Dongsha4_18405) | - | 4221340..4222020 (-) | 681 | WP_330203726.1 | uracil-DNA glycosylase family protein | - |
| Dongsha4_RS18435 (Dongsha4_18410) | purQ | 4222125..4222805 (-) | 681 | WP_330203727.1 | phosphoribosylformylglycinamidine synthase subunit PurQ | - |
| Dongsha4_RS18440 (Dongsha4_18415) | purS | 4222826..4223089 (-) | 264 | WP_015220033.1 | phosphoribosylformylglycinamidine synthase subunit PurS | - |
| Dongsha4_RS18445 (Dongsha4_18420) | - | 4223253..4223888 (-) | 636 | WP_330203728.1 | biopolymer transporter ExbD | - |
| Dongsha4_RS18450 (Dongsha4_18425) | sodN | 4224114..4224599 (+) | 486 | WP_330203729.1 | superoxide dismutase, Ni | - |
| Dongsha4_RS18455 (Dongsha4_18430) | sodX | 4224705..4224980 (+) | 276 | WP_330203730.1 | nickel-type superoxide dismutase maturation protease | - |
| Dongsha4_RS18460 (Dongsha4_18435) | dnaJ | 4225086..4226216 (+) | 1131 | WP_330203731.1 | molecular chaperone DnaJ | - |
Sequence
Protein
Download Length: 432 a.a. Molecular weight: 48584.27 Da Isoelectric Point: 7.3168
>NTDB_id=609533 Dongsha4_RS18425 WP_330203725.1 4220021..4221319(-) (comA) [Cyanobacterium sp. Dongsha4]
MLKIISFPLGLIFSLIVLGGCHNSPPKLPPLAQEESIQVYFNHNQAKGKEYEEPYRKIKRFGDDLETVLIEQINSANQTI
DIAIQELNLARLARVIVEKKKQGVKIRIVTENNYNKPLSQLPNNHGLAILKRANIPIIDDTEDGSKGSGLMHHKFVVIDQ
KKVVTGSANFTLSGIHGDFDNQETRGNTNHLIVFNSQEMAKIFTQEFNYLWGDGVGKKKDSLFGLNKPERRTNIVNIGSS
SIVVKFSPNSRSTLWENSSNGIIARELAKANSSIDLALFVFSDQEIANTLEKESLAGVKIRVLIDPNFAYQYYSEGLDLL
GVALSRKCVYENNNNPWRQPLETVGIPRLPLGDKLHHKFAIVDRYIVITGSHNWSNSANNINDETLLVIYNPLIAQHFQR
EFDYLYHDAILGVSSELKKEIDQENSKCGLSN
MLKIISFPLGLIFSLIVLGGCHNSPPKLPPLAQEESIQVYFNHNQAKGKEYEEPYRKIKRFGDDLETVLIEQINSANQTI
DIAIQELNLARLARVIVEKKKQGVKIRIVTENNYNKPLSQLPNNHGLAILKRANIPIIDDTEDGSKGSGLMHHKFVVIDQ
KKVVTGSANFTLSGIHGDFDNQETRGNTNHLIVFNSQEMAKIFTQEFNYLWGDGVGKKKDSLFGLNKPERRTNIVNIGSS
SIVVKFSPNSRSTLWENSSNGIIARELAKANSSIDLALFVFSDQEIANTLEKESLAGVKIRVLIDPNFAYQYYSEGLDLL
GVALSRKCVYENNNNPWRQPLETVGIPRLPLGDKLHHKFAIVDRYIVITGSHNWSNSANNINDETLLVIYNPLIAQHFQR
EFDYLYHDAILGVSSELKKEIDQENSKCGLSN
Nucleotide
Download Length: 1299 bp
>NTDB_id=609533 Dongsha4_RS18425 WP_330203725.1 4220021..4221319(-) (comA) [Cyanobacterium sp. Dongsha4]
ATGCTTAAAATAATTAGTTTTCCCCTAGGTCTAATTTTTAGCTTAATAGTATTGGGTGGTTGTCATAATTCTCCACCTAA
ATTGCCACCTTTAGCTCAAGAGGAAAGCATCCAAGTTTATTTTAATCATAATCAAGCGAAGGGAAAAGAATATGAAGAAC
CTTATCGTAAAATTAAAAGATTTGGAGATGATTTAGAAACAGTTTTAATTGAGCAAATTAATAGTGCAAATCAAACTATT
GATATAGCGATTCAAGAATTAAATTTGGCAAGATTAGCAAGGGTGATTGTAGAGAAAAAAAAGCAGGGTGTAAAAATAAG
AATTGTCACGGAAAATAATTATAATAAACCTTTAAGTCAGTTGCCGAATAATCATGGTTTAGCAATTCTAAAAAGGGCGA
ATATACCCATCATCGATGATACAGAAGATGGCAGTAAGGGCAGTGGTTTAATGCACCATAAATTTGTTGTTATTGATCAA
AAAAAAGTTGTTACAGGTTCAGCAAATTTCACTCTGAGTGGTATTCATGGAGATTTTGATAATCAAGAAACAAGGGGAAA
TACAAATCATTTAATTGTATTTAATAGTCAAGAAATGGCTAAAATTTTTACTCAAGAATTTAACTACTTATGGGGTGATG
GAGTAGGAAAAAAGAAGGATAGCTTATTTGGTTTAAATAAACCCGAAAGACGCACAAACATAGTAAATATTGGCAGTAGT
TCTATTGTGGTAAAATTTTCTCCTAATAGTCGATCAACTCTTTGGGAGAATAGTAGTAATGGTATTATTGCTAGGGAATT
AGCAAAGGCGAATAGTTCTATTGATTTAGCATTATTTGTTTTTAGTGATCAAGAAATAGCAAATACTCTTGAAAAAGAAT
CCTTAGCAGGAGTAAAAATAAGGGTTTTAATTGATCCTAATTTTGCTTACCAATACTATAGTGAAGGACTTGATTTGTTG
GGGGTTGCCCTTTCAAGAAAATGTGTTTATGAGAATAATAATAATCCTTGGCGACAACCATTAGAAACTGTCGGCATTCC
TCGTTTACCGTTGGGAGATAAGTTACATCATAAATTTGCTATAGTTGACCGTTATATCGTAATCACAGGCTCTCATAACT
GGTCTAATTCTGCAAATAATATTAACGATGAAACCTTATTAGTTATTTATAATCCTCTTATTGCTCAACATTTTCAGAGG
GAATTTGACTATCTTTATCACGATGCAATATTAGGAGTTTCTTCGGAGTTAAAAAAAGAAATTGACCAAGAAAATTCCAA
ATGTGGGCTATCGAATTAA
ATGCTTAAAATAATTAGTTTTCCCCTAGGTCTAATTTTTAGCTTAATAGTATTGGGTGGTTGTCATAATTCTCCACCTAA
ATTGCCACCTTTAGCTCAAGAGGAAAGCATCCAAGTTTATTTTAATCATAATCAAGCGAAGGGAAAAGAATATGAAGAAC
CTTATCGTAAAATTAAAAGATTTGGAGATGATTTAGAAACAGTTTTAATTGAGCAAATTAATAGTGCAAATCAAACTATT
GATATAGCGATTCAAGAATTAAATTTGGCAAGATTAGCAAGGGTGATTGTAGAGAAAAAAAAGCAGGGTGTAAAAATAAG
AATTGTCACGGAAAATAATTATAATAAACCTTTAAGTCAGTTGCCGAATAATCATGGTTTAGCAATTCTAAAAAGGGCGA
ATATACCCATCATCGATGATACAGAAGATGGCAGTAAGGGCAGTGGTTTAATGCACCATAAATTTGTTGTTATTGATCAA
AAAAAAGTTGTTACAGGTTCAGCAAATTTCACTCTGAGTGGTATTCATGGAGATTTTGATAATCAAGAAACAAGGGGAAA
TACAAATCATTTAATTGTATTTAATAGTCAAGAAATGGCTAAAATTTTTACTCAAGAATTTAACTACTTATGGGGTGATG
GAGTAGGAAAAAAGAAGGATAGCTTATTTGGTTTAAATAAACCCGAAAGACGCACAAACATAGTAAATATTGGCAGTAGT
TCTATTGTGGTAAAATTTTCTCCTAATAGTCGATCAACTCTTTGGGAGAATAGTAGTAATGGTATTATTGCTAGGGAATT
AGCAAAGGCGAATAGTTCTATTGATTTAGCATTATTTGTTTTTAGTGATCAAGAAATAGCAAATACTCTTGAAAAAGAAT
CCTTAGCAGGAGTAAAAATAAGGGTTTTAATTGATCCTAATTTTGCTTACCAATACTATAGTGAAGGACTTGATTTGTTG
GGGGTTGCCCTTTCAAGAAAATGTGTTTATGAGAATAATAATAATCCTTGGCGACAACCATTAGAAACTGTCGGCATTCC
TCGTTTACCGTTGGGAGATAAGTTACATCATAAATTTGCTATAGTTGACCGTTATATCGTAATCACAGGCTCTCATAACT
GGTCTAATTCTGCAAATAATATTAACGATGAAACCTTATTAGTTATTTATAATCCTCTTATTGCTCAACATTTTCAGAGG
GAATTTGACTATCTTTATCACGATGCAATATTAGGAGTTTCTTCGGAGTTAAAAAAAGAAATTGACCAAGAAAATTCCAA
ATGTGGGCTATCGAATTAA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Synechocystis sp. PCC 6803 |
47.732 |
100 |
0.512 |