Detailed information
Overview
| Name | comA | Type | Machinery gene |
| Locus tag | MCP04_RS21045 | Genome accession | NZ_CP092419 |
| Coordinates | 4381921..4383477 (+) | Length | 518 a.a. |
| NCBI ID | WP_144056228.1 | Uniprot ID | - |
| Organism | Leptolyngbya boryana IU 594 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 4376921..4388477
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| MCP04_RS21025 (MCP04_21025) | psbD | 4377575..4378630 (+) | 1056 | WP_017287122.1 | photosystem II D2 protein (photosystem q(a) protein) | - |
| MCP04_RS21030 (MCP04_21030) | psbC | 4378614..4379996 (+) | 1383 | WP_017288960.1 | photosystem II reaction center protein CP43 | - |
| MCP04_RS21035 (MCP04_21035) | - | 4380128..4380661 (+) | 534 | WP_017288961.1 | hypothetical protein | - |
| MCP04_RS21040 (MCP04_21040) | - | 4380658..4381827 (-) | 1170 | WP_017288962.1 | cysteine desulfurase family protein | - |
| MCP04_RS21045 (MCP04_21045) | comA | 4381921..4383477 (+) | 1557 | WP_144056228.1 | phospholipase D-like domain-containing protein | Machinery gene |
| MCP04_RS21050 (MCP04_21050) | - | 4383529..4383888 (-) | 360 | WP_017288964.1 | DUF1818 family protein | - |
| MCP04_RS21055 (MCP04_21055) | - | 4383885..4384322 (-) | 438 | WP_017288965.1 | hypothetical protein | - |
| MCP04_RS21060 (MCP04_21060) | - | 4384369..4384614 (-) | 246 | WP_017288966.1 | DNA-directed RNA polymerase subunit omega | - |
| MCP04_RS21065 (MCP04_21065) | - | 4384759..4385451 (+) | 693 | WP_017288967.1 | GDSL-type esterase/lipase family protein | - |
| MCP04_RS21070 (MCP04_21070) | rdgB | 4385448..4386023 (+) | 576 | WP_017288968.1 | RdgB/HAM1 family non-canonical purine NTP pyrophosphatase | - |
| MCP04_RS21075 (MCP04_21075) | - | 4386094..4386855 (-) | 762 | WP_017288969.1 | PEP-CTERM sorting domain-containing protein | - |
Sequence
Protein
Download Length: 518 a.a. Molecular weight: 57530.37 Da Isoelectric Point: 7.2386
>NTDB_id=657350 MCP04_RS21045 WP_144056228.1 4381921..4383477(+) (comA) [Leptolyngbya boryana IU 594]
MVRLNHLIQFSLCTLLFAGCRAASSSISVRPLPQDPNIQVFTNQAAASEYTEPYRKITRAGDNLEEVLIDTIAQAKSSID
VAVQEFRLPNVAKALRDRAAAGVTVRVILENQYARPYSSYSAAEVEKLPEREKARYQESRTLIDQDQDGQLSLTEIQDRD
ALVMLDVAKIPRIDDRADGSRGSNLMHHKFIVVDGKTTIVTSANFTMSDVHGDFARPSSRGNANNLLKMESVEVAQAFTA
EFELMWSDRKFGVKKPFRPAQQFKLGESVISLQFSPNSRTIPWEQTPNGLIAKTLQQSTQAIDLALFVFSDQDLVNTIEG
KSIRALIEPSFMYRPFSEGLDMLGVALSNNCKWEVNNRPWQNPIKTAGVPQLPPGDMLHHKFGIVDGHTVITGSHNWTLA
ANRGNDETVIVIRNPVVAAHFQREFDRLYANSVLGIPPAIQRKVEAEQKQCPPPSQSRTGLVNLNTATQAELEALPGVGS
GLAKRIIQARPFQSLEDLDHVPGVGKKLIERLSDRVTW
MVRLNHLIQFSLCTLLFAGCRAASSSISVRPLPQDPNIQVFTNQAAASEYTEPYRKITRAGDNLEEVLIDTIAQAKSSID
VAVQEFRLPNVAKALRDRAAAGVTVRVILENQYARPYSSYSAAEVEKLPEREKARYQESRTLIDQDQDGQLSLTEIQDRD
ALVMLDVAKIPRIDDRADGSRGSNLMHHKFIVVDGKTTIVTSANFTMSDVHGDFARPSSRGNANNLLKMESVEVAQAFTA
EFELMWSDRKFGVKKPFRPAQQFKLGESVISLQFSPNSRTIPWEQTPNGLIAKTLQQSTQAIDLALFVFSDQDLVNTIEG
KSIRALIEPSFMYRPFSEGLDMLGVALSNNCKWEVNNRPWQNPIKTAGVPQLPPGDMLHHKFGIVDGHTVITGSHNWTLA
ANRGNDETVIVIRNPVVAAHFQREFDRLYANSVLGIPPAIQRKVEAEQKQCPPPSQSRTGLVNLNTATQAELEALPGVGS
GLAKRIIQARPFQSLEDLDHVPGVGKKLIERLSDRVTW
Nucleotide
Download Length: 1557 bp
>NTDB_id=657350 MCP04_RS21045 WP_144056228.1 4381921..4383477(+) (comA) [Leptolyngbya boryana IU 594]
GTGGTCAGACTGAATCACCTTATCCAGTTTAGTCTCTGTACGCTCCTATTTGCGGGATGCCGAGCAGCTTCATCTTCGAT
CAGTGTGCGCCCACTGCCACAAGATCCCAACATCCAAGTCTTTACGAATCAAGCAGCGGCCTCTGAATATACTGAGCCGT
ATCGCAAAATTACAAGGGCAGGCGATAACTTAGAAGAAGTTTTAATCGATACGATCGCACAGGCAAAGTCGAGCATTGAT
GTTGCTGTTCAAGAATTTCGATTGCCTAATGTGGCGAAGGCATTGCGCGATCGAGCCGCAGCAGGCGTGACCGTGCGAGT
GATTCTCGAAAATCAGTACGCTCGTCCTTACAGCAGTTATTCTGCCGCAGAAGTTGAGAAATTGCCGGAGCGAGAAAAAG
CGAGATATCAGGAATCCAGAACGCTGATCGATCAAGATCAAGATGGACAGTTAAGCCTGACTGAAATTCAGGATCGAGAT
GCCTTAGTGATGTTGGATGTTGCGAAGATTCCTCGGATCGATGATCGAGCCGATGGCAGCCGAGGCAGTAACTTAATGCA
CCACAAATTCATAGTTGTGGATGGAAAGACTACGATCGTGACTTCGGCAAACTTCACGATGTCAGACGTACACGGAGACT
TTGCTCGTCCTAGCAGCCGCGGCAATGCTAATAATTTGCTGAAGATGGAAAGTGTGGAAGTTGCACAGGCTTTTACAGCA
GAATTTGAACTCATGTGGAGCGATCGCAAATTTGGCGTGAAAAAGCCATTTCGTCCAGCACAGCAGTTCAAACTGGGCGA
GAGCGTGATTTCCTTGCAGTTCTCACCCAATTCCAGAACGATTCCCTGGGAACAAACACCCAATGGTTTGATTGCCAAAA
CGCTTCAGCAATCCACGCAAGCGATCGATTTAGCGTTGTTTGTGTTCTCGGATCAGGATCTAGTTAATACGATCGAAGGA
AAATCAATTCGGGCATTAATCGAACCCAGCTTCATGTATAGACCTTTCAGCGAAGGTTTAGATATGTTAGGCGTGGCGCT
AAGTAATAACTGCAAGTGGGAAGTCAATAATCGTCCTTGGCAAAATCCCATCAAGACAGCTGGAGTGCCGCAACTTCCGC
CCGGTGATATGTTGCATCACAAGTTTGGAATTGTAGACGGACATACCGTGATTACAGGGTCTCATAACTGGACGCTTGCC
GCGAATCGTGGAAATGATGAAACGGTTATCGTGATTCGGAATCCGGTAGTTGCAGCCCATTTTCAACGAGAATTTGATCG
ACTGTATGCAAATTCAGTTCTGGGGATTCCTCCAGCAATTCAGCGCAAGGTTGAGGCTGAACAGAAACAATGTCCTCCAC
CAAGCCAATCTCGCACCGGGTTAGTCAATCTCAATACTGCAACTCAGGCAGAGTTAGAAGCATTGCCGGGAGTTGGCTCA
GGATTAGCGAAACGAATCATTCAAGCCCGTCCATTTCAGTCCTTGGAAGATCTCGATCATGTGCCTGGTGTCGGGAAGAA
GTTGATAGAACGCTTGAGCGATCGTGTGACCTGGTAG
GTGGTCAGACTGAATCACCTTATCCAGTTTAGTCTCTGTACGCTCCTATTTGCGGGATGCCGAGCAGCTTCATCTTCGAT
CAGTGTGCGCCCACTGCCACAAGATCCCAACATCCAAGTCTTTACGAATCAAGCAGCGGCCTCTGAATATACTGAGCCGT
ATCGCAAAATTACAAGGGCAGGCGATAACTTAGAAGAAGTTTTAATCGATACGATCGCACAGGCAAAGTCGAGCATTGAT
GTTGCTGTTCAAGAATTTCGATTGCCTAATGTGGCGAAGGCATTGCGCGATCGAGCCGCAGCAGGCGTGACCGTGCGAGT
GATTCTCGAAAATCAGTACGCTCGTCCTTACAGCAGTTATTCTGCCGCAGAAGTTGAGAAATTGCCGGAGCGAGAAAAAG
CGAGATATCAGGAATCCAGAACGCTGATCGATCAAGATCAAGATGGACAGTTAAGCCTGACTGAAATTCAGGATCGAGAT
GCCTTAGTGATGTTGGATGTTGCGAAGATTCCTCGGATCGATGATCGAGCCGATGGCAGCCGAGGCAGTAACTTAATGCA
CCACAAATTCATAGTTGTGGATGGAAAGACTACGATCGTGACTTCGGCAAACTTCACGATGTCAGACGTACACGGAGACT
TTGCTCGTCCTAGCAGCCGCGGCAATGCTAATAATTTGCTGAAGATGGAAAGTGTGGAAGTTGCACAGGCTTTTACAGCA
GAATTTGAACTCATGTGGAGCGATCGCAAATTTGGCGTGAAAAAGCCATTTCGTCCAGCACAGCAGTTCAAACTGGGCGA
GAGCGTGATTTCCTTGCAGTTCTCACCCAATTCCAGAACGATTCCCTGGGAACAAACACCCAATGGTTTGATTGCCAAAA
CGCTTCAGCAATCCACGCAAGCGATCGATTTAGCGTTGTTTGTGTTCTCGGATCAGGATCTAGTTAATACGATCGAAGGA
AAATCAATTCGGGCATTAATCGAACCCAGCTTCATGTATAGACCTTTCAGCGAAGGTTTAGATATGTTAGGCGTGGCGCT
AAGTAATAACTGCAAGTGGGAAGTCAATAATCGTCCTTGGCAAAATCCCATCAAGACAGCTGGAGTGCCGCAACTTCCGC
CCGGTGATATGTTGCATCACAAGTTTGGAATTGTAGACGGACATACCGTGATTACAGGGTCTCATAACTGGACGCTTGCC
GCGAATCGTGGAAATGATGAAACGGTTATCGTGATTCGGAATCCGGTAGTTGCAGCCCATTTTCAACGAGAATTTGATCG
ACTGTATGCAAATTCAGTTCTGGGGATTCCTCCAGCAATTCAGCGCAAGGTTGAGGCTGAACAGAAACAATGTCCTCCAC
CAAGCCAATCTCGCACCGGGTTAGTCAATCTCAATACTGCAACTCAGGCAGAGTTAGAAGCATTGCCGGGAGTTGGCTCA
GGATTAGCGAAACGAATCATTCAAGCCCGTCCATTTCAGTCCTTGGAAGATCTCGATCATGTGCCTGGTGTCGGGAAGAA
GTTGATAGAACGCTTGAGCGATCGTGTGACCTGGTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comA | Synechocystis sp. PCC 6803 |
48.733 |
99.035 |
0.483 |