Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   GM3708_RS09460 Genome accession   NZ_AP014815
Coordinates   2134773..2136068 (+) Length   431 a.a.
NCBI ID   WP_066345992.1    Uniprot ID   A0A0D6AFG4
Organism   Geminocystis sp. NIES-3708     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2129773..2141068
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GM3708_RS09425 (GM3708_1955) - 2129876..2130409 (-) 534 WP_066345968.1 pentapeptide repeat-containing protein -
  GM3708_RS09430 (GM3708_1956) - 2130422..2131060 (-) 639 WP_066345970.1 hypothetical protein -
  GM3708_RS09435 (GM3708_1957) gloB 2131318..2132091 (+) 774 WP_066345974.1 hydroxyacylglutathione hydrolase -
  GM3708_RS09440 (GM3708_1958) - 2132367..2132597 (-) 231 WP_066345975.1 ferredoxin-thioredoxin reductase variable chain -
  GM3708_RS09445 (GM3708_1959) purS 2133028..2133288 (+) 261 WP_066345977.1 phosphoribosylformylglycinamidine synthase subunit PurS -
  GM3708_RS09450 (GM3708_1960) purQ 2133335..2134015 (+) 681 WP_066345984.1 phosphoribosylformylglycinamidine synthase subunit PurQ -
  GM3708_RS09455 (GM3708_1961) - 2134079..2134759 (+) 681 WP_066345990.1 uracil-DNA glycosylase family protein -
  GM3708_RS09460 (GM3708_1962) comA 2134773..2136068 (+) 1296 WP_066345992.1 phosphatidylserine/phosphatidylglycerophosphate/ cardiolipin synthase family protein Machinery gene
  GM3708_RS09465 (GM3708_1963) groES 2136390..2136701 (+) 312 WP_066345994.1 co-chaperone GroES -
  GM3708_RS09470 (GM3708_1964) groL 2136821..2138446 (+) 1626 WP_066346001.1 chaperonin GroEL -
  GM3708_RS09475 (GM3708_1965) - 2138754..2140280 (+) 1527 WP_066346003.1 YifB family Mg chelatase-like AAA ATPase -

Sequence


Protein


Download         Length: 431 a.a.        Molecular weight: 48315.99 Da        Isoelectric Point: 6.6631

>NTDB_id=66929 GM3708_RS09460 WP_066345992.1 2134773..2136068(+) (comA) [Geminocystis sp. NIES-3708]
MVKIISLSLTIISYLLFLNGCQSQVKKLPPLSQDSAIEVYFNHNQAKGKEYLDPYRNIQRTGDNLEAIIIEQINTAKSTL
DIAVQEINLTNLAKAIIDKKKQGVKVRIVIENNYNLPLNQLKNNDGLAILKQVNIPIIDDREDGSKGSGLMHHKFIIIDG
EKIVTGSANFTLSDIHGDFENLETIGNANHILVINSSSLANIFTEEFNYLWGDGVGGKKDSLFGLKKPYRREKIVKIGEN
TIVIKFSPTSQSKSWIESTNGLISHTLKSASNSIDLALFVFSDQDIANTLKNESLQGVKIRALIDPNFAYQYYSEGLDLL
GLALPNKCKYEKNNNPWQQPIETVGIPNLAQGDKLHHKFGIVDNYTIITGSHNWSNAANNLNDETLLIIYNPLIAQHFRR
EFDYLYRDAILGIPSNIQEKIDTEITKCSNN

Nucleotide


Download         Length: 1296 bp        

>NTDB_id=66929 GM3708_RS09460 WP_066345992.1 2134773..2136068(+) (comA) [Geminocystis sp. NIES-3708]
ATGGTCAAAATTATCTCTTTATCTTTAACTATAATCAGTTATTTATTATTTTTAAATGGTTGTCAAAGTCAAGTTAAAAA
ATTGCCTCCATTATCTCAAGATTCTGCCATAGAAGTTTATTTTAATCACAATCAGGCAAAGGGGAAAGAATATTTAGATC
CTTATCGTAATATTCAACGTACTGGAGATAATTTAGAAGCTATTATTATCGAACAAATAAACACAGCAAAATCAACTTTA
GATATAGCAGTACAAGAGATAAATTTAACAAACTTAGCAAAAGCGATTATAGATAAAAAAAAACAAGGTGTAAAAGTTAG
AATTGTTATAGAAAATAACTATAATTTACCCTTGAATCAATTGAAAAATAATGATGGTTTAGCTATTTTAAAACAAGTCA
ATATTCCCATAATAGATGATAGAGAAGATGGTAGTAAAGGCAGTGGATTAATGCACCATAAGTTTATCATTATTGATGGA
GAAAAAATTGTTACAGGTTCAGCTAATTTTACCCTCAGTGATATTCATGGAGATTTTGAAAATTTAGAAACAATCGGCAA
TGCTAATCATATTCTAGTAATAAATAGCTCTTCATTAGCCAATATTTTTACAGAAGAATTTAACTATCTTTGGGGAGATG
GAGTAGGGGGAAAAAAAGATAGTTTATTTGGTTTAAAAAAGCCTTATCGCCGAGAGAAAATAGTTAAAATAGGAGAAAAC
ACTATAGTTATAAAATTTTCACCTACTTCTCAAAGTAAATCATGGATAGAAAGCACCAACGGTTTAATATCTCATACTCT
CAAATCTGCCAGTAATTCTATTGATTTAGCATTATTTGTTTTTAGCGATCAAGATATAGCAAATACTTTAAAAAATGAAT
CGTTACAAGGAGTAAAAATAAGAGCCTTAATTGATCCTAATTTTGCCTATCAATACTATAGTGAAGGACTAGATTTGTTA
GGATTAGCTTTACCCAATAAATGCAAATATGAAAAGAATAATAATCCTTGGCAACAACCCATCGAAACTGTTGGAATTCC
TAATTTAGCTCAAGGAGACAAATTACATCATAAATTTGGCATTGTTGATAACTATACCATTATCACAGGGTCTCATAATT
GGTCAAATGCGGCGAATAATCTTAATGATGAAACTCTTTTAATAATTTATAATCCATTAATTGCCCAACATTTTCGTCGA
GAATTTGATTACCTTTATCGTGATGCAATTTTAGGTATCCCTTCCAATATTCAAGAAAAAATAGACACTGAAATAACAAA
ATGTTCTAATAATTAG

Domains


Predicted by InterproScan.

(263-405)

(69-211)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0D6AFG4

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

47.846

100

0.49


Multiple sequence alignment