Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   CLI64_RS02510 Genome accession   NZ_CP023278
Coordinates   588926..590545 (-) Length   539 a.a.
NCBI ID   WP_225977593.1    Uniprot ID   -
Organism   Nostoc sp. CENA543     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 583926..595545
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CLI64_RS02485 (CLI64_02470) - 584689..585765 (+) 1077 WP_103135747.1 hybrid sensor histidine kinase/response regulator -
  CLI64_RS02490 (CLI64_02475) - 585987..586640 (-) 654 WP_103135748.1 HAD family hydrolase -
  CLI64_RS02495 (CLI64_02480) - 586718..587923 (-) 1206 WP_103135749.1 NAD(P)/FAD-dependent oxidoreductase -
  CLI64_RS02500 (CLI64_02485) - 588138..588332 (-) 195 WP_103135750.1 RNA-binding S4 domain-containing protein -
  CLI64_RS02505 (CLI64_02490) - 588556..588801 (-) 246 WP_103135751.1 ChaB family protein -
  CLI64_RS02510 (CLI64_02495) comA 588926..590545 (-) 1620 WP_225977593.1 DUF655 domain-containing protein Machinery gene
  CLI64_RS02515 (CLI64_02500) - 590757..591926 (+) 1170 WP_103135753.1 cysteine desulfurase family protein -
  CLI64_RS02520 (CLI64_02505) - 591944..592306 (+) 363 WP_157943179.1 S1 RNA-binding domain-containing protein -
  CLI64_RS02525 (CLI64_02510) - 592337..592564 (+) 228 WP_103135754.1 hypothetical protein -
  CLI64_RS02530 (CLI64_02515) - 592561..592983 (+) 423 WP_103135755.1 type II toxin-antitoxin system VapC family toxin -
  CLI64_RS02535 (CLI64_02520) - 593089..593829 (+) 741 WP_103135756.1 DUF1995 family protein -
  CLI64_RS02545 (CLI64_02525) - 593996..594691 (-) 696 WP_103135758.1 carbonic anhydrase -
  CLI64_RS02550 (CLI64_02530) - 594876..595187 (-) 312 WP_103135759.1 P-II family nitrogen regulator -

Sequence


Protein


Download         Length: 539 a.a.        Molecular weight: 60496.95 Da        Isoelectric Point: 7.8328

>NTDB_id=245705 CLI64_RS02510 WP_225977593.1 588926..590545(-) (comA) [Nostoc sp. CENA543]
MLLKLQKSLVLFLLLAIAGCQKVQSQNTRLPALPQDPFVQVYFNQSESSEYREPYRQQTRLGDNLEQQIIEIISQAKSTI
DIAVQELRLPGIAKALSDKQKAGIQVRVILENNYSRPWSSFTDAEVKKLPPREKDRYQEYFKFVDQNQDNQLSPEEINQR
DALIILKAANIPVIDDRADGSAGSNLMHHKFVIVDNRMVIITSANFTLSDTFGDFTNPSSLGNPNNFLQIDSRELANLFT
EEFNLMWGDGVGGKPDSKFGVNKPVRPPKTITLGDNKITVNFSPTSPTKPWSNTSNGLIGETLNSSTQSVDMALFVFSEQ
RLANILEIRHQQNVAIRALIDKQFAYRPYSEALDMMGVALSNKCKYELDNKPWQNPLTTVGVPILPKGDLLHHKFAVVDQ
KTVITGSHNWSDAANKANDETLIIIENPNIAAHYVREFNRLYTKAKVGVPENIQAKIQAEQKQCPQISAPNSAEKIIKPI
NINTASLEELSTLPGVGKKLAQKIITARQQQKFTSLQDLEKIPGISERMIANWQGYIEL

Nucleotide


Download         Length: 1620 bp        

>NTDB_id=245705 CLI64_RS02510 WP_225977593.1 588926..590545(-) (comA) [Nostoc sp. CENA543]
ATTTTACTGAAATTGCAGAAATCTTTAGTCTTATTTTTACTGTTGGCGATCGCAGGTTGTCAAAAAGTCCAATCCCAAAA
TACGCGCCTTCCTGCTTTGCCACAAGACCCCTTTGTGCAAGTTTACTTTAATCAGTCTGAGTCTAGTGAATATCGAGAAC
CTTACCGCCAGCAAACTCGACTAGGAGATAATTTAGAACAGCAAATTATTGAGATAATTTCTCAAGCTAAATCTACTATA
GATATTGCAGTCCAAGAATTACGTTTACCAGGAATTGCCAAGGCTTTAAGTGATAAACAAAAAGCAGGTATTCAAGTTAG
AGTTATCTTAGAAAATAACTACAGTCGTCCTTGGAGTAGTTTTACAGATGCGGAAGTGAAGAAATTACCACCAAGAGAAA
AAGATAGATATCAAGAATATTTTAAATTTGTAGACCAAAATCAAGATAATCAACTCAGCCCAGAGGAAATTAACCAAAGG
GATGCGTTAATCATTTTAAAAGCTGCTAACATCCCCGTCATTGATGATCGAGCGGATGGTTCAGCAGGTAGTAATTTGAT
GCACCATAAGTTTGTCATTGTAGACAATCGAATGGTGATTATTACTTCCGCTAATTTTACCTTAAGTGACACTTTTGGTG
ACTTTACTAATCCTAGCAGTTTAGGAAATCCTAATAACTTTTTACAAATTGATAGTCGAGAGTTAGCCAATTTATTCACA
GAAGAATTTAATCTGATGTGGGGTGATGGTGTTGGTGGTAAACCAGATAGTAAATTTGGCGTAAATAAACCTGTGCGTCC
TCCAAAAACGATTACATTAGGTGATAACAAAATTACTGTAAATTTTTCACCTACATCACCTACCAAACCTTGGAGTAATA
CTAGTAATGGACTCATTGGTGAAACTTTAAATTCGTCTACTCAATCTGTAGATATGGCTTTGTTTGTGTTTTCTGAACAG
CGACTTGCCAATATTTTAGAAATTCGTCATCAGCAAAATGTAGCTATTCGCGCCCTGATTGACAAGCAATTCGCCTATCG
TCCGTATAGCGAAGCCTTAGACATGATGGGCGTAGCTTTAAGTAATAAATGTAAATATGAGTTGGATAATAAACCTTGGC
AAAATCCTCTCACTACCGTCGGTGTTCCCATATTACCCAAAGGTGATTTATTGCATCATAAATTTGCAGTGGTTGACCAA
AAAACGGTAATCACAGGTTCTCATAATTGGTCAGATGCGGCGAATAAAGCTAATGATGAGACTTTAATTATTATTGAGAA
TCCGAATATCGCTGCTCATTATGTGCGTGAATTTAATCGGCTTTACACTAAGGCGAAAGTTGGCGTACCAGAAAATATTC
AAGCTAAGATTCAAGCCGAACAAAAGCAATGTCCGCAAATCTCTGCTCCTAATAGCGCAGAGAAAATTATTAAACCCATT
AATATTAATACAGCGAGTCTAGAAGAATTATCAACTTTGCCAGGGGTCGGCAAGAAATTAGCCCAGAAAATTATCACAGC
CCGTCAACAACAGAAATTCACTTCTTTACAAGATTTAGAAAAAATTCCAGGTATCAGTGAGAGAATGATAGCTAATTGGC
AAGGTTATATCGAATTGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

47.97

100

0.482


Multiple sequence alignment