Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   EH233_RS01060 Genome accession   NZ_CP034058
Coordinates   235591..237219 (-) Length   542 a.a.
NCBI ID   WP_041456500.1    Uniprot ID   -
Organism   Anabaena sp. YBS01     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 230591..242219
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EH233_RS01040 (EH233_01040) gltX 231605..233050 (+) 1446 WP_011321519.1 glutamate--tRNA ligase -
  EH233_RS01045 (EH233_01045) - 233134..233802 (-) 669 WP_011321518.1 2OG-Fe dioxygenase family protein -
  EH233_RS01050 (EH233_01050) sfsA 234274..234999 (+) 726 WP_011321517.1 DNA/RNA nuclease SfsA -
  EH233_RS01055 (EH233_01055) - 235008..235490 (+) 483 WP_041456504.1 gluconokinase -
  EH233_RS01060 (EH233_01060) comA 235591..237219 (-) 1629 WP_041456500.1 DUF655 domain-containing protein Machinery gene
  EH233_RS01065 (EH233_01065) - 237333..238502 (+) 1170 WP_011321514.1 cysteine desulfurase family protein -
  EH233_RS01070 (EH233_01070) - 238519..240549 (-) 2031 WP_011321513.1 serine/threonine-protein kinase -

Sequence


Protein


Download         Length: 542 a.a.        Molecular weight: 61021.15 Da        Isoelectric Point: 7.8981

>NTDB_id=328627 EH233_RS01060 WP_041456500.1 235591..237219(-) (comA) [Anabaena sp. YBS01]
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF

Nucleotide


Download         Length: 1629 bp        

>NTDB_id=328627 EH233_RS01060 WP_041456500.1 235591..237219(-) (comA) [Anabaena sp. YBS01]
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

49.355

100

0.494


Multiple sequence alignment