Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   AVA_RS24645 Genome accession   NC_007413
Coordinates   6120224..6121852 (+) Length   542 a.a.
NCBI ID   WP_041456500.1    Uniprot ID   -
Organism   Trichormus variabilis ATCC 29413     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 6115224..6126852
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AVA_RS24635 (Ava_4855) - 6116894..6118924 (+) 2031 WP_011321513.1 serine/threonine-protein kinase -
  AVA_RS24640 (Ava_4856) - 6118941..6120110 (-) 1170 WP_011321514.1 cysteine desulfurase family protein -
  AVA_RS24645 (Ava_4857) comA 6120224..6121852 (+) 1629 WP_041456500.1 DUF655 domain-containing protein Machinery gene
  AVA_RS24650 (Ava_4858) - 6121953..6122435 (-) 483 WP_041456504.1 gluconokinase -
  AVA_RS24655 (Ava_4859) sfsA 6122444..6123169 (-) 726 WP_011321517.1 DNA/RNA nuclease SfsA -
  AVA_RS24660 (Ava_4860) - 6123641..6124309 (+) 669 WP_011321518.1 2OG-Fe dioxygenase family protein -
  AVA_RS24665 (Ava_4861) gltX 6124393..6125838 (-) 1446 WP_011321519.1 glutamate--tRNA ligase -

Sequence


Protein


Download         Length: 542 a.a.        Molecular weight: 61021.15 Da        Isoelectric Point: 7.8981

>NTDB_id=24709 AVA_RS24645 WP_041456500.1 6120224..6121852(+) (comA) [Trichormus variabilis ATCC 29413]
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF

Nucleotide


Download         Length: 1629 bp        

>NTDB_id=24709 AVA_RS24645 WP_041456500.1 6120224..6121852(+) (comA) [Trichormus variabilis ATCC 29413]
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

49.355

100

0.494


Multiple sequence alignment