Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   GSQ19_RS04065 Genome accession   NZ_CP047242
Coordinates   1052892..1054520 (+) Length   542 a.a.
NCBI ID   WP_041456500.1    Uniprot ID   -
Organism   Trichormus variabilis 0441     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1047892..1059520
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GSQ19_RS04055 (GSQ19_04055) - 1049562..1051592 (+) 2031 WP_011321513.1 serine/threonine-protein kinase -
  GSQ19_RS04060 (GSQ19_04060) - 1051609..1052778 (-) 1170 WP_011321514.1 cysteine desulfurase family protein -
  GSQ19_RS04065 (GSQ19_04065) comA 1052892..1054520 (+) 1629 WP_041456500.1 DUF655 domain-containing protein Machinery gene
  GSQ19_RS04070 (GSQ19_04070) - 1054621..1055103 (-) 483 WP_041456504.1 gluconokinase -
  GSQ19_RS04075 (GSQ19_04075) sfsA 1055112..1055837 (-) 726 WP_011321517.1 DNA/RNA nuclease SfsA -
  GSQ19_RS04080 (GSQ19_04080) - 1056309..1056977 (+) 669 WP_011321518.1 2OG-Fe dioxygenase family protein -
  GSQ19_RS04085 (GSQ19_04085) gltX 1057061..1058506 (-) 1446 WP_011321519.1 glutamate--tRNA ligase -

Sequence


Protein


Download         Length: 542 a.a.        Molecular weight: 61021.15 Da        Isoelectric Point: 7.8981

>NTDB_id=411811 GSQ19_RS04065 WP_041456500.1 1052892..1054520(+) (comA) [Trichormus variabilis 0441]
MRIFPAFRNFWVFFLIVAIAACQKVQSHNNRPAPLPQDSFVKVYFNQSESSEYREPYRQQTRLGDNLEQQIIDAISQAKS
TIDVAVQELRLPRIAQALKDKQKAGIKVRVILENTYTRSLSNLTPDEVKKLPEREQARYQEYFKFVDLNQDNQLSPEEVN
QRDALIILQNAKIPWIDDQADGSAGSKLMHHKFVVVDNRIVIVTSANFTLSDVFGDFSNSSSLGNANNLLHIDSPELAAL
VTEEFNLMWGDGVGGKPDSKFGLNKPVRPPQKITLGDNTITVHFSPTSPTLPWTQSSNGLINESLNLANKSIDMALFVFS
EQRLANTLEKRHQQQVSIRALIDKQFAYRYYSEALDMMGIALGNKCRYEIDNRPWSNPVTTVGVPTLREGDLLHHKFSVI
DNQTVITGSHNWSDAANHGNDETLIVINNPTIAAHYEREFARLYAKAQVGVPAKVQAQIQQEQKQCGQIKTPTSSELTPT
QVVNINTANLAELETLPGVGKKLAQKIITARQQRKFVSSQDLDKVPGISPKMIENWQGRIQF

Nucleotide


Download         Length: 1629 bp        

>NTDB_id=411811 GSQ19_RS04065 WP_041456500.1 1052892..1054520(+) (comA) [Trichormus variabilis 0441]
GTGCGGATTTTCCCAGCATTTAGGAATTTTTGGGTATTTTTTTTGATAGTGGCGATCGCCGCCTGTCAAAAAGTCCAATC
TCACAATAATCGTCCTGCACCTCTACCGCAAGACTCATTTGTGAAAGTTTACTTTAATCAATCCGAATCCTCAGAATATC
GAGAACCTTACCGTCAACAAACTCGACTGGGAGATAACTTAGAACAGCAGATTATTGACGCTATTTCTCAAGCTAAATCT
ACTATCGATGTAGCAGTACAAGAATTGCGTTTACCGAGAATCGCCCAAGCCCTCAAAGACAAACAAAAAGCGGGAATCAA
AGTCAGAGTAATTTTAGAAAATACCTATACTCGTTCTTTGAGTAACTTGACACCAGATGAAGTCAAGAAATTACCTGAAC
GGGAACAAGCACGCTATCAAGAATACTTTAAATTTGTAGACCTAAACCAAGATAATCAACTCAGTCCTGAGGAAGTTAAT
CAGAGGGATGCACTGATAATTTTACAAAATGCCAAAATTCCTTGGATAGATGATCAAGCTGATGGTTCAGCAGGTAGTAA
GTTGATGCACCATAAGTTTGTGGTTGTAGATAATCGCATAGTAATTGTGACTTCGGCAAACTTCACCTTAAGCGACGTTT
TCGGGGATTTCTCTAATTCTTCAAGTTTGGGAAATGCCAACAACCTATTACACATTGATAGCCCAGAATTAGCAGCTTTG
GTCACAGAAGAATTCAACCTCATGTGGGGTGATGGTGTTGGAGGTAAACCAGACAGTAAATTCGGTTTAAATAAACCTGT
ACGTCCTCCCCAAAAAATTACCTTGGGTGACAACACAATTACTGTGCATTTTTCCCCAACTTCACCCACCTTACCTTGGA
CTCAAAGCAGCAATGGCTTAATTAATGAAAGCTTAAATTTAGCGAATAAATCTATTGATATGGCGTTGTTTGTTTTTTCC
GAACAGCGTCTTGCTAATACATTAGAAAAACGTCATCAACAACAAGTCTCAATTCGAGCATTAATTGATAAACAATTCGC
CTATCGTTATTACAGCGAAGCTTTAGATATGATGGGAATTGCCCTGGGTAATAAATGCCGATATGAAATTGATAATCGAC
CTTGGTCTAATCCCGTTACTACGGTGGGCGTACCCACTTTACGAGAAGGAGACCTGCTACACCATAAATTTTCTGTTATC
GACAACCAAACGGTAATTACAGGTTCTCACAACTGGTCTGATGCAGCAAATCATGGCAATGATGAGACTTTGATAGTAAT
TAATAATCCCACAATTGCTGCTCATTATGAGCGTGAATTTGCTCGTCTTTACGCTAAAGCTCAAGTCGGTGTCCCAGCCA
AAGTCCAAGCACAAATTCAACAAGAACAAAAGCAATGTGGTCAAATTAAAACTCCTACTTCCAGTGAACTTACTCCTACT
CAAGTGGTGAATATCAATACAGCAAATTTGGCAGAATTGGAGACCTTACCCGGTGTAGGTAAAAAGCTAGCCCAAAAAAT
TATCACCGCCCGTCAGCAGAGAAAATTTGTCTCATCACAAGACTTGGATAAAGTACCTGGAATCAGTCCAAAGATGATAG
AAAATTGGCAAGGGCGTATTCAATTTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

49.355

100

0.494


Multiple sequence alignment