Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   GQR42_RS17910 Genome accession   NZ_CP046973
Coordinates   3578126..3579766 (+) Length   546 a.a.
NCBI ID   WP_158200981.1    Uniprot ID   A0A857D6L8
Organism   Microcystis aeruginosa FD4     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3573126..3584766
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GQR42_RS28945 (GQR42_17935) - 3573950..3574141 (-) 192 Protein_3585 Uma2 family endonuclease -
  GQR42_RS29575 (GQR42_17940) - 3574140..3574451 (-) 312 Protein_3586 nucleotidyltransferase family protein -
  GQR42_RS17900 (GQR42_17945) - 3574411..3575271 (-) 861 WP_158200979.1 formylglycine-generating enzyme family protein -
  GQR42_RS17905 (GQR42_17950) - 3575478..3577904 (-) 2427 WP_158200980.1 dynamin-like GTPase family protein -
  GQR42_RS17910 (GQR42_17955) comA 3578126..3579766 (+) 1641 WP_158200981.1 DUF655 domain-containing protein Machinery gene
  GQR42_RS17915 (GQR42_17960) - 3579828..3580139 (-) 312 WP_002797828.1 MgPME-cyclase complex family protein -
  GQR42_RS17920 (GQR42_17965) - 3580198..3580920 (-) 723 WP_199273208.1 pyridoxine 5'-phosphate synthase -
  GQR42_RS17925 (GQR42_17970) - 3580980..3581324 (-) 345 WP_158200983.1 YbaB/EbfC family nucleoid-associated protein -
  GQR42_RS17930 (GQR42_17975) - 3581775..3582971 (+) 1197 WP_158200984.1 META domain-containing protein -
  GQR42_RS17935 (GQR42_17980) - 3583204..3583980 (-) 777 WP_158200985.1 hypothetical protein -

Sequence


Protein


Download         Length: 546 a.a.        Molecular weight: 61484.32 Da        Isoelectric Point: 9.4536

>NTDB_id=409771 GQR42_RS17910 WP_158200981.1 3578126..3579766(+) (comA) [Microcystis aeruginosa FD4]
MWTKMAIDKLRLKKIRPMKQLGFLLGTVFLLGGCHSLNAIQNRPQPLPQDQYMQVYFNYNQSQGSNYTDPYRQINRPGDN
LEQVIIDNINSAKSSIDLAVQELRLPAIAQALVARQQQGIKVRVILENIYNQPINSLAKNQEQLSEREQERYQNLFDLVD
LNRDGKLSTEEIAQRDAITILKKAGIPLIDDSADRSKGSGLMHHKFMVIDHHTTLISSANYTLSDIHGDFSNPKTVGNAN
HLLVITNPPLARVFQREFNLMWGREDQPKFGLDKPHRKPQKITIGNSSLMVKFSPDSITYPWVITSNGLIGETLNKAEKS
VNLALFVLTEQPLVNILAQRHQKGIEIKALIDPSFAFRYYSEGLDLLGVALSNKCRYEPDNQPWQNPINTLGVPALAAGD
KLHHKFGLIDDKIVITGSHNWSAAANHQNDEALVIIDNPTVAAHFDREFQYLYRTAKLGLPETIKNRIEQDRKNCPQSVT
IKSDRLINLNTATKEELDTLPGISGKLAEKIIAARQEKPFTSLEDLDRVSGIGKGKINKIKGKVSW

Nucleotide


Download         Length: 1641 bp        

>NTDB_id=409771 GQR42_RS17910 WP_158200981.1 3578126..3579766(+) (comA) [Microcystis aeruginosa FD4]
ATGTGGACAAAAATGGCAATAGATAAATTGAGATTAAAAAAAATTCGGCCTATGAAACAGTTAGGATTTTTGTTAGGGAC
AGTTTTTCTTTTGGGGGGTTGTCATTCTCTGAATGCTATCCAAAATCGTCCCCAACCCTTGCCCCAAGATCAATATATGC
AAGTCTATTTTAATTATAATCAGTCCCAGGGTTCAAATTATACCGATCCCTATCGCCAAATTAATCGCCCAGGGGATAAT
TTAGAACAGGTTATTATTGATAATATCAACTCGGCAAAAAGTTCGATCGATCTAGCGGTGCAAGAGTTACGTTTACCCGC
CATCGCTCAAGCTTTAGTTGCTCGTCAGCAACAGGGAATTAAAGTTAGGGTTATTCTCGAAAATATTTACAATCAACCTA
TTAATAGTTTAGCCAAAAACCAAGAACAATTAAGCGAACGAGAACAGGAACGCTATCAAAATCTTTTTGATTTAGTTGAT
CTAAATCGAGATGGTAAATTAAGTACAGAAGAAATCGCCCAAAGAGATGCAATTACTATCCTCAAAAAAGCTGGTATTCC
CCTAATTGATGATAGTGCAGATCGCTCAAAAGGTAGCGGTTTAATGCACCATAAATTCATGGTTATTGATCATCACACTA
CTTTAATCAGTTCAGCTAACTATACCTTAAGTGATATTCATGGAGACTTTTCTAATCCCAAAACTGTCGGGAATGCCAAT
CACTTATTAGTAATTACTAATCCCCCACTAGCTAGAGTTTTTCAAAGAGAATTTAATCTGATGTGGGGAAGGGAAGATCA
ACCTAAATTTGGCTTAGATAAACCCCACAGAAAACCGCAAAAAATTACTATCGGTAATAGTAGTCTAATGGTAAAATTTT
CCCCCGATTCTATCACTTATCCCTGGGTAATTACTAGCAATGGTCTGATCGGTGAAACTTTAAATAAAGCTGAAAAATCC
GTGAATTTAGCCCTGTTTGTCTTGACGGAACAGCCTCTAGTTAATATTCTCGCCCAACGACATCAAAAAGGGATAGAAAT
TAAAGCGTTAATTGATCCTAGTTTTGCTTTCCGATACTATAGCGAAGGATTAGATTTATTAGGAGTTGCTCTCAGTAATA
AATGTCGTTATGAACCAGATAATCAGCCTTGGCAAAACCCAATTAATACCCTAGGAGTTCCCGCTCTAGCGGCGGGAGAT
AAATTACACCATAAATTCGGTTTAATCGATGATAAAATTGTGATTACTGGTTCTCATAATTGGTCAGCAGCTGCTAACCA
TCAAAACGATGAAGCTTTAGTTATTATCGATAATCCCACCGTTGCCGCTCATTTCGATCGAGAATTTCAATATCTTTACC
GTACCGCTAAATTAGGACTGCCGGAAACTATTAAAAATCGGATCGAACAGGATAGAAAAAACTGCCCCCAATCTGTAACA
ATTAAAAGCGATCGCTTAATTAATTTAAACACTGCCACAAAAGAAGAATTAGATACTTTACCCGGTATCAGTGGCAAATT
AGCCGAAAAAATTATCGCAGCCCGGCAAGAAAAACCCTTTACTTCCCTAGAAGATTTGGATAGAGTATCGGGAATCGGTA
AGGGAAAAATTAACAAGATCAAAGGCAAAGTTAGCTGGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A857D6L8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

51.032

97.619

0.498


Multiple sequence alignment