Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   NSMS1_RS30715 Genome accession   NZ_AP023441
Coordinates   7107002..7108594 (-) Length   530 a.a.
NCBI ID   WP_411908654.1    Uniprot ID   -
Organism   Nostoc sp. MS1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 7102002..7113594
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NSMS1_RS30700 (NSMS1_60160) - 7103793..7105139 (-) 1347 WP_224089260.1 P-loop NTPase fold protein -
  NSMS1_RS30705 (NSMS1_60170) - 7105607..7106077 (+) 471 WP_224095402.1 beta-lactamase hydrolase domain-containing protein -
  NSMS1_RS30710 (NSMS1_60180) - 7106469..7106951 (+) 483 WP_224089265.1 gluconokinase -
  NSMS1_RS30715 (NSMS1_60190) comA 7107002..7108594 (-) 1593 WP_411908654.1 phospholipase D-like domain-containing protein Machinery gene
  NSMS1_RS30720 - 7108545..7108688 (-) 144 WP_224089267.1 hypothetical protein -
  NSMS1_RS30725 (NSMS1_60200) - 7108761..7109927 (+) 1167 WP_224089270.1 cysteine desulfurase family protein -
  NSMS1_RS30730 (NSMS1_60210) - 7110300..7112327 (-) 2028 WP_224089273.1 serine/threonine-protein kinase -

Sequence


Protein


Download         Length: 530 a.a.        Molecular weight: 59704.23 Da        Isoelectric Point: 9.1787

>NTDB_id=83265 NSMS1_RS30715 WP_411908654.1 7107002..7108594(-) (comA) [Nostoc sp. MS1]
MFLIIAIAACQKVESHNNRPALLPQDPFVKVYFNQSEASEYKEPYRQQTRLGDNLEQQMIDAISQAKSTVDIAVQELRLP
RIAQALADKQKAGIKVRVILENTYSRPLSSLTPDEVKKLNPREKARYQEYFKFIDLNQDNKITSEEINQQDALIILRNAK
IPWIDDQADGSAGSKLMHHKFIVVDNRIIIVTSANFTLSDVLGDFTNPDSLGNANNLLYIDSPELAVLFTEEFNLMWGDG
PGGKPDSKFGIRKPPRSFKTIILGDNKITVHFSPTSPTLPWSQSSNGLIDKTLKSAKKSVDMALFVFSEQRLANTLEQVH
QQDISIRALIDKQFAYRYYSEGLDMMGVALSNECKYEINNKPWSNPVTTVGIPNLRKGDILHHKFALVDNQTVITGSHNW
SDTANYGNDETLIVINNPIITAHYEREFNRLYAKAQIGVPTKIQAQIQKEKKQCKQIKSPTSAELTTPKVVNLNTASPTE
LETLPGVGKKLAQKIIIARQQQKITSIQDLDKIPGVSKKMIDKWQGNIQF

Nucleotide


Download         Length: 1593 bp        

>NTDB_id=83265 NSMS1_RS30715 WP_411908654.1 7107002..7108594(-) (comA) [Nostoc sp. MS1]
TTGTTTTTGATAATTGCGATCGCTGCTTGTCAAAAAGTCGAATCCCATAATAATCGTCCTGCACTTCTGCCGCAAGATCC
ATTTGTGAAAGTTTACTTTAATCAGTCCGAAGCGTCAGAATATAAAGAACCATACCGTCAGCAAACTCGACTGGGAGATA
ATTTAGAACAGCAGATGATTGATGCTATTTCTCAAGCTAAATCTACTGTCGATATAGCAGTACAAGAATTACGTTTACCC
AGAATAGCCCAAGCCCTTGCAGATAAGCAAAAAGCAGGAATCAAAGTCAGAGTAATTTTAGAAAATACCTACAGCCGTCC
TTTGAGTAGCTTGACACCAGATGAAGTCAAGAAATTAAACCCTAGAGAAAAGGCACGATATCAAGAATACTTTAAATTTA
TCGACCTGAATCAAGATAATAAAATTACTTCTGAGGAAATCAATCAGCAAGATGCACTAATAATTTTACGCAATGCCAAA
ATTCCCTGGATAGATGATCAAGCCGATGGTTCAGCCGGAAGTAAGCTGATGCACCACAAATTTATAGTTGTGGATAATCG
AATAATAATTGTCACTTCAGCGAATTTTACTCTTAGCGATGTACTAGGAGACTTTACAAATCCTGATAGTTTAGGAAATG
CCAATAATTTACTATATATCGACAGTCCAGAGTTAGCTGTTTTATTTACAGAAGAGTTTAATTTGATGTGGGGTGATGGC
CCTGGCGGTAAACCGGACAGTAAATTTGGTATACGAAAACCCCCACGTTCCTTCAAAACAATTATTTTAGGTGATAACAA
AATTACTGTGCATTTTTCACCCACATCACCTACCTTACCTTGGAGTCAAAGCAGTAATGGTTTAATTGATAAAACTTTAA
AATCAGCTAAGAAATCTGTTGATATGGCATTATTTGTTTTTTCCGAACAACGTCTTGCTAATACTCTAGAGCAAGTTCAT
CAACAAGACATATCAATTAGAGCATTAATTGATAAACAATTTGCATATCGTTATTATAGCGAAGGCTTAGATATGATGGG
TGTTGCCCTGAGTAACGAATGCAAATATGAAATTAATAATAAGCCTTGGTCTAATCCAGTTACTACAGTGGGCATACCCA
ATTTAAGAAAAGGAGATATTCTTCATCACAAGTTTGCCCTGGTTGACAATCAAACAGTAATTACAGGTTCCCACAACTGG
TCTGACACAGCAAATTATGGTAATGATGAAACTCTTATAGTAATTAATAATCCCATAATTACTGCCCATTATGAGCGGGA
ATTTAATCGTCTTTATGCAAAGGCTCAAATAGGTGTACCAACAAAAATCCAAGCACAAATTCAAAAAGAGAAAAAACAAT
GTAAGCAGATTAAAAGTCCTACTTCCGCAGAACTCACCACACCCAAAGTAGTCAATCTAAATACAGCCAGTCCAACAGAA
CTAGAAACCTTACCCGGTGTTGGTAAAAAACTTGCTCAAAAAATCATTATCGCCCGTCAACAGCAAAAAATTACATCAAT
ACAAGATTTAGATAAAATACCAGGGGTCAGCAAAAAAATGATAGATAAATGGCAAGGAAATATTCAATTCTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

47.681

100

0.485


Multiple sequence alignment