Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   BJP34_RS08270 Genome accession   NZ_CP017599
Coordinates   2146720..2148150 (+) Length   476 a.a.
NCBI ID   WP_324611035.1    Uniprot ID   -
Organism   Moorena producens PAL-8-15-08-1     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2141720..2153150
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  BJP34_RS08250 (BJP34_08250) - 2142299..2143546 (-) 1248 WP_229424287.1 RNA-guided endonuclease InsQ/TnpB family protein -
  BJP34_RS08255 (BJP34_08255) - 2143921..2144676 (-) 756 WP_070391931.1 DUF1995 family protein -
  BJP34_RS08260 (BJP34_08260) - 2144804..2145967 (-) 1164 WP_070391932.1 cysteine desulfurase family protein -
  BJP34_RS08265 (BJP34_08265) - 2146086..2146466 (-) 381 WP_070391933.1 DUF7682 family zinc-binding protein -
  BJP34_RS08270 (BJP34_08270) comA 2146720..2148150 (+) 1431 WP_324611035.1 phospholipase D-like domain-containing protein Machinery gene
  BJP34_RS36340 - 2148645..2149796 (+) 1152 WP_083305054.1 aspartate carbamoyltransferase catalytic subunit -
  BJP34_RS08280 (BJP34_08280) - 2149844..2150704 (-) 861 WP_070391935.1 hypothetical protein -
  BJP34_RS39080 (BJP34_08285) - 2150772..2152874 (-) 2103 WP_070391936.1 hypothetical protein -

Sequence


Protein


Download         Length: 476 a.a.        Molecular weight: 52598.85 Da        Isoelectric Point: 7.1680

>NTDB_id=200604 BJP34_RS08270 WP_324611035.1 2146720..2148150(+) (comA) [Moorena producens PAL-8-15-08-1]
MLILLILCGCKQAQPQVSLEPPLPQDPWVQAYFNHSRSAQYIDPYRGVTREGDNLEQVIVDAIASAKSTVDVAVQELRMP
RIAQALVERKQAGVKVRVILDNNYSLPVSKLTAESVAKLRQRERSRYNESLILIDRDGDGQLSAREISEGDALIILGNGG
VPMIDDTADGSRGSGLMHHKFIVIDNKVSIITSANFTLSDLHGDFQFRESRGNANNLLKIDSVELANQFTQEFNLMWGDG
AGGKPDSKFGLKKPWRPVEQVTLGDTSVGDTSVGDTTVGDTTIAIQFSPTSTTKPWHQSSNGLIAEILPQAKETIHIALF
VFSEQRLTNVLEESHQQGVKVKALIEPSFAFRYYSEGLDMMGVAVAHKCKYETGNRPWRQPIATVGVPQLPEGDVLHHKF
AILDGKIVITGSHNWSKAANTRNDETVLVIHSPMVAAHFEREFQRLYGKAVKGVPVRVQQKIKAQQQKCVPLDPHS

Nucleotide


Download         Length: 1431 bp        

>NTDB_id=200604 BJP34_RS08270 WP_324611035.1 2146720..2148150(+) (comA) [Moorena producens PAL-8-15-08-1]
TTGCTGATACTATTGATTTTGTGTGGCTGTAAACAAGCTCAGCCACAGGTTTCTCTTGAGCCCCCTCTGCCCCAAGACCC
TTGGGTTCAGGCTTACTTTAATCATAGTAGGTCAGCCCAATACATAGATCCTTACCGGGGAGTAACTCGGGAGGGAGATA
ACTTGGAGCAAGTGATTGTGGATGCGATCGCATCAGCAAAGTCAACAGTAGATGTGGCGGTTCAGGAGTTGCGTATGCCC
AGAATTGCCCAAGCTCTAGTAGAACGTAAGCAAGCTGGTGTCAAGGTCAGAGTCATTTTAGACAATAACTACAGTCTTCC
GGTCAGTAAACTTACCGCTGAGTCAGTGGCAAAACTGCGCCAAAGAGAACGCTCACGCTACAATGAGTCCCTAATATTGA
TCGACCGTGACGGAGATGGTCAGCTGAGTGCAAGGGAAATTTCAGAAGGAGACGCTTTAATCATCTTAGGAAATGGTGGT
GTACCTATGATTGATGACACAGCGGATGGTTCTCGGGGCAGTGGCTTAATGCATCATAAGTTTATAGTTATTGATAACAA
AGTTTCTATTATTACCTCAGCCAACTTCACTCTTAGTGATCTACATGGTGACTTTCAGTTTCGAGAGAGTCGTGGCAATG
CCAACAATCTGCTCAAGATTGACAGTGTAGAGTTAGCTAACCAATTCACCCAGGAATTTAACCTGATGTGGGGGGATGGT
GCTGGGGGTAAACCAGACAGTAAGTTTGGTCTCAAGAAGCCATGGCGACCAGTAGAGCAGGTGACACTGGGGGATACCAG
TGTGGGGGATACCAGTGTGGGGGATACGACTGTGGGGGATACAACTATCGCGATACAATTCTCCCCAACCTCCACTACCA
AACCTTGGCATCAGAGTAGCAATGGTCTGATCGCTGAAATCCTTCCTCAGGCTAAGGAAACCATTCACATAGCTTTGTTT
GTCTTTTCCGAGCAACGGCTAACTAATGTTTTAGAGGAATCCCATCAACAAGGTGTTAAGGTCAAAGCGCTAATTGAGCC
AAGTTTTGCCTTTCGCTATTACAGTGAGGGTTTGGACATGATGGGAGTAGCTGTTGCTCACAAGTGTAAATATGAGACTG
GTAACCGTCCTTGGCGTCAACCGATTGCTACCGTCGGTGTGCCTCAGTTGCCAGAAGGAGATGTCTTACATCACAAATTT
GCCATTCTGGATGGAAAGATAGTGATTACAGGCTCTCACAACTGGTCAAAAGCAGCAAATACCAGGAACGATGAGACGGT
ATTAGTTATTCATAGTCCTATGGTTGCTGCCCATTTTGAGCGGGAGTTTCAACGCCTCTATGGCAAAGCAGTGAAGGGAG
TACCAGTTAGAGTTCAGCAAAAGATTAAAGCACAGCAACAAAAATGTGTGCCTCTTGATCCACATTCTTGA

Domains


Predicted by InterproScan.

(59-237)

(306-446)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Synechocystis sp. PCC 6803

48.319

100

0.483


Multiple sequence alignment