Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   HH212_RS23215 Genome accession   NZ_CP051685
Coordinates   5536437..5538923 (-) Length   828 a.a.
NCBI ID   WP_170204653.1    Uniprot ID   A0A7Z2W196
Organism   Massilia forsythiae strain GN2-R2     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 5531437..5543923
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  HH212_RS23200 (HH212_23200) - 5531830..5535090 (-) 3261 WP_170204651.1 DEAD/DEAH box helicase -
  HH212_RS23205 (HH212_23205) - 5535282..5535641 (+) 360 WP_211172400.1 hypothetical protein -
  HH212_RS23210 (HH212_23210) - 5535719..5536390 (-) 672 WP_170204652.1 hypothetical protein -
  HH212_RS23215 (HH212_23215) comA 5536437..5538923 (-) 2487 WP_170204653.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  HH212_RS27860 (HH212_23220) - 5539012..5540961 (-) 1950 WP_370663889.1 T6SS immunity protein Tli4 family protein -
  HH212_RS23225 (HH212_23225) - 5540981..5542651 (-) 1671 WP_170204655.1 triacylglycerol lipase -

Sequence


Protein


Download         Length: 828 a.a.        Molecular weight: 87445.82 Da        Isoelectric Point: 9.2228

>NTDB_id=440470 HH212_RS23215 WP_170204653.1 5536437..5538923(-) (comA) [Massilia forsythiae strain GN2-R2]
MRCLILGFAAGVFWLQQASSLPAALSLAGCAAAALALSILAAFVRRAAPSRLRGAAAVVLMLAAAGGLSGYAWAALLAQR
ALAPMLAATDEGRDLAIVGVVDNLPASFEQGVRFNFLVERTLTAGAAAPPRVALSWYANPRGASGRAAAASSVSAQDALP
EIEPGQRWQLTVRLQRPHGNANPGGFDYEAWLLEQGVRATGYVRTGRAAAGVPAAVLLDEFAPSLPGVVERCRAWLRERI
LRALAGRQYAGVIVALVIGDQRGIDQADWQVFNRTGIGHLVSISGLHITMIAGLAALGASALWRRSFFTDAQLPLLLPAQ
KVAALAGAVTALLYVLLAGFGVPAQRTLYMLSVVALALWSGRLTAVSHVLCAALGVVLLLDPWAVLWPGFWLSFGAVAMI
LFAGHGRINPPLRGLCGTLLGAGHTQWAVTLGLVPLTMLLFGQVSLVSPLANAVAIPLVSFVVTPLALAGSLLPDPLCGW
LLALAHAAVAALAWLLGWMAGLPLAVWRAPAPQAWVFLLALGGTLWLLMPRGWPLRWSGAIAWLPLLLHLPDHPPAGSVR
VTAFDVGQGMALLVETAGHRLLYDTGPAYAPGADAGSRVILPYLRMRGIGALDGIVVSHGDLDHTGGALALLGELEVGWL
ASSLGEEHAIARAAPRHLHCMAGQRWEWDGIRFEMLHPAPSSYGDAGLKANARSCVLRIVNATHALLLAGDIEAAQEAGL
VADRAQALRADVLLAPHHGSGTSSTPAFLQAVRPSIGIFQVGYRNRYRHPKAEVYERYRMLGIERLRTDTSGAVTFDVDA
AVTVEAYRSAHARYWYAGGMTADHLQSY

Nucleotide


Download         Length: 2487 bp        

>NTDB_id=440470 HH212_RS23215 WP_170204653.1 5536437..5538923(-) (comA) [Massilia forsythiae strain GN2-R2]
ATGCGCTGCCTGATCCTGGGGTTTGCCGCCGGCGTGTTCTGGCTGCAGCAGGCGTCGTCCCTGCCGGCGGCACTGTCGCT
GGCCGGCTGCGCCGCAGCCGCACTGGCGCTGTCGATATTGGCCGCGTTTGTCCGCCGCGCAGCCCCGTCTCGTTTGCGTG
GCGCCGCCGCTGTGGTCCTGATGCTCGCGGCCGCCGGCGGCTTGTCCGGTTACGCATGGGCCGCGCTGCTGGCGCAGCGC
GCATTGGCGCCAATGCTGGCGGCCACCGACGAAGGGCGCGACCTGGCCATCGTCGGCGTGGTCGACAACCTGCCGGCCAG
CTTCGAGCAGGGCGTGCGTTTCAACTTCCTGGTCGAGCGCACGCTCACGGCGGGCGCGGCCGCGCCGCCGCGCGTGGCGC
TGTCCTGGTATGCCAACCCGCGCGGCGCGTCCGGCCGCGCCGCTGCGGCGTCCAGCGTGTCCGCGCAAGACGCGCTGCCC
GAGATCGAGCCCGGCCAGCGCTGGCAGTTGACGGTGCGCCTGCAGCGCCCGCACGGCAACGCCAATCCCGGCGGCTTCGA
CTACGAAGCCTGGCTGCTGGAACAGGGCGTGCGCGCCACCGGCTACGTGCGCACCGGGCGCGCTGCGGCAGGAGTGCCGG
CGGCGGTCCTGCTGGACGAGTTCGCGCCCAGCCTGCCGGGCGTGGTCGAGCGCTGCCGCGCCTGGCTGCGCGAACGCATC
CTGCGCGCGCTGGCGGGGCGCCAGTACGCCGGCGTCATCGTCGCGCTGGTGATCGGCGACCAGCGCGGCATCGACCAGGC
CGACTGGCAGGTGTTCAACCGCACCGGCATCGGCCACCTGGTTTCGATCTCGGGATTGCACATCACGATGATCGCCGGGC
TTGCGGCACTCGGCGCCTCGGCGCTGTGGCGGCGCTCGTTCTTCACCGATGCCCAGTTGCCGCTGCTGCTGCCGGCGCAG
AAGGTGGCGGCGCTGGCCGGCGCCGTCACCGCGCTGCTGTACGTGCTGCTGGCCGGCTTCGGGGTGCCGGCCCAGCGCAC
CTTGTACATGCTGTCGGTGGTGGCGCTGGCGCTGTGGAGCGGACGGCTGACGGCGGTCTCGCACGTGCTGTGCGCGGCGC
TCGGCGTAGTGCTGCTGCTGGACCCCTGGGCGGTGTTGTGGCCGGGCTTCTGGCTGTCGTTCGGCGCGGTGGCGATGATC
CTGTTCGCCGGCCATGGCCGCATCAATCCGCCGCTGCGCGGCCTGTGCGGCACGCTGCTGGGGGCGGGGCACACGCAGTG
GGCGGTGACGCTCGGCCTGGTGCCGCTGACGATGCTGCTGTTCGGCCAGGTATCGCTGGTCAGTCCGCTGGCGAATGCGG
TGGCGATCCCGCTGGTGAGCTTCGTGGTCACGCCGCTGGCACTGGCCGGCAGCCTGCTGCCCGATCCGTTGTGCGGCTGG
CTGCTGGCGCTGGCGCACGCGGCGGTCGCGGCGCTGGCCTGGCTGCTGGGCTGGATGGCGGGACTGCCGCTGGCGGTATG
GCGCGCACCGGCGCCGCAGGCCTGGGTATTCCTGCTGGCGCTGGGCGGCACGCTGTGGCTGTTGATGCCGCGCGGCTGGC
CGCTGCGCTGGAGCGGCGCGATCGCCTGGCTGCCCTTGCTGCTGCACCTGCCCGATCACCCGCCGGCAGGCAGCGTGCGC
GTCACCGCCTTCGACGTCGGCCAGGGCATGGCGCTGCTGGTCGAGACCGCGGGCCACCGCCTGCTGTACGACACCGGCCC
GGCCTATGCGCCCGGCGCGGATGCCGGCAGCCGCGTGATCCTGCCGTACCTGCGCATGCGCGGCATCGGGGCGCTGGACG
GCATCGTGGTCAGCCATGGCGACCTCGATCACACCGGCGGCGCCCTGGCGCTGCTGGGGGAACTCGAGGTCGGCTGGCTG
GCGTCGTCGCTCGGTGAAGAGCACGCGATCGCGCGGGCGGCGCCGCGCCACCTGCATTGCATGGCCGGCCAGCGCTGGGA
GTGGGACGGCATCCGCTTCGAGATGCTGCATCCGGCGCCGTCGAGTTATGGCGACGCCGGCCTGAAGGCGAATGCGCGCA
GCTGCGTGCTGCGCATCGTCAATGCCACGCATGCGTTGCTGCTGGCGGGCGACATCGAGGCGGCGCAGGAAGCCGGCCTG
GTGGCGGATCGAGCGCAGGCGCTGCGCGCCGACGTGCTGCTGGCGCCGCACCACGGCAGCGGCACCTCGTCCACGCCGGC
CTTTTTGCAGGCGGTGCGGCCGTCGATCGGCATCTTCCAGGTGGGGTACCGCAACCGTTACCGCCATCCCAAGGCCGAGG
TCTACGAGCGTTACCGGATGCTGGGCATCGAGCGCCTGCGCACCGACACCAGCGGCGCCGTGACGTTCGACGTGGATGCA
GCCGTGACGGTGGAAGCCTACCGCAGCGCGCACGCCCGTTACTGGTATGCGGGGGGCATGACGGCAGACCATTTACAAAG
TTATTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7Z2W196

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

49.11

100

0.5