Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   AM586_RS00840 Genome accession   NZ_CP012640
Coordinates   164655..167039 (+) Length   794 a.a.
NCBI ID   WP_047825929.1    Uniprot ID   A0A0J1DA05
Organism   Massilia sp. WG5     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 159655..172039
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AM586_RS00830 (AM586_13645) - 161162..162829 (+) 1668 WP_047825927.1 alpha/beta fold hydrolase -
  AM586_RS00835 (AM586_13650) - 163065..164624 (+) 1560 WP_047825928.1 T6SS immunity protein Tli4 family protein -
  AM586_RS00840 (AM586_13655) comA 164655..167039 (+) 2385 WP_047825929.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  AM586_RS00845 (AM586_13660) - 167041..168552 (-) 1512 WP_047825930.1 cell wall metabolism sensor histidine kinase WalK -
  AM586_RS00850 (AM586_13665) - 168555..169241 (-) 687 WP_047825931.1 response regulator transcription factor -
  AM586_RS00855 (AM586_13670) - 169378..169866 (+) 489 WP_162600506.1 Spy/CpxP family protein refolding chaperone -
  AM586_RS00860 (AM586_13675) - 169986..170813 (+) 828 WP_047825972.1 alpha/beta fold hydrolase -
  AM586_RS00865 (AM586_13680) dcd 170853..171419 (+) 567 WP_047825933.1 dCTP deaminase -

Sequence


Protein


Download         Length: 794 a.a.        Molecular weight: 84718.50 Da        Isoelectric Point: 9.9465

>NTDB_id=155930 AM586_RS00840 WP_047825929.1 164655..167039(+) (comA) [Massilia sp. WG5]
MRSAILGFTAGVLALQSSAALPSAALMLACAAGAVPCILLARYAERTRGTACLAACAGVLLGYAWAAWLAHQALAPALGA
ADEGRDLSVVGVIDSLPYRFEQGLRFNLLVERSDTPGATVPPRIALSWYADRRGEAGLPERLMPGERWRLTVRLQRPHGN
ANPGGFDYEAWLLEQGVRATGYVRPAPGNAKLDPFVPRFGTFVERSRAVLRERILRALDGRQYAGVIVALVIGDQRGIDQ
SDWQVFNRTGIGHLISISGLHITMVAGLAALGVSSLWRRSFFTGLQLPLRLPAQKVAALAGALAALLYVLLAGFGVPAQR
TLYMLAVIALALWLGRLSSVSHVLCAALGAVLLLDPWAVLWPGFWLSFGAVAVILFGGHGRIAARAPGWREKLRAAGRTQ
WSVTLGLVPLSMLLFGQVSLVSPLANAVAIPLVSFVVTPLALLGSLLPEPLCGWLLLPAHAVVAGLAGALGWLAALPAAV
WRAPAPQAWLFLLALGGTLWMLMPRGWPHRWAGATAWLPLLLQVPAYPEHGAFRVTAFDVGQGMALLVETAGHRLLYDTG
PGYAPGADAGSRVLLPYLRMRGIGALDGMVVSHGDTDHTGGALALLAEQRIGWVASSLDAGHPIARAARRHLHCAAGQRW
EWDGVRFEMLHPAASVYADAGPKANARSCTLRIGNGRTAILLAGDIEAAQEAALVRTAPESLRADVLLAPHHGSGTSSTE
AFLQAVHPAVGVFQVGYRNRYKHPKKEVYERYGQLGIERLRTDELGAVTLDFDGAVAHQAYRTAHARYWSAPPP

Nucleotide


Download         Length: 2385 bp        

>NTDB_id=155930 AM586_RS00840 WP_047825929.1 164655..167039(+) (comA) [Massilia sp. WG5]
ATGCGCAGCGCCATCCTTGGCTTCACGGCCGGCGTGCTGGCGCTGCAATCCTCCGCCGCCTTGCCGTCCGCCGCCCTCAT
GCTCGCCTGCGCGGCCGGCGCCGTGCCGTGCATCCTTCTCGCCCGCTACGCCGAACGGACCCGGGGCACGGCATGCCTCG
CCGCCTGCGCCGGCGTCTTGCTCGGCTACGCCTGGGCAGCATGGCTGGCCCACCAGGCCCTCGCGCCGGCACTGGGCGCC
GCCGACGAAGGCCGCGACCTGAGCGTTGTCGGCGTGATCGACAGCCTTCCTTACCGCTTCGAGCAGGGCCTGCGCTTCAA
CCTGCTGGTCGAACGCAGCGACACGCCCGGCGCCACGGTCCCGCCGCGGATCGCGCTGTCCTGGTACGCCGACCGGCGCG
GCGAGGCGGGCCTGCCCGAGCGCTTGATGCCCGGCGAACGCTGGCGCCTGACGGTGCGCCTGCAGCGCCCGCACGGCAAC
GCCAATCCCGGCGGCTTCGACTACGAGGCCTGGCTGCTGGAGCAGGGTGTGCGCGCCACCGGCTACGTGCGCCCGGCGCC
GGGCAACGCGAAGCTGGATCCGTTCGTGCCGCGTTTCGGCACATTCGTCGAACGCAGCCGCGCCGTGCTGCGCGAGCGCA
TCCTGCGCGCGCTCGACGGGCGGCAGTACGCGGGCGTGATCGTCGCGCTGGTGATCGGCGACCAGCGCGGCATCGACCAG
TCGGACTGGCAGGTCTTCAACCGCACCGGCATCGGCCACCTGATCTCGATCTCGGGCCTGCATATCACGATGGTCGCCGG
GCTCGCCGCGCTGGGGGTGTCGAGCCTGTGGCGGCGTTCCTTCTTCACCGGCCTCCAATTGCCGCTGCGACTGCCCGCGC
AGAAGGTGGCGGCGCTGGCCGGGGCGCTGGCCGCGCTGCTGTACGTGCTGCTGGCCGGCTTCGGCGTGCCGGCCCAGCGC
ACGCTCTACATGCTGGCGGTGATCGCGCTGGCGCTGTGGCTGGGCCGGCTCTCGAGCGTGTCGCACGTGCTGTGCGCGGC
GCTCGGCGCGGTGCTGTTGCTCGATCCGTGGGCCGTGCTGTGGCCGGGCTTCTGGCTGTCCTTCGGCGCGGTGGCGGTCA
TCCTGTTCGGCGGCCACGGCAGGATCGCCGCGCGCGCGCCGGGCTGGCGCGAGAAGCTGCGCGCGGCCGGCCGCACCCAG
TGGTCGGTGACGCTGGGGCTGGTGCCGCTGTCGATGCTGCTTTTCGGCCAGGTGTCGCTGGTCAGCCCGCTGGCGAATGC
GGTCGCGATCCCGCTGGTGAGCTTCGTGGTGACGCCGCTGGCGCTGCTGGGCAGCCTCTTGCCGGAACCCCTGTGCGGCT
GGCTGCTGCTGCCGGCGCATGCCGTCGTCGCCGGGCTGGCGGGCGCCCTCGGCTGGCTGGCCGCGCTGCCGGCCGCGGTA
TGGCGCGCGCCCGCGCCGCAAGCCTGGCTGTTCCTGCTGGCGCTGGGCGGCACGCTGTGGATGCTGATGCCGCGCGGCTG
GCCGCATCGCTGGGCCGGCGCCACGGCTTGGCTGCCCTTGCTGTTGCAGGTGCCGGCGTATCCCGAGCATGGCGCCTTCC
GCGTCACCGCCTTCGACGTCGGCCAGGGCATGGCGCTGCTGGTCGAAACCGCCGGCCACCGCCTGCTCTACGACACCGGC
CCCGGCTACGCACCGGGCGCCGACGCCGGCAGCCGGGTGCTGCTGCCTTACCTGCGCATGCGCGGCATCGGCGCGCTCGA
CGGCATGGTCGTCAGCCACGGCGACACCGACCACACCGGCGGCGCGCTGGCGCTGCTGGCCGAGCAGCGAATCGGCTGGG
TGGCGTCCTCGCTGGATGCCGGCCATCCGATCGCGCGCGCGGCGCGGCGCCACCTGCATTGCGCGGCCGGCCAGCGGTGG
GAATGGGATGGCGTGCGCTTCGAGATGCTGCATCCGGCCGCGTCGGTCTACGCCGATGCCGGTCCGAAGGCGAACGCGCG
CAGCTGCACGCTGCGGATCGGGAACGGGCGCACGGCGATCCTGCTGGCAGGCGACATCGAAGCGGCCCAGGAAGCGGCGC
TGGTGAGGACGGCCCCGGAAAGCCTGCGCGCCGACGTGCTGCTGGCCCCGCACCACGGCAGCGGCACCTCGTCGACCGAG
GCCTTCCTGCAGGCGGTGCACCCGGCCGTCGGCGTGTTCCAGGTCGGCTACCGCAACCGTTACAAGCATCCGAAAAAAGA
GGTCTACGAGCGCTACGGCCAGCTCGGCATCGAGCGCCTGCGCACCGACGAACTGGGCGCCGTCACGCTCGACTTCGACG
GCGCCGTTGCGCACCAGGCCTACCGGACAGCGCACGCCCGCTACTGGAGCGCGCCGCCGCCCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0J1DA05

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

49.043

100

0.516

  comA Pseudomonas stutzeri DSM 10701

39.184

92.569

0.363


Multiple sequence alignment