Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   DPH57_RS07300 Genome accession   NZ_CP030092
Coordinates   1705598..1708030 (-) Length   810 a.a.
NCBI ID   WP_112937097.1    Uniprot ID   A0A7U5XX48
Organism   Massilia sp. YMA4     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1700598..1713030
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DPH57_RS07275 (DPH57_07275) - 1701164..1701733 (-) 570 WP_227470544.1 Spy/CpxP family protein refolding chaperone -
  DPH57_RS07280 (DPH57_07280) - 1701919..1702620 (+) 702 WP_112937091.1 response regulator transcription factor -
  DPH57_RS07285 (DPH57_07285) - 1702629..1704053 (+) 1425 WP_112937092.1 ATP-binding protein -
  DPH57_RS07290 (DPH57_07290) dcd 1704072..1704638 (-) 567 WP_112937093.1 dCTP deaminase -
  DPH57_RS07295 (DPH57_07295) - 1704707..1705537 (-) 831 WP_112937094.1 alpha/beta fold hydrolase -
  DPH57_RS07300 (DPH57_07300) comA 1705598..1708030 (-) 2433 WP_112937097.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  DPH57_RS07305 (DPH57_07305) - 1708134..1709153 (-) 1020 WP_112937099.1 LacI family DNA-binding transcriptional regulator -
  DPH57_RS07310 (DPH57_07310) - 1709202..1710556 (-) 1355 Protein_1474 MFS transporter -
  DPH57_RS07315 (DPH57_07315) gudD 1710864..1712204 (+) 1341 WP_112937101.1 glucarate dehydratase -

Sequence


Protein


Download         Length: 810 a.a.        Molecular weight: 86048.40 Da        Isoelectric Point: 10.8591

>NTDB_id=299056 DPH57_RS07300 WP_112937097.1 1705598..1708030(-) (comA) [Massilia sp. YMA4]
MRGAILGFVGGSALLQLQSALPGTAMLACAALLCCATLALQWRRTGTSGSLLRIAAGTLAGFCWAAIVAARALAPALAPQ
DEGRDATVIGTIDSLPAPGQGSTRFHFALEDVLAPAKMAIPPRIALSWYGGDGSLTRSVRPGERWQLKVRLQRPHGNANP
GGFDYELWLLEQGVRATGYVRNEGPNRRLAAFVPTPHNLVQRSRDRLRERILAALADRPYAGVIVALVVGDQRGIGQADW
QVFTRTGIGHLVSISGLHITMIAGLVAGIASWLWRRSFLMPTLQLPLRLPAQKVAALAGMLAALGYVLLAGFGVPAQRTL
YMLGVVALALWTGRLPGVTHVLALAAGVVVLLDPWAVMWPGFWLSFGAVALILYATVGRTAVYKQAPAADAPPDQTRRGR
PLTRRQRLMQALYGAALTQYAVTLGLVPLTMLLFAQVSVVSPLANAIAIPVVSLLVTPLALAGGVLPEPLATPLLLTAHA
VVELLVALLRWLSAGRLAVWAAPAPAWWTFALALAGTCWLLAPRGWPNRWLGLATWVPLLAAQPDAPPAGTLRIVAFDVG
QGMAVLVETSGHRLLYDTGPAYGPDADAGSRVIVPYLRWRGIERLDGVIVSHGDTDHVGGALSLLRAVPAGWLLSSLAPA
HPVVQAAPRQVPCAAGQGWTWDGIRFDVLHPTAASHADGTIKSNARSCVLRIGAPGGTVLLAGDIEAAQETELLARGARL
RADVLLAPHHGSGTSSTPAFLAAVRPRVAVFQVGYRNRYRHPKAAVYERYGALGVERVRTDEAGAVTLWLDGAVGMTAQR
QARPRYWHAR

Nucleotide


Download         Length: 2433 bp        

>NTDB_id=299056 DPH57_RS07300 WP_112937097.1 1705598..1708030(-) (comA) [Massilia sp. YMA4]
ATGCGCGGTGCAATCCTCGGCTTCGTGGGCGGTAGCGCGTTGCTGCAACTGCAGTCTGCGCTGCCCGGTACGGCCATGCT
GGCATGCGCGGCGCTGCTGTGTTGCGCGACGCTCGCACTGCAGTGGCGTCGGACCGGAACGTCCGGTTCGCTGCTGCGCA
TCGCCGCTGGCACCTTGGCGGGGTTCTGCTGGGCCGCCATCGTCGCGGCTCGAGCGCTTGCGCCAGCCCTGGCGCCGCAA
GACGAAGGCCGCGATGCGACCGTCATCGGCACCATCGACAGCCTGCCGGCCCCCGGTCAGGGCAGCACCCGCTTTCACTT
CGCGCTAGAAGATGTCCTGGCGCCCGCCAAGATGGCGATACCGCCGCGCATTGCGCTGTCCTGGTACGGCGGCGACGGCA
GCTTGACGCGGTCCGTGCGCCCGGGCGAGCGCTGGCAACTCAAGGTGCGGCTGCAGCGGCCGCACGGCAACGCCAACCCG
GGCGGCTTCGATTACGAGCTGTGGCTGCTGGAGCAGGGCGTGCGTGCCACGGGCTACGTCCGCAACGAAGGACCGAACCG
GCGACTGGCCGCGTTCGTGCCGACGCCGCACAACCTCGTGCAGCGCTCGCGCGACCGGTTGCGCGAGCGTATCCTGGCCG
CGCTGGCGGACCGGCCGTACGCCGGCGTCATCGTCGCGCTCGTGGTCGGCGACCAGCGCGGCATCGGGCAGGCCGACTGG
CAGGTATTCACCCGCACCGGCATCGGCCACCTGGTGTCGATCTCCGGCCTGCACATCACGATGATTGCTGGCCTGGTGGC
CGGTATCGCATCCTGGCTGTGGCGCCGGTCCTTCCTGATGCCCACCCTGCAGCTGCCCCTGCGCCTGCCGGCGCAGAAGG
TGGCCGCGCTGGCCGGCATGCTGGCGGCGCTGGGCTACGTGCTGCTGGCCGGCTTCGGGGTGCCGGCGCAGCGCACCCTG
TACATGCTGGGCGTCGTCGCGCTGGCGTTGTGGACAGGTCGCTTGCCGGGCGTAACGCACGTGCTGGCACTGGCGGCCGG
CGTCGTCGTGCTGCTCGATCCGTGGGCCGTGATGTGGCCGGGGTTCTGGCTGTCGTTCGGCGCCGTGGCGCTGATTCTGT
ACGCGACCGTGGGGCGCACCGCCGTCTACAAGCAGGCGCCGGCCGCCGACGCGCCGCCTGACCAGACCCGGCGCGGCCGT
CCATTGACGCGGCGCCAGCGGCTGATGCAGGCCCTGTACGGCGCCGCGCTGACGCAGTATGCGGTGACACTCGGGCTGGT
GCCGCTGACGATGCTGCTGTTCGCCCAGGTCTCCGTAGTCAGCCCGCTGGCCAACGCCATCGCGATCCCGGTCGTCAGCC
TGCTGGTGACGCCGCTGGCCCTGGCGGGCGGCGTGCTGCCCGAGCCGCTTGCCACGCCGCTGCTCCTGACGGCACATGCG
GTGGTCGAACTGCTGGTGGCGCTGCTGCGCTGGCTCAGCGCCGGCCGCCTGGCCGTGTGGGCGGCACCGGCGCCGGCGTG
GTGGACGTTCGCGCTGGCCCTGGCCGGCACCTGTTGGCTGCTGGCCCCGCGCGGCTGGCCCAACCGCTGGCTGGGCCTGG
CGACCTGGGTGCCGCTGCTGGCCGCCCAGCCGGACGCGCCGCCAGCGGGCACGCTGCGCATCGTCGCTTTCGACGTGGGG
CAGGGCATGGCCGTGCTGGTCGAAACGTCCGGCCACCGCTTGCTGTACGACACCGGGCCGGCCTACGGTCCCGATGCCGA
CGCGGGCAGCCGCGTCATCGTTCCCTACCTGCGCTGGCGCGGCATCGAGCGGCTCGACGGCGTCATCGTCAGCCATGGCG
ACACCGACCACGTGGGCGGTGCGCTGTCGCTGCTGCGCGCCGTGCCCGCCGGCTGGCTGCTGTCGTCGCTGGCGCCGGCC
CATCCGGTCGTGCAGGCAGCGCCGCGCCAGGTGCCATGCGCCGCGGGCCAGGGCTGGACCTGGGACGGCATCCGCTTCGA
CGTGCTGCATCCGACAGCGGCCAGCCACGCCGACGGTACGATCAAGTCGAACGCGCGCAGCTGCGTCCTGCGCATCGGCG
CGCCCGGCGGGACGGTGCTGCTGGCCGGCGACATCGAGGCGGCGCAGGAAACCGAATTGCTGGCGCGTGGCGCGCGGCTG
CGGGCCGACGTGCTGCTGGCGCCGCACCACGGCAGCGGCACCTCGTCCACGCCGGCGTTCCTGGCCGCCGTGCGGCCGCG
TGTGGCCGTCTTCCAGGTGGGCTACCGCAACCGTTACCGGCATCCGAAAGCCGCAGTGTACGAGCGCTATGGGGCGCTGG
GCGTGGAGCGGGTGCGGACGGACGAGGCGGGGGCCGTAACGCTATGGCTGGACGGGGCCGTCGGCATGACAGCCCAGCGG
CAGGCCCGGCCCCGCTACTGGCACGCCCGCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7U5XX48

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

47.63

100

0.496

  comA Pseudomonas stutzeri DSM 10701

39.136

94.321

0.369


Multiple sequence alignment