Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   NRS07_RS10385 Genome accession   NZ_CP103371
Coordinates   2321248..2323602 (+) Length   784 a.a.
NCBI ID   WP_259206075.1    Uniprot ID   -
Organism   Massilia sp. H6     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2316248..2328602
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NRS07_RS10370 (NRS07_10370) - 2316641..2318347 (+) 1707 WP_259206056.1 triacylglycerol lipase -
  NRS07_RS10375 (NRS07_10375) - 2318467..2320044 (+) 1578 WP_259206059.1 T6SS immunity protein Tli4 family protein -
  NRS07_RS10380 (NRS07_10380) - 2320037..2321275 (+) 1239 WP_259206072.1 T6SS immunity protein Tli4 family protein -
  NRS07_RS10385 (NRS07_10385) comA 2321248..2323602 (+) 2355 WP_259206075.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  NRS07_RS10390 (NRS07_10390) - 2323698..2324534 (+) 837 WP_259206076.1 alpha/beta fold hydrolase -
  NRS07_RS10395 (NRS07_10395) dcd 2324572..2325138 (+) 567 WP_259206078.1 dCTP deaminase -
  NRS07_RS10400 (NRS07_10400) - 2325218..2327479 (-) 2262 WP_259206080.1 arginine/lysine/ornithine decarboxylase -
  NRS07_RS10405 (NRS07_10405) - 2327665..2328303 (-) 639 WP_259206081.1 DUF4337 domain-containing protein -

Sequence


Protein


Download         Length: 784 a.a.        Molecular weight: 82464.99 Da        Isoelectric Point: 10.4479

>NTDB_id=722372 NRS07_RS10385 WP_259206075.1 2321248..2323602(+) (comA) [Massilia sp. H6]
MRCAILAFLSGVAWLQTLAALPAPPTIFGAAVIALLLCAFKRRAATLLASALLGAAWAALSATLALAEALPRELEGRDLA
IIGTVDSLPHEFEGGTRFNFRVEQVLEPGATVPARVALSWYGRITGQANALTVAPGERWQLVVRLQRPHGNANPHGFDYE
GWLLEQGVRATGYVRPAGSGNQRLAAFVPGVGHAVERARAALRRHILAQLEGKPYAGVIVALVIGDQRGIDQPDWDVFNR
TGISHLVSISGLHVTMLAGLAAWAMSALWRRSFFVEGAALPLLLPAQKAAAVAGALTALLYVLLAGFGVPAQRTLYMLAA
GALALWCGRVVTLSQVLCLALGVVVIIDPWAVLSAGFWLSFGAVAVILYANLGRVDFATGWRRTLRTAVQTQYAITVGLV
PLTLMLFSQVSIVSPLANAVAIPVVSLLVTPLALAGALLPAPLAAWVLGLAHALVALLAEGLAWLAARPYAAGSAPAPQP
WGLALALAGTAWMLAPRGWPHRWAGLTAWLPMLTQLPSAPEDGSFTVTAFDVGQGMALLVETHRHRLLYDAGPQYAPGSD
AAGRILLPYLRGRGIGKLDALVVSHSDTDHAGGARALLAGVPVAQVRSSLAPAHAVVRAARSHTRCAAGQAWNWDGVRFD
MLGPAPESYANAGLKANGRSCVLRVSSGGRAILLAGDIEAAQEAQLVAQAPGALRADVLLAPHHGSGTSSTVGFLAAVKP
TVVIFQVGYRNRYRHPKPAVLARYQALGIGSLRTDEAGAVTIGFGQSIGLSAYRLSHRRYWHGR

Nucleotide


Download         Length: 2355 bp        

>NTDB_id=722372 NRS07_RS10385 WP_259206075.1 2321248..2323602(+) (comA) [Massilia sp. H6]
ATGCGCTGTGCGATCCTCGCGTTTCTGAGCGGCGTAGCCTGGCTCCAGACGCTCGCCGCGCTGCCCGCGCCGCCGACGAT
CTTCGGCGCGGCCGTCATTGCGCTGCTCTTGTGCGCGTTCAAGCGCCGCGCGGCGACCCTGCTTGCCAGCGCGCTGCTCG
GCGCCGCGTGGGCGGCGCTCAGTGCGACGCTGGCACTGGCCGAAGCGCTGCCGCGCGAGCTCGAAGGCAGGGACCTTGCC
ATCATTGGTACGGTCGACAGCCTGCCGCATGAGTTCGAGGGTGGCACCCGCTTTAACTTTCGTGTCGAGCAGGTGCTAGA
GCCAGGCGCGACGGTCCCGGCCCGCGTTGCGCTGTCGTGGTATGGGCGCATCACGGGGCAGGCCAATGCGCTGACAGTCG
CGCCGGGCGAGCGCTGGCAGCTGGTAGTGCGCCTGCAGCGTCCGCACGGCAACGCCAACCCGCACGGCTTCGACTACGAA
GGCTGGCTGCTGGAGCAGGGCGTGCGCGCCACCGGCTATGTGCGTCCCGCCGGCAGCGGCAACCAGCGCCTGGCTGCCTT
CGTCCCGGGCGTGGGCCATGCCGTCGAGCGCGCCCGTGCGGCGCTGCGCCGCCATATCCTTGCTCAGCTCGAAGGCAAGC
CTTACGCCGGCGTGATCGTCGCCCTGGTGATAGGCGACCAGCGCGGCATCGACCAGCCTGACTGGGACGTCTTCAATCGC
ACCGGCATCAGTCACCTGGTGTCGATCTCCGGCCTGCACGTGACCATGCTCGCTGGCCTGGCGGCGTGGGCGATGTCGGC
CCTGTGGCGGCGTTCCTTCTTCGTCGAAGGCGCCGCGCTGCCGCTGCTGCTGCCGGCGCAAAAGGCCGCCGCCGTGGCCG
GCGCGCTGACCGCGCTGCTGTATGTCCTGCTGGCCGGTTTCGGGGTGCCGGCCCAGCGCACCCTCTACATGCTCGCTGCC
GGCGCGCTGGCGCTCTGGTGCGGGCGCGTCGTCACGCTGTCGCAGGTGCTGTGCCTGGCGCTTGGGGTGGTGGTGATCAT
CGATCCCTGGGCCGTGCTCAGCGCGGGATTCTGGCTTTCGTTCGGGGCGGTCGCCGTGATCCTGTATGCAAACCTCGGTC
GGGTCGATTTTGCCACTGGCTGGCGCCGTACGCTGCGCACGGCGGTGCAGACGCAATACGCGATCACGGTCGGACTGGTG
CCGCTCACGCTGATGCTGTTTTCGCAGGTCTCGATCGTCAGCCCACTGGCCAACGCGGTGGCGATTCCAGTCGTTAGCCT
GTTGGTCACGCCGCTTGCGCTGGCCGGCGCGCTGCTGCCGGCCCCGCTGGCGGCTTGGGTGCTCGGCCTGGCGCATGCAC
TGGTCGCGCTGCTGGCCGAGGGCCTGGCCTGGCTGGCGGCGCGGCCGTACGCGGCCGGGAGCGCACCGGCGCCGCAGCCA
TGGGGGCTGGCCCTGGCGCTGGCCGGTACCGCGTGGATGCTGGCACCGCGCGGCTGGCCGCACCGGTGGGCGGGGTTGAC
CGCCTGGCTGCCGATGCTGACCCAGCTGCCCAGCGCACCCGAGGATGGCAGCTTCACGGTTACCGCCTTCGACGTTGGCC
AAGGCATGGCACTGCTGGTCGAGACGCACCGGCACCGGTTGCTGTACGACGCTGGTCCGCAGTATGCGCCAGGATCCGAC
GCCGCCGGGCGCATCCTGCTGCCTTACCTGCGCGGGCGCGGCATCGGCAAGCTCGATGCGCTGGTGGTGTCGCACAGCGA
CACCGACCATGCCGGCGGCGCCCGCGCGCTGCTGGCAGGGGTGCCGGTGGCGCAGGTGCGCTCTTCGCTCGCCCCGGCTC
ATGCGGTGGTGCGGGCGGCGCGCAGCCATACACGTTGCGCGGCGGGCCAGGCTTGGAACTGGGACGGTGTGCGCTTTGAC
ATGCTCGGGCCGGCGCCGGAGTCGTATGCGAATGCGGGCCTGAAGGCAAACGGGCGCAGCTGCGTACTGCGGGTCAGTAG
TGGTGGACGGGCCATCCTGCTGGCGGGAGACATCGAAGCGGCGCAGGAAGCGCAGCTGGTGGCGCAGGCGCCCGGGGCCC
TGCGCGCCGACGTGCTGCTGGCCCCGCACCATGGAAGCGGCACCTCGTCGACGGTCGGCTTCCTGGCGGCTGTGAAACCG
ACGGTCGTCATTTTCCAGGTCGGCTACCGCAATCGCTACCGCCATCCGAAGCCCGCCGTGCTCGCGCGCTACCAGGCACT
GGGAATTGGCAGCCTTCGCACCGACGAAGCTGGCGCCGTCACCATCGGCTTTGGCCAGTCCATCGGGTTGTCGGCTTATC
GCCTGAGCCATCGGCGCTACTGGCATGGCCGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

47.172

100

0.5

  comA Pseudomonas stutzeri DSM 10701

37.922

98.214

0.372