Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   NQE15_RS11240 Genome accession   NZ_CP102486
Coordinates   2251184..2252719 (+) Length   511 a.a.
NCBI ID   WP_416336519.1    Uniprot ID   -
Organism   Dechloromonas sp. A34     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 2220869..2251042 2251184..2252719 flank 142


Gene organization within MGE regions


Location: 2220869..2252719
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NQE15_RS11075 - 2220869..2223037 (+) 2169 WP_265949763.1 AAA family ATPase -
  NQE15_RS11080 - 2223601..2224365 (-) 765 WP_265949765.1 DUF6710 family protein -
  NQE15_RS11085 - 2224461..2225348 (+) 888 WP_265949767.1 FRG domain-containing protein -
  NQE15_RS11090 - 2225388..2225624 (-) 237 WP_265949769.1 hypothetical protein -
  NQE15_RS11095 - 2225658..2225969 (-) 312 WP_265949770.1 hypothetical protein -
  NQE15_RS11100 - 2226131..2226301 (+) 171 WP_265949771.1 hypothetical protein -
  NQE15_RS11105 - 2226363..2228447 (-) 2085 WP_265949772.1 tetratricopeptide repeat protein -
  NQE15_RS11110 - 2228610..2228795 (-) 186 WP_265949773.1 hypothetical protein -
  NQE15_RS11115 - 2228810..2229880 (-) 1071 WP_265949774.1 tyrosine-type recombinase/integrase -
  NQE15_RS11120 - 2230040..2230222 (+) 183 WP_265949776.1 hypothetical protein -
  NQE15_RS11125 - 2230984..2231973 (+) 990 WP_265949777.1 nuclease-related domain-containing protein -
  NQE15_RS11130 - 2232044..2232877 (+) 834 WP_265949778.1 vWA domain-containing protein -
  NQE15_RS11135 - 2232885..2233637 (+) 753 WP_265949779.1 hypothetical protein -
  NQE15_RS11140 - 2233618..2235252 (+) 1635 WP_265949780.1 hypothetical protein -
  NQE15_RS11145 - 2235237..2235797 (+) 561 WP_265949781.1 thermonuclease family protein -
  NQE15_RS11150 - 2235858..2236151 (-) 294 WP_265949782.1 hypothetical protein -
  NQE15_RS11155 - 2236144..2236338 (-) 195 WP_265949783.1 hypothetical protein -
  NQE15_RS11160 - 2236490..2236756 (+) 267 WP_265949784.1 hypothetical protein -
  NQE15_RS11165 - 2236825..2237445 (-) 621 WP_265949785.1 hypothetical protein -
  NQE15_RS11170 - 2237800..2238588 (+) 789 WP_265949786.1 DUF5131 family protein -
  NQE15_RS11175 - 2238684..2239352 (+) 669 WP_265949787.1 NADAR family protein -
  NQE15_RS11180 - 2239354..2239587 (+) 234 WP_265949788.1 hypothetical protein -
  NQE15_RS11185 - 2239661..2242339 (+) 2679 WP_265949789.1 DNA polymerase -
  NQE15_RS11190 - 2242351..2243766 (+) 1416 WP_265949790.1 DNA primase family protein -
  NQE15_RS11195 - 2243894..2244964 (-) 1071 WP_265949791.1 hypothetical protein -
  NQE15_RS11200 - 2245028..2245402 (+) 375 WP_265942633.1 transposase -
  NQE15_RS11205 tnpB 2245345..2245728 (+) 384 WP_265942632.1 IS66 family insertion sequence element accessory protein TnpB -
  NQE15_RS11210 tnpC 2245791..2247299 (+) 1509 WP_265950075.1 IS66 family transposase -
  NQE15_RS11215 - 2247443..2247904 (-) 462 WP_265949792.1 hypothetical protein -
  NQE15_RS11220 - 2248014..2249156 (-) 1143 WP_265949793.1 XRE family transcriptional regulator -
  NQE15_RS11225 - 2249146..2249691 (-) 546 WP_265949794.1 HEPN domain-containing protein -
  NQE15_RS11230 - 2249984..2250358 (-) 375 WP_265949795.1 hypothetical protein -
  NQE15_RS11235 - 2250392..2250949 (-) 558 WP_265949796.1 hypothetical protein -
  NQE15_RS11240 comA 2251184..2252719 (+) 1536 WP_416336519.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene

Sequence


Protein


Download         Length: 511 a.a.        Molecular weight: 54909.34 Da        Isoelectric Point: 9.3072

>NTDB_id=717250 NQE15_RS11240 WP_416336519.1 2251184..2252719(+) (comA) [Dechloromonas sp. A34]
MVAALFGWLVSFGWRRVPRLALALPAQKAGLLAACTAAMLYALLAGFAVPAQRTLYMLMVAALAMLSGRIVAPSRTLLLG
LLVVLLIDPWAVLAAGFWLSFGAVGALLYVGSAKIGRGRGWRERLGAWGGVQWAATLASLPVLLLVFQQFSLVSPLANAV
AIPVISFIVTPLALLGALVPWWPIAAVAHLAMDWLMLFLDWCATWPVWQAPAPPLWAAVAAGLGVALCLLPRGMPGRAIG
LFLLAPALFWPVDRPAPGEAWITVLDVGQGLASVVRTRDHTLIYDPGPLYSAESDAGQRVVVPYLRRLGINRVDLLMVTH
RDSDHAGGSASVQSALTVDAIRSSVPEIPGEPCLAGQGWNWDGVLFEVLHPVAEQATARQKTNHHSCVLRVTAGDKRMLL
TSDIEAADEAALLARYPGQLAADVLLVPHHGSRTSSTPAFLDAVAPASAVIPVGYRNRFGHPKSEVLERYAAREIPLWRT
DRDGAVEIRLAVGGLALSGWRAQYRRYWQGE

Nucleotide


Download         Length: 1536 bp        

>NTDB_id=717250 NQE15_RS11240 WP_416336519.1 2251184..2252719(+) (comA) [Dechloromonas sp. A34]
ATGGTGGCGGCGCTCTTTGGCTGGCTGGTCAGCTTTGGCTGGCGCCGGGTGCCCCGCCTGGCCCTTGCGCTACCGGCACA
GAAGGCCGGTCTGCTGGCGGCGTGTACTGCCGCCATGCTCTATGCCTTGCTGGCCGGCTTTGCCGTGCCCGCCCAGCGCA
CCCTCTACATGCTGATGGTTGCCGCGCTGGCCATGCTTTCGGGGCGGATCGTCGCGCCCAGCCGGACCTTGCTGCTCGGC
CTGCTGGTGGTCCTGCTGATCGATCCGTGGGCGGTGCTGGCAGCCGGCTTCTGGTTGTCGTTCGGCGCGGTCGGTGCCTT
GCTCTACGTCGGCTCGGCCAAGATCGGCCGGGGGCGGGGTTGGCGCGAGCGCCTGGGCGCCTGGGGCGGCGTTCAGTGGG
CGGCGACGCTGGCCTCGTTGCCGGTGTTGCTGCTGGTTTTCCAGCAATTTTCGCTGGTCTCACCGCTGGCCAATGCCGTG
GCGATTCCGGTGATTAGTTTTATCGTCACGCCGCTGGCGCTGCTTGGGGCACTCGTTCCGTGGTGGCCGATCGCCGCCGT
CGCCCATCTGGCCATGGATTGGCTCATGTTGTTCCTCGACTGGTGTGCCACCTGGCCGGTCTGGCAGGCGCCGGCACCGC
CGTTGTGGGCAGCGGTCGCTGCCGGCCTCGGGGTGGCGCTCTGCCTGTTGCCGCGCGGCATGCCAGGGCGCGCCATCGGG
CTGTTCCTGCTGGCGCCGGCGCTGTTTTGGCCGGTCGACCGACCCGCCCCGGGCGAAGCCTGGATCACCGTCCTCGATGT
CGGGCAGGGCCTGGCCAGCGTGGTGCGGACCCGCGACCATACGCTGATCTATGACCCGGGGCCGCTCTATAGCGCCGAAT
CGGATGCCGGGCAGCGCGTCGTGGTGCCTTACCTGCGTCGCCTGGGAATCAACCGGGTTGATCTGCTGATGGTGACGCAT
CGCGATAGCGACCATGCCGGCGGTAGCGCTTCGGTCCAGTCGGCACTGACGGTCGATGCGATCCGCTCCTCGGTGCCGGA
AATTCCCGGCGAGCCTTGCCTGGCCGGGCAGGGCTGGAATTGGGACGGCGTACTGTTCGAGGTTTTGCATCCGGTGGCGG
AGCAGGCGACCGCCCGTCAGAAAACCAACCACCACTCCTGCGTCTTGCGCGTGACGGCCGGCGATAAACGCATGCTGCTG
ACTTCAGATATCGAAGCCGCCGACGAGGCGGCCCTGCTTGCGCGCTATCCCGGGCAACTGGCGGCGGATGTCCTGCTGGT
GCCGCACCACGGCTCGCGGACCTCGTCGACTCCAGCCTTTCTGGATGCCGTGGCGCCCGCCTCGGCGGTGATTCCGGTGG
GCTATCGCAACCGTTTCGGGCACCCCAAGAGCGAGGTGCTCGAGCGCTATGCCGCCCGGGAAATTCCGCTGTGGCGAACC
GACCGCGACGGGGCGGTCGAGATCAGGCTGGCGGTCGGTGGACTTGCGCTTTCCGGCTGGCGGGCACAGTATCGGCGCTA
CTGGCAGGGGGAATGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Ralstonia pseudosolanacearum GMI1000

45.208

100

0.489

  comA Neisseria gonorrhoeae MS11

40.194

100

0.405

  comA Pseudomonas stutzeri DSM 10701

43.432

92.368

0.401