Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   AUC44_RS12725 Genome accession   NZ_CP013910
Coordinates   2613708..2614307 (-) Length   199 a.a.
NCBI ID   WP_062159089.1    Uniprot ID   -
Organism   Deinococcus actinosclerus strain BM2     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2571597..2613722 2613708..2614307 flank -14


Gene organization within MGE regions


Location: 2571597..2614307
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  AUC44_RS12475 (AUC44_12455) - 2571597..2572202 (-) 606 WP_062159041.1 glycoside hydrolase family 108 protein -
  AUC44_RS12480 (AUC44_12460) - 2572192..2572521 (-) 330 WP_157445351.1 hypothetical protein -
  AUC44_RS12485 (AUC44_12465) - 2572530..2572880 (-) 351 WP_157445352.1 hypothetical protein -
  AUC44_RS12490 (AUC44_12470) - 2572861..2573052 (-) 192 WP_058975531.1 hypothetical protein -
  AUC44_RS12495 (AUC44_12475) - 2573178..2575679 (-) 2502 WP_062159044.1 hypothetical protein -
  AUC44_RS12500 (AUC44_12480) - 2575688..2576122 (-) 435 WP_062159045.1 hypothetical protein -
  AUC44_RS12505 (AUC44_12485) - 2576119..2577624 (-) 1506 WP_157445353.1 hypothetical protein -
  AUC44_RS12510 (AUC44_12490) - 2577635..2581453 (-) 3819 WP_197408538.1 phage tail tape measure protein -
  AUC44_RS16250 - 2581803..2582399 (-) 597 WP_197408539.1 HNH endonuclease -
  AUC44_RS17375 - 2582392..2582712 (-) 321 WP_082689058.1 HNH endonuclease -
  AUC44_RS12515 (AUC44_12495) - 2582712..2583005 (-) 294 WP_157445354.1 hypothetical protein -
  AUC44_RS16600 (AUC44_12500) - 2583050..2583307 (-) 258 WP_335338667.1 hypothetical protein -
  AUC44_RS16260 - 2583449..2583661 (+) 213 WP_231724438.1 helix-turn-helix domain-containing protein -
  AUC44_RS16605 - 2583679..2585277 (-) 1599 WP_157445356.1 hypothetical protein -
  AUC44_RS12525 (AUC44_12505) - 2585462..2585755 (-) 294 WP_157445357.1 hypothetical protein -
  AUC44_RS12530 (AUC44_12510) - 2585843..2586145 (-) 303 WP_062159051.1 hypothetical protein -
  AUC44_RS12535 (AUC44_12515) - 2586163..2586657 (-) 495 WP_157445358.1 hypothetical protein -
  AUC44_RS12540 (AUC44_12520) - 2586654..2586893 (-) 240 WP_062159053.1 hypothetical protein -
  AUC44_RS12545 (AUC44_12525) - 2586896..2587357 (-) 462 WP_062159054.1 hypothetical protein -
  AUC44_RS12550 (AUC44_12530) - 2587360..2587704 (-) 345 WP_062159055.1 DUF3168 domain-containing protein -
  AUC44_RS16840 (AUC44_12535) - 2587701..2587973 (-) 273 WP_062159056.1 HK97 gp10 family phage protein -
  AUC44_RS16845 (AUC44_12540) - 2587970..2588287 (-) 318 WP_062159057.1 hypothetical protein -
  AUC44_RS16615 (AUC44_12545) - 2588287..2588823 (-) 537 WP_062159058.1 hypothetical protein -
  AUC44_RS12570 (AUC44_12550) - 2588792..2589217 (-) 426 WP_062159059.1 hypothetical protein -
  AUC44_RS12575 (AUC44_12555) - 2589270..2590526 (-) 1257 WP_062159060.1 phage major capsid protein -
  AUC44_RS12580 (AUC44_12560) - 2590523..2591368 (-) 846 WP_062159061.1 HK97 family phage prohead protease -
  AUC44_RS16620 - 2591445..2591726 (-) 282 WP_157445360.1 hypothetical protein -
  AUC44_RS16625 - 2591727..2591927 (-) 201 WP_157445361.1 hypothetical protein -
  AUC44_RS12585 (AUC44_12565) - 2591932..2593908 (-) 1977 WP_062159062.1 phage portal protein -
  AUC44_RS12590 (AUC44_12570) - 2593922..2595259 (-) 1338 WP_062159063.1 hypothetical protein -
  AUC44_RS12595 (AUC44_12575) - 2595256..2595984 (-) 729 WP_157445362.1 terminase small subunit -
  AUC44_RS16630 - 2596020..2596316 (+) 297 WP_157445363.1 hypothetical protein -
  AUC44_RS12600 (AUC44_12580) - 2597214..2597729 (-) 516 WP_062159065.1 hypothetical protein -
  AUC44_RS12605 (AUC44_12585) - 2597726..2598070 (-) 345 WP_197408541.1 RusA family crossover junction endodeoxyribonuclease -
  AUC44_RS16635 - 2598067..2598222 (-) 156 WP_157445364.1 hypothetical protein -
  AUC44_RS12610 (AUC44_12590) - 2598219..2598413 (-) 195 WP_062159066.1 hypothetical protein -
  AUC44_RS12615 (AUC44_12595) - 2598410..2598664 (-) 255 WP_062159067.1 hypothetical protein -
  AUC44_RS12620 (AUC44_12600) - 2598661..2599053 (-) 393 WP_157445365.1 hypothetical protein -
  AUC44_RS12625 (AUC44_12605) - 2599064..2599660 (-) 597 WP_062159069.1 hypothetical protein -
  AUC44_RS12630 (AUC44_12610) - 2599944..2602673 (-) 2730 WP_062159070.1 bifunctional DNA primase/polymerase -
  AUC44_RS17380 - 2602670..2603896 (-) 1227 WP_417926375.1 DNA cytosine methyltransferase -
  AUC44_RS17105 (AUC44_12620) - 2603837..2603986 (+) 150 WP_231724439.1 hypothetical protein -
  AUC44_RS12645 (AUC44_12625) - 2604148..2604396 (-) 249 WP_062159073.1 hypothetical protein -
  AUC44_RS12650 (AUC44_12630) - 2604514..2605152 (-) 639 WP_062159074.1 hypothetical protein -
  AUC44_RS12655 (AUC44_12635) - 2605279..2605464 (-) 186 WP_157445366.1 hypothetical protein -
  AUC44_RS12660 (AUC44_12640) - 2605485..2605814 (-) 330 WP_062159076.1 hypothetical protein -
  AUC44_RS12665 (AUC44_12645) - 2605811..2606083 (-) 273 WP_062159077.1 hypothetical protein -
  AUC44_RS12670 (AUC44_12650) - 2606077..2606286 (-) 210 WP_062159078.1 hypothetical protein -
  AUC44_RS12675 (AUC44_12655) - 2606286..2606489 (-) 204 WP_062159079.1 hypothetical protein -
  AUC44_RS12680 (AUC44_12660) - 2606486..2606677 (-) 192 WP_062159080.1 hypothetical protein -
  AUC44_RS12685 (AUC44_12665) - 2606674..2606967 (-) 294 WP_062159081.1 hypothetical protein -
  AUC44_RS16640 - 2607028..2607183 (-) 156 WP_157445367.1 hypothetical protein -
  AUC44_RS12690 (AUC44_12670) - 2607180..2607935 (-) 756 WP_157445368.1 hypothetical protein -
  AUC44_RS12695 (AUC44_12675) - 2608000..2608224 (-) 225 WP_062159083.1 helix-turn-helix transcriptional regulator -
  AUC44_RS16850 - 2608283..2608900 (+) 618 WP_197408542.1 helix-turn-helix domain-containing protein -
  AUC44_RS17110 - 2608860..2609219 (+) 360 Protein_2574 ImmA/IrrE family metallo-endopeptidase -
  AUC44_RS16280 - 2609305..2610078 (-) 774 WP_082688957.1 IS982 family transposase -
  AUC44_RS12710 (AUC44_12690) - 2610167..2610391 (+) 225 WP_157445370.1 hypothetical protein -
  AUC44_RS12715 (AUC44_12695) - 2610477..2610869 (+) 393 WP_062159087.1 hypothetical protein -
  AUC44_RS16645 - 2611053..2611919 (+) 867 WP_157445371.1 hypothetical protein -
  AUC44_RS12720 (AUC44_12700) - 2612310..2613722 (+) 1413 WP_062159088.1 recombinase family protein -
  AUC44_RS12725 (AUC44_12705) comF 2613708..2614307 (-) 600 WP_062159089.1 ComF family protein Machinery gene

Sequence


Protein


Download         Length: 199 a.a.        Molecular weight: 21345.76 Da        Isoelectric Point: 11.8368

>NTDB_id=165983 AUC44_RS12725 WP_062159089.1 2613708..2614307(-) (comF) [Deinococcus actinosclerus strain BM2]
MTAAFGTALLGALRTLLPRACPGCGAQLGAHAGLCPACRAALRPQVQAHSPLRAHPEPHLVTLGTYSGVRRRAVRELKFA
QARDLARVLGETLATGVPDTWNVQAVIPVPLHPTRQRERGFNQSELLGRALAGALAVPCVPALTRTRAGAQQARRHGAQR
EDLHGAFRAHEGFLPTGAVLLIDDVLTTTSPLMHWLRRV

Nucleotide


Download         Length: 600 bp        

>NTDB_id=165983 AUC44_RS12725 WP_062159089.1 2613708..2614307(-) (comF) [Deinococcus actinosclerus strain BM2]
ATGACCGCTGCGTTCGGAACGGCCCTGCTCGGCGCGCTGCGCACCCTGCTGCCCCGCGCCTGCCCCGGCTGCGGCGCGCA
GCTCGGGGCGCACGCCGGACTGTGCCCCGCCTGCCGCGCCGCCCTGCGCCCCCAGGTGCAGGCCCACAGTCCCCTGCGCG
CCCACCCGGAGCCGCACCTCGTGACGCTCGGGACCTACAGCGGCGTGCGCCGGCGCGCCGTGCGGGAACTGAAATTCGCG
CAGGCCCGCGACCTCGCCCGCGTCCTCGGGGAGACCCTCGCCACCGGCGTGCCCGACACCTGGAACGTGCAGGCGGTCAT
CCCGGTGCCGCTGCATCCCACCCGGCAGCGCGAACGCGGCTTCAACCAGTCGGAACTGCTGGGCCGCGCCCTGGCCGGCG
CCCTGGCGGTGCCGTGCGTGCCCGCCCTGACCCGCACCCGCGCTGGCGCGCAGCAGGCCCGGCGTCACGGCGCGCAGCGC
GAGGACCTCCACGGGGCCTTCCGCGCCCACGAGGGCTTCCTGCCCACCGGCGCGGTGCTGCTGATCGACGACGTGCTCAC
CACCACATCGCCACTCATGCACTGGCTACGACGGGTCTAG

Domains


Predicted by InterproScan.

(13-46)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

62.366

93.467

0.583


Multiple sequence alignment