Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   DGO_RS04695 Genome accession   NC_017790
Coordinates   1021940..1022584 (+) Length   214 a.a.
NCBI ID   WP_050920685.1    Uniprot ID   -
Organism   Deinococcus gobiensis I-0     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1016940..1027584
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DGO_RS04675 (DGo_CA0922) - 1017436..1017708 (-) 273 WP_014684332.1 hypothetical protein -
  DGO_RS24025 (DGo_CA0924) - 1017876..1019036 (+) 1161 WP_014684334.1 hypothetical protein -
  DGO_RS04685 (DGo_CA0925) - 1019026..1020147 (+) 1122 WP_050920684.1 [LysW]-lysine hydrolase -
  DGO_RS04690 (DGo_CA0926) - 1020166..1021932 (+) 1767 WP_043801100.1 SLC13 family permease -
  DGO_RS04695 (DGo_CA0927) comF 1021940..1022584 (+) 645 WP_050920685.1 ComF family protein Machinery gene
  DGO_RS04700 (DGo_CA0928) tatA 1022698..1022967 (+) 270 WP_014684338.1 twin-arginine translocase TatA/TatE family subunit -
  DGO_RS04705 (DGo_CA0929) - 1022984..1023556 (-) 573 WP_145975246.1 DUF1684 domain-containing protein -
  DGO_RS04710 (DGo_CA0930) - 1023553..1023771 (-) 219 WP_043801101.1 RNA-binding S4 domain-containing protein -
  DGO_RS04715 (DGo_CA0931) - 1023800..1024555 (-) 756 WP_226991452.1 dockerin type I domain-containing protein -
  DGO_RS23165 (DGo_CA0932) - 1024555..1025106 (-) 552 WP_226991453.1 hypothetical protein -
  DGO_RS21000 (DGo_CA0933) - 1025103..1026176 (-) 1074 WP_083847219.1 Ig domain-containing protein -

Sequence


Protein


Download         Length: 214 a.a.        Molecular weight: 22315.00 Da        Isoelectric Point: 10.7089

>NTDB_id=50882 DGO_RS04695 WP_050920685.1 1021940..1022584(+) (comF) [Deinococcus gobiensis I-0]
MPPDPRPTLGGWLRALLPRPCPGCGAQLGAAAGLCGVCRAALRVRAESHSPLSTRVTPHLLSCGPYQGVARRSVRALKYG
GAREVAAPLGELLAAAVPPDWQVAAVVPVPLHPARQRERGYNQAELLARATARGLGVPCVCALERTRATAQQARQHAAAR
QANLAGAFRVCAPLPPGTVLLVDDVATTGSTLLACRDALLAAGPRELRYAVVAR

Nucleotide


Download         Length: 645 bp        

>NTDB_id=50882 DGO_RS04695 WP_050920685.1 1021940..1022584(+) (comF) [Deinococcus gobiensis I-0]
ATGCCCCCGGACCCCCGCCCCACCCTCGGCGGCTGGCTGCGCGCCCTGCTGCCGCGCCCCTGCCCCGGCTGCGGCGCGCA
GCTCGGGGCGGCGGCGGGGCTGTGCGGGGTGTGCCGTGCCGCGCTGCGCGTGCGGGCCGAGAGCCACTCGCCCCTAAGCA
CTCGCGTGACTCCGCACCTGCTGAGTTGCGGACCGTACCAGGGCGTGGCCCGGCGCAGCGTACGCGCCCTGAAATACGGC
GGCGCGCGCGAGGTGGCCGCGCCGCTGGGCGAGTTGCTGGCCGCCGCCGTGCCCCCCGACTGGCAGGTGGCGGCGGTGGT
GCCGGTGCCGCTGCACCCGGCGCGGCAGCGCGAGCGCGGCTACAACCAGGCCGAGCTGCTGGCACGGGCCACCGCGCGCG
GCCTGGGCGTGCCGTGCGTCTGCGCGCTGGAGCGTACCCGCGCCACTGCCCAGCAGGCCCGGCAACACGCGGCGGCCCGG
CAGGCCAACCTCGCCGGGGCCTTCCGGGTGTGCGCACCCCTGCCGCCCGGCACGGTGCTGCTGGTGGACGATGTGGCGAC
CACCGGCAGCACCCTGCTGGCCTGCCGCGACGCGCTGCTGGCCGCCGGGCCGCGCGAGCTGCGTTACGCGGTGGTGGCGC
GCTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Deinococcus radiodurans R1 = ATCC 13939 = DSM 20539

72.277

94.393

0.682


Multiple sequence alignment