Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   EB819_RS00835 Genome accession   NZ_CP034157
Coordinates   179474..180124 (-) Length   216 a.a.
NCBI ID   WP_069799636.1    Uniprot ID   A0A1E5UDI8
Organism   Cloacibacterium normanense strain NRS-1     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 174474..185124
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EB819_RS00820 (EB819_00820) - 176528..177028 (-) 501 WP_069799641.1 hypothetical protein -
  EB819_RS00825 (EB819_00825) der 177377..178684 (+) 1308 WP_069799639.1 ribosome biogenesis GTPase Der -
  EB819_RS00830 (EB819_00830) upp 178820..179470 (+) 651 WP_069799638.1 uracil phosphoribosyltransferase -
  EB819_RS00835 (EB819_00835) comF 179474..180124 (-) 651 WP_069799636.1 ComF family protein Machinery gene
  EB819_RS00840 (EB819_00840) - 180275..180724 (-) 450 WP_069799634.1 DUF4442 domain-containing protein -
  EB819_RS00845 (EB819_00845) - 180836..181702 (+) 867 WP_245993190.1 hypothetical protein -
  EB819_RS00850 (EB819_00850) - 181776..183905 (-) 2130 WP_069799632.1 M3 family metallopeptidase -
  EB819_RS00855 (EB819_00855) - 184031..184834 (+) 804 WP_069799629.1 head GIN domain-containing protein -

Sequence


Protein


Download         Length: 216 a.a.        Molecular weight: 25300.37 Da        Isoelectric Point: 8.8504

>NTDB_id=329392 EB819_RS00835 WP_069799636.1 179474..180124(-) (comF) [Cloacibacterium normanense strain NRS-1]
MLLDLLFPNRCLHCNTIISKDELVCHVCFPQIKFSHFNFYEENPLKQRCKLLFPVKNAYAVMEFQEEALSQKIIHQLKYR
SQEKVGKIMAEWTLERIYLSEKPDVLITVPLHPKKLKKRGYNQLHVFADTLSKNWEIPHHKEALKRNSYQKAQAQKDKSH
RAETKYDFSLTEEISGKHVLLIDDVFTTGNTISAIAWEILKNPRNEVSVLVMAFDS

Nucleotide


Download         Length: 651 bp        

>NTDB_id=329392 EB819_RS00835 WP_069799636.1 179474..180124(-) (comF) [Cloacibacterium normanense strain NRS-1]
ATGCTGCTGGATTTACTGTTCCCGAACAGATGTCTACATTGCAATACCATCATTTCAAAAGACGAATTGGTTTGCCATGT
ATGTTTCCCTCAAATTAAGTTTTCTCATTTTAATTTCTACGAAGAAAATCCTCTGAAACAGAGATGTAAGCTTTTATTTC
CCGTGAAAAATGCTTATGCTGTGATGGAATTTCAAGAAGAAGCATTGAGCCAAAAAATTATTCATCAACTCAAATATCGT
TCTCAGGAAAAAGTTGGAAAAATCATGGCTGAATGGACTTTGGAAAGAATTTACCTTTCAGAAAAACCGGATGTTTTAAT
TACGGTTCCACTTCATCCTAAAAAATTAAAAAAACGAGGTTATAATCAGTTGCATGTTTTTGCAGACACGCTTTCTAAAA
ATTGGGAAATCCCACATCATAAAGAAGCGCTTAAAAGAAATTCTTACCAAAAAGCACAAGCCCAAAAAGACAAATCTCAC
CGCGCCGAAACGAAATACGATTTCTCGCTAACCGAAGAAATTTCCGGGAAACATGTTTTGTTGATAGATGATGTTTTCAC
CACGGGAAATACAATAAGCGCCATTGCTTGGGAAATTCTGAAAAATCCAAGGAATGAAGTGAGTGTTTTGGTAATGGCTT
TTGATTCGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1E5UDI8

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Riemerella anatipestifer ATCC 11845 = DSM 15868

50.926

100

0.509


Multiple sequence alignment