Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   EIH08_RS10400 Genome accession   NZ_CP034171
Coordinates   2209462..2210112 (+) Length   216 a.a.
NCBI ID   WP_124785211.1    Uniprot ID   A0A3G8WIQ0
Organism   Chryseobacterium taklimakanense strain H4753     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2204462..2215112
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EIH08_RS10385 (EIH08_10385) aceA 2204624..2205922 (-) 1299 WP_124785209.1 isocitrate lyase -
  EIH08_RS10390 (EIH08_10390) aceB 2206096..2207672 (-) 1577 Protein_2105 malate synthase A -
  EIH08_RS10395 (EIH08_10395) - 2207947..2209416 (+) 1470 WP_124785210.1 helix-turn-helix domain-containing protein -
  EIH08_RS10400 (EIH08_10400) comF 2209462..2210112 (+) 651 WP_124785211.1 ComF family protein Machinery gene
  EIH08_RS10405 (EIH08_10405) upp 2210232..2210882 (-) 651 WP_124785212.1 uracil phosphoribosyltransferase -
  EIH08_RS10410 (EIH08_10410) - 2210983..2211330 (-) 348 WP_124785213.1 four helix bundle protein -
  EIH08_RS10415 (EIH08_10415) der 2211444..2212754 (-) 1311 WP_095072912.1 ribosome biogenesis GTPase Der -
  EIH08_RS10420 (EIH08_10420) - 2213020..2213421 (+) 402 WP_317127109.1 heme-binding domain-containing protein -
  EIH08_RS10425 (EIH08_10425) - 2213408..2213818 (+) 411 WP_124785214.1 thiol-disulfide oxidoreductase DCC family protein -
  EIH08_RS10430 (EIH08_10430) - 2213855..2214496 (+) 642 WP_095072905.1 DUF4290 domain-containing protein -

Sequence


Protein


Download         Length: 216 a.a.        Molecular weight: 25210.26 Da        Isoelectric Point: 8.6095

>NTDB_id=329570 EIH08_RS10400 WP_124785211.1 2209462..2210112(+) (comF) [Chryseobacterium taklimakanense strain H4753]
MFLDFLFPNRCLNCNSIIDAENLVCEACMDQIHFTHHIFGEENELKKRCRLLFPVEYAYALMQFGEANLSRKIVHDLKYK
SREKVGKILAEWTAKRLDFKNEKPDLLVSVPLHPKKLKERGYNQLHLFTETLSKHFDIPYEHTLIKRNFYKKAQALKDKA
HRSETEKLFSISKPVQNQHVLLIDDVFTTGNTMSSVAWEILKEGNNKVSVLVMAVD

Nucleotide


Download         Length: 651 bp        

>NTDB_id=329570 EIH08_RS10400 WP_124785211.1 2209462..2210112(+) (comF) [Chryseobacterium taklimakanense strain H4753]
ATGTTTCTGGATTTTCTTTTTCCGAACAGGTGCCTGAACTGCAACAGCATTATTGATGCGGAAAATTTGGTTTGTGAAGC
GTGCATGGATCAGATTCATTTCACCCACCATATTTTTGGAGAAGAAAACGAACTGAAAAAGCGGTGCAGATTACTTTTCC
CTGTTGAATATGCTTATGCATTGATGCAGTTTGGGGAGGCAAACCTTTCCCGAAAAATTGTGCATGATCTGAAATATAAA
AGCCGCGAAAAAGTTGGGAAAATTCTGGCAGAATGGACAGCAAAACGTCTGGATTTTAAAAATGAAAAGCCTGATTTGTT
GGTATCTGTTCCGCTGCATCCGAAGAAGCTGAAAGAGCGCGGGTATAATCAGCTGCATCTGTTTACGGAAACCCTTTCGA
AACATTTTGATATTCCTTATGAACACACTTTAATCAAAAGAAATTTCTATAAAAAGGCACAGGCTCTGAAAGACAAAGCC
CACCGTTCGGAAACGGAAAAACTTTTTTCAATTTCAAAACCAGTTCAAAATCAACATGTTTTGCTTATTGATGATGTTTT
CACCACTGGAAATACGATGAGTTCCGTTGCGTGGGAAATTTTGAAAGAGGGGAATAATAAGGTGAGTGTATTGGTGATGG
CGGTGGATTGA

Domains


Predicted by InterproScan.

(2-35)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3G8WIQ0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Riemerella anatipestifer ATCC 11845 = DSM 15868

52.315

100

0.523


Multiple sequence alignment