Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   CO230_RS08135 Genome accession   NZ_CP023540
Coordinates   1758815..1759468 (+) Length   217 a.a.
NCBI ID   WP_122028143.1    Uniprot ID   -
Organism   Chryseobacterium sp. 6424     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1753815..1764468
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CO230_RS08115 (CO230_08110) aceA 1754272..1755558 (-) 1287 WP_122028934.1 isocitrate lyase -
  CO230_RS08120 (CO230_08115) aceB 1755590..1757167 (-) 1578 WP_122028140.1 malate synthase A -
  CO230_RS12590 (CO230_08120) - 1757333..1758043 (+) 711 WP_122028141.1 helix-turn-helix domain-containing protein -
  CO230_RS12595 (CO230_08125) - 1758048..1758809 (+) 762 WP_410492839.1 hypothetical protein -
  CO230_RS08135 (CO230_08130) comF 1758815..1759468 (+) 654 WP_122028143.1 ComF family protein Machinery gene
  CO230_RS08140 (CO230_08135) upp 1759465..1760115 (-) 651 WP_122028144.1 uracil phosphoribosyltransferase -
  CO230_RS08145 (CO230_08140) der 1760194..1761504 (-) 1311 WP_122028145.1 ribosome biogenesis GTPase Der -
  CO230_RS08150 (CO230_08145) murA 1761661..1762968 (-) 1308 WP_122028146.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  CO230_RS08155 (CO230_08150) - 1762972..1763634 (-) 663 WP_122028147.1 DUF4290 domain-containing protein -
  CO230_RS08160 (CO230_08155) - 1763703..1764116 (-) 414 WP_122028148.1 thiol-disulfide oxidoreductase DCC family protein -

Sequence


Protein


Download         Length: 217 a.a.        Molecular weight: 24927.76 Da        Isoelectric Point: 8.1741

>NTDB_id=248281 CO230_RS08135 WP_122028143.1 1758815..1759468(+) (comF) [Chryseobacterium sp. 6424]
MSVTDLLFPNRCLECNRIIAADDLVCALCYSLINFRHYDFEEKNPLQEKCGLLFPVKYAFGLMQFEDESLSRKIVHQLKY
ANREKVGSILADWVAERVKFNKDKPDVLITIPLHPKKMKSRGYNQLHLFAERLSQRLGIPCDHSLLKRNSHKTAQARKDR
KHRAETENQFSCTREVSGQHILIIDDVFTTGNTMSSAAWELLQSGDNMVSVLVMAVD

Nucleotide


Download         Length: 654 bp        

>NTDB_id=248281 CO230_RS08135 WP_122028143.1 1758815..1759468(+) (comF) [Chryseobacterium sp. 6424]
ATGTCAGTCACCGATCTGCTATTCCCGAACCGCTGCCTGGAGTGTAACAGGATTATCGCCGCAGATGATCTTGTTTGCGC
GTTGTGCTATAGCCTGATTAACTTCCGGCATTATGATTTTGAGGAAAAAAATCCATTACAGGAGAAATGTGGTCTGTTAT
TTCCGGTGAAATATGCATTTGGACTGATGCAATTTGAAGATGAAAGCCTGAGCCGGAAAATTGTACATCAACTTAAATAT
GCCAACCGGGAGAAAGTGGGCAGCATCCTTGCCGACTGGGTTGCGGAAAGAGTTAAATTCAACAAAGACAAACCGGACGT
ACTAATCACCATTCCGCTGCATCCAAAGAAAATGAAATCCCGAGGCTATAATCAGTTACATTTATTTGCAGAACGCTTAT
CACAAAGATTAGGCATCCCGTGTGACCACAGTCTGCTAAAAAGAAATTCGCACAAAACAGCACAGGCCAGAAAAGACCGT
AAACACCGCGCAGAAACGGAAAACCAATTTTCGTGCACGCGAGAAGTATCAGGTCAGCATATACTGATCATTGATGATGT
TTTCACGACCGGCAATACCATGAGTTCGGCCGCGTGGGAGTTGCTACAATCGGGTGACAACATGGTTTCTGTACTTGTAA
TGGCGGTGGATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Riemerella anatipestifer ATCC 11845 = DSM 15868

51.643

98.157

0.507


Multiple sequence alignment