Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   EAG08_RS14850 Genome accession   NZ_CP033070
Coordinates   3239329..3239982 (-) Length   217 a.a.
NCBI ID   WP_129536116.1    Uniprot ID   A0A3G2GBP1
Organism   Chryseobacterium sp. 3008163     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 3234329..3244982
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  EAG08_RS14820 (EAG08_14965) - 3234630..3235298 (-) 669 WP_129536110.1 DUF4290 domain-containing protein -
  EAG08_RS14825 (EAG08_14970) - 3235416..3235829 (-) 414 WP_129536111.1 thiol-disulfide oxidoreductase DCC family protein -
  EAG08_RS14830 (EAG08_14975) - 3235841..3236275 (-) 435 WP_129536112.1 heme-binding domain-containing protein -
  EAG08_RS14835 (EAG08_14980) der 3236526..3237836 (+) 1311 WP_129536113.1 ribosome biogenesis GTPase Der -
  EAG08_RS14840 (EAG08_14985) upp 3237945..3238595 (+) 651 WP_129536114.1 uracil phosphoribosyltransferase -
  EAG08_RS14845 (EAG08_14990) - 3238596..3239285 (-) 690 WP_129536115.1 alpha/beta fold hydrolase -
  EAG08_RS14850 (EAG08_14995) comF 3239329..3239982 (-) 654 WP_129536116.1 ComF family protein Machinery gene
  EAG08_RS22965 - 3240070..3240219 (-) 150 WP_317126280.1 hypothetical protein -
  EAG08_RS14855 (EAG08_15000) - 3240231..3241544 (-) 1314 WP_317126281.1 helix-turn-helix transcriptional regulator -
  EAG08_RS14860 (EAG08_15005) aceB 3241672..3243246 (+) 1575 WP_129536117.1 malate synthase A -
  EAG08_RS14865 (EAG08_15010) aceA 3243400..3244683 (+) 1284 WP_410493339.1 isocitrate lyase -

Sequence


Protein


Download         Length: 217 a.a.        Molecular weight: 25448.40 Da        Isoelectric Point: 8.3568

>NTDB_id=321169 EAG08_RS14850 WP_129536116.1 3239329..3239982(-) (comF) [Chryseobacterium sp. 3008163]
MILDLFFPNRCIHCNRIIDGNLLVCSICFEQIHFTHFDYLSENSLKEKCKLLFPIENAYALMQFEQENLSRKIIHQLKYR
SREKTGKTVADWVTERLDFKSEKPDVLVSVPLHPKKEKERGYNQLHLFTKTLSDSYGIPFEHHLLKRNHYSKAQALKDKQ
HRLETQNTFSLTKKISNQHILLIDDVFTTGNTLATIAWEILKEGNNKVSILVMAMDE

Nucleotide


Download         Length: 654 bp        

>NTDB_id=321169 EAG08_RS14850 WP_129536116.1 3239329..3239982(-) (comF) [Chryseobacterium sp. 3008163]
ATGATTTTAGATTTATTTTTCCCAAACCGCTGCATTCATTGCAATAGAATTATTGATGGCAACTTACTTGTTTGCAGTAT
TTGTTTTGAACAGATTCACTTTACTCATTTTGATTATTTATCTGAAAACAGCTTAAAAGAAAAATGCAAATTGTTATTTC
CGATTGAGAATGCATACGCTTTGATGCAGTTTGAACAGGAAAACTTAAGCCGGAAAATCATTCACCAATTAAAGTACAGA
AGTCGAGAAAAAACCGGAAAAACTGTTGCAGATTGGGTGACAGAAAGATTAGATTTTAAAAGCGAAAAGCCGGATGTACT
AGTCAGCGTTCCTCTTCATCCGAAGAAAGAAAAGGAAAGAGGTTATAATCAATTGCATCTGTTCACAAAAACGCTTTCTG
ATTCTTATGGAATTCCATTTGAGCATCATTTACTGAAGAGAAATCATTACTCAAAAGCTCAAGCGTTGAAAGATAAACAA
CACCGTCTTGAAACTCAAAATACTTTCTCATTAACAAAAAAGATTTCTAATCAACACATCTTATTGATTGATGATGTTTT
TACAACCGGAAATACTTTGGCAACAATTGCATGGGAAATTCTGAAAGAAGGAAATAATAAAGTGAGCATTTTGGTAATGG
CAATGGATGAATAG

Domains


Predicted by InterproScan.

(2-34)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A3G2GBP1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Riemerella anatipestifer ATCC 11845 = DSM 15868

52.995

100

0.53


Multiple sequence alignment