Detailed information    

insolico Bioinformatically predicted

Overview


Name   waaF   Type   Regulator
Locus tag   R8947_RS03180 Genome accession   NZ_AP028371
Coordinates   639360..640313 (-) Length   317 a.a.
NCBI ID   WP_002839071.1    Uniprot ID   A0A5Z1EHW0
Organism   Campylobacter coli strain BCH-10879     
Function   repress natural transformation (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 635710..655048 639360..640313 within 0


Gene organization within MGE regions


Location: 635710..655048
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R8947_RS03160 (B10879_06280) rfaD 635710..636663 (+) 954 WP_002780616.1 ADP-glyceromanno-heptose 6-epimerase -
  R8947_RS03165 (B10879_06290) rfaE1 636656..638041 (+) 1386 WP_002786305.1 D-glycero-beta-D-manno-heptose-7-phosphate kinase -
  R8947_RS03170 (B10879_06300) gmhA 638038..638598 (+) 561 WP_002806601.1 D-sedoheptulose 7-phosphate isomerase -
  R8947_RS03175 (B10879_06310) - 638598..639353 (+) 756 WP_072214063.1 glycosyltransferase family 25 protein -
  R8947_RS03180 (B10879_06320) waaF 639360..640313 (-) 954 WP_002839071.1 glycosyltransferase family 9 protein Regulator
  R8947_RS03185 (B10879_06330) - 640372..641221 (+) 850 Protein_628 glycosyltransferase family 2 protein -
  R8947_RS03190 (B10879_06350) - 641599..641922 (-) 324 WP_072214065.1 hypothetical protein -
  R8947_RS03195 - 642433..642714 (-) 282 Protein_630 hypothetical protein -
  R8947_RS03200 (B10879_06370) - 642829..643770 (+) 942 WP_072214066.1 glycosyltransferase family 2 protein -
  R8947_RS03205 (B10879_06380) - 643758..644969 (+) 1212 WP_072214067.1 glycosyltransferase family 8 protein -
  R8947_RS03210 (B10879_06390) - 644956..646011 (-) 1056 WP_057991437.1 glycosyltransferase family 4 protein -
  R8947_RS03215 (B10879_06400) - 646008..647555 (-) 1548 WP_072214068.1 glycosyltransferase -
  R8947_RS03220 (B10879_06410) - 647552..648439 (-) 888 WP_072214069.1 lipid A biosynthesis lauroyl acyltransferase -
  R8947_RS03225 (B10879_06420) waaC 648432..649460 (-) 1029 WP_072214070.1 lipopolysaccharide heptosyltransferase I -
  R8947_RS03230 (B10879_06430) - 649525..650316 (+) 792 WP_025443299.1 3'-5' exonuclease -
  R8947_RS03235 (B10879_06440) galE 650373..651359 (+) 987 WP_002786297.1 UDP-glucose 4-epimerase GalE -
  R8947_RS03240 (B10879_06450) - 651353..653053 (+) 1701 WP_002786296.1 ABC transporter ATP-binding protein -
  R8947_RS03245 (B10879_06460) - 653050..654126 (+) 1077 WP_002780642.1 glycosyltransferase -
  R8947_RS03250 (B10879_06470) - 654119..655048 (+) 930 WP_002777213.1 glycosyltransferase family 2 protein -

Sequence


Protein


Download         Length: 317 a.a.        Molecular weight: 36557.15 Da        Isoelectric Point: 10.2875

>NTDB_id=104999 R8947_RS03180 WP_002839071.1 639360..640313(-) (waaF) [Campylobacter coli strain BCH-10879]
MKIFIHLPTWLGDAVMASPALYAIKEHFKNAQFILYGSLVSTALFKEFPNSKIIIENKQSRYKQALSLRKELGKIDFSFA
FRSAFSSKIILHILKTKQRYFFDKNKHKEEHQVLKYLYFIENSLSIKAHFKDLKLPFKLKFQNPLVLKNGKKILGLNPGA
SFGSAKRWDVSYFAKVALNFSQSHEILIFGAGKAEQELCNEIYQILKEQNIKVKNLCNKTTIKTLCQNIAFCDLFITNDS
GPMHISAVYKVKTVAIFGPTKFTQTSPWQNENARWVHLNLACMPCMQKTCPLKHHKCMKDLKPEIIIEVSKQLLFLN

Nucleotide


Download         Length: 954 bp        

>NTDB_id=104999 R8947_RS03180 WP_002839071.1 639360..640313(-) (waaF) [Campylobacter coli strain BCH-10879]
ATGAAAATTTTTATACATCTTCCCACTTGGCTAGGCGATGCAGTGATGGCTTCGCCTGCTTTATACGCTATAAAAGAACA
TTTTAAAAATGCCCAATTTATCCTTTATGGCTCTTTGGTTTCCACAGCACTTTTTAAAGAATTTCCTAATTCTAAAATCA
TCATAGAAAATAAACAATCTCGCTATAAACAAGCCCTATCTTTACGCAAAGAACTTGGCAAAATCGATTTTAGCTTTGCT
TTTAGGTCTGCGTTTTCTTCTAAGATTATCTTACATATTCTTAAAACAAAGCAAAGATATTTTTTTGACAAAAACAAGCA
CAAAGAAGAACATCAAGTTTTAAAATACCTTTATTTTATAGAAAACTCACTTAGTATAAAAGCTCATTTTAAAGACTTAA
AGCTTCCCTTTAAGCTAAAATTTCAAAACCCTCTTGTCTTAAAAAATGGTAAAAAAATTCTAGGACTCAACCCTGGTGCA
AGCTTTGGAAGTGCAAAAAGATGGGATGTGAGTTATTTTGCTAAAGTGGCTTTAAATTTCAGCCAAAGTCATGAGATTTT
AATCTTTGGTGCAGGAAAAGCCGAACAAGAACTTTGTAATGAAATTTATCAAATTTTAAAAGAACAAAACATAAAAGTAA
AAAATCTTTGCAATAAAACTACCATCAAAACCCTTTGTCAAAATATCGCTTTTTGCGATCTTTTCATCACAAATGACAGT
GGACCTATGCACATAAGTGCGGTTTATAAAGTAAAAACCGTGGCTATTTTTGGGCCTACCAAATTTACTCAAACTTCACC
TTGGCAAAATGAAAATGCAAGATGGGTGCATTTAAATCTAGCTTGTATGCCTTGTATGCAAAAAACCTGCCCTTTAAAAC
ACCACAAATGCATGAAAGATTTAAAACCAGAAATCATCATAGAAGTAAGTAAACAATTGCTATTTCTAAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A5Z1EHW0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  waaF Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

93.631

99.054

0.927


Multiple sequence alignment