Detailed information    

insolico Bioinformatically predicted

Overview


Name   waaF   Type   Regulator
Locus tag   P4E82_RS03275 Genome accession   NZ_OX442413
Coordinates   658793..659737 (-) Length   314 a.a.
NCBI ID   WP_052774038.1    Uniprot ID   -
Organism   Campylobacter coli isolate pure strain     
Function   repress natural transformation (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 648659..686929 658793..659737 within 0


Gene organization within MGE regions


Location: 648659..686929
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  P4E82_RS03225 - 648659..650215 (-) 1557 WP_318764467.1 DNA polymerase III subunit gamma/tau -
  P4E82_RS03230 rho 650218..651516 (-) 1299 WP_002780613.1 transcription termination factor Rho -
  P4E82_RS03235 - 651629..653992 (+) 2364 WP_072216344.1 heavy metal translocating P-type ATPase -
  P4E82_RS03240 - 653985..654185 (+) 201 WP_002777240.1 cbb3-type cytochrome oxidase assembly protein -
  P4E82_RS03245 - 654215..654514 (-) 300 WP_002777238.1 cytochrome c -
  P4E82_RS03250 - 654591..655154 (+) 564 WP_002787747.1 D-glycero-beta-D-manno-heptose 1,7-bisphosphate 7-phosphatase -
  P4E82_RS03255 rfaD 655151..656104 (+) 954 WP_038853884.1 ADP-glyceromanno-heptose 6-epimerase -
  P4E82_RS03260 rfaE1 656097..657482 (+) 1386 WP_318764468.1 D-glycero-beta-D-manno-heptose-7-phosphate kinase -
  P4E82_RS03265 gmhA 657479..658039 (+) 561 WP_002831598.1 D-sedoheptulose 7-phosphate isomerase -
  P4E82_RS03270 - 658039..658800 (+) 762 WP_002826801.1 glycosyltransferase family 25 protein -
  P4E82_RS03275 waaF 658793..659737 (-) 945 WP_052774038.1 lipopolysaccharide heptosyltransferase II Regulator
  P4E82_RS03280 - 659796..660617 (+) 822 WP_032685539.1 glycosyltransferase family 2 protein -
  P4E82_RS09570 - 660958..661104 (-) 147 WP_168848228.1 hypothetical protein -
  P4E82_RS03285 - 661101..661337 (-) 237 WP_255296982.1 alpha-2,3-sialyltransferase -
  P4E82_RS09575 - 661345..661658 (-) 314 Protein_644 alpha-2,3-sialyltransferase -
  P4E82_RS03295 - 661786..662586 (+) 801 WP_002777224.1 class I SAM-dependent methyltransferase -
  P4E82_RS03300 - 662667..663602 (+) 936 WP_002777223.1 glycosyltransferase family 2 protein -
  P4E82_RS03305 - 663590..664801 (+) 1212 WP_002826798.1 glycosyltransferase family 8 protein -
  P4E82_RS03310 - 664788..665843 (-) 1056 WP_002783678.1 glycosyltransferase family 4 protein -
  P4E82_RS03315 - 665840..667387 (-) 1548 WP_002777220.1 glycosyltransferase -
  P4E82_RS03320 - 667384..668271 (-) 888 WP_002780636.1 lipid A biosynthesis lauroyl acyltransferase -
  P4E82_RS03325 waaC 668264..669292 (-) 1029 WP_002787741.1 lipopolysaccharide heptosyltransferase I -
  P4E82_RS03330 - 669357..670148 (+) 792 WP_025443299.1 3'-5' exonuclease -
  P4E82_RS03335 galE 670205..671191 (+) 987 WP_002777216.1 UDP-glucose 4-epimerase GalE -
  P4E82_RS03340 - 671185..672885 (+) 1701 WP_318764469.1 ATP-binding cassette domain-containing protein -
  P4E82_RS03345 - 672882..673958 (+) 1077 WP_002783686.1 glycosyltransferase -
  P4E82_RS03350 - 673951..674880 (+) 930 WP_002777213.1 glycosyltransferase family 2 protein -
  P4E82_RS03355 - 674868..675965 (+) 1098 WP_002777212.1 glycosyltransferase -
  P4E82_RS03360 - 675969..678113 (+) 2145 WP_318764470.1 STT3 domain-containing protein -
  P4E82_RS03365 pglA 678125..679255 (+) 1131 WP_002777209.1 N, N'-diacetylbacillosaminyl-diphospho-undecaprenol alpha-1,3-N-acetylgalactosaminyltransferase -
  P4E82_RS03370 pglC 679248..679850 (+) 603 WP_002790131.1 undecaprenyl phosphate N,N'-diacetylbacillosamine 1-phosphate transferase -
  P4E82_RS03375 pglD 679837..680424 (+) 588 WP_002845778.1 UDP-N-acetylbacillosamine N-acetyltransferase -
  P4E82_RS03380 pglE 680579..681739 (+) 1161 WP_002777202.1 UDP-N-acetylbacillosamine transaminase -
  P4E82_RS03385 pglF 681742..683514 (+) 1773 WP_002777199.1 UDP-N-acetylglucosamine 4,6-dehydratase (configuration-retaining) -
  P4E82_RS03390 - 683729..684121 (+) 393 WP_002777196.1 chemotaxis response regulator CheY -
  P4E82_RS03395 - 684135..684983 (+) 849 WP_002777195.1 50S ribosomal protein L11 methyltransferase -
  P4E82_RS03400 ftsH 684983..686929 (+) 1947 WP_002780655.1 ATP-dependent zinc metalloprotease FtsH -

Sequence


Protein


Download         Length: 314 a.a.        Molecular weight: 36095.63 Da        Isoelectric Point: 10.2686

>NTDB_id=1158368 P4E82_RS03275 WP_052774038.1 658793..659737(-) (waaF) [Campylobacter coli isolate pure strain]
MKVFIHLPTWLGDAVMTSPALYGVYHYFKNAEFILYGSFVSTALFKEFPNAKIIVENKKSRYKQALSLRKKLGKIDLALS
FRSALSSKIILHILKVKQRYFFNKYDFKEEHQVLKYLYFIENSLEIKAHSKDLKLPFKLKFQNPIVLKNGKKILGLNPGA
SFGSAKRWDASYFAQVALNFSKTHEILIFGAGKAEQELCDEIFHILKEKNVKVKNLCNKTTIKTLCQNIVFCDIFITNDS
GPMHIAAAYKTKTIAIFGPTKFTQTSPWQNQNAKLVHLNLSCMPCMQKTCPLKHHRCMKDLKPEKILEAIQNFT

Nucleotide


Download         Length: 945 bp        

>NTDB_id=1158368 P4E82_RS03275 WP_052774038.1 658793..659737(-) (waaF) [Campylobacter coli isolate pure strain]
ATGAAAGTTTTTATCCATCTTCCCACTTGGCTAGGCGATGCAGTCATGACCTCACCTGCTTTATATGGAGTTTATCATTA
TTTTAAAAATGCTGAATTTATCCTTTATGGATCTTTTGTATCTACAGCGCTTTTTAAGGAATTTCCTAATGCTAAAATCA
TTGTAGAAAATAAAAAATCACGCTACAAACAAGCCCTTTCTTTACGCAAAAAACTTGGAAAAATCGATCTAGCCCTTTCT
TTTAGATCAGCCCTTTCCTCCAAGATCATTTTACACATCCTTAAGGTAAAACAAAGATATTTTTTTAACAAATATGATTT
TAAAGAAGAACATCAGGTTTTAAAATATCTTTATTTTATAGAAAATTCATTAGAAATTAAAGCTCATTCTAAGGATTTAA
AACTTCCTTTTAAATTAAAATTTCAAAATCCTATTGTTTTAAAAAATGGCAAAAAAATCCTAGGATTAAACCCTGGAGCG
AGTTTTGGAAGTGCTAAAAGATGGGATGCGAGCTATTTTGCACAAGTTGCTTTAAATTTTAGCAAAACTCATGAAATTTT
GATTTTTGGAGCAGGTAAAGCAGAACAAGAACTTTGCGATGAAATTTTTCATATACTAAAAGAAAAAAATGTCAAAGTAA
AAAATCTTTGCAATAAAACCACTATCAAAACCCTTTGTCAAAATATCGTTTTTTGTGATATTTTCATCACAAACGATAGC
GGCCCTATGCACATAGCTGCCGCTTATAAAACAAAAACCATAGCTATTTTTGGGCCTACTAAATTTACCCAAACCTCACC
TTGGCAAAATCAAAATGCAAAATTAGTACATCTAAATTTATCTTGCATGCCTTGCATGCAAAAAACCTGTCCTTTAAAAC
ATCATCGGTGCATGAAAGATTTAAAACCGGAAAAAATTTTAGAAGCAATCCAAAATTTTACTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  waaF Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

82.748

99.682

0.825


Multiple sequence alignment