Detailed information    

insolico Bioinformatically predicted

Overview


Name   waaF   Type   Regulator
Locus tag   CLCT_RS05990 Genome accession   NZ_CP043426
Coordinates   1161040..1161954 (-) Length   304 a.a.
NCBI ID   WP_149062571.1    Uniprot ID   -
Organism   Campylobacter lari subsp. concheus strain LMG 21009     
Function   repress natural transformation (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1155218..1179494 1161040..1161954 within 0


Gene organization within MGE regions


Location: 1155218..1179494
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CLCT_RS05960 (CLCT_1183) - 1155218..1156375 (+) 1158 WP_039668725.1 peptidoglycan DD-metalloendopeptidase family protein -
  CLCT_RS05965 (CLCT_1184) mgtE 1156377..1157720 (+) 1344 WP_039668726.1 magnesium transporter -
  CLCT_RS05970 (CLCT_1185) tpx 1157733..1158251 (+) 519 WP_149062569.1 thiol peroxidase -
  CLCT_RS05975 (CLCT_1186) - 1158248..1159633 (-) 1386 WP_039668728.1 replicative DNA helicase -
  CLCT_RS05980 (CLCT_1187) cysK 1159646..1160548 (-) 903 WP_149062570.1 cysteine synthase A -
  CLCT_RS05985 (CLCT_1188) - 1160646..1160948 (-) 303 WP_039619003.1 HU family DNA-binding protein -
  CLCT_RS05990 (CLCT_1189) waaF 1161040..1161954 (-) 915 WP_149062571.1 glycosyltransferase family 9 protein Regulator
  CLCT_RS05995 (CLCT_1190) - 1162011..1162826 (+) 816 WP_149062572.1 glycosyltransferase family 2 protein -
  CLCT_RS06000 (CLCT_1191) - 1162874..1163548 (+) 675 WP_249040763.1 glycosyltransferase family 25 protein -
  CLCT_RS06005 (CLCT_1192) rfbA 1163545..1164420 (+) 876 WP_149062574.1 glucose-1-phosphate thymidylyltransferase RfbA -
  CLCT_RS06010 (CLCT_1193) rfbB 1164421..1165449 (+) 1029 WP_149062575.1 dTDP-glucose 4,6-dehydratase -
  CLCT_RS06015 (CLCT_1194) wlaRB 1165451..1165855 (+) 405 WP_149062576.1 class E lipooligosaccharide biosynthesis 3,4-ketoisomerase WlaRB -
  CLCT_RS06020 (CLCT_1195) - 1165845..1166585 (+) 741 WP_249040749.1 bifunctional 2-polyprenyl-6-hydroxyphenol methylase/3-demethylubiquinol 3-O-methyltransferase UbiG -
  CLCT_RS06025 (CLCT_1196) - 1166582..1167511 (+) 930 WP_149062577.1 GNAT family N-acetyltransferase -
  CLCT_RS06030 (CLCT_1197) - 1167508..1167903 (+) 396 WP_149062578.1 adenylyltransferase/cytidyltransferase family protein -
  CLCT_RS06035 (CLCT_1198) - 1167903..1169168 (+) 1266 WP_149062579.1 CDP-glycerol glycerophosphotransferase family protein -
  CLCT_RS06040 (CLCT_1199) - 1169165..1170223 (+) 1059 WP_149062580.1 glycosyltransferase family 2 protein -
  CLCT_RS06045 (CLCT_1200) - 1170228..1171313 (+) 1086 WP_149062581.1 DegT/DnrJ/EryC1/StrS aminotransferase family protein -
  CLCT_RS06050 (CLCT_1201) - 1171310..1172467 (-) 1158 WP_149062582.1 glycosyltransferase family 2 protein -
  CLCT_RS06055 (CLCT_1202) - 1172471..1174018 (-) 1548 WP_149062583.1 glycosyltransferase -
  CLCT_RS06060 (CLCT_1203) - 1174027..1174899 (-) 873 WP_149062584.1 lipid A biosynthesis lauroyl acyltransferase -
  CLCT_RS06065 (CLCT_1204) waaC 1174896..1175915 (-) 1020 WP_149062585.1 lipopolysaccharide heptosyltransferase I -
  CLCT_RS06070 (CLCT_1205) - 1175982..1176761 (+) 780 WP_149062586.1 3'-5' exonuclease -
  CLCT_RS06075 (CLCT_1206) galE 1176820..1177806 (+) 987 WP_039626879.1 UDP-glucose 4-epimerase GalE -
  CLCT_RS06080 (CLCT_1207) - 1177800..1179494 (+) 1695 WP_149062587.1 ABC transporter ATP-binding protein -

Sequence


Protein


Download         Length: 304 a.a.        Molecular weight: 34781.12 Da        Isoelectric Point: 10.3198

>NTDB_id=383981 CLCT_RS05990 WP_149062571.1 1161040..1161954(-) (waaF) [Campylobacter lari subsp. concheus strain LMG 21009]
MNIFINLPTWLGDAVMASAAIYAIKEKYPQAKFTFYGSFVSTELFKYFENAQILVENKKQRYKQILKARKNLGKFDLAFS
FRSAFSSKIILNLIKAKKRFYFDKNILKEEHQVLKYLNFIEKALNFKATLNVLKLPIKAKSTQKLLGINAGAHFGSAKRW
EASYFARVAKEFSFTHKILIFGVESEREICDEIEHYLLKEGIKAKNLCGKTNIYTLCKNISMLDLLITNDSGPMHIGAVY
GVKTVAIFGSTKFSQTSPWQKNAKIAHLNLACMPCMQKTCPLKHHKCMKDLKPEVVINLAKTFF

Nucleotide


Download         Length: 915 bp        

>NTDB_id=383981 CLCT_RS05990 WP_149062571.1 1161040..1161954(-) (waaF) [Campylobacter lari subsp. concheus strain LMG 21009]
ATGAATATTTTTATCAACCTTCCCACTTGGCTTGGCGATGCGGTGATGGCTAGTGCGGCTATTTATGCTATAAAAGAAAA
ATACCCCCAAGCTAAATTTACTTTTTATGGTTCTTTTGTGAGTACAGAGCTTTTTAAATACTTTGAAAATGCTCAAATTT
TAGTAGAAAATAAAAAACAAAGATATAAGCAAATTTTAAAAGCTAGAAAAAACCTTGGTAAATTTGATTTAGCTTTTTCG
TTTCGCTCGGCATTCTCAAGCAAGATTATTTTAAATCTAATCAAAGCAAAAAAAAGATTTTATTTTGATAAAAATATTCT
AAAAGAAGAACACCAAGTCTTAAAATACTTAAATTTCATAGAAAAAGCTTTAAATTTTAAAGCAACTTTAAATGTTTTAA
AACTCCCTATAAAAGCAAAATCAACTCAAAAACTTTTGGGTATAAATGCAGGTGCGCATTTTGGAAGTGCGAAAAGATGG
GAAGCGAGTTATTTTGCAAGGGTAGCAAAAGAATTTAGTTTTACGCATAAAATTTTAATCTTTGGAGTAGAAAGCGAGAG
AGAAATTTGTGATGAAATTGAACATTATCTTTTAAAAGAAGGCATAAAGGCAAAAAATCTTTGTGGTAAAACTAACATTT
ATACTTTATGTAAAAATATTTCTATGCTTGATTTACTCATCACAAACGATAGTGGTCCTATGCATATAGGTGCAGTTTAT
GGGGTGAAAACAGTAGCTATTTTTGGTTCGACTAAATTTAGTCAAACTTCACCTTGGCAAAAAAATGCTAAAATAGCGCA
TTTAAATTTAGCTTGTATGCCTTGTATGCAAAAGACTTGTCCTTTAAAACATCACAAATGCATGAAAGACTTAAAGCCTG
AGGTGGTGATAAATTTAGCAAAAACATTTTTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  waaF Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819

64.968

100

0.671


Multiple sequence alignment