The Work is divided into two steps:
- Offline model Form a bi-partite graph Cluster them into concepts (group of similar queries) Create concept sequence suffix tree
- Online model Map queries into concept Look up the suffix tree and suggest queries
g++ main.cpp ./a.out input_file //input file should be your data log
Due to copyright issues we cannot share the dataset in public.