LLM-ConvRec is a prompting-based, semi-structured conversational system that leverages the generative power of GPT to provide flexible and natural interaction. Unlike fully-structured conversational systems such as Siri, where utterances are often predefined and inflexible, LLM-ConvRec is designed for versatility and the production of more natural responses. Moreover, it incorporates past memory into the conversation, a feature often lacking in fully-structured systems.
While unstructured conversational systems like ChatGPT can produce fluid, engaging responses, their approach to utterance handling is often a "black box", which can lead to the generation of inappropriate or incorrect responses, or cause the conversation to go off the rails. This is where LLM-ConvRec distinguishes itself: although it provides the flexibility and naturalness of an unstructured system, its semi-structured nature ensures that utterance handling is not opaque, and that inappropriate responses can be avoided through structural constraints.
The system retains important information about the conversation, ensuring that context and past interactions are reflected in the responses. This makes LLM-ConvRec not just a conversational system, but a conversational partner capable of delivering precise, personalized recommendations across diverse domains.
- Introduction: A Semi-Structured Conversational Recommendation System
- Example Conversation
- Quick Start
- Installation and Running the System
- Overall Conversation Flow
- Domain Initialization and Customization
Demonstration using the restaurant domain:
User: I am looking for Japanese restaurants that serve excellent sushi, preferably in a casual setting.
Recommender: Can you provide the location?
User: Tower Road NW & Kingsway NW
Recommender: How about trying Washoku Bistro for a comfortable and laid-back vibe while enjoying some delicious Japanese sushi? But if you're looking for something quick and easy without sacrificing taste, Tokyo Express is another great option near the same location.
User: What kind of menu do they offer?
Recommender: For Washoku Bistro, they offer a lunch menu with bento boxes and entrées, with limited options. They also have sushi rolls, such as spicy salmon rolls and California rolls. As for Tokyo Express, they have a wide variety of options including bento boxes, sushi combos, feature rolls, tempura, and noodles.
User: Ok, then the first one doesn't seem to match my preference.
Recommender: I'm sorry that you did not like the recommendation. Is there anything else I can assist you with?
User: Does Tokyo Express have a parking lot?
Recommender: Tokyo Express has a parking lot.
User: Great. I will go there then. Thanks!
Recommender: Great! Enjoy your meal! If you need any more assistance, feel free to ask.
Here is the link to the Google Colab for a quick start:
https://apoj.short.gy/d3m-llm-convrec-demo
Before you can use the system, you must first ensure that you have Python (version 3.7 or higher) installed on your machine. You will also need pip for installing the necessary Python packages.
Clone the repository from GitHub to your local machine by running the following command in your terminal:
git clone https://github.com/D3Mlab/llm-convrec.git
Once you've cloned the repository, use the command line to navigate into the project's directory:
cd llm-convrec
The project has a number of dependencies that need to be installed. These are listed in the requirements.txt
file. To install these dependencies, run the following command in your terminal:
pip install -r requirements.txt
Before you can run the system, you need an API key from OpenAI. This key enables the model to interact with the OpenAI's servers to process and generate conversational responses.
Here are the steps to obtain an API key:
-
Visit OpenAI's website and create an account or log in if you already have an account.
-
Navigate to the 'API Keys' section in your account settings.
-
Click on the 'Create a new API key' button.
-
Name your key and click on 'Create secret key' to generate your new API key.
Note that the system cannot be used without entering your credit card information to your OpenAI account since the system interactions exceed the free API limitations. A typical conversation costs < $0.02.
After you have the API key, you need to configure your .env
file:
-
Create a new file in your project root directory and name it
.env
. -
Inside the
.env
file, create a new line and writeOPENAI_API_KEY=
, and then paste your API key after the equals sign. For example:
OPENAI_API_KEY='sk1234567890abcdef`.
- Save the
.env
file.
Please ensure you do not upload your .env
file to public repositories to keep your OpenAI API key secure.
If you want to run the restaurant demo execute following command in the terminal:
python restaurant_main.py
The restaurant demo uses geocoding in order to verify location. It currently uses Nominatim
, but can be configured to use Google Maps API. To do this, you must create a Google API Key. To get the API Key, go to https://console.developers.google.com/ and navigate to 'Credentials' tab after logging in to your google cloud account and then click on 'CREATE CREDENTIALS'. Make sure to have both Geocoding API and Time Zone API services enabled for this API key.
Inside the .env
file, create a new line and write GOOGLE_API_KEY=
and then paste the API key in after the equals sign. For example:
GOOGLE_API_KEY = hghrjkdkxhgyrujjedksdk
Or, here is the link to the Google Colab for a quick start:
https://apoj.short.gy/d3m-llm-convrec-demo
The LLM-ConvRec system follows a precise process during each conversation turn to ensure the highest quality of responses. Let's break down this process, from the initial user input to the final recommender response:
Examples to demonstrate these categories are from the restaurant recommendation domain.
The conversation begins with user input. This input is passed to an Intent Classifier, which determines the user's intent. The intent could be any or multiple of the following:
- Provide Preference: The user expresses a preference or interest. (Example: "I'd like to eat some sushi.")
- Inquire: The user asks a question about a specific item or detail. (Example: "What's on their menu? Does the restaurant have a patio?")
- Accept/Reject Recommendation: The user responds to a previous recommendation made by the system.(Example: "Sure, that first one sounds good!")
The prompts used for intent classification can be found here: https://github.com/D3Mlab/llm-convrec/tree/main/prompt_files/user_intent_prompts
After identifying the user's intent, the system updates its internal state. This state stores critical information gathered during the conversation, including:
- User's Preferences and Constraints: These include their location, budget, dietary restrictions, etc.
- Current Item of Interest: The specific item (e.g., restaurant) the user is currently referring to or interested in.
- Accepted and Rejected Items: The system keeps track of the items that the user has accepted or rejected.
The prompts used for preferences and constraints extraction can be found here: https://github.com/D3Mlab/llm-convrec/tree/main/prompt_files/constraints_prompts
The prompts used for current, accepted, and rejected item extraction can be found here: https://github.com/D3Mlab/llm-convrec/tree/main/prompt_files/items_extractor_prompts
With the updated state, the system then decides the action to take next. This could be any of the following:
- Request Information: The system may ask the user for additional details to refine its understanding or recommendations. (Example: "Can you provide your location?")
- Give Recommendation: The system might suggest an item that matches the user's stated preferences.
- Answer a Question: If the user asked a question in their last utterance, the system would provide an appropriate answer.
Following the action classification, the system produces a response that corresponds with the determined action. This response creation leverages retrieval augmented information retrieval, which is particularly employed in 'recommend' and 'answer' actions.
The response is not only contextually in sync with the ongoing dialogue, but also respects the semi-structured conversation pattern that the system adheres to.
The prompts used for recommendation and explanation can be found here: https://github.com/D3Mlab/llm-convrec/tree/main/prompt_files/recaction_prompts/recommend_prompts
The prompts used for answering questions can be found here: https://github.com/D3Mlab/llm-convrec/tree/main/prompt_files/recaction_prompts/answer_prompts
This system is designed to be flexible and adaptable, allowing you to initialize and customize your own domain. With a configuration process involving providing some key files, you can utilize our robust system architecture tailored to your specific needs.
If you're looking to get started quickly, we've already set up a pre-configured domain: Restaurant.
-
Restaurant Domain: This domain utilizes a Yelp dataset containing all Edmonton restaurants. The domain is already initialized and ready to use, providing a wide range of restaurant data.
Due to data available, the location is restricted to Edmonton, Canada.
Here is the link to the Google Colab for a quick start:
https://apoj.short.gy/d3m-llm-convrec-demo
The domain specific files reside in the domain_specific folder, where there are two subfolders: classes
and configs
.
- The
classes
folder stores all of the user defined domain specific classes (more information on this below), which are completely optional but allow for a better user experience. - The
config
folder stores all of the domain specific configs necessary in order to make a recommendation.
If you want to follow our structure, create a folder representing the new domain in both the config
and classes
folders. For example, we named the restaurant domain folders restaurant_configs
(domain_specific/configs/restaurant_configs
) and restaurants (domain_specific/classes/restaurants
).
Note that key files should be stored in domain_specific folder, with all CSV and yaml files being stored in domain_specific/configs/<YOUR_DOMAIN_CONFIG_FOLDER>
.
To provide personalized recommendations, the LLM-ConvRec system takes into account user constraints that can be both explicit (provided directly by the user) or implicit (derived from the user's input). For efficient constraint management, it is crucial to set up a constraints_config.csv
that defines the various constraints and their properties.
The constraints_config.csv
file should include the following columns:
- key: The constraint's key name.
- description: Description of the constraint.
- is_cumulative: A Boolean value (TRUE or FALSE) indicating if the constraint's value is cumulative. If
is_cumulative
is TRUE, the system appends the newly extracted values to the existing values of the constraint rather than overwriting them. If FALSE, any newly identified value replaces the previous value. - in_explanation: A Boolean value (TRUE or FALSE) indicating if the constraint's value should be considered when giving explanation about the item to the user, during recommendation stage.
- default_value: The default value of the constraint when it is not specified by the user.
Below is an example of how the constraints_config.csv
file should look:
key | description | is_cumulative | in_explanation | default_value |
---|---|---|---|---|
location | The desired location of the restaurants. | FALSE | FALSE | None |
cuisine type | The desired specific style of cooking or cuisine offered by the restaurants (e.g., "Italian", "Mexican", "Chinese"). This can be implicitly provided through dish type (e.g "italian" if dish type is "pizza"). | FALSE | TRUE | None |
dish type | The desired menu item or dish in the restaurant that user shows interests. | TRUE | TRUE | None |
type of meal | The desired category of food consumption associated with specific times of day (e.g., "breakfast", "lunch", "dinner"). | TRUE | TRUE | None |
The configuration of these constraints will allow the system to capture user preferences more accurately, leading to more personalized and relevant recommendations.
Few-shot examples are crucial in training the LLM-ConvRec system. They are a set of input-output pairs that demonstrate the type of behavior we want the system to exhibit. In the context of our system, few-shot examples help train the classifiers and provide the necessary prompts for information extraction.
When providing few-shot examples, make sure that they are representative of the tasks you want the model to perform. For instance, if you want the system to recognize when a user is expressing a preference, include examples where users express preferences in different ways.
Please note that these examples are illustrative. The content of your files will be determined by the specific nature of your domain and the complexity of the user's queries. Remember, the quality of the few-shot examples can significantly impact the performance of the system. Carefully curating these examples will lead to a more responsive and accurate conversational system.
For effective intent classification, few-shot examples must be provided for each intent. This should be done in the form of CSV files with two columns: User Input
and Response
. User Input
should contain examples of user utterances, while Response
indicates whether the input corresponds to the respective intent (True) or not (False).
This file should contain examples of user utterances that express acceptance of a recommendation.
User Input | Response |
---|---|
That sounds good, let's go there | True |
I don't like that type of food | False |
This file should contain examples where the user rejects a recommendation.
User Input | Response |
---|---|
No, I don't want to go to that place | True |
Sure, that sounds nice | False |
This file should contain examples where the user is inquiring or asking a question.
User Input | Response |
---|---|
What's on their menu? | True |
I think we should try something else | False |
The 'constraints_updater_fewshots.csv' file plays an instrumental role in enhancing the personalization of recommendations by keeping track of explicit and implicit user preferences. It's fundamental in identifying and storing the constraints from the user's conversation, categorizing them as hard or soft constraints. Hard constraints are strict requirements, such as a specific location or cuisine type, which must be adhered to. On the other hand, soft constraints denote user preferences that are desirable but not necessarily mandatory, like a preference for a place offering free parking or a patio.
This file should be structured with five columns: 'user_input', 'old_hard_constraints', 'old_soft_constraints', 'new_hard_constraints', and 'new_soft_constraints'.
- 'user_input' represents a user's utterance.
- 'old_hard_constraints' and 'old_soft_constraints' denote the previously identified hard and soft constraints, respectively.
- 'new_hard_constraints' and 'new_soft_constraints' represent the updated set of hard and soft constraints after processing the user's input.
Here is an example of the structure of this CSV file:
user_input | old_hard_constraints | old_soft_constraints | new_hard_constraints | new_soft_constraints |
---|---|---|---|---|
pizza and pasta | "location=["Toronto"]" | "location=["toronto"], cuisine type=["italian"], dish type=["pizza", "pasta"]" | ||
does it have a patio? | "location=["jasper avenue, edmonton"], cuisine type=["japanese"]" | "price range=["moderate"], others=["free parking"]" | "location=["jasper avenue, edmonton"], cuisine type=["japanese"]" | "price range=["moderate"], others=["free parking", "patio"]" |
What kind of menu does I Love Sushi offer? | "location=["jasper avenue"], cuisine type=["japanese"]" | "location=["jasper avenue"], cuisine type=["japanese"]" |
The ability to track and update these evolving constraints allows the system to fine-tune its recommendations, significantly enhancing the overall conversation experience.
This section provides details about the few-shot prompt CSV files required for the 'Answer' recommender action.
This file helps in mapping user queries to metadata categories. It needs two columns: input
(user's question) and output
(one of the metadata field or one of the key in optional field).
input | output |
---|---|
What's their addresses? | address |
Can you recommend any dishes or specialties? | none |
Can I make a reservation? | HasReservations |
What are the meals it's known for? | PopularMeals |
This file instructs the model to extract answers from reviews based on the user's question. It requires question
(user's question), information
(reviews retrieved by the system), and answer
(the answer to the question derived from the provided information).
Note that the answer "I do not know." to the corresponding prompt is a special one that triggers a default response that is responsible for the situation where the question is not relevant to the information we have access to. Hence, one of the fewshot examples should always be an example question that should be answered with "I do not know.". This is illustrated in the below example too.
question | information | answer |
---|---|---|
Do they have a slide in the restaurant? | I really like this place. They have great food. | I do not know. |
This file aids the system in breaking down complex user queries into simpler, individual questions. It requires question
(user's question) and individual_questions
(decomposed questions).
question | individual_questions |
---|---|
Do they have wine? | Do they have wine? |
What are dishes, cocktails and types of wine do you recommend? | What dishes do you recommend?\nWhat cocktails do you recommend?\nWhat types of wine do you recommend? |
The hard_coded_responses.csv
file includes specific, structured responses that the system should use under certain conditions. This file has three columns: Action
, Response
, and Constraints
.
- Action: This column refers to the action that the system should take.
- Response: This column provides the exact response that should be given when the action is chosen.
- Constraints: This column specifies one or more constraints. If any of these constraints are missing in the system's state, then the corresponding response will be prioritized. If no constraints are applicable, this field should be left empty.
Descriptions of each action:
- PostAcceptanceAction: The response given after the user accepts a recommendation.
- PostRejectionAction: The response given after the user rejects a recommendation.
- RequestInformation: The system is requesting additional information from the user. The required information type depends on the 'Constraints' column.
- DefaultResponse: Used when the system failed to classify an intent and cannot decide on any specific action until the user provides more information.
- NoRecommendation: This response is given when the system cannot find a restaurant that meets the user's constraints.
- NoAnswer: Used when the system cannot answer the user's question with the information it has retrieved.
- InitMessage: This message is given to the user at the beginning.
Here are some examples:
Action | Response | Constraints |
---|---|---|
PostAcceptanceAction | Great! If you need any more assistance, feel free to ask. | |
PostRejectionAction | I'm sorry that you did not like the recommendation. Is there anything else I can assist you with? | |
RequestInformation | Could you provide the location? | location |
RequestInformation | Could you provide the cuisine type or dish type? | cuisine type, dish type |
RequestInformation | Do you have any other preferences? | |
DefaultResponse | Could you provide more information? | |
NoRecommendation | Sorry, there is no restaurant that matches your constraints. | |
NoAnswer | Please only ask questions about previously recommended restaurant. | |
InitMessage | Hello there! I am a restaurant recommender. Please provide me with some preferences for what you are looking for. For example, location, cuisine type, or dish type. Thanks! |
By providing these hard-coded responses, you can control the behavior of the system and ensure that the conversation flow remains on track.
The filter_config.csv
is a configuration file that allows you to specify the filters you want to apply in the system. This CSV file consists of three columns: type_of_filter
, key_in_state
and metadata_field
.
-
type_of_filter
: This column specifies the type of filter you want to use. Valid types include"exact word matching"
,"word in"
,"value range"
, and"item"
. -
key_in_state
: This column lists the keys in the hard constraint that you want to check againstmetadata_field
. Depending on the filter type, this could be a list of keys or a single key. -
metadata_field
: This column contains the field you want to check againstkey_in_state
.
Here is an example of what this file might look like:
type_of_filter | key_in_state | metadata_field |
---|---|---|
word in | cuisine type, dish type | categories |
item | recommended_items | name |
Below, we describe several types of filters that can be used in the system. They allow you to filter out items based on various criteria.
This filter retains an item if any value in key_in_state
exactly matches a value in metadata_field
. This filter is case insensitive.
Type of filter: "exact word matching"
key_in_state: List of keys in the hard constraint you want to check against metadata_field
.
metadata_field: The field you want to check against key_in_state
. The value in this field should be a string, a list-like string (e.g. “A, B, C”), or a list. If it is neither a string nor a list, the item will be retained.
This filter retains an item if any value in key_in_state
is present in metadata_field
, or vice versa. It also considers plural forms. This filter is case insensitive.
Type of filter: "word in"
key_in_state: List of keys in the hard constraint you want to check against metadata_field
.
metadata_field: The field you want to check against key_in_state
. The value in this field should be a string, a list-like string (e.g. “A, B, C”), or a list. If it is neither a string nor a list, the item will be retained.
This filter retains an item if any value in metadata_field
falls within a value range in key_in_state
, or if any value range in metadata_field
overlaps with a value range in key_in_state
.
Type of filter: "value range"
key_in_state: A key in the hard constraint containing the value ranges to check against metadata_field
. Value ranges should be in the format of "(lower bound value (with units))-(upper bound value (with units))".
metadata_field: The field you want to check against key_in_state
. The value in this field should be a string (either a value or a value range), a list-like string of values (e.g. “A, B, C”), or a list of values. If it is neither a string nor a list, the item will be retained.
This filter retains an item if it is not in the list of recommended items specified by key_in_state
.
Type of filter: "item"
key_in_state: A key in the state manager that contains a list of recommended items to check against metadata_field
.
metadata_field: The field to use for checking whether an item is not in the value of key_in_state
. Must be either item_id
or name
.
The LLM-ConvRec system requires two main types of data: metadata and reviews.
The metadata must include name (name) and unique item identifiers (item_id) as keys(columns). Each item can have various other keys representing different types of metadata, such as location, type of cuisine, cost, etc. It is not necessary for all items to have a value for every metadata field.
In addition, the metadata must have an "optional" key where the value contains the key-value pairs of optional categories.
An example of a metadata structure is as follows:
{
"item_id": "001",
"name": "Brits Fish & Chips",
"address": "6940 77 Street NW",
"city": "Edmonton",
"categories": ["Fish & Chips", "Restaurants"],
"optional": {"GoodForKids": "True", "OutdoorSeating": "False"}
}...
The review data must have two keys(columns): "item_id" and "Review".
An example of a review file structure is as follows:
item_id | Review |
---|---|
001 | great! ... |
001 | I like it ... |
002 | this restaurant... |
Note that the order of item_id in the review data must correspond to the order in the metadata.
User defined classes are used to implement domain specific tasks, for example, merging constraints in a specialized way. You can create your own implementations of the following classes:
Defines special way to merge constraint
Example implementation: Location Merger
(merges two location using geocoding)
Path to abstract class: state/constraints/constraint_merger.py
Allows constraint to have custom status and corresponding to response from the recommender
Example implementation: Location Status
(location can be “invalid”, “valid”, or “specific”)
Path to abstract class: state/status.py
Defines a way of filtering item based on the constraints and metadata
Path to abstract class: information_retriever/filter/filter.py
Example implementation: Location Filter
(filters based off of location)
Once all the mentioned csv files and data files have been created, they can be put into one folder under the domain_specific/configs
folder, and then modify the PATH_TO_DOMAIN_CONFIGS
to be:
PATH_TO_DOMAIN_CONFIGS: "domain_specific/configs/<YOUR_DOMAIN_CONFIG_FOLDER>"
Then, create a domain_specific_config.yaml
file and modify the file paths, similar to the below example:
DOMAIN: < DOMAIN_NAME_AS_A_NOUN >
EXPLANATION_METADATA_BLACKLIST: < all metadata keys that should be ignored when giving explanation of the item to the user, during recommendation stage >
INQUIRE_CLASSIFICATION_FEWSHOTS_FILE: "inquire_classification_fewshots.csv"
ACCEPT_CLASSIFICATION_FEWSHOTS_FILE: "accept_classification_fewshots.csv"
REJECT_CLASSIFICATION_FEWSHOTS_FILE: "reject_classification_fewshots.csv"
CONSTRAINTS_CATEGORIES: "constraints_config.csv"
CONSTRAINTS_UPDATER_FEWSHOTS: "constraints_updater_fewshots.csv"
ACCEPTED_ITEMS_EXTRACTOR_FEWSHOTS_FILE: "accepted_items_extractor_fewshots.csv"
REJECTED_ITEMS_EXTRACTOR_FEWSHOTS_FILE: "rejected_items_extractor_fewshots.csv"
CURRENT_ITEMS_EXTRACTOR_FEWSHOTS_FILE: "current_items_extractor_fewshots.csv"
ANSWER_EXTRACT_CATEGORY_FEWSHOTS_FILE: "answer_extract_category_fewshots.csv"
ANSWER_IR_FEWSHOTS_FILE: "answer_ir_fewshots.csv"
ANSWER_SEPARATE_QUESTIONS_FEWSHOTS_FILE: "answer_separate_questions_fewshots.csv"
HARD_CODED_RESPONSES_FILE: "hard_coded_responses.csv"
FILTER_CONFIG_FILE: "filter_config.csv"
PATH_TO_ITEM_METADATA: "data/item_metadata.json"
PATH_TO_REVIEWS: "data/items_reviews.csv"
PATH_TO_EMBEDDING_MATRIX: "data/reviews_embedding_matrix.pt"
PATH_TO_DATABASE: "data/database.faiss"
If you do not want to follow our structure, then modify the paths above to where you stored the corresponding files.