OLA_project

Online Learning Application Project

Course held @ Politecnico di Milano
Acadamic year 2021 - 2022

Developed by

Name	Surname	person code	github
Sofia	Martellozzo	10623060	link
Vlad Marian	Cimpeanu	10606922	link
Lorenzo	Rossi	10698834	link

Topic

Social influence techniques and Pricing

Motivation

Nowadays one big problem of e-commerces is to allocate the best price to its products so that, the seller can maximize its margin.
The main issue is that increasing the price of a product leads to fewer people interested in that product, thus increasing the price is not necessarily beneficial to the seller. In contrast, decreasing the price will increase the number of people interested in the product, but the revenue will be of course sub-optimal.
In order to maximize the revenue, we can analyze the demand curve of a given product, which is a graphical representation of the relationship between the price $p_i$ of a good or service $i$ and the quantity demanded $q_i(p_i)$ for a given period of time, and find the price $\hat{p}$ such that:

$\hat{p} = argmax(pq(p))$

Unfortunately, in real-world problems, the demand curve is not available, furthermore, we need to estimate this curve by interacting with the environment. One main problem of interacting with an unknown environment is that exploration costs a lot of money, so we want to find the best prices in the shortest amount of time to decrease the regret. In order to do so, we can use reinforcement learning techniques such as Multi Armed Bandit (MAB) algorithms.

Problem

In our project we deal with a website supported by a recommender system: once a customer buys a specific product, it will suggest up to 2 new products (called secondary products), thus, a customer might buy multiple products during the same visit.
Here an example of the recommender system graph: From this graph we can see for instance, if a customer decides to buy a shirt, the system will suggest to buy a hoodie and a t-shirt.
This means our algorithm should consider the indirect reward a specific product may lead.
For more details about the environment and its settings, read the report.
In order to solve this problem:

each day we estimate the customers' conversion rate with MAB algoirhtms based on the collected observations.
according to the estimated conversion rates, we solve a combinatorial problem to find the best prices using a dynamic programming approach. DP gives a correct estimate of the margin given by the prices (assuming the conversion rate estimates are correct) but is computationally expensive, thus, this approach is affordable in small optimization problems as this one. In case of more complex problems, one may consider using Montecarlo simulations to have a raw estimate of the reward in reasonable time.

We compare the performances achieved by UCB-1 and Thompson sampling algorithms.

Repository structure

Code folder contains all the classes used to solve the problem:
- environment package content is mainly used to simulate the environment, which our alogrithms interact with.
- data folder contains json files having all the parameters used to model the customers (real conversion rates, click rates, ...) and the graph used by the recommender system.
Report folder contains the latex code used for the pdf report.

Here a simplified UML of the project:

Set up

Use the following command to compile the report.

pdflatex -shell-escape  main.tex

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
Code		Code
Report		Report
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
customers_parameters.ipynb		customers_parameters.ipynb
step2.ipynb		step2.ipynb
step3.ipynb		step3.ipynb
step4.ipynb		step4.ipynb
step5.ipynb		step5.ipynb
step6.ipynb		step6.ipynb
step7.ipynb		step7.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OLA_project

Table of contents

Developed by

Topic

Motivation

Problem

Repository structure

Set up

About

Releases

Packages

Contributors 3

Languages

License

VladMarianCimpeanu/OLA_project

Folders and files

Latest commit

History

Repository files navigation

OLA_project

Table of contents

Developed by

Topic

Motivation

Problem

Repository structure

Set up

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages