Skip to content

ajnetzley/Seattle-Restaurant-Reviews

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Seattle-Restaurant-Reviews

Abstract:

This study explores online restaurant reviews in Seattle by analyzing data collected from Tripadvisor, Yelp, and Google. Relationships between rating and review count, cuisine type, platform, and neighborhood were analyzed via a suite of point and interval inferential procedures and statistical tests, including t-test, ANOVA, chi-square, and linear regression. We found that less-reviewed restaurants had a greater proportion of highly-rated restaurants on some platforms, possibly indicating overrating. We also found that ratings and rating distributions of top rated restaurants differed significantly by review platform. While Seattle lacks a definitive top cuisine, we discovered that it does have top neighborhoods, with certain areas consistently receiving higher ratings than others. We also discovered that review count and star rating were significant predictors for a restaurant’s Tripadvisor ranking. These findings suggest diners should be cautious of highly-rated restaurants with few reviews, consider the source of a rating, and explore neighborhood trends before putting too much stock in an online score.

Introduction:

When in an unfamiliar city and hungry, it’s common to let online restaurant reviews guide your choice of where to eat. Oftentimes, this strategy pays off, and well-reviewed locations serve amazing food. However, online ratings depend on a variety of factors that can move the goalposts and lead to misleading scores. Compared to other cities, Seattle seems especially problematic in this regard. Therefore, our group set out to address these difficulties and improve our decision-making as diners using data from top review platforms. In particular, after scraping and merging our cross-platform dataset, our group broke our efforts out into individual lines of questioning. These questions are listed below:

How does review count affect rating trustworthiness? Are certain cuisines rated significantly higher than others? Is there a bias related to reviews platform? Is there a difference in restaurant rating and price by neighborhood? Can Tripadvisor restaurant rankings be predicted using a regression model?

Repository Details

This repository contains three subfolders with data and code:

1.) data

Data contains all of the raw and processed data files from the webscraping, cleaning, and merging

2.) preprocessing

Preprocessing contains two notebooks, one that loads and cleans each raw data file, and another that merges the three cleaned data files together.

3.) analysis

Analysis contains a few notebooks containing the statistical analysis code for the specific questions we answered in this analysis.

Creators

This project was developed by the following members of the UW MSDS '25 cohort:

Alex Netzley, Jake Flynn, Edouard Seryozhenkov, Mark Daniel, Elliott Sanger

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •