Skip to content

CodyPenny/predict_austin_prices

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 

Repository files navigation

Responsive Austin Real Estate Price Predictor


Introduction

15,175 listings from the Austin, TX area were sourced from a data science platform, then cleaned, evaluated, and preprocessed to build a machine learning model which predicts the value of the property. The dataset was then trained and fitted to a model which was next integrated to the backend of the web application.

A listing can have an excessive amount of detail unnecessary for the model. These were removed while nulls, duplicates, and outliers were also evaluated. Real estate prices were also appreciated to the current time scale to give a more accurate prediction.

Once the dataset is ready, an 80/20 training split was applied and a standard linear regression model was used for the prediction.

Project Description

The data frame form of the dataset was exported as a pickle file and the predictive model was built using Python's sci-kit learn library. 4 API endpoints were built with Python, one of which queries a third-party geolocation API to obtain the street address's latitude, longitude, and zipcode. The second API accepts the characteristics of the property and the third and fourth API returns the data for the UI visualization.

For deployment, Nginx was used as a reverse proxy and uWSGI as the application server in AWS EC2.

Technologies

  • Python's sci-kit learn
  • Flask
  • Geolocation
  • Swagger
  • React
  • Semantic UI
  • Styled Components
  • D3
  • AWS
  • Nginx
  • uWSGI

Backend Setup

To run this project, use the configuration file requirements.txt or the Pipfile to install specified packages with the specified version.

pipenv install --dev

Pipenv uses pip and virtualenv underneath. Activate the virtual environment:

pipenv run python server/app.py

Frontend Setup

Install specified packages

npm i

Run the project

npm run dev-server

Features

1.The app accepts an address string and validates the address with geolocation API integration. It uses the validated zipcode in its computation

Screen Shot 2022-02-26 at 1 58 37 AM

Here are starter addresses that can be used:

  • 7905 Bannock Ln
  • 11905 Glenda Ct

The app uses the geocoding API of https://positionstack.com/

  1. Uses a multi-step form process to collect details about the property with input validation checks along the way.

screen

screen

  1. The collected data runs through the model and computes a prediction.

screen

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published