Skip to content

hadoop과 spark를 사용해 nba 경기의 승부를 예측하는 프로그램입니다.

License

Notifications You must be signed in to change notification settings

houony/NBA_Predictor

 
 

Repository files navigation

Basketball-Reference Webcrawler

Scrapy-based webcrawler which collects all data for a specific NBA season from basketball-reference.com.

Prerequisites

Requires the scrapy and pandas python packages to be installed.

Usage

The webcrawler can be started from the project directory using the command

scrapy crawl basketball-reference -a season=2020

where the season for which data should be collected is given by the season argument (default is current season).

Dataset

  • odds data : Odds data collecting with sbrscrape, scraping FanDuel odds data ➡️ test.py
      python3 test.py
    year = ["2023", "2024"] , season = ["2023-24"] ➡️ change year with when you want to discover
    in odds data, you can access tomorrow's betting info. ➡️ bet_api.py, accessable with your own key {https://api.the-odds-api.com/v4/sports}
  • season data : Seasonal data collecting with https://www.basketball-reference.com/leagues/NBA_{self.season}_games.html site.➡️ br_spider.py
      scrapy crawl basketball-reference -a season=2020 ➡️ crawl command, change season args with when you want.
  • merged data : merge odds data & season data with [date, home, away] ➡️ data_preprocess.py

NBA_Predictor

hadoop과 spark를 사용해 nba 경기의 승부를 예측하는 프로그램입니다.


first commit은 무시하셔도 됩니다..

About

hadoop과 spark를 사용해 nba 경기의 승부를 예측하는 프로그램입니다.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%