Skip to content

alisonqiu/Peak-ProphetPro

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Peak-ProphetPro

Inspiration

Peak-ProphetPro was inspired by the Chevron challenge, focusing on predicting peak oil production across various well profiles.

What it does

Peak-ProphetPro has not only a Neural Network for predicting peak oil production, but also a fullstack data visualization dashboard using Taipy! This dashboard allows users to examine the dataset and generate dynamic graphs, including heatmaps, scatter plots, and histograms.

ScreenRecording2024-01-20at3 53 55PM-ezgif com-video-to-gif-converter

How we built it

Handling Missing Values:

We addressed missing values by identifying rows without missing target values and assessing the proportion of missing values in specific variables. For instance, we dropped the 'frac_type' variable because all rows with non-missing target values had the same 'frac_type,' making it irrelevant for the model's learning process.

Imputing Numerical Values:

To decide between using the mean or median for imputing numerical values, we examined data distributions. Since the data distribution was skewed or contained outliers, we opted for the median, which is more suitable when dealing with asymmetric distributions or outliers.

Imputing Categorical Variables:

We decided against the common way of replacing missing values in categorical variables because there is no significant differences between the top few frequencies. We tried random sampling, and we later improved our performance by building a predictive model to impute missing values.

Modeling:

We built four baseline models (Linear Regression, Decision Trees, Random Forest, XGBoost) and an ensemble of Random Forest and XGBoost. We also built a Neural Network with four hidden layers. The final model with best performance is a Stacking Ensemble. The ensemble consists of three base models: RandomForestRegressor, XGBRegressor and LinearRegression. These models are combined using a StackingRegressor with a final estimator being another Linear Regression model. Screenshot 2024-01-20 at 8 02 49 PM

What's next for Peak-ProphetPro

Although we wrote the code for model training prediction, we didn't have time to add it as a webpage, so we will integrate the model training prediction and management functionality into the taipy visualization tool!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages