This is the repository for the CP4D Tutorial Data and setup. CP4D Tutorial is based on Cloud Pak for data v2.1. For more information on this exciting new Data Science Platform, please visit ibm.com. CP4Data provides an end-to-end, integrated & governed data & analytics solution platform where Data Engineers Data Stewards, Data Scientists and Business Users collaborate to bring forward the best insights from the existing data in the enterprise.
- Download and Load the core setup modules.
- Import the dataset into IBM CP4Data
- Prepare and shape the dataset using Data Transform.
- Using Imported Jupyter notebook , train a simple linear regression model.
- Save the resulting model into CP4Data.
- Use the saved model for predictions
- [Data Shaping and Cleansing] (https://en.wikipedia.org/wiki/Data_science): Tools to shape and pepare the Data
- Data Science: Systems and scientific methods to analyze structured and unstructured data in order to extract knowledge and insights.
- Artificial Intelligence: Artificial intelligence can be applied to disparate solution spaces to deliver disruptive technologies.
- Python: Python is a programming language that lets you work more quickly and integrate your systems more effectively.
Follow these steps to create the required services and run the notebook locally.
Clone the Cloud Pak for Data tutorial
repository locally. In a terminal, run the following command:
$ git clone [email protected]:IBM-ICP4D/ICP4DTutorial.git
$ ./load_samples.sh --list
- banking
- manufacturing
- retail
Depending on your interest of domain, you would pass the domain in the loader.
$ ./load_samples.sh -t banking
- Data Analytics Code Patterns: Enjoyed this Code Pattern? Check out our other Data Analytics Code Patterns
- AI and Data Code Pattern Playlist: Bookmark our playlist with all of our Code Pattern videos
- Watson Studio: Master the art of data science with IBM's Watson Studio
For further questions please [contact ICP4D Cusomer Experience Team] or Slack us