MindSpore Data Special Interest Group (SIG)

This is the working repo for the Data special interest group (SIG). This repo contains all the artifacts, materials, meeting notes and proposals regarding dataset - data processing and mindrecord - data format in MindSpore. Feedbacks and contributions are welcome.

Data Processing: You can understand it as a Dataset, which is mainly responsible for reading the user's data into a Dataset, then performing related data enhancement operations (such as: resize, onehot, rotate, shuffle, batch ...), and finally provide the Dataset to the training process.
Data Format: It can conveniently normalize the user's training data to a unified format (MindRecord). The specific operation steps are as follows: The user can easily convert the training data into MindRecord data by defining the training data schema and calling the Python API interface. The format is then read into a Dataset through MindDataset and provided to the training process.

SIG Leads

Liu Cunwei (Huawei)

Logistics

SIG leads will drive the meeting.
Meeting announcement will be posted on our gitee channel: https://gitee.com/mindspore/community/tree/master/sigs/data
Feedbacks and topic requests are welcome by all.

Discussion

Slack channel: https://app.slack.com/client/TUKCY4QDR/C010RPN6QNP?cdn_fallback=2
Documents and artifacts: https://gitee.com/mindspore/community/tree/master/sigs/data

Representative vedios

mindspore data processing introduction
mindspore data loading and data format conversion
optimize data processing

Main issue To be solved

Here we call for developer joining us to develop a better Dataset processing system, following is mainly issue in each season.
Comment in issue please if you have any quetions and for better communication. Also you can find all the issue in gitee by filter with label comp/data

Main issue of Q2

Meeting notes

Thursday April 2, 2020
Friday May 15, 2020
Wednesday June 03, 2020
Friday July 03, 2020
Wednesday August 05, 2020
Thursday August 06, 2020
Thursday September 03, 2020
Friday October 16, 2020
Wednesday November 04, 2020
Monday November 23, 2020
Wednesday April 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MindSpore Data Special Interest Group (SIG)

SIG Leads

Logistics

Discussion

Representative vedios

Main issue To be solved

Meeting notes

Files

README.md

Latest commit

History

README.md

File metadata and controls

MindSpore Data Special Interest Group (SIG)

SIG Leads

Logistics

Discussion

Representative vedios

Main issue To be solved

Meeting notes