Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

P0 - Documentation ready for Beta Release #4023

Closed
74 of 94 tasks
scarlett2018 opened this issue Dec 17, 2019 · 2 comments
Closed
74 of 94 tasks

P0 - Documentation ready for Beta Release #4023

scarlett2018 opened this issue Dec 17, 2019 · 2 comments

Comments

@scarlett2018
Copy link
Member

scarlett2018 commented Dec 17, 2019

Checklist for public release

We plan to separate the docs into 4 parts:

  • Manual for data scientists: This manual is for data scientists to learn how to submit job, debug job, manage data, use Marketplace, VSCode / Jupyter extension and CLI support on OpenPAI.
  • Manual for cluster administrators: This manual is for cluster administrators to learn the installation and uninstallation of OpenPAI, some basic management operations, storage management, troubleshootiong, etc.
  • Manual for PAI developers: This manual is for potential PAI developers to quickly go through the achitecture of OpenPAI and learn how to leverage the SDK and develop new plugins.
  • Design documentation (Reference): PAI design documentation and API reference for different PAI components.

I have created a branch user_manual for creating these 3 manuals, the link to the manual is https://github.com/microsoft/pai/tree/user_manual/manual . The planned contents of these manuals are as follows:

Contents for data scientists manual

Contents for cluster administrator manual

  • Installation Guide (Quick Start)
  • Major breaking changes comparing with previous version
  • Installation FAQ (need help) @abuccts
    • Questions about Installation
    • Installation troubleshooting
  • Basic Management Operations
  • How to Manage Users and Groups
  • How to Setup Team-wise Storage (need help) @Binyang2014
    • NFS server (storage manager)
    • Samba
    • AzureFile
    • Azureblob
  • How to Setup Virtual Clusters
    • Introduction to hivedscheduler and how to configure it
    • Example: single-type GPUs in different VCs
    • Example: multi-type GPUs in different VCs
    • How to grant VC to user groups (both AAD mode and auth mode)
  • How to Add / Remove Node (need help) @ydye
    • Guide to add / remove nodes (add link to "How to Setup Virtual Clusters" because hivedscheduler config needs change, too)
    • Example: setup multi-type GPUs and split them into multiple Virtual Clusters
  • How to use CPU Nodes
    • How to add CPU nodes and setup CPU-only VC
    • How to submit CPU-only job
  • How to customize cluster by plugins
  • Troubleshooting (need help)
    • What if GPU is not detected
    • What if some node is lost
    • What if some PAI service is not working
  • How to Uninstall OpenPAI
  • Upgrade Guide (needed in future)

Contents for PAI developer Manual

Contents for design documentation (to be updated @hzy46 )

  • OpenPAI overview
  • OpenPAI architecture and component introduction
@fanyangCS
Copy link
Contributor

fanyangCS commented Jan 2, 2020

related #2130 and #1763

@hzy46

This comment has been minimized.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

4 participants