Gradient

Gradient is the machine learning assistance framework that helps developers to monitor the GPU usage and the progress of the trainings in models.

Project Demo Video: https://youtu.be/UsKgrigKVm8

Key Features

Monitor the GPU usage by reporting general GPU usage statistics when the user runs the machine learning program.
Check the progress of the model training by watching on the iteration procedure of the training and changes in the variables.
Alarm the error situations like gradient explosion or memory lack error by giving push-notification to the developer.

Getting Started

Generally, there are 3 components in this project

A nodejs web server which will host the website
A MongoDB server which will store the data (normally, nodejs webserver and MongoDB server would run on same machine, however, you can also configure with different settings)
Clients (GPU servers) which will send GPU data and learning progress data.

Installation

For the following guide, I will assume that all web server, MongoDB server and GPU server will run on the same machine. (However, you can still configure differently)

Client API installation (to use progress API)

Setup your virtualenv (or global, if you wish)

(This guide assumes that MongoDB is installed in localhost)

$ pip install pymongo
$ python setup.py install

Client GPU collector setup

Assume pymongo is installed and MongoDB is installed in localhost

$ cd gpu_monitor_nvidia/
$ python gpu_status_collector.py

Web server setup

Install

$ cd gradient
$ npm install -d

Run

gulp dev

Progress API Usage

All you need to do

import gradient

gradient.registerProgress("DeepMNIST") (parameters : name)

gradient.updateProgress("DeepMNIST", i, 20000) (parameters : name, current_iteration, maximum_iteration)

Example Usage

import gradient
sess.run(tf.global_variables_initializer())
gradient.registerProgress("DeepMNIST")
for i in range(20000):
    batch = mnist.train.next_batch(50)
    if i%100 == 0:
        train_accuracy = accuracy.eval(feed_dict={
            x:batch[0], y_: batch[1], keep_prob: 1.0})
        print("step %d, training accuracy %g"%(i, train_accuracy))
    train_step.run(feed_dict={x: batch[0], y_: batch[1], keep_prob: 0.5})
    gradient.updateProgress("DeepMNIST", i, 20000)
   
print("test accuracy %g"%accuracy.eval(feed_dict={
    x: mnist.test.images, y_: mnist.test.labels, keep_prob: 1.0}))

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
gpu_monitor_nvidia		gpu_monitor_nvidia
gradient		gradient
server		server
src		src
.babelrc		.babelrc
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST		MANIFEST
README.md		README.md
customEslintrc.json		customEslintrc.json
grommet-toolbox.config.js		grommet-toolbox.config.js
gulpfile.babel.js		gulpfile.babel.js
package.json		package.json
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gradient

Key Features

Getting Started

Installation

Client API installation (to use progress API)

Client GPU collector setup

Web server setup

Install

Run

Progress API Usage

All you need to do

Example Usage

Other open source libraries used

About

Releases

Packages

Contributors 3

Languages

License

Gradient-kaist/Gradient

Folders and files

Latest commit

History

Repository files navigation

Gradient

Key Features

Getting Started

Installation

Client API installation (to use progress API)

Client GPU collector setup

Web server setup

Install

Run

Progress API Usage

All you need to do

Example Usage

Other open source libraries used

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages