We actively encourage community contributions to DeepChem. The first
place to start getting involved is
the tutorials.
Afterwards, we encourage contributors to give a shot to improving our documentation.
While we take effort to provide good docs, there's plenty of room
for improvement. All docs are hosted on Github, either in README.md
file, or in the docs/
directory.
Once you've got a sense of how the package works, we encourage the use of Github issues to discuss more complex changes, raise requests for new features or propose changes to the global architecture of DeepChem. Once consensus is reached on the issue, please submit a PR with proposed modifications. All contributed code to DeepChem will be reviewed by a member of the DeepChem team, so please make sure your code style and documentation style match our guidelines!
To develop DeepChem on your machine, we recommend using Anaconda for managing packages. If you want to manage multiple builds of DeepChem, you can make use of conda environments to maintain seperate Python package environments, each of which can be tied to a specific build of DeepChem. Here are some tips to get started:
- Fork the DeepChem repository and clone the forked repository
git clone https://github.com/YOUR-USERNAME/deepchem.git
cd deepchem
1.1. If you already have DeepChem from source, update it by running
git fetch upstream
git rebase upstream/master
- Set up a new conda environment for DeepChem
conda create -n deepchem python=3.8
conda activate deepchem
2.1. DeepChem provides backend support for deep learning using tensorflow, pytorch or jax. By default, the deep learning frameworks gets installed in CPU. If GPU support is required, make sure CUDA is installed and then install the desired deep learning framework before installing DeepChem:
- Install DeepChem in
develop
mode
python setup.py develop
This mode will symlink the Python files from current local source tree into the Python install. Hence, if you modify a Python file, you do not need to reinstall DeepChem again and again.
In case you need to reinstall, uninstall DeepChem first by running
pip uninstall deepchem
until you see Warning: Skipping deepchem as it is not installed
; run python setup.py clean
and install in develop
mode again.
Some other tips:
- Every contribution must pass the unit tests. Some tests are
marked with custom
markers like
@pytest.mark.tensorflow
. This helps mainly in two ways: 1) restricting the tests only to the part of code marked with the marker 2) giving flexibility in running the unit tests depending on the environment. - DeepChem has a number of soft requirements which can be found here.
- If a commit is simple and doesn't affect any code (keep in mind that some
docstrings contain code that is used in tests), you can add
[skip ci]
(case sensitive) somewhere in your commit message to skip all build / test steps. Note that changing the pull request body or title on GitHub itself has no effect.
Every contribution, must be a pull request and must have adequate time for review by other committers.
A member of the Technical Steering Committee will review the pull request. The default path of every contribution should be to merge. The discussion, review, and merge process should be designed as corrections that move the contribution into the path to merge. Once there are no more corrections, (dissent) changes should merge without further process.
On successful merge the author will be added as a member of the DeepChem organization.
DeepChem uses these tools or styles for keeping our codes healthy.
- YAPF (code format)
- Flake8 (code style check)
- mypy (type check)
- doctest (interactive examples)
- pytest (unit testing)
Before making a PR, please check your codes using them. You can confirm how to check your codes from Coding Conventions.
DeepChem uses Sphinx to build the document. The document is automatically built by Numpy style docstrings in source codes and Napoleon extension. For any changes or modification to source code in a PR, please don't forget to add or modify Numpy style docstrings.
Contributor offers to license certain software (a “Contribution” or multiple “Contributions”) to DeepChem, and DeepChem agrees to accept said Contributions, under the terms of the open source license The MIT License
The Contributor understands and agrees that DeepChem shall have the irrevocable and perpetual right to make and distribute copies of any Contribution, as well as to create and distribute collective works and derivative works of any Contribution, under The MIT License.
DeepChem understands and agrees that Contributor retains copyright in its Contributions. Nothing in this Contributor Agreement shall be interpreted to prohibit Contributor from licensing its Contributions under different terms from the The MIT License or this Contributor Agreement.
The Technical Steering Committee admits and oversees all top-level of DeepChem.
The TSC exercises autonomy in setting up and maintaining procedures, policies, and management and administrative structures as it deems appropriate for the maintenance and operation of these projects and resources.
Included in the responsibilities of the TSC are:
- Managing code and documentation creation and changes for the listed projects and resources
- Performing code reviews on incoming pull requests and merging suitable code changes.
- Setting and maintaining standards covering contributions of code, documentation and other materials
- Managing code and binary releases: types, schedules, frequency, delivery mechanisms
- Making decisions regarding dependencies of DeepChem, including what those dependencies are and how they are bundled with source code and releases
- Creating new repositories and projects under the deepchem GitHub organization as required
- Setting overall technical direction for the DeepChem project, including high-level goals and low-level specifics regarding features and functionality
- Setting and maintaining appropriate standards for community discourse via the various mediums under TSC control (gitter, facebook, blog)
Members of the TSC will meet regularly (over phone or video conferencing) to coordinate efforts. Minutes from the TSC meetings will be published publicly on an ongoing basis. The current members of the TSC are (alphabetically)
- Peter Eastman
- Karl Leswing
- Bharath Ramsundar
- Zhenqin Wu
If you want to join the technical steering committee, you will need to submit an application. The application process is relatively lightweight: submit a one page document discussing your past contributions to DeepChem and propose potential projects you could commit to working on as a member of the steering committee. Note that steering committee membership comes with responsibilities. In particular, you will need to commit to spending about 10 hours a week working on DeepChem. The committee will review your application, and if suitable, will accept you as a probationary member of the TSC. Your application will be posted publicly to the DeepChem blog if accepted. Membership on the committee will be confirmed after 6 months if you’ve successfully implemented some of your proposed projects and demonstrated your ability to meet the necessary time commitment.