Skip to content

google/sedpack

Sedpack - Scalable and efficient data packing

Coverage Status

Mainly refactored from the SCAAML project.

Available components

  • TODO

Install

Dependencies

To use this library you need to have a working version of TensorFlow 2.x.

Development dependencies:

  • cmake for Rust flate2 with zlib-ng

  • python-dev and gcc for xxhash

Dataset install

Development install

  1. Clone the repository: git clone https://github.com/google/sedpack
  2. Install dependencies: python3 -m pip install --require-hashes -r requirements.txt
  3. Install the package in development mode: python3 -m pip install --editable . (short pip install -e . or legacy python setup.py develop)

Rust install

  • Activate your Python virtual environment
  • Install Rust
  • Run maturin develop --release
  • Run python -m pytest from the project root directory -- no tests should be skipped

Update dependencies

Make sure to have: sudo apt install python3 python3-pip python3-venv and activated the virtual environment.

Install requirements: pip install --require-hashes -r base-tooling-requirements.txt

Update: pip-compile requirements.in --generate-hashes --upgrade and commit requirements.txt.

Package install

pip install sedpack

Tutorial

Tutorials available in the docs/tutorials/ directory. For a "hello world" see docs/tutorials/quick_start/mnist_save.py and docs/tutorials/quick_start/mnist_save.py.

Disclaimer

This is not an official Google product.

About

No description, website, or topics provided.

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published