diff --git a/README.md b/README.md index 4a2e8f46..2154fe62 100644 --- a/README.md +++ b/README.md @@ -64,7 +64,30 @@ mse = np.square(C_halut - C).mean() print(mse) ``` -# Hardware - OpenROAD flow results from CI - NOT OPTIMIZED +## Install + +```bash +# install conda environment & activate +# mamba is recommended for faster install +conda env create -f environment_gpu.yml +conda activate halutmatmul + +# IIS prefixed env +conda env create -f environment_gpu.yml --prefix /scratch/janniss/conda/halutmatmul_gpu +``` + +# Citation + +```bibtex +@article{schonleber2023stella, + title={Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication}, + author={Sch{\"o}nleber, Jannis and Cavigelli, Lukas and Andri, Renzo and Perotti, Matteo and Benini, Luca}, + journal={arXiv preprint arXiv:2311.10207}, + year={2023} +} +``` + +## Hardware - OpenROAD flow results from CI - NOT OPTIMIZED All completely open hardware results are NOT OPTIMIZED! The results are only for reference and to show the flow works. @@ -94,8 +117,8 @@ ACC_TYPE=INT DATA_WIDTH=8 NUM_M=8 NUM_DECODER_UNITS=4 NUM_C=16 make halut-open-s | Voltage [V] | 1.1 | | Util [%] | 50.4 | | TNS | 0 | -| Clock Net | Clock Net | -| Routing | Routing | +| Clock Net | Clock Net | +| Routing | Routing | | GDS | [GDS Download](https://raw.githubusercontent.com/joennlae/halutmatmul-openroad-reports/main/latest/nangate45/halut_matmul/results/6_final.gds) | @@ -109,8 +132,8 @@ ACC_TYPE=INT DATA_WIDTH=8 NUM_M=8 NUM_DECODER_UNITS=4 NUM_C=16 make halut-open-s | Voltage [V] | 1.1 | | Util [%] | 48.7 | | TNS | 0 | -| Clock Net | Clock Net | -| Routing | Routing | +| Clock Net | Clock Net | +| Routing | Routing | | GDS | [GDS Download](https://raw.githubusercontent.com/joennlae/halutmatmul-openroad-reports/main/latest/nangate45/halut_encoder_4/results/6_final.gds) | @@ -124,34 +147,11 @@ ACC_TYPE=INT DATA_WIDTH=8 NUM_M=8 NUM_DECODER_UNITS=4 NUM_C=16 make halut-open-s | Voltage [V] | 1.1 | | Util [%] | 52.1 | | TNS | 0 | -| Clock Net | Clock Net | -| Routing | Routing | +| Clock Net | Clock Net | +| Routing | Routing | | GDS | [GDS Download](https://raw.githubusercontent.com/joennlae/halutmatmul-openroad-reports/main/latest/nangate45/halut_decoder/results/6_final.gds) | -## Install - -```bash -# install conda environment & activate -# mamba is recommended for faster install -conda env create -f environment_gpu.yml -conda activate halutmatmul - -# IIS prefixed env -conda env create -f environment_gpu.yml --prefix /scratch/janniss/conda/halutmatmul_gpu -``` - -# Citation - -```bibtex -@article{schonleber2023stella, - title={Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication}, - author={Sch{\"o}nleber, Jannis and Cavigelli, Lukas and Andri, Renzo and Perotti, Matteo and Benini, Luca}, - journal={arXiv preprint arXiv:2311.10207}, - year={2023} -} -``` - ### References * [arXiv](https://arxiv.org/abs/2106.10860) Maddness paper