Add trt support for BF16 #195

andompesta · 2024-11-14T22:47:24Z

This pull request aims to add support for TensortRT integration

Summary of Changes

CLI Support for TensorRT
- Added a new variable in src/flux/cli.py to enable TensorRT inference.
- environment variable TRT_ENGINE_DIR specifies the directory for storing TensorRT engines
- environment variable ONNX_DIRspecifies the directory for ONNX model exports.
- surrently supports the bf16 (fp16 and fp8 coming soon)
- supports for model offloading can be added
Modifications for ONNX Export
- Minor changes to src/flux/math and src/flux/modules/autoencoder.py to enable proper export in ONNX
- Additional changes are required to address numerical stability issues within the Flux-Transformer model.
TensorRT Exporter
- Added the flux/trt/exporter package, containing code to export PyTorch models to ONNX and build TensorRT engines.
TensorRT Engine Execution
- flux/trt/engine package is responsible to execute inference using TRT
TensorRT Mixin Classes
- Added the flux/trt/mixin package with mixin classes to share parameters between the model building and inference phases.
TensorRT Manager
- Introduced flux/trt/trt_manager.py as the main TensorRT management class. It handles the conversion of PyTorch models to TensorRT engines and manages the TensorRT context for inference.

https://replicate.com/collections/flux-fine-tunes

Co-authored-by: Neil Movva <[email protected]>

* Remove unused import * Remove extraneous `f` prefix --------- Co-authored-by: Emil Sadek <[email protected]>

Add trt support push

Add trt support cli conflict

zeke and others added 30 commits August 29, 2024 13:56

Add link to fine-tunes collection on Replicate (black-forest-labs#130)

a4b0f13

https://replicate.com/collections/flux-fine-tunes

Add Torch CUDA sync to fix timing code in cli.py (black-forest-labs#147)

ed51d5e

Co-authored-by: Neil Movva <[email protected]>

Update API interface for FLUX.1.1 [pro]

bc22ee3

CLI: /n is for steps, not seeds (black-forest-labs#169)

c5ebf2b

Update README.md

933e54a

Update README.md

d171e39

Remove unused import and extraneous f prefix (black-forest-labs#171)

a94a546

* Remove unused import * Remove extraneous `f` prefix --------- Co-authored-by: Emil Sadek <[email protected]>

update readme for 1.1

16fc5e2

add question logger

f8747c2

add cli input for TRT support

bf71e81

initial support for TRT engine builder

223017d

add onnx export functions taken from

fd7057e

base class for convert to onnx

0713049

add missing dependencies

d0ba09a

fix imports

959eaa7

moved to wrappers package

50149bd

moved to wrapper package and renamed into base wrapper

71c1b0d

implement load engines function

407a3ac

remove old wrapper

ca691f1

add additional parameters to base class constructor

78fbb25

implemented CLIP wrapper

3548085

remove model as a property

6b01977

enable float16 optimization

2477223

reorder arguments

969e06d

first wrapper fir onnx build

f8a183b

add load_engines with minimal parameters

5572ae7

fix get_sample_input interface format and add get_model_to_trace

713a66e

fix stage name

b4bebd2

ad imports for wrappers

e56078c

set assert error message for missing stage

f1fa53b

andompesta and others added 17 commits November 14, 2024 22:36

remove trt dependencies from toml

63e29cc

rename requirements and fix readme

c978cc3

remove unused files

3087c60

fix import format

5c2cba1

remove comments

08fbb60

add gitignore

1b4a41a

reset dependencies

a404144

add hidden setup files

a8b8478

solve ruff check

8fa1d22

fix imports with rufs

3f20508

run ruff formatter

7662313

update gitignore

4691502

simplify dependencies

deb5633

remove gitignore

1de2799

add cli formatting

64cbb8f

fix import orders

fd1455e

Merge pull request #1 from andompesta/add-trt-support-push

095ee89

Add trt support push

This was referenced Nov 24, 2024

Awful Flux Fill results. (blurry, grainy results) comfyanonymous/ComfyUI#5746

Closed

Flux fill is prone to crashing comfyanonymous/ComfyUI#5715

Open

andompesta and others added 10 commits November 26, 2024 13:38

simplify dependencies

3d3741e

solve vae quality issue

f31ffd4

Merge branch 'main' of https://github.com/black-forest-labs/flux

728c018

Merge branch 'main' into add-trt-support

1cd9476

Merge branch 'main' into add-trt-support-cli-conflict

bee6c45

fix ruff format

f80058f

fix merge changes

079778f

format and sort src/flux/cli

a5986b5

fix merge conflicts

c7fdb64

Merge pull request #2 from andompesta/add-trt-support-cli-conflict

74c4c7a

Add trt support cli conflict

andompesta marked this pull request as ready for review November 26, 2024 13:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add trt support for BF16 #195

Add trt support for BF16 #195

andompesta commented Nov 14, 2024

Add trt support for BF16 #195

Are you sure you want to change the base?

Add trt support for BF16 #195

Conversation

andompesta commented Nov 14, 2024

Summary of Changes