Skip to content

Pull requests: huggingface/optimum-habana

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Flux Image-To-Image pipeline
#1524 opened Nov 25, 2024 by dsocek Loading…
4 tasks done
add video-llava model support
#1522 opened Nov 25, 2024 by kaixuanliu Loading…
Added custom mamba op and fix the mamba cache issue
#1521 opened Nov 22, 2024 by zzhang37 Loading…
3 tasks
implement fused sdpa for wav2vec2 (#18) run-test Run CI for PRs from external contributors
#1520 opened Nov 22, 2024 by astachowiczhabana Loading…
3 tasks
Add support for optimized SDXL pipeline
#1519 opened Nov 22, 2024 by sushildubey171 Loading…
Add DynamicMoE support for Mixtral (#10)
#1518 opened Nov 22, 2024 by astachowiczhabana Loading…
Disable default sdpa in Albert (#22) run-test Run CI for PRs from external contributors
#1517 opened Nov 22, 2024 by astachowiczhabana Loading…
Removed workaround for NaN bug causing graph break. run-test Run CI for PRs from external contributors
#1516 opened Nov 22, 2024 by astachowiczhabana Loading…
pass "lazy_mode" arg to GaudiLlamaModel GaudiTrainer run-test Run CI for PRs from external contributors
#1515 opened Nov 22, 2024 by astachowiczhabana Loading…
Add option to use bf16 in PT sdp (#5)
#1514 opened Nov 22, 2024 by astachowiczhabana Loading…
Memory optimization for gpt_bitcode (#4) run-test Run CI for PRs from external contributors
#1513 opened Nov 22, 2024 by astachowiczhabana Loading…
add Qwen2-VL static generation
#1512 opened Nov 22, 2024 by Spycsh Loading…
1 of 3 tasks
Add DynamicMoE support for Mixtral
#1511 opened Nov 21, 2024 by kwisniewski98 Loading…
Fixed Gemma FP8 flash_attention lower throughput issue run-test Run CI for PRs from external contributors
#1510 opened Nov 21, 2024 by kplau1128 Loading…
3 tasks
enable dynamic compile for mpi
#1509 opened Nov 21, 2024 by chaojun-zhang Loading…
3 tasks
Migrate OH CLIP (roberta-clip) training to torch.compile run-test Run CI for PRs from external contributors
#1507 opened Nov 21, 2024 by chaojun-zhang Loading…
3 tasks
Migrate OH T5-large training to torch.compile run-test Run CI for PRs from external contributors
#1506 opened Nov 21, 2024 by chaojun-zhang Loading…
3 tasks
Support FP8 model fallback KVCache to bfloat16
#1505 opened Nov 20, 2024 by changwangss Loading…
add fused kernel config support for run_clm.py
#1502 opened Nov 20, 2024 by ranzhejiang Loading…
Adding support for Context Parallelism using Deepseed's DistributedAt… run-test Run CI for PRs from external contributors synapse_1.19_dependency
#1501 opened Nov 20, 2024 by bhargaveede Loading…
3 tasks
add check_neural_compressor_min_version for 4 bit behavior run-test Run CI for PRs from external contributors
#1500 opened Nov 20, 2024 by xin3he Loading…
3 tasks
Makes the with_stack of the profiler changeable run-test Run CI for PRs from external contributors
#1497 opened Nov 18, 2024 by ranzhejiang Loading…
ProTip! Follow long discussions with comments:>50.