Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

NVIDIA / FasterTransformer Public

Notifications You must be signed in to change notification settings
Fork 894
Star 5.9k

Code
Issues 249
Pull requests 40
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Pull requests: NVIDIA/FasterTransformer

Labels 9 Milestones 0

Labels 9 Milestones 0

New pull request New

40 Open 129 Closed

40 Open 129 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

fix: fix position_encoding_table memory error.

#791 opened Mar 27, 2024 by johnson-magic

Loading…

Update README.md

#776 opened Oct 29, 2023 by eltociear

Loading…

Fix shape mismatch on the masked_tokens param in decoder masked multi-head attention kernel.

#773 opened Oct 24, 2023 by FengDSP

Loading…

Include stdio.h

#770 opened Oct 19, 2023 by JihaoXin

Loading…

2

Ft llama opt

#762 opened Oct 2, 2023 by dypshong

Loading…

Support Seq length up to 8K

#756 opened Sep 4, 2023 by zhen-jia

Loading…

[BugFix] GPT inference error when pipeline_para_size > 1 and int8_mode != 0

#750 opened Aug 23, 2023 by 00why00

Loading…

[Bugfix] GptJ & GptNeoX batch inference error

#742 opened Aug 11, 2023 by YZP17121579

Loading…

1

Add fusion-for-decoder-only for llama

#733 opened Jul 28, 2023 by binxuan

Loading…

Fix beam search output_log_prob index error

#732 opened Jul 25, 2023 by cpm0722

Loading…

[Doc] Add projects section in README which is developed based on FasterTransformer

#731 opened Jul 25, 2023 by lvhan028

Loading…

2

Add triton fastertransformer backend support for deberta

#725 opened Jul 19, 2023 by sfc-gh-zhwang

Loading…

1

Add cuDNN include path as a common include dir

#724 opened Jul 18, 2023 by jacobkahn

Loading…

fix: initialize tiled_prompt_lengths_buf_ to zero in gptneox

#716 opened Jul 13, 2023 by yandai

Loading…

Remove parenthesis from asserts

#699 opened Jul 2, 2023 by miguelusque

Loading…

Huggingface gptj convert script supports sharded checkpoint

#695 opened Jun 29, 2023 by skyser2003

Loading…

[Doc] Fix typo in gpt_guide.md

#682 opened Jun 26, 2023 by myry96

Loading…

swin-transformer quantization readme files changes

#675 opened Jun 16, 2023 by Mhhhaster

Loading…

fix: fix Qk_vec_acum_fp32_ has already been declared

#659 opened Jun 9, 2023 by lkm2835

Loading…

gptneox & gptj int8 quantization & share context

#653 opened Jun 7, 2023 by rahuan

Loading…

Add missing headers

#648 opened Jun 1, 2023 by brian14708

Loading…

Fix TOC of gptneox_guide.md

#633 opened May 23, 2023 by xu-song

Loading…

Update gpt_guide.md: documentation link is invalid

#620 opened May 22, 2023 by treycheng

Loading…

fix multi-gpu build

#616 opened May 17, 2023 by dskhudia

Loading…

1

Fix mpi library linking issue

#612 opened May 16, 2023 by liangfu

Loading…

Previous 1 2 Next

Previous Next

ProTip! Updated in the last three days: updated:>2024-11-17.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.