-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text-Generation-Inference v1.0+ new license: HFOIL 1.0 #726
Comments
Hello is it similar license as |
@dongs0104 correct |
TGI depends vLLM, which is licensed under Apache. What if vLLM changes their license for the same reason and forbid TGI from using vLLM? |
TGI depends on our fork of VLLM which is Apache 2.0. |
Transitioning to HFOIL 1.0 is understandable, but please consider introducing both free/growth and enterprise plans. It would be great if I could pass an access token to args, and have it function as a license key. I believe you could collect this and this for telemetry. Here are the benefits I foresee:
|
@shibanovp I think you are misunderstanding the terms of the license. You can continue using TGI for free without ever interacting with our sales team if the service your are building or selling is not a container serving engine tighly integrated with TGI (for example a serverless TGI experience where user can pick a hub model and it automatically starts a TGI container for them like we offer in Inference Endpoint). Then AND ONLY THEN would you need to contact us to get a license if AND ONLY IF you want to use TGI 1.0+. To summarize:
|
I think if you write in the readme that is not opensource software then contributors will not be confused. |
@rom1504 all contributors that we could contact received this information days in advance. We warn new contributors about the license change directly in their PR when reviewing them (like in #617) and we might add a proper Contributor License Agreement in the future. Some contributors decided to close their PR given the new license (#529 #719). What type of language would you like to see in the README? |
@OlivierDehaene |
@shibanovp I do not think I am the best interlocutor to answer such a precise question. Are you ok with contacting us at [email protected] to loop in our legal team? |
I thought as much... This is what I was referring to in #726 (comment) The workflow I would prefer is as follows:
However, the current workflow is:
|
@OlivierDehaene "This tool is not opensource. We provide open access via license XYZ" |
Hi @julien-c 👋 Hi @OlivierDehaene 👋 Bad News 😭 Got it! You're trying to sell to big companies, but that might harm small startups. Please add a volume-based license. It will apply to the big companies making significant profits, while still supporting smaller businesses. |
@shibanovp now that you explained your use case, this kind of telemetry is really interesting and an integration that we will consider for sure! |
@camenduru Startups taking a severe hit is just collateral damage. I don't believe the intention was to penalize everyone who is not using this feature. @OlivierDehaene This makes things complex for teams without legal experts. NDAs must change since I have to share my client's work with you. Startups might suffer as they stick to |
A big thank you to @OlivierDehaene and @Narsil for creating this awesome project. I understand HuggingFace's motivation to choose to close-source. I was the maintainer of HHVM's OSS version when I was working for Meta. We were in a similar situation where the only notable external user of HHVM is one of Meta's competitors. Meta finally removed my team and eliminated the most of OSS support workload for HHVM. Currently I am working for Preemo. We use text-generation-inference internally in a way that might be incompatible with the new license. So we decide to create an open-source fork of text-generation-inference. https://github.com/Preemo-Inc/text-generation-inference I am pretty familiar with the code base, and we will actively maintain it given that we are internally using it. Forking text-generation-inference is not an all bad thing.
I plan to redirect my unreviewed pull requests to the new repository and I anticipant we will export other Preemo's internal changes to public pull requests in next couple of weeks. Currently text-generation-inference's CI depends on HuggingFace's private infrastructure and it surely would not work in the new repository. We plan to fix the CI in next couple of weeks. Welcome to contribute to the open source fork of text-generation-inference! |
@Atry the reason we were not reviewing/merging your proposals is because we have been discussing this license change internally for a long time and we felt that having new contributors would make this process harder. You are more than welcome to fork TGI; a lot of great project (Grafana comes to mind for example) started this way. Thank you for your interest in TGI in the first place and the kind words. |
Since there is a lot of critic about that steps maybe you also want to hear some positive feedback. Maybe it's just not clear enough explained, what do you think about some examples in real world cases ? |
I mean, HuggingFace was entirely built on open source. Imagine if pytorch came out one day with "you can still use pytorch for commercial purposes, but if you are planning to just wrap some pytorch functions to build a framework and a commercial service based on it, you need to get a license". |
Hi @OlivierDehaene , Questions:1. Can this project be upgraded to v1.0+, is it restricted by this license?2. If Q1 is not restricted, can we use this image with TGI 1.0+ to deploy our own model on AWS ? is it restricted by this license? |
@flozi00 We have created a FAQ which hopefully is clearer: #744 Do you think it's more understandable. As stated in the FAQ our biggest mistake I think is not making the change sooner or making our intent more public sooner. But we didn't have the license yet, and trying to figure out how we could keep the code open and protect our business at the same time took time. |
Yeah, looks pretty good now |
Maybe a better option was to keep it Apache 2.0 and create an enterprise version on top which you provide commercial license. Like Redhat and companies who over enterprise value on open source. Now, in this new model, why would a startup use the opensource when they are building a product to know that it has an unknown commercial license. |
We at Scale have decided to create a fork of This is a dependency we use inside of LLM Engine, which is our open source framework for serving and fine-tuning open source LLMs on your own infrastructure. We're committing to keeping both Apache 2 licensed. |
@flozi00 The license isn't specific in its term to Hugging Face. @jcrupi This has been discussed internally, however doing that would mean keeping the best of our features reserved for commercial usage, not putting them in open source and creating an internal fork. We could even have made it closed source and use only that for our own offerings. However we felt that this wasn't really helping the community at large. 0.9.4 is still available for all without any restrictions |
Given this license change we also created a fork of text-generation-inference here: We are using TGI in our inference service. We will keep this fork under Apache 2.0 |
The OS community was converging on TGI, since unlike competitors it provides all the new algorithmic innovations in one bundle: tensor sharding, quantization, continuous batching, etc. It could have become the standard library for deployment if it already wasn't. It wasn't "HF pays for development", but the very opposite.
If this was the case, HF could have made a condition based on the size of the company. Just copy the Llama 2 clause 2."...greater than 700 million monthly active users in the preceding calendar month, you must request a license from Meta...". Even better, make a general license based on this that could be adopted industry-wide. As is, HF is expecting to get free OS contributions, while at the same time getting their revenue from OS startups. This should have been thought out better, this doesn't benefit Hugging Face or the OS community. |
Hi , We are currently using tgi-1.0.0 as backend to build a chat bot to our clients. Can we need to inform about this to your sales team. |
Hi, I work for a document AI company, where we want to use TGI to serve llm's for a number of use cases related to document data extraction, chat with your docs, document summarization and other document related use cases. We have no plans to provide clients with direct access to the llm's but instead these would sit behind an application layer designed to incorporate the power of llms for predefined use cases. Would we need to request a license for this purpose? |
Practically anything built on any cloud today and made available to users in a browser as “a hosted or managed service”. I do not begrudge HF from trying to prevent managed service competitors using their own code BUT there is enough ambiguity in the actual licensing language that it would be irresponsible for any SaaS vendor to use this without a license from HF. |
Does this license also restrict commercial usage of AWS DLCs with TGI v >= 1.0? |
@abdullahsych no, your usage of TGI v1.0+ within the HuggingFace Text Generation Inference Containers DLCs on SageMaker is not restricted by the HFOIL 1.0 license, thanks to the agreement between Hugging Face and Amazon. |
which cloud providers besides aws this comment includes? also for example azure? |
Has this plan ever worked on a long run?
At some point, some of the developers will leave huggingface and regret this license change, then begin contributing to some open-source fork.
As "substantial" is not well-defined, you have to read this as You may not distribute the Software as a hosted or managed, and paid service. I've been experimenting with VLLM and TGI somewhat in parallel. I will focus on VLLM now. |
I am very happy to announce that the license was reverted to Apache 2.0. This concerns both TGI and the Text Embeddings Inference repository. We reflected a lot on the change to HFoil since July. We understand how alienating this change must have felt for our users and we are sorry. At the time we felt that this change was necessary to safeguard our inference solutions from larger companies. A lot of things have changed in the past year and the reasons we made the change in the first place are not applicable today. This decision is irreversible; the repository will remain under the Apache 2.0 license for all forthcoming releases. Myself and the team are super exicted by this change and for the future of HuggingFace inference solutions :) |
Text-Generation-Inference, aka TGI, is a project we started earlier this year to power optimized inference of Large Language Models, as an internal tool to power LLM inference on the Hugging Face Inference API and later Hugging Chat. Since then it has become a crucial component of our commercial products (like Inference Endpoints) and that of our commercial partners, like Amazon SageMaker, Azure Machine Learning and IBM watsonx. At the same time, the project quickly grew in popularity and was adopted by other open source projects like Open-Assistant and nat.dev.
TGI v1.0 new license: HFOIL 1.0
We are releasing TGI v1.0 under a new license: HFOIL 1.0.
All prior versions of TGI remain licensed under Apache 2.0, the last Apache 2.0 version being version 0.9.4.
HFOIL stands for Hugging Face Optimized Inference License, and it has been specifically designed for our optimized inference solutions. While the source code remains accessible, HFOIL is not a true open source license because we added a restriction: to sell a hosted or managed service built on top of TGI, we now require a separate agreement.
You can consult the new license here.
What does this mean for you?
This change in source code licensing has no impact on the overwhelming majority of our user community who use TGI for free. Additionally, both our Inference Endpoint customers and those of our commercial partners will also remain unaffected.
However, it will restrict non-partnered cloud service providers from offering TGI v1.0+ as a service without requesting a license.
To elaborate further:
If you are an existing user of TGI prior to v1.0, your current version is still Apache 2.0 and you can use it commercially without restrictions.
If you are using TGI for personal use or research purposes, the HFOIL 1.0 restrictions do not apply to you.
If you are using TGI for commercial purposes as part of an internal company project (that will not be sold to third parties as a hosted or managed service), the HFOIL 1.0 restrictions do not apply to you.
If you integrate TGI into a hosted or managed service that you sell to customers, then consider requesting a license to upgrade to v1.0 and later versions - you can email us at [email protected] with information about your service.
Why the new license?
TGI started as a project to power our internal products, and we see it as a critical component of our commercial solutions. TGI is not meant as a community-driven project, but as a production solution that’s widely accessible to the community. We want to continue building TGI in the open, and will continue to welcome contributions. But unlike community-driven projects like Transformers and Diffusers focused on making machine learning accessible, TGI is focused on performance and robustness in production contexts, with the goal of building commercial products.
What about Hugging Face contributions to open source?
Our mission as a company is to democratize good machine learning. An important component of democratization is making good machine learning more accessible. We achieve this through community-driven open source projects like Transformers, Diffusers, Datasets, our free courses (Transformers, Diffusers Audio, RL), and many more libraries collectively garnering about 240k GitHub stars as of this writing. Our long term commitment to open source has not changed.
The text was updated successfully, but these errors were encountered: