-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider supporting FOSS LLMs #58
Comments
Hi @yuiseki , Lovely you have already opened issue on this , Some open source LLM on your mind that you are looking to do research on ? |
@kshitijrajsharma Thanks for your comment! I have already done some research on some OSS LLMs. I share an overview of them below.
In summary, I have started my trial and error with most attention to llama.cpp and OpenLLaMA, but I will continue to keep an eye on LLaMA 2. |
Lovely , Are you seeking for help on any of these ? I can checkout few |
I get little bit hands on LLama model training : Here is the sample dataset that can be used to train LLama with RLHF https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences Now we need similar training dataset to be prepared for overpass questions and query ! There will be some challenges : We might search for training data , in order to retrain or play with it , it needs massive GPU and machine , I tried a demo with collab couldn't go through on free version Two references : Looks like it can run on 8GB of GPU which is good a standard personal computer nowadays have this , but a Solid GPU and training dataset is needed ,Training dataset is something we can generate by asking community , bootstrapping overpass query examples , Challenge is the machine |
@kshitijrajsharma Wow! I am amazed at your quickness! Sorry for my delay in responding to you. I was thinking of having 👍 and 👎 buttons on the frontend user interface of TRIDENT, like the ones on ChatGPT, to actively receive feedback from users. It should allow the model to be continuously improved by building and updating datasets that paired natural language sentences entered by the user and Overpass API queries generated by the model. But, currently, TRIDENT does not yet have a database to store data permanently, so this will be a major change. So that will be a time-consuming development process. If you will find the dataset that paired natural language sentences and Overpass API query, please let me know. https://stackoverflow.com/questions/tagged/overpass-api My PC has a 12GB VRAM GPU so I would be able to run the training. |
Does stackoverflow provides API to collect questions and answers ? I need to check license if it enables us to pull data from , I heard stackoverflow is also coming up with its own AI so assuming we might get data to train , if not we can store the result and query from the tool itself , We can design a small python service with database table attach it to the app and include like and dislike button . This seems good idea this will help us collect query and questions tagged with like button and can be used to retrain other LM |
@kshitijrajsharma Forgive me for contacting you at such a sudden moment. This Issue is my only connection to you. I feel I need your advice now.
Related: |
@yuiseki Sorry I was away previous week If you need training data probably this is something we can generate |
Currently, TRIDENT is only based on the OpenAI API as LLMs.
The OpenAI API is difficult to freely customize and control the base model.
Supporting FOSS LLMs should make it possible to develop TRIDENT as a fair and transparent FOSS AI assistant.
The text was updated successfully, but these errors were encountered: