Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider supporting FOSS LLMs #58

Open
yuiseki opened this issue Jul 5, 2023 · 8 comments
Open

Consider supporting FOSS LLMs #58

yuiseki opened this issue Jul 5, 2023 · 8 comments
Assignees

Comments

@yuiseki
Copy link
Owner

yuiseki commented Jul 5, 2023

Currently, TRIDENT is only based on the OpenAI API as LLMs.
The OpenAI API is difficult to freely customize and control the base model.
Supporting FOSS LLMs should make it possible to develop TRIDENT as a fair and transparent FOSS AI assistant.

@kshitijrajsharma
Copy link

Hi @yuiseki , Lovely you have already opened issue on this , Some open source LLM on your mind that you are looking to do research on ?

@yuiseki
Copy link
Owner Author

yuiseki commented Jul 24, 2023

@kshitijrajsharma Thanks for your comment! I have already done some research on some OSS LLMs. I share an overview of them below.

In summary, I have started my trial and error with most attention to llama.cpp and OpenLLaMA, but I will continue to keep an eye on LLaMA 2.

@yuiseki yuiseki changed the title Consider supporting FOSS LLMs such as Stable LM Consider supporting FOSS LLMs Jul 24, 2023
@yuiseki yuiseki self-assigned this Jul 24, 2023
@kshitijrajsharma
Copy link

Lovely , Are you seeking for help on any of these ? I can checkout few
Can you lay out what should I check and test ?

@kshitijrajsharma
Copy link

I get little bit hands on LLama model training :

Here is the sample dataset that can be used to train LLama with RLHF

https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences

Now we need similar training dataset to be prepared for overpass questions and query ! There will be some challenges : We might search for training data , in order to retrain or play with it , it needs massive GPU and machine , I tried a demo with collab couldn't go through on free version

Two references :
https://lightning.ai/pages/community/tutorial/accelerating-llama-with-fabric-a-comprehensive-guide-to-training-and-fine-tuning-llama/
https://huggingface.co/blog/stackllama

Looks like it can run on 8GB of GPU which is good a standard personal computer nowadays have this , but a Solid GPU and training dataset is needed ,Training dataset is something we can generate by asking community , bootstrapping overpass query examples , Challenge is the machine

@yuiseki
Copy link
Owner Author

yuiseki commented Jul 27, 2023

@kshitijrajsharma Wow! I am amazed at your quickness! Sorry for my delay in responding to you.

I was thinking of having 👍 and 👎 buttons on the frontend user interface of TRIDENT, like the ones on ChatGPT, to actively receive feedback from users. It should allow the model to be continuously improved by building and updating datasets that paired natural language sentences entered by the user and Overpass API queries generated by the model.

But, currently, TRIDENT does not yet have a database to store data permanently, so this will be a major change. So that will be a time-consuming development process.

If you will find the dataset that paired natural language sentences and Overpass API query, please let me know.
Narrowing it down from the Stackoverflow question and answer data set seems like a very realistic idea.

https://stackoverflow.com/questions/tagged/overpass-api
https://stackoverflow.com/questions/tagged/overpass-api?tab=Votes
https://stackoverflow.com/search?tab=votes&q=overpass&searchOn=3

My PC has a 12GB VRAM GPU so I would be able to run the training.
I also have a contract with Google Colaboratory Pro.

@kshitijrajsharma
Copy link

Does stackoverflow provides API to collect questions and answers ? I need to check license if it enables us to pull data from , I heard stackoverflow is also coming up with its own AI so assuming we might get data to train , if not we can store the result and query from the tool itself , We can design a small python service with database table attach it to the app and include like and dislike button . This seems good idea this will help us collect query and questions tagged with like button and can be used to retrain other LM

@yuiseki
Copy link
Owner Author

yuiseki commented Apr 8, 2024

@kshitijrajsharma Forgive me for contacting you at such a sudden moment. This Issue is my only connection to you. I feel I need your advice now.

  • I now have the most powerful computing resource of my life.
  • This is
    • Prize of the gold winner of Local AI Hackathon in Japan.
    • Exclusive use rights until the end of April.
    • 32 core, 64 thread CPU, 128 GB RAM.
    • 8 GPUs, total 192GB VRAM.
    • However, all results calculated with this computing resource must be published as OSS.
  • I am now swamped with this computational resource.
  • I feel I have already done enough of what I need to do.
  • Please let me know if there is anything else I should accomplish with this computing resource.

Related:

@kshitijrajsharma
Copy link

kshitijrajsharma commented Apr 16, 2024

@yuiseki Sorry I was away previous week
Thats awesome
While you have the resources , I would expect may be you can try running some of the llma with sample spatial queries to start with ? Have you found and updates regarding that issues . How do they performed ? Any closer ??

If you need training data probably this is something we can generate

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants