Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: add lms for data post #8222

Merged
merged 16 commits into from
Feb 5, 2024
Merged

docs: add lms for data post #8222

merged 16 commits into from
Feb 5, 2024

Conversation

lostmygithubaccount
Copy link
Member

Description of changes

work in progress but code is finalized IMO

follow-up post for local "open source" LMs needed

covers:

  • use cases for language models w/ data
  • demonstration of each use case

Issues closed

@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 4, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 4, 2024
@lostmygithubaccount
Copy link
Member Author

forgot to wrap some lines -- want to get feedback, then finish all the prose/cleanup

@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 4, 2024

@lostmygithubaccount lostmygithubaccount added docs-preview Add this label to trigger a docs preview docs Documentation related issues or PRs labels Feb 4, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 4, 2024
@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 4, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 4, 2024
@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 4, 2024

1 similar comment
@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 4, 2024

@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 5, 2024

@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 5, 2024

@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 5, 2024

## Natural language processing

This includes tasks like:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think it's powerful to be able to say: extract sentiment of each row, then do a join/groupby, all in GPU (instead of doing sentiment on gpus and join/groupby on cpus)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we're not there with Ibis just yet! soon...


We can think of three approaches to analytical code with language models:

1. Use LMs in an analytic subroutine
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

like an agent?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no, just what's shown above -- basically using LLMs in UDFs

@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 5, 2024

@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 5, 2024

@lostmygithubaccount lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@ibis-docs-bot ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Feb 5, 2024
@lostmygithubaccount
Copy link
Member Author

@cpcloud this should be good to merge!

@ibis-docs-bot
Copy link

ibis-docs-bot bot commented Feb 5, 2024

Copy link
Contributor

@jitingxu1 jitingxu1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm.

docs/posts/lms-for-data/index.qmd Show resolved Hide resolved
docs/posts/lms-for-data/index.qmd Outdated Show resolved Hide resolved
- summarization
- translation
- question answering

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is embedding or encoding good use cases in the future?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you could but I haven't really understood why these are useful (as opposed to storing text)

```

1. Import Ibis, the data engineering toolkit
2. Import Marvin, the AI engineering toolkit
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe some confusion here, "our team" means ibis. I thought something like langchain.

I believe our team already invested lots of time on the toolkit, I am not very sure how popular marvin is.

@lostmygithubaccount
Copy link
Member Author

argh let me fix some of those last things before we merge

@lostmygithubaccount
Copy link
Member Author

ready to merge

@cpcloud cpcloud merged commit 8f35010 into ibis-project:main Feb 5, 2024
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs Documentation related issues or PRs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants