diff --git a/README.md b/README.md index 296b766c52..d583c3ffb9 100644 --- a/README.md +++ b/README.md @@ -34,6 +34,9 @@ Argilla is a **collaboration platform for AI engineers and domain experts** that require **high-quality outputs, full data ownership, and overall efficiency**. +> [!NOTE] +> This README represents the release candidate for the 2.0.0 SDK version. The README for the last stable version of the 1x SDK can be found [1.x](./argilla-v1/README.md) + If you just want to get started, we recommend our [UI demo](https://demo.argilla.io/sign-in?auth=ZGVtbzoxMjM0NTY3OA%3D%3D) or our [free Hugging Face Spaces deployment integration](https://huggingface.co/new-space?template=argilla/argilla-template-space). Curious, and want to know more? Read our [documentation](https://argilla-io.github.io/argilla/latest/). ## Why use Argilla? diff --git a/argilla-frontend/README.md b/argilla-frontend/README.md index 33d042ba75..d36358ac47 100644 --- a/argilla-frontend/README.md +++ b/argilla-frontend/README.md @@ -1,67 +1,45 @@

Argilla
- ✨ Argilla ✨ + Argilla

+

Work on data together, make your model outputs better!

+

-CI +CI - Codecov -CI +CI - - + +

-

Open-source data curation platform for LLMs

-

MLOps for NLP: from data labeling to model monitoring

- -
- -https://github.com/argilla-io/argilla/assets/1107111/49e28d64-9799-4cac-be49-19dce0f6bd86 -

- - + + - - + + - - + +

-
+Argilla is a **collaboration platform for AI engineers and domain experts** that require **high-quality outputs, full data ownership, and overall efficiency**. -

-

-πŸ“„ Documentation | -πŸš€ Quickstart | -🎼 Cheatsheet | -πŸ«±πŸΎβ€πŸ«²πŸΌ Contribute | -πŸ—ΊοΈ Roadmap -

-

+If you just want to get started, we recommend our [UI demo](https://demo.argilla.io/sign-in?auth=ZGVtbzoxMjM0NTY3OA%3D%3D) or our [free Hugging Face Spaces deployment integration](https://huggingface.co/new-space?template=argilla/argilla-template-space). Curious, and want to know more? Read our [documentation](https://argilla-io.github.io/argilla/latest/). -## πŸš€ Quickstart +This repository only contains developer info about the front end. If you want to get started, we recommend taking a +look at our [main repository](https://github.com/argilla-io/argilla) or our [documentation](https://argilla-io.github.io/argilla/latest/). -Argilla is an open-source data curation platform for LLMs. Using Argilla, everyone can build robust language models through faster data curation using both human and machine feedback. We provide support for each step in the MLOps cycle, from data labeling to model monitoring. - -There are different options to get started: - -1. Take a look at our [quickstart page](https://docs.argilla.io/en/latest/getting_started/quickstart.html) πŸš€ - -2. Start contributing by looking at our [contributor guidelines](#πŸ«±πŸΎβ€πŸ«²πŸΌ-contribute) πŸ«±πŸΎβ€πŸ«²πŸΌ - -3. Skip some steps with our [cheatsheet](#🎼-cheatsheet) 🎼 +Are you a contributor or do you want to understand what is going on under the hood, please keep reading the +documentation below. ## πŸ–₯️ FRONTEND @@ -99,9 +77,7 @@ npm run generate ## πŸ«±πŸΎβ€πŸ«²πŸΌ Contribute - To help our community with the creation of contributions, we have created our [developer](https://docs.argilla.io/en/latest/community/developer_docs.html) and [contributor](https://docs.argilla.io/en/latest/community/contributing.html) docs. Additionally, you can always [schedule a meeting](https://calendly.com/david-berenstein-huggingface/30min) with our Developer Advocacy team so they can get you up to speed. - -## πŸ₯‡ Contributors +To help our community with the creation of contributions, we have created our [community](https://argilla-io.github.io/argilla/latest/community/) docs. Additionally, you can always [schedule a meeting](https://calendly.com/david-berenstein-huggingface/30min) with our Developer Advocacy team so they can get you up to speed. diff --git a/argilla-server/CHANGELOG.md b/argilla-server/CHANGELOG.md index a6fc455df0..a665acf0e3 100644 --- a/argilla-server/CHANGELOG.md +++ b/argilla-server/CHANGELOG.md @@ -16,6 +16,8 @@ These are the section headers that we use: ## [Unreleased]() +## [2.0.0rc1](https://github.com/argilla-io/argilla/compare/v1.29.0...v2.0.0rc1) + ### Removed - Removed all API v0 endpoints. ([#4852](https://github.com/argilla-io/argilla/pull/4852)) diff --git a/argilla-server/README.md b/argilla-server/README.md index ba2d4d6bda..b2de9c85ae 100644 --- a/argilla-server/README.md +++ b/argilla-server/README.md @@ -1,19 +1,18 @@

Argilla
- Argilla-Server + Argilla

-

The repository for the Python native FastAPI server for Argilla backend.

- +

Work on data together, make your model outputs better!

- + CI -Codecov - -CI +Codecov + +CI @@ -32,11 +31,13 @@

-Argilla is a **collaboration platform for AI engineers and domain experts** that require **high-quality outputs, full -data ownership, and overall efficiency**. +Argilla is a **collaboration platform for AI engineers and domain experts** that require **high-quality outputs, full data ownership, and overall efficiency**. + +> [!NOTE] +> This README represents the release candidate for the 2.0.0 SDK version. The README for the last stable version of the 1x SDK can be found [1.x](../argilla-v1/README.md) This repository only contains developer info about the backend server. If you want to get started, we recommend taking a -look at our [main repository](https://github.com/argilla-io/argilla) or our [documentation](https://docs.argilla.io/). +look at our [main repository](https://github.com/argilla-io/argilla) or our [documentation](https://argilla-io.github.io/argilla/latest/). Are you a contributor or do you want to understand what is going on under the hood, please keep reading the documentation below. @@ -108,3 +109,17 @@ the [argilla-frontend](/argilla-frontend/README.md) project ```sh pdm server ``` + +## πŸ«±πŸΎβ€πŸ«²πŸΌ Contribute + +To help our community with the creation of contributions, we have created our [community](https://argilla-io.github.io/argilla/latest/community/) docs. Additionally, you can always [schedule a meeting](https://calendly.com/david-berenstein-huggingface/30min) with our Developer Advocacy team so they can get you up to speed. + + + + + + + +## πŸ—ΊοΈ Roadmap + +We continuously work on updating [our plans and our roadmap](https://github.com/orgs/argilla-io/projects/10/views/1) and we love to discuss those with our community. Feel encouraged to participate. diff --git a/argilla-v1/CHANGELOG.md b/argilla-v1/CHANGELOG.md index 18e67b7cad..b68022463a 100644 --- a/argilla-v1/CHANGELOG.md +++ b/argilla-v1/CHANGELOG.md @@ -16,6 +16,8 @@ These are the section headers that we use: ## [Unreleased]() +## [2.0.0rc1](https://github.com/argilla-io/argilla/compare/v1.29.0...v2.0.0rc1) + > [!NOTE] > As per the release of our 2.0 SDK, this changelog is deprecated and will only contain potential bug fixes for the 1.x SDK, but it will not contain any new features. For the latest features and changes, please refer to the [2.0 SDK changelog](../argilla/CHANGELOG.md). diff --git a/argilla-v1/README.md b/argilla-v1/README.md index 37aff75763..a52ba8ed74 100644 --- a/argilla-v1/README.md +++ b/argilla-v1/README.md @@ -34,6 +34,9 @@ Argilla is a **collaboration platform for AI engineers and domain experts** that require **high-quality outputs, full data ownership, and overall efficiency**. +> [!NOTE] +> This README represents the 1.29 SDK version. We have stopped development for the 1.x SDK version, while still committing to bug fixes. If you are looking for the README of the 2.x SDK version take a look [here](../README.md). + If you just want to get started, we recommend our [UI demo](https://demo.argilla.io/sign-in?auth=ZGVtbzoxMjM0NTY3OA%3D%3D) or our [2-click deployment quick start](https://docs.argilla.io/en/latest/getting_started/cheatsheet.html). Curious, and want to know more? Read our [documentation](https://docs.argilla.io/). ## Why use Argilla? diff --git a/argilla/CHANGELOG.md b/argilla/CHANGELOG.md index 4d82977d1b..e37bc8f4f2 100644 --- a/argilla/CHANGELOG.md +++ b/argilla/CHANGELOG.md @@ -17,7 +17,7 @@ These are the section headers that we use: ## [2.0.0rc1](https://github.com/argilla-io/argilla/compare/v1.29.0...v2.0.0rc) > [!NOTE] -> This releas for 2.0.0rc1 does not contain any changelog entries because it is the first release candidate for the 2.0.0 version. The following versions will contain the changelog entries again. For a general overview of the changes in the 2.0.0 version, please refer to [our blog](https://argilla.io/blog/) or [our new documentation](https://argilla-io.github.io/argilla/latest). +> This release for 2.0.0rc1 does not contain any changelog entries because it is the first release candidate for the 2.0.0 version. The following versions will contain the changelog entries again. For a general overview of the changes in the 2.0.0 version, please refer to [our blog](https://argilla.io/blog/) or [our new documentation](https://argilla-io.github.io/argilla/latest). ## [1.29.0](https://github.com/argilla-io/argilla/compare/v1.28.0...v1.29.0) diff --git a/argilla/docs/how_to_guides/annotate.md b/argilla/docs/how_to_guides/annotate.md index 97d40eb396..f8e3ab44e6 100644 --- a/argilla/docs/how_to_guides/annotate.md +++ b/argilla/docs/how_to_guides/annotate.md @@ -172,7 +172,7 @@ If the dataset contains metadata, responses and suggestions, click onΒ **Filter* From the `Responses` dropdown, type and select the question. You can set a range for rating questions and select specific values for label, multi-label, and span questions. !!! note - The text and ranking questions are not available for filtering. + The text and ranking questions are not available for filtering. === "By suggestions" diff --git a/docs/_source/_common/snippets/start_page.md b/docs/_source/_common/snippets/start_page.md index 0d81692825..8b7b1ef96f 100644 --- a/docs/_source/_common/snippets/start_page.md +++ b/docs/_source/_common/snippets/start_page.md @@ -2,7 +2,7 @@ # Welcome to -## Argilla is a platform to build high-quality AI datasets +## Argilla is a platform for building high-quality AI datasets If you need support join the [Argilla Slack community](https://join.slack.com/t/rubrixworkspace/shared_invite/zt-whigkyjn-a3IUJLD7gDbTZ0rKlvcJ5g) @@ -12,19 +12,15 @@ If you need support join the [Argilla Slack community](https://join.slack.com/t/ Get started by publishing your first dataset. -### 1. Open an IDE, Jupyter or Collab - -If you're a Collab user, you can directly use our [introductory tutorial](https://colab.research.google.com/github/argilla-io/argilla/blob/develop/docs/_source/getting_started/quickstart_workflow_feedback.ipynb). - -### 2. Install the SDK with pip +### 1. Install the SDK with pip To work with Argilla datasets, you need to use the Argilla SDK. You can install the SDK with pip as follows: ```sh -pip install argilla -U +pip install argilla -U --pre ``` -### 3. Connect to your Argilla server +### 2. Connect to your Argilla server Get your `ARGILLA_API_URL`: @@ -38,51 +34,66 @@ Make sure to replace `ARGILLA_API_URL` and `ARGILLA_API_KEY` in the code below. ```python import argilla as rg -rg.init( - api_url="ARGILLA_API_URL", - api_key="ARGILLA_API_KEY", - # extra_headers={"Authorization": f"Bearer {"HF_TOKEN"}"} +client = rg.Argilla( + api_url="", + api_key="" + # extra_headers={"Authorization": f"Bearer {HF_TOKEN}"} ) ``` -### 4. Create your first dataset +### 3. Create your first dataset -Specify a workspace where the dataset will be created. Check your workspaces in ["My settings"](/user-settings). To create a new workspace, check the [docs](https://docs.argilla.io/en/latest/getting_started/installation/configurations/workspace_management.html). +Specify a workspace where the dataset will be created. Check your workspaces in ["My settings"](/user-settings). To create a new workspace, check the [docs](https://argilla-io.github.io/argilla/latest/how_to_guides/workspace/). -Create a Dataset with two labels ("sadness" and "joy"). Don't forget to replace "". Here, we are using a task template, check the docs to [create a fully custom dataset](https://docs.argilla.io/en/latest/practical_guides/create_update_dataset/create_dataset.html). +Here, we are defining a creating a dataset with a text field and a label question ("positive" and "negative"), check the docs to [create a fully custom dataset](https://argilla-io.github.io/argilla/latest/how_to_guides/dataset/). Don't forget to replace "". ```python -dataset = rg.FeedbackDataset.for_text_classification( - labels=["sadness", "joy"], - multi_label=False, - use_markdown=True, - guidelines=None, - metadata_properties=None, - vectors_settings=None, +settings = rg.Settings( + guidelines="Classify the reviews as positive or negative.", + fields=[ + rg.TextField( + name="review", + title="Text from the review", + use_markdown=False, + ), + ], + questions=[ + rg.LabelQuestion( + name="my_label", + title="In which category does this article fit?", + labels=["positive", "negative"], + ) + ], ) -dataset.push_to_argilla(name="my-first-dataset", workspace="") +dataset = rg.Dataset( + name=f"my_first_dataset", + workspace="", + settings=settings, + client=client, +) +dataset.create() ``` -### 5. Add records +### 4. Add records -Create a list with the records you want to add. Ensure that you match the fields with the ones specified in the previous step. +You can create a list with records that you want to add. Ensure that you match the fields with those specified in the question settings. -You can also use `pandas` or `load_dataset` to [read an existing dataset and create records from it](https://docs.argilla.io/en/latest/practical_guides/create_update_dataset/records.html#add-records). +You can also use `pandas` or `datasets.load_dataset` to [read an existing dataset and create records from it](https://argilla-io.github.io/argilla/latest/how_to_guides/record/). ```python records = [ - rg.FeedbackRecord( + rg.Record( fields={ - "text": "I am so happy today", + "review": "This is a great product.", }, ), - rg.FeedbackRecord( + rg.Record( fields={ - "text": "I feel sad today", + "review": "This is a bad product.", }, - ) + ), ] -dataset.add_records(records) +dataset.records.log(records) ```