Skip to content

Commit

Permalink
Update docs/source/examples.rst
Browse files Browse the repository at this point in the history
  • Loading branch information
NivekT authored and ejguan committed Feb 27, 2023
1 parent f5276c4 commit 05299f5
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/source/examples.rst
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ created by our community.
laion2B-en-joined
^^^^^^^^^^^^^^^^^^^^^^
The `laion2B-en-joined dataset <https://huggingface.co/datasets/laion/laion2B-en-joined>`_ is a subset of the `LAION-5B dataset <https://laion.ai/blog/laion-5b/>`_ containing english captions, URls pointing to images,
and other metadata. It contains around 2.32 billion entries.
and other metadata. It contains around 2.32 billion entries.
Currently (February 2023) around 86% of the URLs still point to valid images. Here is a `DataPipe implementation of laion2B-en-joined
<https://github.com/pytorch/data/blob/main/examples/vision/laion5b.py>`_ that filters out unsafe images and images with watermarks and loads the images from the URLs.

Expand Down

0 comments on commit 05299f5

Please sign in to comment.