Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[gedi] Add new extractor #26946

Closed
wants to merge 13 commits into from
Closed

[gedi] Add new extractor #26946

wants to merge 13 commits into from

Conversation

nixxo
Copy link
Contributor

@nixxo nixxo commented Oct 20, 2020

Before submitting a pull request make sure you have:

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

  • I am the original author of this code and I am willing to release it under Unlicense

What is the purpose of your pull request?

  • New extractor

Description of your pull request and other information

A new extractor for the various Gedi group italian newspaper websites like repubblica.it and lastampa.it among the biggest.

It closes #7347 (and #20835)

youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
nixxo and others added 8 commits February 19, 2021 20:39
- improved and simplified formats extraction using _extract_akamai_formats
- test md5 fix
- improved thumbnail extraction
- improved title extraction with fallback value
- added timestamp (with tests)
- added fallback values for description and thumbnail
- improved regex to extract video player data
- removed custom IE_NAME
- removed http direct url creation (no longer necessary after extract_akamai_formats teke care of it)
- removed audio format clean-up
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
- removed 'image_full_play' from thumbnail url extraction and adjusted test value
- removed 'modified_time' as a value to timestamp generation
- put 'only_matching' to some no longer necessary tests
- added 'ipad' tag to hls formats selection
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
youtube_dl/extractor/gedidigital.py Outdated Show resolved Hide resolved
- removed md5 leftover in test
- added comment in test for .strip() necessity
- changed default='' to fatal=False in _og_search_property for timestamp genaration
@nixxo
Copy link
Contributor Author

nixxo commented Feb 22, 2021

@remitamine remember the looong conversation ;-) we had last month about the vvvvid extractor "season" title? ;-)

In the end I think the solution was to set the playlist_title tag as the more appropriate tag for the value.

When you have time can you give a look at it here #27618 so we can close the vvvvid extractor for good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

support request for video.repubblica.it
2 participants