[Lumni] Add new extractor #26532

Surkal · 2020-09-06T16:45:08Z

Please follow the guide below

You will be asked some questions, please read them carefully and answer honestly
Put an x into all the boxes [ ] relevant to your pull request (like that [x])
Use Preview tab to see how your pull request will actually look like

Before submitting a pull request make sure you have:

At least skimmed through adding new extractor tutorial and youtube-dl coding conventions sections
Searched the bugtracker for similar pull requests
Checked the code with flake8

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

I am the original author of this code and I am willing to release it under Unlicense
I am not the original author of this code but it is in public domain or released under Unlicense (provide reliable evidence)

What is the purpose of your pull request?

Bug fix
Improvement
New extractor
New feature

Description of your pull request and other information

French website for school children.
Requested in #24574 and #25772.

llevrel · 2020-10-02T10:52:35Z

Hi,

Why has the "Lumni" commit not be applied? I'm eagerly waiting for it XD

Thanks

dirkf

Happy to re-review and merge once FranceTV IE is working again.

dirkf · 2022-06-09T14:52:36Z

youtube_dl/extractor/extractors.py

@@ -580,6 +580,7 @@
 from .localnews8 import LocalNews8IE
 from .lovehomeporn import LoveHomePornIE
 from .lrt import LRTIE
+from .lumni import LumniIE, LumniPlaylistIE


More maintainable:

Suggested change

from .lumni import LumniIE, LumniPlaylistIE

from .lumni import (

LumniIE,

LumniPlaylistIE,

)

dirkf · 2022-06-09T14:53:45Z

youtube_dl/extractor/lumni.py

+        entries = [self.url_result(
+            'https://lumni.fr/video/%s' % video_id,
+            ie=LumniIE.ie_key(), video_id=video_id)
+            for video_id in orderedSet(re.findall(
+                r'<a[^>]+\bhref=["\']/video/([0-9a-z-]+)', webpage))]
+
+        playlist_title = self._og_search_title(webpage)
+
+        return self.playlist_result(entries, playlist_id, playlist_title)


There's a utility method for this; also the regex can be relaxed:

Suggested change

entries = [self.url_result(

'https://lumni.fr/video/%s' % video_id,

ie=LumniIE.ie_key(), video_id=video_id)

for video_id in orderedSet(re.findall(

r'<a[^>]+\bhref=["\']/video/([0-9a-z-]+)', webpage))]

playlist_title = self._og_search_title(webpage)

return self.playlist_result(entries, playlist_id, playlist_title)

playlist_title = self._og_search_title(webpage)

return self.playlist_from_matches(

re.findall(r'''<a\b[^>]+\bhref\s*=\s*["']/video/([0-9a-z-]+)''', webpage),

playlist_id, playlist_title,

getter=lambda x: 'https://lumni.fr/video/' + x,

ie=LumniIE.ie_key())

dirkf · 2022-06-09T14:57:38Z

youtube_dl/extractor/lumni.py

+            'title': 'Les Fondamentaux : Vocabulaire',
+        },
+        'playlist_mincount': 39


Currently these are right:

Suggested change

'title': 'Les Fondamentaux : Vocabulaire',

},

'playlist_mincount': 39

'title': 're:(?i)^Les Fondamentaux : Vocabulaire$',

},

'playlist_mincount': 25

dirkf · 2022-06-09T14:58:25Z

youtube_dl/extractor/lumni.py

+
+from .common import InfoExtractor
+from .francetv import FranceTVIE
+from ..utils import orderedSet


Removed by final suggestion:

Suggested change

from ..utils import orderedSet

dirkf · 2022-06-09T15:00:22Z

youtube_dl/extractor/lumni.py

+        full_id = 'francetv:%s' % video_id
+
+        return self.url_result(full_id,


Roll this together:

Suggested change

full_id = 'francetv:%s' % video_id

return self.url_result(full_id,

return self.url_result( 'francetv:' + video_id,

Surkal added 3 commits September 6, 2020 18:43

[Lumni] Add new extractor

68b5101

fix regex

9d037f4

split regex

e062ec8

dstftw force-pushed the master branch 2 times, most recently from 5e26784 to da2069f Compare September 13, 2020 13:49

cypheron mentioned this pull request Feb 3, 2021

Evaluation / overview of new proposed extractors / sites #28054

Open

dirkf requested changes Jun 9, 2022

View reviewed changes

dirkf force-pushed the master branch from 01bf89e to 4c6fba3 Compare August 26, 2022 07:51

dirkf closed this Aug 1, 2023

dirkf added the defunct PR source branch is not accessible label Oct 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Lumni] Add new extractor #26532

[Lumni] Add new extractor #26532

Surkal commented Sep 6, 2020

llevrel commented Oct 2, 2020

dirkf left a comment

dirkf Jun 9, 2022

dirkf Jun 9, 2022

dirkf Jun 9, 2022

dirkf Jun 9, 2022

dirkf Jun 9, 2022

		full_id = 'francetv:%s' % video_id

		return self.url_result(full_id,

[Lumni] Add new extractor #26532

[Lumni] Add new extractor #26532

Conversation

Surkal commented Sep 6, 2020

Please follow the guide below

Before submitting a pull request make sure you have:

In order to be accepted and merged into youtube-dl each piece of code must be in public domain or released under Unlicense. Check one of the following options:

What is the purpose of your pull request?

Description of your pull request and other information

llevrel commented Oct 2, 2020

dirkf left a comment

Choose a reason for hiding this comment

dirkf Jun 9, 2022

Choose a reason for hiding this comment

dirkf Jun 9, 2022

Choose a reason for hiding this comment

dirkf Jun 9, 2022

Choose a reason for hiding this comment

dirkf Jun 9, 2022

Choose a reason for hiding this comment

dirkf Jun 9, 2022

Choose a reason for hiding this comment