-
Notifications
You must be signed in to change notification settings - Fork 10.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to make the search request only return videos with subtitle? #30794
Comments
A mechanism within yt-dl would have to filter based on the extracted metadata (like The v3 YT search API has this parameter:
The yt-dl extractor uses https://www.youtube.com/youtubei/v1, which doesn't have this option. |
With #27749, you can pass search URLs like https://www.youtube.com/results?search_query=python&sp=EgIoAQ%253D%253D. You can find the correct |
That's nice. I must find that YouTube 101 course ... Apparently just Unfortunately, with https://www.youtube.com/results?q=ditempat&sp=EgQQASgB we still appear to run out of result pages (31, 538 results). I guessed that the This presentation (PDF) was useful for terminology. |
I'm pretty sure the "subtitles/CC" filter excludes automatic captions |
Looks right. It's just that |
So far the YT extractor uses just a few canned search criteria like this. That's robust as long as people don't do a JavaScript on the ProtoBuf encoding spec. When someone identifies a need to create ProtoBuf criteria dynamically, it might be necessary to write or borrow a marshaller.
So read that
Thanks. |
Randomly changing the specification in incompatible ways, I mean. |
#30794 (comment) seems to be the answer OP wanted. PR branch or git master needed, though. |
Checklist
Question
My goal is to download videos with human uploaded subtitle. The pipeline which I am currently execute is search some query first then parse the info.json to filter out the videos with human made subtitle and download them.
But as we all know videos with human made subtitle is much less than the videos without human made subtitle and considering that each search request only return ~500 results. How can I grasp these precious resource to make the returned results more efficient, which means that I can make only the videos with subtile return rather than all videos with or without subtitle in advance.
Do youtube-dl have any parameter that I can set when do each search request to achieve this purpose?
The text was updated successfully, but these errors were encountered: