yt-dlp

mirror of https://github.com/yt-dlp/yt-dlp.git synced 2025-07-02 19:38:32 +00:00

Author	SHA1	Message	Date
GiorgosTsak	45d132a6be	Improved WebVTT parser with robust error handling and input validation Enhances the WebVTT partial parser by adding comprehensive error handling, type validation, and defensive checks to prevent unexpected failures during parsing. Specifically, input types are validated in _MatchParser and parse_fragment, ensuring only valid strings or bytes are accepted. Timestamp parsing now raises clear errors for invalid matches, while regex operations are guarded to avoid NoneType attribute errors. The .decode() step in parse_fragment uses safe fallback to handle invalid byte sequences gracefully.	2025-06-23 20:16:28 +03:00
sepro	add96eb9f8	[cleanup] Add more ruff rules (#10149 ) Authored by: seproDev Reviewed-by: bashonly <88596187+bashonly@users.noreply.github.com> Reviewed-by: Simon Sawicki <contact@grub4k.xyz>	2024-06-12 01:09:58 +02:00
pukkandan	615a84447e	[cleanup] Misc (#8968 ) Authored by: pukkandan, bashonly, seproDev	2024-03-11 00:52:28 +05:30
pukkandan	298230e550	[webvtt] Fix `15f22b4880`	2023-12-13 05:11:45 +05:30
TSRBerry	15f22b4880	[webvtt] Allow spaces before newlines for CueBlock (#7681 ) Closes #7453 Ref: https://www.w3.org/TR/webvtt1/#webvtt-cue-block	2023-11-29 04:50:06 +05:30
Marcel	f352a09778	[webvtt] Handle premature EOF Closes #2867, closes #5600 Authored by: flashdagger	2022-11-20 14:14:42 +05:30
pukkandan	2fa669f759	[docs] Misc improvements Closes #4987, Closes #4906, Closes #4919, Closes #4977, Closes #4979	2022-09-22 02:15:55 +05:30
pukkandan	c646d76f67	[webvtt, extractor/youtube] Extract auto-subs from livestream VODs Closes #4130 Authored by: pukkandan, fstirlitz	2022-07-31 02:20:11 +05:30
pukkandan	6929b41a21	Remove Python 3.6 support Closes #3764	2022-07-18 06:31:14 +05:30
pukkandan	0f06bcd759	[cleanup] Minor fixes (See desc) * [youtube] Fix `--youtube-skip-dash-manifest` * [build] Use `$()` in `Makefile`. Closes #3684 * Fix bug in `385ffb467b` * Fix bug in `43d7f5a5d0` * [cleanup] Remove unnecessary `utf-8` from `str.encode`/`bytes.decode` * [utils] LazyList: Expose unnecessarily "protected" attributes and other minor cleanup	2022-05-09 17:59:26 +05:30
felix	77f9033095	[compat] Split into sub-modules (#2173 ) Authored by: fstirlitz, pukkandan	2022-04-18 04:26:43 +05:30
pukkandan	19a0394044	[cleanup] Misc cleanup and refactor (#2173 )	2022-04-18 02:28:28 +05:30
pukkandan	f82711587c	[cleanup] Sort imports Using https://github.com/PyCQA/isort isort -m VERTICAL_HANGING_INDENT --py 36 -l 80 --rr -n --tc .	2022-04-12 05:32:52 +05:30
pukkandan	86e5f3ed2e	[cleanup] Upgrade syntax Using https://github.com/asottile/pyupgrade 1. `__future__` imports and `coding: utf-8` were removed 2. Files were rewritten with `pyupgrade --py36-plus --keep-percent-format` 3. f-strings were cherry-picked from `pyupgrade --py36-plus` Extractors are left untouched (except removing header) to avoid unnecessary merge conflicts	2022-04-12 05:32:51 +05:30
pukkandan	f9934b9614	[cleanup] Mark some compat variables for removal (#2173 ) Authored by fstirlitz, pukkandan	2022-04-12 05:32:50 +05:30
pukkandan	aa7785f860	[utils] Standardize timestamp formatting code Closes #1285	2021-10-19 22:58:25 +05:30
pukkandan	81a136b80f	[WebVTT] Adjust parser to accommodate PBS subtitles (#922 ) Closes #921	2021-09-08 16:10:10 +05:30
Felix S	25a3f4f5d6	[webvtt] Merge daisy-chained duplicate cues (#638 ) Fixes: https://github.com/yt-dlp/yt-dlp/issues/631#issuecomment-893338552 Previous deduplication algorithm only removed duplicate cues with identical text, styles and timestamps. This change also merges cues that come in ‘daisy chains’, where sequences of cues with identical text and styles appear in which the ending timestamp of one equals the starting timestamp of the next. This deduplication algorithm has the somewhat unfortunate side effect that NOTE blocks between cues, if found, will be emitted in a different order relative to their original cues. This may be unwanted if perfect fidelity is desired, but then so is daisy-chain deduplication itself. NOTE blocks ought to be ignored by WebVTT players in any case. Authored by: fstirlitz	2021-08-10 01:52:30 +05:30
pukkandan	75722b037d	[webtt] Fix timestamps Closes #474	2021-07-12 05:20:12 +05:30
Felix S	333217f43e	[downloader/hls] Remove duplicate cues using a sliding window of candidates	2021-04-28 17:21:26 +05:30
Felix S	4a2f19abbd	[downloader/hls] Assemble single-file WebVTT subtitles from HLS segments	2021-04-28 17:21:14 +05:30

21 Commits