fix: VTTCues with identical time intervals being incorrectly removed #1005

alex-barstow · 2020-11-18T23:03:05Z

Description

This is a replacement for #995, which was an attempt to revert a PR that resulted in WebVTT cues with identical time-intervals (which the spec allows) being removed from the cue list. However, simply reverting the change broke 608 captions, so another solution was required.

The main goals of this PR are to:

Make sure WebVTT cues are not removed from the TextTrackList just because they have the same time-intervals
Satisfy the original purpose of this logic (originally this) that I removed, which is to make sure any cues that "overlap" VTT segments (cues that are both the last cue of one segment and the first cue of the next segment) are removed.

Solution

We can meet those criteria by only removing cues that have identical time intervals and identical text. This will ensure we remove any cues that overlap VTT segments, while keeping any cues that are actually intended to be displayed at the same time (which we can reasonably assume will have different text)

Specific Additions

Create removeDuplicateCuesFromTrack() function in text-track utils
Modify the vtt-segment-loader so that we run the de-duping after adding VTTcues from a new segment
Unit tests

gkatsev

Just tested 608 and the stream that promted this and looking good.

gkatsev · 2020-11-19T15:58:18Z

src/util/text-tracks.js

+    if (duplicates.length) {
+      duplicates.forEach(dupe => track.removeCue(dupe));
+    }


Should this happen after both forloops? It'll make it less likely that the loop fails because the cues list got updated from under it.

Following some offline discussion, we've decided to leave this as is. The loop will start with the first cue, remove any duplicates of it, progress to the second cue, remove any dupes of that, etc. And the length of the cues array is updated accordingly as duplicates are removed, so this should be safe.

brandonocasey

The logic is way better here 👍 👍

…1005)

robflynn · 2020-12-04T21:09:00Z

@gkatsev @alex-barstow

I submitted the bug report that I believe led to this PR.

In reviewing the code ran across a possible additional issue that wan't covered in the fix regarding simultaneous captions.

In the code you mention that you're dropping captions if their time codes and text match.

You might want to also consider cue position in this check as duplicate text is not always safe to purge. I have seen multiple examples in the past of simultaneous captions WITH the same text, but on opposite ends of the screen. (Think two people surprised and shouting: "Dude!?" from opposite ends of the screen.)

gkatsev · 2020-12-09T19:41:06Z

Thanks @robflynn that definitely makes sense. Really, I think that our deduping should be limited to our 608 captions but I think it's not easy for us to differentiate where we do the deduping right now.

remove duplicate cues with same time interval and text

6e4a643

gkatsev reviewed Nov 19, 2020

View reviewed changes

brandonocasey reviewed Nov 19, 2020

View reviewed changes

gkatsev approved these changes Nov 19, 2020

View reviewed changes

brandonocasey approved these changes Nov 19, 2020

View reviewed changes

gkatsev merged commit 6db2b6a into main Nov 19, 2020

gkatsev deleted the fix-vtt-cue-removal branch November 19, 2020 19:59

gkatsev pushed a commit that referenced this pull request Nov 19, 2020

fix: remove duplicate cues with same time interval and text (#1005)

fb1c909

evanfarina pushed a commit to evanfarina/http-streaming that referenced this pull request Nov 26, 2020

fix: remove duplicate cues with same time interval and text (videojs#…

ee4817a

…1005)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: VTTCues with identical time intervals being incorrectly removed #1005

fix: VTTCues with identical time intervals being incorrectly removed #1005

alex-barstow commented Nov 18, 2020 •

edited

Loading

gkatsev left a comment

gkatsev Nov 19, 2020

alex-barstow Nov 19, 2020

brandonocasey left a comment

robflynn commented Dec 4, 2020

gkatsev commented Dec 9, 2020

fix: VTTCues with identical time intervals being incorrectly removed #1005

fix: VTTCues with identical time intervals being incorrectly removed #1005

Conversation

alex-barstow commented Nov 18, 2020 • edited Loading

Description

Solution

Specific Additions

gkatsev left a comment

Choose a reason for hiding this comment

gkatsev Nov 19, 2020

Choose a reason for hiding this comment

alex-barstow Nov 19, 2020

Choose a reason for hiding this comment

brandonocasey left a comment

Choose a reason for hiding this comment

robflynn commented Dec 4, 2020

gkatsev commented Dec 9, 2020

alex-barstow commented Nov 18, 2020 •

edited

Loading