Add SubStation Alpha (SSA) support #3060

avelad · 2020-12-23T11:47:14Z

Add SubStation Alpha (SSA) support

Format info: https://en.wikipedia.org/wiki/SubStation_Alpha

Note: SSA and ASS are supported

avelad · 2020-12-23T11:48:23Z

Before merging this, it is necessary to previously accept #3036 so that I can add a part that is the same for the conversion to WebVTT in src=.

avelad · 2020-12-29T07:09:05Z

Merge ready, it is now possible to review this PR.

avelad · 2021-01-12T15:58:16Z

@joeyparrish , do you think you could find a time this week to review this?

TheModMaker · 2021-01-19T23:22:18Z

lib/text/ssa_text_parser.js

+    /** @type {!Array.<!shaka.extern.Cue>} */
+    const cues = [];
+    const parts = str.split(/\r?\n\s*\r?\n/);
+    for (const part of parts) {


I think it would be easier to understand if you split the raw parsing and the name handling into separate parts. For example, have a parse method that produces a nested object map:

{ "Script Info": { "Title": "Foo", "Original Script": "" }, "V4+ Styles": { "Format": "..." } }

I'm going to think of a way to make it easier, and hope to come back tomorrow with a different implementation.

I was suggesting a more generic parser that didn't check for tag names. For example, something like:

const ret = {}; let section = null; for (const line of lines) { const match = /\[([^\]]+)\]/.exec(line); if (match) { section = match[1]; ret[section] = {}; } else if (/\s*;/.test(line)) { // Skip comments. } else { const parts = line.split(':'); ret[section][parts[0].trim()] = parts.slice(1).join(':').trim(); // You'll need to special-case Dialogue to support multiple cues. } }

I would also suggest putting this in another function to convert the data buffer to the object. Then you can just use data['V4 Styles'] or data['Events'] later.

Given that it is a popular format, what is the advantage of doing what you propose? At this time, what is supported are known tags and those that we add to sections. Adding unknown sections I don't think has any advantage ... Sorry if I didn't understand you.

Your code "adds" unknown sections too by ignoring them in the if statements below. I was suggesting we separate the text parsing from the handling of the sections. Like you would split XML parsing from the reading of the elements. By splitting them up, you can focus on reading the tag names without seeing all the regex and string parsing.

Sorry, I still don't see it :(. Right now with the last changes the events and styles are already separated, and also the parsing of them is made independent. Adding a generic parse here does not add much since it is a known format.In your example above, script information is added to the object that is not really useful.
On the other hand, even if I do a generic parse, I still have to know the tags.
The execution time of my proposal and yours should be similar.

lib/player.js

test/text/ssa_text_parser_unit.js

avelad · 2021-01-21T07:40:18Z

@TheModMaker I have applied all your comments. Can you review again?

lib/text/ssa_text_parser.js

avelad · 2021-01-21T20:08:54Z

Ready for review!

TheModMaker · 2021-01-21T20:36:56Z

lib/text/lrc_text_parser.js

@@ -33,16 +33,16 @@ shaka.text.LrcTextParser = class {
   * @export
   */
  parseMedia(data, time) {
-    return shaka.text.LrcTextParser.getCues_(data, time.segmentEnd);
+    return shaka.text.LrcTextParser.getCues(data, time.segmentEnd);


Since this just forwards, I suggest just using parseMedia directly.

We cannot do this because getCues is static and is necessary in the player.

Part of my generic suggestion was to avoid checking if (mimeType == 'foo') and instead use the parser interface directly. Doing something like this:

const factory = TextEngine.findParser(mimeType); if (factory) { const obj = factory(); cues = obj.parseMedia(data); }

But you don't have to do that now. You could do it in another PR or I could do it instead.

I'm going to address it in another PR, I keep it on my radar. Thanks!

lib/text/lrc_text_parser.js

lib/text/web_vtt_generator.js

lib/text/ssa_text_parser.js

TheModMaker · 2021-01-21T21:29:31Z

lib/text/ssa_text_parser.js

+    /** @type {!Array.<!shaka.extern.Cue>} */
+    const cues = [];
+    const parts = str.split(/\r?\n\s*\r?\n/);
+    for (const part of parts) {


I was suggesting a more generic parser that didn't check for tag names. For example, something like:

const ret = {}; let section = null; for (const line of lines) { const match = /\[([^\]]+)\]/.exec(line); if (match) { section = match[1]; ret[section] = {}; } else if (/\s*;/.test(line)) { // Skip comments. } else { const parts = line.split(':'); ret[section][parts[0].trim()] = parts.slice(1).join(':').trim(); // You'll need to special-case Dialogue to support multiple cues. } }

I would also suggest putting this in another function to convert the data buffer to the object. Then you can just use data['V4 Styles'] or data['Events'] later.

lib/text/ssa_text_parser.js

TheModMaker · 2021-01-27T18:26:48Z

lib/text/ssa_text_parser.js

+    /** @type {!Array.<!shaka.extern.Cue>} */
+    const cues = [];
+    const parts = str.split(/\r?\n\s*\r?\n/);
+    for (const part of parts) {


Your code "adds" unknown sections too by ignoring them in the if statements below. I was suggesting we separate the text parsing from the handling of the sections. Like you would split XML parsing from the reading of the elements. By splitting them up, you can focus on reading the tag names without seeing all the regex and string parsing.

TheModMaker · 2021-01-27T18:31:13Z

lib/text/lrc_text_parser.js

@@ -33,16 +33,16 @@ shaka.text.LrcTextParser = class {
   * @export
   */
  parseMedia(data, time) {
-    return shaka.text.LrcTextParser.getCues_(data, time.segmentEnd);
+    return shaka.text.LrcTextParser.getCues(data, time.segmentEnd);


Part of my generic suggestion was to avoid checking if (mimeType == 'foo') and instead use the parser interface directly. Doing something like this:

const factory = TextEngine.findParser(mimeType); if (factory) { const obj = factory(); cues = obj.parseMedia(data); }

But you don't have to do that now. You could do it in another PR or I could do it instead.

shaka-bot · 2021-02-01T20:14:35Z

Test Failure:

Generating Closure dependencies...
Linting JavaScript...

/var/lib/jenkins/workspace/Manual PR Test (local-tests)/lib/text/ssa_text_parser.js
272:11  error  'alpha' is never reassigned. Use 'const' instead  prefer-const

✖ 1 problem (1 error, 0 warnings)
1 error and 0 warnings potentially fixable with the `--fix` option.

END-BUILD: FAILURE
Build step 'Execute shell' marked build as failure

avelad · 2021-02-01T20:21:02Z

Fixed lint error. Sorry....

shaka-bot · 2021-02-01T20:42:24Z

All tests passed!

1. Added hdr as a property in stream when constructing. Fixes build failure from commit 7137286 . PR #3116 Issue #2813 2. Fixed the test error from commit d3640d1 . PR #3044 Issue #3029 3. Fixed the new line with no other arguments from commit 0845843 . PR #3060 Change-Id: I5833e49c1a95172742c4ec820960c9c5a7bf0cca

Add SubStation Alpha (SSA) support

115419d

Alvaro Velad added 7 commits December 28, 2020 08:32

Add new link

8169f7a

Fix example

e2c8309

Update regex

39509d3

Add new tests for different time formats

02a6db3

Merge branch 'master' into substation-alpha

dee5103

Fix regex

71ade8b

Add conversion to webvtt

c3d05a0

Merge branch 'master' into substation-alpha

e8bc3c1

TheModMaker suggested changes Jan 19, 2021

View reviewed changes

Alvaro Velad added 3 commits January 20, 2021 07:54

Merge branch 'master' into substation-alpha

2de4f66

Add WebVttGenerator

ef388c7

Split processing for better reading

22dec59

avelad mentioned this pull request Jan 21, 2021

Add SubViewer (SBV) support #3063

Merged

avelad commented Jan 21, 2021

View reviewed changes

lib/text/ssa_text_parser.js Outdated Show resolved Hide resolved

Add backgroundColor and color support

3dc815b

TheModMaker reviewed Jan 21, 2021

View reviewed changes

Alvaro Velad added 2 commits January 21, 2021 22:53

Remove some exports

17bcc3e

Simplify get the payload in the events

062483f

avelad requested a review from TheModMaker January 25, 2021 16:04

joeyparrish added this to the v3.1 milestone Jan 25, 2021

michellezhuogg added the waiting for review label Jan 26, 2021

Add parseSsaColor_ function

0edc712

TheModMaker reviewed Jan 27, 2021

View reviewed changes

Update test

da9b9ea

Alvaro Velad added 2 commits January 27, 2021 19:43

Update parseSsaColor_ function

d66acb3

Change alpha calculation

9e2890c

avelad requested a review from TheModMaker January 27, 2021 19:16

TheModMaker approved these changes Feb 1, 2021

View reviewed changes

Fix lint error

10e89fd

michellezhuogg merged commit 0845843 into shaka-project:master Feb 1, 2021

avelad deleted the substation-alpha branch February 1, 2021 21:11

michellezhuogg removed the waiting for review label Feb 2, 2021

github-actions bot added the status: archived Archived and locked; will not be updated label Jul 25, 2023

github-actions bot locked as resolved and limited conversation to collaborators Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SubStation Alpha (SSA) support #3060

Add SubStation Alpha (SSA) support #3060

avelad commented Dec 23, 2020

avelad commented Dec 23, 2020

avelad commented Dec 29, 2020

avelad commented Jan 12, 2021

TheModMaker Jan 19, 2021

avelad Jan 20, 2021

avelad Jan 21, 2021

TheModMaker Jan 21, 2021

avelad Jan 21, 2021

TheModMaker Jan 27, 2021

avelad Jan 27, 2021

avelad commented Jan 21, 2021

avelad commented Jan 21, 2021

TheModMaker Jan 21, 2021

avelad Jan 21, 2021

TheModMaker Jan 27, 2021

avelad Jan 27, 2021

TheModMaker Jan 21, 2021

TheModMaker Jan 27, 2021

TheModMaker Jan 27, 2021

shaka-bot commented Feb 1, 2021

avelad commented Feb 1, 2021

shaka-bot commented Feb 1, 2021

Add SubStation Alpha (SSA) support #3060

Add SubStation Alpha (SSA) support #3060

Conversation

avelad commented Dec 23, 2020

avelad commented Dec 23, 2020

avelad commented Dec 29, 2020

avelad commented Jan 12, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

avelad commented Jan 21, 2021

avelad commented Jan 21, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shaka-bot commented Feb 1, 2021

avelad commented Feb 1, 2021

shaka-bot commented Feb 1, 2021