Do not use the implemented interface for element/attributes #147

mozfreddyb · 2022-06-10T11:31:20Z

Instead of talking about interfaces, we better talk about the element name directly.
Notably, HTML elements like blink and applet both implement the HTMLUnknownElement interface, yet we'd probably remove the former and keep the latter. (Even though applet is specified not to do anything, I'd prefer we don't allow it)

The text was updated successfully, but these errors were encountered:

otherdaniel · 2022-06-13T14:42:34Z

This makes total sense. The problem I had is that I didn't have a proper classification I could adopt, or derive ours from. IMHO, we need to categorize elements and attributes into (at least) 3 classes each: known-good; known-bad; other. If we maintain explicit lists for the first two (or derive them from the HTML spec), we should be good.

It might be slightly better if we pick 4 classes: known-good ( == default-listed); known-maybe ( == not default-allowed, but permissible); known-bad ( == not in basedline == not permissable); and other.

Maybe a good transition strategy would be to define these name sets manually in an appendix, base the remaining spec on them, and then transition to deriving those same sets from the HTML spec.

I guess we'll have to come up with proper names, though. I'm pretty sure "known-good", "known-maybe" and "known-bad" won't be agreeable. :-)

mozfreddyb · 2022-06-14T09:06:56Z

Yeah, that seems fine in terms of next-steps. IIRC @annevk suggested we won't necessarily derive those sets from the HTML spec immediately but could do self-serving lists as part of the Sanitizer API at first.

annevk · 2022-06-15T07:15:32Z

@domenic might have an alternative idea here, but I think new static lists is the way to go for now. And then later we can figure out if some of that information can be deduced from other sources or should be listed as part of the definition of an element.

koto · 2022-06-15T07:39:22Z

We used [StringContext] to enumerate the XSSy attributes in TT, with a JS API for querying.

evilpie · 2022-08-01T13:56:18Z

It's starting to look like we need some kind of resolution here for me to finish the new implementation of the Sanitizer API for Firefox that closely follows the spec.

It doesn't seem like Gecko has any central list of supported attributes, so implementing the current spec is rather harder.

Aside: What I do find interesting is that the list of allowed HTML attributes (plus a few specials ones like URLs) in the current sanitizer is maybe half as long as the current baseline list of the Sanitizer API.

otherdaniel · 2022-08-24T13:41:59Z

We (Freddy + myself) discussed this offline. The gist is:

Conceptually, the Sanitizer requires three lists: known element/attributes; default element/attributes; baseline elements/attributes.
Two of those - baseline + default - are explicit in the spec. The third - known element/attributes - is "hidden" in the element kind / attribute kind definitions. We should re-word the spec to be more clear about this.
Conceptually, the known element/attributes "list" is contained in the HTML spec, but unfortunately not in the shape of an actual list that one could somehow reference. It's not entirely clear how to deal with this. One option would be to just maintain an additional list in the Sanitizer for now; but that risks getting out of date.
The known element/attributes lists are important to handle the case of allowUnknownMarkup, since unknown markup cannot be tested against the baseline list - it can't be in there - but we still want baseline checks with allowUnknownMarkup against known elements, because otherwise we'd violate the one hard guarantee the Sanitizer wants to give, namely to not contain script-y content.

otherdaniel · 2022-09-08T10:35:54Z

I've sent a first PR to resolve the more trivial instances of this in "handling funky elements", #173.

I'm preparing a second, more elaborate one that puts a set of lists for default/baseline/known elements/attributes into the repo, plus a script to assemble appropriate lists for Appendix A. That should allow us to

more easily reason about what we're specifying (e.g., we can just diff "known" and "baseline" elements),
write this spec almost entirely in terms of list membership,
easily adapt this to whatever namespace representation we pick by changing the script.

And if some day the HTML spec can export lists of spec-defined elements/attributes, we should be able to hook those up quite easily, too.

annevk · 2022-09-08T13:04:32Z

Bit confused by the last paragraph. The idea is still to upstream most of this to HTML, right?

mozfreddyb · 2022-09-09T11:54:27Z

Yeah, I believe we agreed that all "lists" would be part of the Sanitizer API at first, because HTML has no "these are all the elements we know of" list that one could easily refer to (and similar for a attributes).

annevk · 2022-09-09T19:38:06Z

Right and then the list would be moved over to the HTML Standard upon integration. (And we'd add a note to the documentation for updating elements that this list might need to be changed as well.)

evilpie mentioned this issue Aug 12, 2022

Also sanitize javascript: href on MathML and SVG anchor? #168

Open

otherdaniel mentioned this issue Sep 8, 2022

Refer to elements by namespace and local name #173

Merged

annevk added the v1 label Oct 18, 2023

mozfreddyb added this to the v1 milestone Jan 23, 2024

otherdaniel mentioned this issue Mar 15, 2024

Update the spec to match the current group consensus. #208

Merged

otherdaniel closed this as completed in db21b50 Mar 19, 2024

otherdaniel closed this as completed in #208 Mar 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not use the implemented interface for element/attributes #147

Do not use the implemented interface for element/attributes #147

mozfreddyb commented Jun 10, 2022

otherdaniel commented Jun 13, 2022

mozfreddyb commented Jun 14, 2022

annevk commented Jun 15, 2022

koto commented Jun 15, 2022

evilpie commented Aug 1, 2022

otherdaniel commented Aug 24, 2022

otherdaniel commented Sep 8, 2022

annevk commented Sep 8, 2022

mozfreddyb commented Sep 9, 2022

annevk commented Sep 9, 2022

Do not use the implemented interface for element/attributes #147

Do not use the implemented interface for element/attributes #147

Comments

mozfreddyb commented Jun 10, 2022

otherdaniel commented Jun 13, 2022

mozfreddyb commented Jun 14, 2022

annevk commented Jun 15, 2022

koto commented Jun 15, 2022

evilpie commented Aug 1, 2022

otherdaniel commented Aug 24, 2022

otherdaniel commented Sep 8, 2022

annevk commented Sep 8, 2022

mozfreddyb commented Sep 9, 2022

annevk commented Sep 9, 2022