convert does not respect encoding #82

DanielHabenicht · 2021-03-19T22:39:35Z

Describe the bug
Convert should recognise "escaped" characters and parse them correctly.

To Reproduce

var xmlbuilder2 = require("xmlbuilder2")

const xmlBase = {
  xliff: {
    "@version": "1.2",
    file: {
      "@datatype": "plaintext",
      "@source-language": "en",
      body: {
      "@id": "test",
      source: "<this> test </this>"
      },
    },
  },
};

src = xmlbuilder2.convert(xmlBase, { prettyPrint: true })
console.log(src)
// "<?xml version=\"1.0\"?>\n<xliff version=\"1.2\">\n  <file … &lt;/this&gt;</source>\n    </body>\n  </file>\n</xliff>"

xmlbuilder2.convert(src, {format: "object"})

Expected behaviour
Convert does encode special characters like < or > as &lt; and &gt; But does not decode them if parsed again.

Version:

node.js: 15.12.0
xmlbuilder2 2.4.0

The text was updated successfully, but these errors were encountered:

DanielHabenicht · 2021-03-19T23:43:44Z

Workaround: Use CDATA e.g.:

var xmlbuilder2 = require("xmlbuilder2")

const xmlBase = {
  xliff: {
    "@version": "1.2",
    file: {
      "@datatype": "plaintext",
      "@source-language": "en",
      body: {
      "@id": "test",
      $: "<this> test </this>"
      },
    },
  },
};

src = xmlbuilder2.convert(xmlBase, { prettyPrint: true })
console.log(src)
// "<?xml version=\"1.0\"?>\n<xliff version=\"1.2\">\n  <file … &lt;/this&gt;</source>\n    </body>\n  </file>\n</xliff>"

xmlbuilder2.convert(src, {format: "object"})

kernwig · 2021-07-28T17:59:33Z

I ran into this too. Being able to parse back its own output is a basic test. This isn't just failing to decode XML entities, it's actually encoding them again (thus escaping ' to &apos; instead of to ').

CDATA is not only unnecessary, it's impossible when you don't control the source.

DanielHabenicht added the bug Something isn't working label Mar 19, 2021

DanielHabenicht assigned oozcitak Mar 19, 2021

chuanqisun mentioned this issue Jul 20, 2021

When converting from xml string to js object, & might be double escaped #98

Closed

oozcitak mentioned this issue Jul 27, 2021

DOM textContent returns encoded text #88

Closed

oozcitak added a commit to oozcitak/dom that referenced this issue Jul 29, 2021

Add test for #7 also oozcitak/xmlbuilder2#82

59a0ed1

oozcitak closed this as completed in e9d3f93 Jul 29, 2021

jasonkhanlar mentioned this issue Apr 7, 2022

Upgrading from 3.0.1 to 3.0.2 un-escapes & #117

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert does not respect encoding #82

convert does not respect encoding #82

DanielHabenicht commented Mar 19, 2021

DanielHabenicht commented Mar 19, 2021

kernwig commented Jul 28, 2021

convert does not respect encoding #82

convert does not respect encoding #82

Comments

DanielHabenicht commented Mar 19, 2021

DanielHabenicht commented Mar 19, 2021

kernwig commented Jul 28, 2021