Support of escape sequences #125

ruben-ayrapetyan · 2015-05-28T16:05:35Z

Support of escape sequences with the exception of '\0' character and cases that depend on Unicode support.

Related issue: #50

galpeter · 2015-05-28T16:17:08Z

Would it be possible to add intentionally malformed escape sequences for the test cases?

ruben-ayrapetyan · 2015-05-28T16:21:50Z

Currently, we could add the test cases with syntax errors to tests/jerry/fail.
Later, after eval and syntax error feedback would be supported, we could also implement them like the following:

try
{
 eval ('"\u;012"');
 assert (false);
} catch (e)
{
   assert (e instanceof SyntaxError);
}

LaszloLango · 2015-05-29T05:58:15Z

jerry-core/parser/js/lexer.cpp

+ *         false - otherwise.
+ */
+static bool
+is_line_terminator (ecma_char_t c) /**< character */


We need this check too in the RegExp engine. Please declare it in a header, so we don't have to duplicate it. :)

sand1k · 2015-05-29T08:05:13Z

LGTM.

egavrin · 2015-05-29T08:18:28Z

@galpeter please check the PR.

galpeter · 2015-05-29T08:58:44Z

tests/jerry/escape_sequences.js

+// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+// See the License for the specific language governing permissions and
+// limitations under the License.
+


We should also add test cases for different string escapes: \t \n \r \b ...
as described in the http://www.ecma-international.org/ecma-262/5.1/#sec-7.8.4 SingleEscapeCharacter rule.

ruben-ayrapetyan · 2015-05-29T09:23:54Z

@galpeter, @LaszloLango, I've updated pull request.

…ings contained in source code buffer) to lexer token. JerryScript-DCO-1.0-Signed-off-by: Ruben Ayrapetyan [email protected]

ruben-ayrapetyan · 2015-05-29T10:49:31Z

jerry-core/parser/js/lexer.cpp

+          break;
+        }
+
+        converted_char = (ecma_char_t) char_code;


Moving discussion from commit notes (d379aff) to the pull request note.

@galpeter: This will convert the unit32_t to unit8_t or unit16_t is this really what we wanted? I know that we don't have full unicode support for know. Maybe a comment to describe why is this good?

@ruben-ayrapetyan
In CONFIG_ECMA_CHAR_ASCII mode this would convert to uint8_t, ignoring high part of unicode byte pair;
and in CONFIG_ECMA_CHAR_UTF16 mode this would convert to uint16_t.
Would the comment with the text above be ok?
Maybe it is better to use uint16_t instead of uint32_t here?

@galpeter
Will this change after we have full unicode support?

@ruben-ayrapetyan
The function's code would definitely be changed, but the idea probably would be the same, i.e.:
ecma_char_t would be uint8_t for ascii mode, and uint16_t - for utf16.

Then the comment is ok & we should use uint16_t

Ok. I'll add the comment and change the type according to your request.

galpeter · 2015-05-29T10:55:10Z

Other parts looks good to me.

…UL>") character and cases that depend on Unicode support. JerryScript-DCO-1.0-Signed-off-by: Ruben Ayrapetyan [email protected]

egavrin · 2015-05-29T11:04:32Z

Ok, make push

ruben-ayrapetyan added normal parser development labels May 28, 2015

ruben-ayrapetyan assigned sand1k May 28, 2015

ruben-ayrapetyan added this to the Core ECMA features milestone May 28, 2015

ruben-ayrapetyan changed the title ~~Support escape sequences~~ Support of escape sequences May 28, 2015

LaszloLango reviewed May 29, 2015
View reviewed changes

galpeter reviewed May 29, 2015
View reviewed changes

ruben-ayrapetyan force-pushed the Ruben-support-escape-sequences branch from 24b7d67 to d379aff Compare May 29, 2015 09:18

ruben-ayrapetyan assigned galpeter and unassigned sand1k May 29, 2015

Adding routine for conversion of any character sequence (not only str…

7025f97

…ings contained in source code buffer) to lexer token. JerryScript-DCO-1.0-Signed-off-by: Ruben Ayrapetyan [email protected]

ruben-ayrapetyan force-pushed the Ruben-support-escape-sequences branch from d379aff to 8dd45eb Compare May 29, 2015 10:46

ruben-ayrapetyan reviewed May 29, 2015
View reviewed changes

Implementing escape sequences support with the exception of "\0" ("<N…

8b28cac

…UL>") character and cases that depend on Unicode support. JerryScript-DCO-1.0-Signed-off-by: Ruben Ayrapetyan [email protected]

ruben-ayrapetyan force-pushed the Ruben-support-escape-sequences branch from 8dd45eb to 8b28cac Compare May 29, 2015 11:03

ruben-ayrapetyan assigned egavrin and unassigned galpeter May 29, 2015

ruben-ayrapetyan merged commit 8b28cac into master May 29, 2015

egavrin deleted the Ruben-support-escape-sequences branch May 29, 2015 11:42

somang-park unassigned egavrin Nov 25, 2016

This was referenced May 17, 2020

stack-overflow in vm_loop #3750

Closed

stack-overflow in ecma_regexp_match #3753

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support of escape sequences #125

Support of escape sequences #125

ruben-ayrapetyan commented May 28, 2015

galpeter commented May 28, 2015

ruben-ayrapetyan commented May 28, 2015

LaszloLango May 29, 2015

ruben-ayrapetyan May 29, 2015

sand1k commented May 29, 2015

egavrin commented May 29, 2015

galpeter May 29, 2015

ruben-ayrapetyan May 29, 2015

ruben-ayrapetyan commented May 29, 2015

ruben-ayrapetyan May 29, 2015

galpeter May 29, 2015

ruben-ayrapetyan May 29, 2015

galpeter commented May 29, 2015

egavrin commented May 29, 2015

Support of escape sequences #125

Support of escape sequences #125

Conversation

ruben-ayrapetyan commented May 28, 2015

galpeter commented May 28, 2015

ruben-ayrapetyan commented May 28, 2015

LaszloLango May 29, 2015

Choose a reason for hiding this comment

ruben-ayrapetyan May 29, 2015

Choose a reason for hiding this comment

sand1k commented May 29, 2015

egavrin commented May 29, 2015

galpeter May 29, 2015

Choose a reason for hiding this comment

ruben-ayrapetyan May 29, 2015

Choose a reason for hiding this comment

ruben-ayrapetyan commented May 29, 2015

ruben-ayrapetyan May 29, 2015

Choose a reason for hiding this comment

galpeter May 29, 2015

Choose a reason for hiding this comment

ruben-ayrapetyan May 29, 2015

Choose a reason for hiding this comment

galpeter commented May 29, 2015

egavrin commented May 29, 2015