Improve license checker line ending validation #2391

galpeter · 2018-06-08T14:18:32Z

The license checker previously assumed that the
lines of the license will always end with \n
characters. However when checking a file
it could happen that other line endings are
returned (should only happen for test files) thus
the checker can incorrectly report invalid license
as the line endings are incorrect.

Additional note #1: in Python when reading a file
in text mode it can happen that the line endings are
converted to the host system's line ending.
However on Travis the conversion did not happen when
using the open built-in method. By switching to the
io.open call the conversion is enforced and
all line endings are converted to '\n' regardless of
the host system's line ending.

Additional note #2: it is possible that there
are input test files which are not utf-8 conformant
(eg.: to test the parser). These files can't be read
as utf-8 strings and an exception would occur.
By ignoring these errors the tool can check
the file's license. In the license text there is no
invalid utf-8 character so the check will work
correctly.

akosthekiss · 2018-06-08T19:35:00Z

tools/check-license.py

@@ -78,7 +79,7 @@ def main():
            for fname in files:
                if any(fname.endswith(ext) for ext in EXTENSIONS):
                    fpath = os.path.join(root, fname)
-                    with open(fpath) as curr_file:
+                    with io.open(fpath, 'r', errors='ignore', newline='') as curr_file:


Why not simply use universal newlines (newline=None, or just don't mention newline at all)? Then python will see all newlines as \n and the LICENSE regex can be left unchanged.

(BTW, wouldn't that be simply equivalent to open(fpath, 'rU'), without need for io?)

The problem was that Travis (seemingly) did not use the universal newlines that's why there was an license error reported in #2371 , on my system the conversion is performed and there is no error reported. For the rU I got a deprecation warning with Python3.

Could you please double check? Here are my experiments (on top of this PR's commit):

akosthekiss@180980e

https://travis-ci.org/akosthekiss/jerryscript/builds/390173690

The license checker previously assumed that the lines of the license will always end with \n characters. However when checking a file it could happen that other line endings are returned (should only happen for test files) thus the checker can incorrectly report invalid license as the line endings are incorrect. Additional note jerryscript-project#1: in Python when reading a file in text mode it can happen that the line endings are converted to the host system's line ending. However on Travis the conversion did not happen when using the open built-in method. By switching to the io.open call the conversion is enforced and all line endings are converted to '\n' regardless of the host system's line ending. Additional note jerryscript-project#2: it is possible that there are input test files which are not utf-8 conformant (eg.: to test the parser). These files can't be read as utf-8 strings and an exception would occur. By ignoring these errors the tool can check the file's license. In the license text there is no invalid utf-8 character so the check will work correctly. JerryScript-DCO-1.0-Signed-off-by: Peter Gal [email protected]

galpeter · 2018-06-11T09:30:45Z

@akosthekiss did a triple-check and it worked as you suggested (removed the newline parameter). Guess that at a previous check there was some incorrect modifications on my side. Thanks!

I've updated the PR.

akosthekiss

LGTM

zherczeg

LGTM

galpeter mentioned this pull request Jun 8, 2018

Use binary mode in snapshot tool for fopen #2371

Merged

akosthekiss reviewed Jun 8, 2018

View reviewed changes

akosthekiss added enhancement An improvement tools Related to the tooling scripts labels Jun 9, 2018

galpeter force-pushed the licence-checker-lineendings branch from 4dad260 to 60a3c33 Compare June 11, 2018 09:28

akosthekiss approved these changes Jun 11, 2018

View reviewed changes

zherczeg approved these changes Jun 11, 2018

View reviewed changes

yichoi merged commit 9ae60a4 into jerryscript-project:master Jun 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve license checker line ending validation #2391

Improve license checker line ending validation #2391

galpeter commented Jun 8, 2018 •

edited

Loading

akosthekiss Jun 8, 2018

galpeter Jun 8, 2018

akosthekiss Jun 9, 2018

galpeter commented Jun 11, 2018

akosthekiss left a comment

zherczeg left a comment

Improve license checker line ending validation #2391

Improve license checker line ending validation #2391

Conversation

galpeter commented Jun 8, 2018 • edited Loading

akosthekiss Jun 8, 2018

Choose a reason for hiding this comment

galpeter Jun 8, 2018

Choose a reason for hiding this comment

akosthekiss Jun 9, 2018

Choose a reason for hiding this comment

galpeter commented Jun 11, 2018

akosthekiss left a comment

Choose a reason for hiding this comment

zherczeg left a comment

Choose a reason for hiding this comment

galpeter commented Jun 8, 2018 •

edited

Loading