Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatic Rule/.yml file Generation #15

Open
AyanSinhaMahapatra opened this issue May 31, 2020 · 0 comments
Open

Automatic Rule/.yml file Generation #15

AyanSinhaMahapatra opened this issue May 31, 2020 · 0 comments
Assignees

Comments

@AyanSinhaMahapatra
Copy link
Member

The last part of analyzing the scan results is creating new rules (automatically) to add to the existing
repository of rules, and even attempts semi-automated .yml file generation.

  1. Rule Generation

Grouping the license detections by location, and essentially, at last, these are the boundaries (start and end) of the matched text, and by stitching all the matched texts together from all these license detections we get the whole text “query”, which is almost always the Rule text to be added. So, along with keeping track of the boundaries of texts where license detection takes place, we also stitch the matched texts together one by one and discarding those already present in a larger text, in order to generate the final Rule text.

  1. .yml generation

Almost always “license_expression” has to be entered manually, as it is complicated and requires a lot of contexts. These tasks can be sped up significantly by using a GUI based interactive review framework (like in the license tags part), this also takes into account present rule names, where they are numbered sequentially, so conflicts are avoided.

@AyanSinhaMahapatra AyanSinhaMahapatra self-assigned this May 31, 2020
@pombredanne pombredanne transferred this issue from aboutcode-org/scancode-toolkit Oct 22, 2020
AyanSinhaMahapatra pushed a commit to AyanSinhaMahapatra/scancode-analyzer that referenced this issue Nov 5, 2020
Update .gitignore to ignore Jupyter temp files
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant