Update scancode.io models to handle new scancode-toolkit scan fields #436

JonoYang · 2022-05-11T00:21:02Z

In a scancode JSON output, all Packages detected in a scan are present in a top-level attribute named packages. Likewise, all detected Dependencies are placed in the dependencies attribute. Multiple copies of the same package can be present in the packages field, if a particular package was detected multiple times in the same codebase. Each copy of this package will have a different package_uid. A package_uid is the purl of that package with a qualifier named uuid that is specific to the scancode run. e.g. pkg:pypi/[email protected]?uuid=9c19275c-c3fe-43dd-b6ec-a4f2bf65810f

For each Resource that is for a Package, the for_packages for those Resources will be populated with the package_uid of the Package they are for.

We will need to create a DiscoveredDependency model to handle the dependencies from the new top-level dependency attribute from a scan.

We also need to modify the DiscoveredPackage model to better store/query the new package_uid values. Currently, we put the package_uids for a package in the extra_data field.

The serializers of these models will have to be updated as well.

The value in a Resource's for_packages field is not a purl, but a package_uid. for a particular instance of a Package detected during a scan. In the new output, multiple copies of the same package can appear in the top-level packages field. Each copy has a different package_uid. We'll have to find a way to keep the package_uids around on Resources and to display the package_uids properly in the for_packages field in the scancode.io JSON output.

The text was updated successfully, but these errors were encountered:

* Update scan_for_application_packages to save detected Package data to the CodebaseResource it is from, then iterate through the CodebaseResources with Package data and use the proper Package handler to process the Package data * Create DiscoveredDependency model * Add package_data JSON field to CodebaseResource Signed-off-by: Jono Yang <[email protected]>

* Increase field sizes in DiscoveredDependency Signed-off-by: Jono Yang <[email protected]>

tdruez · 2022-08-31T05:38:20Z

@JonoYang are we now ready to close this on?

JonoYang · 2022-08-31T20:04:36Z

@tdruez This is finished now that #486 is merged.

JonoYang added a commit that referenced this issue Jul 21, 2022

Add has_single_resource to Project #436 #447

f60f6c2

* Increase field sizes in DiscoveredDependency Signed-off-by: Jono Yang <[email protected]>

pombredanne added this to the v32.0.0 milestone Jul 28, 2022

JonoYang closed this as completed Aug 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update scancode.io models to handle new scancode-toolkit scan fields #436

Update scancode.io models to handle new scancode-toolkit scan fields #436

JonoYang commented May 11, 2022 •

edited

Loading

tdruez commented Aug 31, 2022

JonoYang commented Aug 31, 2022

Update scancode.io models to handle new scancode-toolkit scan fields #436

Update scancode.io models to handle new scancode-toolkit scan fields #436

Comments

JonoYang commented May 11, 2022 • edited Loading

tdruez commented Aug 31, 2022

JonoYang commented Aug 31, 2022

JonoYang commented May 11, 2022 •

edited

Loading