Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make cudf::test::expect_columns_equal() to fail when comparing unsanitary lists. #10880

Merged

Conversation

nvdbaranec
Copy link
Contributor

Fixes: #10855

Unsanitary lists (those with null rows that actually have backing underlying data) should not compare as equal to sanitized lists, but should compare as equivalent. Only one test in cudf was affected by this change, which I've opened a second issue for:
#10856

This PR also adds a utility function to cudf::test::lists_column_wrapper,

static lists_column_wrapper<T> make_one_empty_row_column(bool valid = true)

Due to the way initializer lists and nested class initialization work, it was impossible to create a list column with 1 row that was empty, so I added this explicit function to handle the edge case.

@nvdbaranec nvdbaranec added bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. breaking Breaking change labels May 17, 2022
@nvdbaranec nvdbaranec requested a review from a team as a code owner May 17, 2022 20:38
@nvdbaranec nvdbaranec requested review from harrism and vuule May 17, 2022 20:38
@codecov
Copy link

codecov bot commented May 17, 2022

Codecov Report

Merging #10880 (4bbc421) into branch-22.06 (6901b12) will increase coverage by 0.03%.
The diff coverage is n/a.

@@               Coverage Diff                @@
##           branch-22.06   #10880      +/-   ##
================================================
+ Coverage         86.29%   86.33%   +0.03%     
================================================
  Files               144      144              
  Lines             22656    22665       +9     
================================================
+ Hits              19551    19567      +16     
+ Misses             3105     3098       -7     
Impacted Files Coverage Δ
python/dask_cudf/dask_cudf/tests/test_groupby.py 100.00% <0.00%> (ø)
python/dask_cudf/dask_cudf/groupby.py 97.42% <0.00%> (+<0.01%) ⬆️
python/cudf/cudf/core/dataframe.py 93.78% <0.00%> (+0.04%) ⬆️
python/cudf/cudf/core/column/string.py 88.78% <0.00%> (+0.12%) ⬆️
python/cudf/cudf/core/groupby/groupby.py 91.79% <0.00%> (+0.22%) ⬆️
python/cudf/cudf/core/column/numerical.py 96.17% <0.00%> (+0.29%) ⬆️
python/cudf/cudf/core/tools/datetimes.py 84.49% <0.00%> (+0.30%) ⬆️
python/cudf/cudf/core/column/lists.py 91.70% <0.00%> (+0.97%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6901b12...4bbc421. Read the comment docs.

Copy link
Contributor

@vuule vuule left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good, just one very minor suggestion

Copy link
Member

@harrism harrism left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One minor comment on clarity, otherwise looks good!

@vuule vuule added the 5 - Ready to Merge Testing and reviews complete, ready to merge label May 19, 2022
@nvdbaranec
Copy link
Contributor Author

@gpucibot merge

@rapids-bot rapids-bot bot merged commit c0895c1 into rapidsai:branch-22.06 May 19, 2022
rapids-bot bot pushed a commit that referenced this pull request May 23, 2022
Addresses:  #10800

Previously, `concatenate_rows()` only handled lists one level deep.   This PR adds support for arbitrary nesting.

<s>This PR is dependent on :  #10880 so some of the files currently showing up on the diff are unrelated to the `concatenate_rows()` change.</s>

Authors:
  - https://github.com/nvdbaranec

Approvers:
  - Robert (Bobby) Evans (https://github.com/revans2)
  - Vukasin Milovanovic (https://github.com/vuule)
  - Nghia Truong (https://github.com/ttnghia)

URL: #10890
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Ready to Merge Testing and reviews complete, ready to merge breaking Breaking change bug Something isn't working libcudf Affects libcudf (C++/CUDA) code.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] cudf::test::expect_columns_equal() is too permissive for unsanitary lists.
4 participants