Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add sample data and include in tests #37

Merged
merged 4 commits into from
Apr 7, 2023

Conversation

mattkappel
Copy link
Contributor

Add sample data and include in tests

Description

  • Category: feature
  • JIRA issue: MIC-3884

Changes

  • Adds HDF files for example/sample data to noise
  • Adds usage for these data files in form integration tests
  • Fixes a bug related to the change to typed HDF columns from stringified CSV columns
  • Handles the case where numeric miswriting is given an empty series, as happened in testing for date of birth

Testing

Newly enabled tests complete successfully.

@mattkappel mattkappel marked this pull request as ready for review April 6, 2023 01:48
@@ -131,6 +131,7 @@ def miswrite_zipcodes(
:return: pd.Series of noised zipcodes
"""

column = column.astype(str)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What are these coming in as now if not strings? I'm concerned if they're coming in as ints again then we will be back to losing preceding 0s (thought I guess the check below would catch that)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same dataset has all the zipcodes as 90210, which was being interpreted as an int.

("todo", Form.CPS),
("todo", Form.WIC),
("todo", Form.SSA),
("todo", Form.TAX_W2_1099),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You knocked out so many TODOs with one fell swoop!

tests/integration/test_interface.py Outdated Show resolved Hide resolved
@mattkappel mattkappel force-pushed the feature/mic-3884-incorporate-sample-dataset branch from 0f897c1 to a92fca3 Compare April 7, 2023 05:37
@mattkappel mattkappel merged commit 0a077a3 into develop Apr 7, 2023
@mattkappel mattkappel deleted the feature/mic-3884-incorporate-sample-dataset branch April 7, 2023 05:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants