Releases: ihmeuw/pseudopeople
Releases · ihmeuw/pseudopeople
v0.7.1
v0.7.0
What's Changed
- Improve Input Data page by @NathanielBlairStahn in #155
- Merge changes stashed until after CODS by @rmudambi in #157
- stop noising employer_id by @stevebachmeier in #159
- Implement targeted omission do_not_respond for census and surveys by @mattkappel in #158
- Feature/sbachmei/test column specific noising by @stevebachmeier in #131
- include row-level noise in col-level integration tests by @stevebachmeier in #160
- make separate test to check single vs multiple source datasets by @stevebachmeier in #161
- Feature/sbachmei/mic 4046 4053 create fixtures of noised data by @stevebachmeier in #162
- split up test by @stevebachmeier in #164
- Nicknames by @albrja in #163
- Add state subsetting to generate dataset interfaces by @mattkappel in #165
- Unpin vivarium version by @albrja in #168
- Feature/sbachmei/mic 4046 4055 row level integration test by @stevebachmeier in #166
- Bugfix/sbachmei/pin vivarium by @stevebachmeier in #169
- add file hashes to doc by @stevebachmeier in #170
- align noise entity class attributes with names by @stevebachmeier in #172
- DOC: wordsmith intro section, 2nd attempt by @NathanielBlairStahn in #171
- fix np.isclose ordering by @stevebachmeier in #175
- Configure readthedocs to build a PDF by @zmbc in #176
- Increase IndexMap size; fix vectorized_choice by @stevebachmeier in #173
- OCR noise function by @albrja in #178
- Add nicknames to noise functions table in docs by @pletale in #177
- include household_id cols; update sample data by @stevebachmeier in #180
- Phonetic noise function by @albrja in #181
- remove household_id from ssa noised dataset by @stevebachmeier in #183
- Bugfix/trailing spaces by @albrja in #184
- Update noise functions page to include OCR errors by @pletale in #179
- Document requesting no noise with a sentinel value by @zmbc in #186
- Copy member df by @albrja in #190
- Copy from household member noise function implementation by @albrja in #191
- Generate un-noised dataset by @albrja in #194
- Update tests for invalid probability validation by @albrja in #197
- Update to support python 3.11 by @albrja in #198
- Update test_noise_order by @albrja in #201
- List household_id as an attribute in all datasets except SSA by @pletale in #202
- Add docstring for no noise interface and fix bug by @albrja in #204
- Update data_access_request.yml by @Ironholds in #206
- Update noise table to include choose the wrong option by @pletale in #200
- Add GQ values to 'relationship to reference person' in census dataset table - hot fix by @pletale in #203
- Do not expose internal exception as context by @zmbc in #210
- Update sample data for new column in 1040 by @albrja in #207
- Add helper function to get filepaths for dataset to be noised. This a… by @albrja in #208
- Update pseudopeople docs to include phonetic noise by @pletale in #209
- optimize make_typos by @stevebachmeier in #211
- automatically tag all tests in integration/ as slow by @stevebachmeier in #215
- Merge docs hotfixes to develop by @rmudambi in #213
- bugfix check user config types by @stevebachmeier in #219
- Document swap month and day noise function by @pletale in #220
- Add 1040 form to datasets page in docs by @pletale in #214
- Refactor to add logic for 1040 in interface by @albrja in #217
- Match date formats to concept model by @zmbc in #218
- Update 1040 form to remove employer and income columns by @pletale in #222
- Mic 4218/format 1040 by @albrja in #225
- add reminder to run integration tests by @stevebachmeier in #227
- Mic 4208/1040 metadata by @albrja in #226
- DOC: add link to mailing list signup page by @aflaxman in #195
- Doc hotfix: Correct date formats by @zmbc in #216
- Hot fix: remove references to wages from noise table by @pletale in #223
- Mic 4210/1040 implementation by @albrja in #228
- DOC: Add columns to decennial census and ACS by @NathanielBlairStahn in #229
- w2 column change from 'income' to 'wages'; new sample data by @stevebachmeier in #230
- Link to example access request by @Ironholds in #224
- Mic-4277/sample data by @albrja in #234
- Add 'relationship_to_reference_person' to ACS; update sample data by @stevebachmeier in #231
- Hot fix: remove age from SSA dataset by @pletale in #239
- Mic 4221/1040 tests by @albrja in #232
- Mic 4294/fix tax 1040 state filter by @albrja in #233
- Ensure all noise functions are verbs by @pletale in #237
- Add link to pseudopeople website in readthedocs by @pletale in #238
- Add middle name and sex columns to SSA dataset by @pletale in #240
- Add year columns in Census, WIC, and tax datasets by @pletale in #241
- Add 1040 to noise table in pseudopeople docs by @pletale in #235
- SSA dataset columns updates by @albrja in #243
- Fix middle name/middle initial rows in noise table by @pletale in #244
- Change income column to wages for W2 and 1099 by @pletale in #245
- Add 1040 docstring link to Datasets page by @pletale in #246
- Fix bug if string is source in fetch_dataset_filepaths by @albrja in #247
- Fix column order in 1040 dataset by @albrja in #248
- add checkbox to pr template for slow tests by @stevebachmeier in #250
- rename "user_config" arg to "overrides" by @stevebachmeier in #249
- include dataset name as outer key to get_config by @stevebachmeier in #252
- Text wrap tables on Datasets page by @pletale in #253
- DOC: Reorder noise functions in user docs by @NathanielBlairStahn in #254
- Mic 4386/remove 1040 special casing by @albrja in #255
- Mic-4134/remove extra columns by @albrja in #257
- Mic 4405/add housing type by @albrja in #260
- Hot fix: remove age from WIC dataset in pseudopeople docs by @pletale in #262
- Edit column descriptions on Datasets page by @pletale in #263
- Remove tax datasets from Noise table column for 'miswrite age' by @pletale in #259
- Mic 4255/relationship category update by @albrja in #264
- Edit introduction page based on doc session feedback by @pletale in #258
- inherit from Exception instead of BaseException by @stevebachmeier in ...
v0.6.5
v0.6.4
What's Changed
- Document the simulation's timespan by @zmbc in #134
- Change to beta, add next steps after quickstart by @zmbc in #137
- Document "write wrong digits" noise by @NathanielBlairStahn in #128
- Document configuration by @zmbc in #133
- Document the "Make typos" noise type by @zmbc in #136
- Revise from feedback by @zmbc in #144
- Document
omit_row
noise type by @NathanielBlairStahn in #143 - Document the "Use a fake name" noise type by @zmbc in #148
- Revise explanation of simulation duration by @zmbc in #145
- Add tables of noise types to noise documentation page by @pletale in #129
- Add link to introduction/quickstart in sidebar by @pletale in #146
- Revise the API reference pages in the docs by @zmbc in #142
- Reorder noise types by @zmbc in #149
- Make some minor presentation changes by @zmbc in #150
- Document pseudopeople input data by @NathanielBlairStahn in #152
- Add table of noise types for each column by @pletale in #151
- update version and changelog by @rmudambi in #153
- Release v0.6.4 by @rmudambi in #154
Full Changelog: v0.6.3...v0.6.4
v0.6.3
What's Changed
- Merge the rest of the dataset descriptions page into develop by @pletale in #117
- Separate ACS and CPS dataset descriptions and remove redundant table by @pletale in #125
- Remove Concepts section by @NathanielBlairStahn in #127
- Title-case fake names by @zmbc in #124
- Remove the ".interface" in docs by @zmbc in #126
- Update landing page to remove whitespace around logo and add logo to upper left hand link by @pletale in #130
- Add descriptions for the first four noise types by @zmbc in #119
- Add data access request to issue templates by @zmbc in #120
- Make quickstart more welcoming to non-Python users by @zmbc in #132
- release v0.6.3 by @stevebachmeier in #140
- Release v0.6.3 by @stevebachmeier in #141
Full Changelog: v0.6.2...v0.6.3
Documentation release
What's Changed
- Add year=None text to generate_x docstrings and type annotations by @mattkappel in #118
- dual progress bars by @stevebachmeier in #121
Full Changelog: v0.6.1...v0.6.2
v0.6.1
What's Changed
- api reference context by @stevebachmeier in #100
- Add website links and info about surveys to documentation by @pletale in #96
- Start noise documentation by @NathanielBlairStahn in #94
- Add SSA dataset description to documentation by @pletale in #102
- Add pseudopeople logo to documentation homepage by @pletale in #104
- Updating docstrings for configuration by @albrja in #103
- Update generate_x docs by @stevebachmeier in #105
- Add subpages for noise function details by @NathanielBlairStahn in #109
- Rename probability to cell and row probability by @albrja in #107
- Add page stubs for Input Data and Configuration by @NathanielBlairStahn in #111
- Add longer introduction by @zmbc in #108
- Feature/account for sim schema updates by @rmudambi in #114
- Remove module name from docs by @albrja in #113
- Edit Datasets documentation page to include link to noise page and other small changes by @pletale in #106
- update version and changelog by @rmudambi in #115
- Develop by @rmudambi in #116
Full Changelog: v0.6.0...v0.6.1
v0.6.0
What's Changed
- Add documentation for dataset descriptions by @pletale in #69
- Update zipcode miswriting by @albrja in #70
- add individuals not in the ihmeuw org to codeowners by @rmudambi in #73
- change 'omission' key to 'omit_rows' by @stevebachmeier in #74
- Feature/sbachmei/mic 3945 update missing data key name by @stevebachmeier in #75
- Typographic update by @albrja in #76
- Feature/sbachmei/mic 3945 update remaining config key names by @stevebachmeier in #78
- Feature/sbachmei/mic 3945 add cell probabilities by @stevebachmeier in #80
- Feature/sbachmei/mic 3945 remove default token noise level by @stevebachmeier in #83
- Update sample datasets to include all GQ types by @zmbc in #82
- Feature/sbachmei/mic 3945 final cleanup by @stevebachmeier in #84
- Scale ACS and CPS omission for oversampling by @albrja in #86
- Add to the dataset documentation and create outline for rest of page by @pletale in #77
- add user config value validation by @stevebachmeier in #85
- Add .DS_Store to .gitignore by @NathanielBlairStahn in #87
- Add 'Notes' column to dataset documentation tables by @pletale in #88
- Revise landing page, add a quickstart by @zmbc in #91
- change 'form' to 'dataset' throughout by @stevebachmeier in #90
- return to having individual names in codeowners by @rmudambi in #92
- Update default configuration values by @zmbc in #89
- remove include_original_token_probability key from typo noise config by @rmudambi in #93
- fix name of ACS dataset by @rmudambi in #98
- Get config by @albrja in #95
- Issue templates by @albrja in #81
- release v0.6.0 by @stevebachmeier in #99
- Release v0.6.0 by @stevebachmeier in #101
New Contributors
Full Changelog: v0.5.1...v0.6.0
v0.5.1
What's Changed
- Add date formatting and refactor generate form helper by @mattkappel in #61
- Switching to np.nan instead of pd.NA by @albrja in #65
- Avoid unnecessary pd.to_datetime by @mattkappel in #64
- validate user configuration keys by @rmudambi in #66
- change 'row_noise_level' to 'probability' by @stevebachmeier in #68
- Add logging and progress bar for generation of forms by @mattkappel in #67
Full Changelog: v0.5.0...v0.5.1
v0.5.0
What's Changed
- Adds noise level scaling function to ColumnNoiseType by @albrja in #52
- add link to docs; remove ... thing by @stevebachmeier in #53
- Adds logging if no cells are noised by @albrja in #54
- Omission noise function by @albrja in #56
- Implement output schema by @stevebachmeier in #51
- Add year argument to form generation functions by @mattkappel in #55
- update codeowners so that researchers can approve docs changes by @rmudambi in #60
- Add index page for noise functions section by @NathanielBlairStahn in #59
- add github repo link to docs by @stevebachmeier in #58
- support only a path to data root directory by @stevebachmeier in #57
- release v0.5.0 by @stevebachmeier in #62
- release v0.5.0 by @stevebachmeier in #63
New Contributors
- @NathanielBlairStahn made their first contribution in #59
Full Changelog: v0.4.0...v0.5.0