Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#653 Detection of multilanguage choices only works if first choice list is multilanguage #666

Merged
merged 4 commits into from
Dec 1, 2023

Conversation

lindsay-stevens
Copy link
Contributor

@lindsay-stevens lindsay-stevens commented Nov 14, 2023

Closes #653

Why is this the best possible solution? Were any other approaches considered?

There are 2 main problem areas addressed: the translation warning, and the detection of translations for triggering multi-language (itext) output behaviour,

This changes the missing translations check in xls2json to use column headers instead of the first row. To avoid re-iterating the sheet data, the existing dealias function is modified to track and return the headers as well as the sheet data.

This changes multi-language detection by doing so only in the Survey object. For the Question sub-types, Survey sets the relevant info with new fields, so that the Questions don't need to re-check. For choices, the detection is per itemset (it could have looked across all itemsets but then untranslated itemsets would've had to be mangled into translations).

What are the regression risks?

Since there are some new fields on the Survey and Question classes, external library users that pass in JSON without these may experience errors or inconsistent behaviour. There is a file json_form_schema.json but it seems to be not consistently updated, and I don't know if it's accurate - there doesn't seem to be a pyxform way to generate it or use it to validate.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

Probably not, it's all internal representation.

Before submitting this PR, please make sure you have:

  • included test cases for core behavior and edge cases in tests
  • run nosetests and verified all tests pass
  • run black pyxform tests to format code
  • verified that any code or assets from external sources are properly credited in comments

@lindsay-stevens lindsay-stevens force-pushed the pyxform-653 branch 3 times, most recently from 391f161 to 3a008d1 Compare November 20, 2023 14:43
- overall goal is to avoid using only the first few lines of survey
  or choices sheet data to decide whether to use multi-language
  behaviour (itext) or not.
- xls2json.py: re-use dealias func for finding headers as well as data
- translation_checks.py: use headers data to find translations
- survey.py:
  - declare _translations key and store and translations info in it
  - tidy setup_translations to data prep vs. _translations update steps
  - add translation type key to help differentiate question vs choice
  - annotate survey elements with info about translations, dynamic
    choices, and media, so that this doesn't need to be re-checked later
    by individual elements.
  - decide whether to emit itext labels for choices on a per-itemset
    basis, rather than switching into itext mode for all choices. This
    avoids needing to mangle untranslated itemsets into translated when
    other itemsets in the survey are translated (otherwise they would
    get no itext, not even a hypen "-").
  - add lru_cache upper limit so that it doesn't grow forever
  - update add_to_nested_dict to allow updating existing dict
- question.py:
  - declare translations, dynamic choices, and media keys set by survey
    and use them to determine relevant behaviour instead of re-checking
- utils.py: remove multi-language parameter from has_dynamic_label
  because it's clearer to do the same logic external to the function.
- add tests for multi-language detection + with media
- For insert/pop on the left side: O(1) for deque vs. O(n) for list.
- Avoids creating lots of new lists with [] + [].
- Did not investigate whether the "flat" filter is still necessary.
@lindsay-stevens lindsay-stevens marked this pull request as ready for review November 24, 2023 07:13
Copy link
Contributor

@lognaturel lognaturel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it could have looked across all itemsets but then untranslated itemsets would've had to be mangled into translations

This is very interesting! We warn about this case and it's probably not terribly useful so either way would be fine.

Here's an interesting form I cooked up: https://docs.google.com/spreadsheets/d/1NW2yfyLCavbJ8reUo-6i1yjq48eIaEJ170ggDEzYB8Y/edit#gid=1033515766

I'm happy with the output and you may find it interesting to look at.

pyxform/survey.py Show resolved Hide resolved
pyxform/utils.py Show resolved Hide resolved
pyxform/survey_element.py Show resolved Hide resolved
@lognaturel
Copy link
Contributor

There's a small recommendation and a question about risk inline but I'm going to merge now so we can get it onto staging asap.

@lognaturel lognaturel merged commit ee6bd2d into XLSForm:master Dec 1, 2023
10 checks passed
@lindsay-stevens lindsay-stevens deleted the pyxform-653 branch December 4, 2023 11:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Detection of multilanguage choices only works if first choice list is multilanguage
2 participants