Fix: libpe_status, xml: Support integer as rule type attribute #2135

nrwahl2 · 2020-08-05T02:37:11Z

The documentation specifies three potential values for the type
attribute of rule expressions:

string
integer
version

Yet the rule.rng schema and pe__eval_attr_expr() expect "number" instead
of "integer".

This pull corrects the documentation to align with the schema and code.

Thanks @tomjelinek for reporting this.

Edited summary:

This pull does the following:

Creates a pcmk_parse_double() function (strtod() wrapper) and unit tests
Creates a pcmk__char_in_any_str() function (multi-strchr() wrapper) and unit tests
Adds "integer" to the rule schema as a valid type attribute
Adds "integer" support to libpe_status, updates "number" to specify a floating-point comparison, and uses "integer" for integer comparisons. (Previously, "number" specified an integer comparison.)
Updates the documentation to align with the schema and libpe_status

nrwahl2 · 2020-08-05T02:42:26Z

Alternative options:

Would it make sense to parse the values with strtod() and treat them as doubles? It seems like that wouldn't be too complicated. Maybe add a pcmk_parse_double() to the strings library.
If Medium: cib: Fix compilation of ACL code #1 is not worthwhile, what about changing the schema to line up with the doc and deprecate "number" in favor of "integer"? Floating-point values get truncated to ints currently. IMO it's better to name the type choice accordingly and make it clear that we can only do integer comparisons.

Changing the schema seems like a tricky process due to compatibility concerns. I hit a roadblock during an initial attempt at that (mostly due to CTS's use of the rule-2.9 schema, which was last updated 3 years ago). I read over the schema readme but still not sure how to approach it if we were to go that route.

kgaillot · 2020-08-06T20:14:12Z

I'm thinking we should leave the documentation as-is, and modify the schema and code to accept either "integer" or "number" interchangeably. Perhaps at a future Pacemaker major-version bump we can make "number" behave as a floating point.

To modify the schema, see xml/Readme.md. This will be a compatible change.

kgaillot · 2020-08-06T20:19:24Z

Another possibility would be to treat "number" as a double if it contains a '.', and an integer otherwise. That would keep backward compatibility for anyone using it as an integer currently while allowing users to start using it for floating points. In that case the documentation, code and schema could be brought in line as:

"integer", or "number" with a value without '.': integer
"number" with a value with '.': double-precision floating point

nrwahl2 · 2020-08-07T00:29:40Z

To modify the schema, see xml/Readme.md. This will be a compatible change.

Ack. As we discussed in chat, the source of confusion was the use of an old rule schema in the current constraints schema.

Another possibility would be to treat "number" as a double if it contains a '.', and an integer otherwise. That would keep backward compatibility for anyone using it as an integer currently while allowing users to start using it for floating points.

Would this actually do anything to alleviate your concern in the RHBZ? You said:

If anyone's using it, they're using "number" with integers, and they may reasonably expect values not to have any of the unusual corner cases of floating-point arithmetic, so we should keep "number" syntax and integer behavior.

If they're currently using integer strings (i.e., without a '.'), then they shouldn't be prone to any floating-point corner cases AFAIK. The floating point promotion would just add zeroes after the decimal place anyway.

Users would only hit a floating-point corner case (e.g., a comparison failing due to a precision issue in FP arithmetic) after the suggested change if they're currently using floating-point strings (i.e., with a '.') and relying on Pacemaker to truncate the value to an int. Then after the change, those values would be parsed as doubles and may compare incorrectly, in theory.

So it seems to me that parsing numbers with '.' as floating-point would be just the same as parsing ALL numbers as floating-point, when it comes to the risk of hitting unexpected behavior in a corner case.

cts/cli/regression.tools.exp

kgaillot · 2020-08-07T15:46:37Z

So it seems to me that parsing numbers with '.' as floating-point would be just the same as parsing ALL numbers as floating-point, when it comes to the risk of hitting unexpected behavior in a corner case.

A bit contrived, but:

#include <stdio.h>
#include <stdlib.h>


int main(int argc, char **argv)
{
    unsigned long integer = 9223372036854775809UL;
    double number = strtod("9223372036854775809.0", NULL);

    printf("%lu %f\n", integer, number);
    return 0;
}

However that wouldn't affect our situation since we currently parse the values as 32-bit integers (which I didn't realize until checking just now), so technically you're right. :) Which makes me think we should indeed parse "number" as double, and start parsing "integer" with crm_parse_ll(). That would clean things up nicely and shouldn't affect any existing usage adversely.

nrwahl2 · 2020-08-07T17:56:47Z

A bit contrived, but:

The best corner cases are contrived :)
And ah yes, large ints that require high precision. I forgot about losses when converting those to doubles.

That would clean things up nicely and shouldn't affect any existing usage adversely.

Ack. So I take it we're not worrying about any odd use cases where the expected result of a comparison depends on truncation to int. I support going with double for "number" as long as you're good with it, since it is more flexible.

I was working on a pcmk__scan_double() and pcmk_parse_double(). I back-burnered them to focus on commits for the main focus of this issue, after having some initial difficulty detecting overflow and underflow in a way that's reliable on both ANSI and C99. (The strtod() return value for underflow is 0 and ERANGE on ANSI, while it's "<= smallest positive normalized number and maybe ERANGE (implementation-dependent)" on C99.)

If we've settled on parsing "number" as double, I'll get back to work on that and add the result to the PR.

kgaillot · 2020-08-07T18:02:26Z

Ack. So I take it we're not worrying about any odd use cases where the expected result of a comparison depends on truncation to int. I support going with double for "number" as long as you're good with it, since it is more flexible.

Yep. Using a "." and expecting an integer comparison would be twisted enough that I'd consider it a bug.

I was working on a pcmk__scan_double() and pcmk_parse_double(). I back-burnered them to focus on commits for the main focus of this issue, after having some initial difficulty detecting overflow and underflow in a way that's reliable on both ANSI and C99. (The strtod() return value for underflow is 0 and ERANGE on ANSI, while it's "<= smallest positive normalized number and maybe ERANGE (implementation-dependent)" on C99.)

If we've settled on parsing "number" as double, I'll get back to work on that and add the result to the PR.

Sounds good

lib/common/strings.c

nrwahl2 · 2020-08-09T02:26:37Z

This has grown pretty large. I think I'm finished, until there's feedback on things that need to be changed.

A lot of the commits are prep work (e.g., adding new utilities and their tests, copying schemas, etc.). Four of them are minor issues and opportunities I happened upon while doing the prep work. The last few commits directly address the issue.

lib/common/strings.c

nrwahl2 · 2020-08-10T00:18:41Z

One other thing: Does any of this require a feature set bump?

kgaillot

Nice work, very thorough. I think a feature bump is worthwhile since otherwise the behavior of floating-point attribute values could change depending on which node is DC. Also, IIRC the schema version bump protects us from the user being able to upgrade the CIB before all nodes support it, but technically users can run without schema verification, so it's best not to rely solely on that (without schema verification someone could specify integer in a mixed-version cluster and get really different behavior).

lib/common/strings.c

include/crm/common/util.h

lib/common/strings.c

lib/common/tests/strings/pcmk_parse_double.c

lib/common/strings.c

lib/common/tests/strings/pcmk__char_in_any_str.c

lib/pengine/rules.c

doc/Pacemaker_Explained/en-US/Ch-Rules.txt

nrwahl2 · 2020-08-11T02:06:19Z

By the way, we can't rely on GCC and Clang extensions, right? GCC (and I believe Clang as well) has an extension for variadic macro arguments that would eliminate the need to pass NULL as a sentinel value to some of our newer string functions. It's not part of the standard though.

lib/common/strings.c

include/crm/common/internal.h

Add new function to parse a string to a double. Modeled after crm_parse_ll(). Signed-off-by: Reid Wahl <[email protected]>

Instead of ERANGE. Also document return values for scan_ll(). Signed-off-by: Reid Wahl <[email protected]>

Signed-off-by: Reid Wahl <[email protected]>

Calls strchr() to check whether a character is in any of a variable argument list of strings. Signed-off-by: Reid Wahl <[email protected]>

For function pcmk_numeric_strcasecmp(). Signed-off-by: Reid Wahl <[email protected]>

Signed-off-by: Reid Wahl <[email protected]>

"...Similarly, it [g_assert()] must not be used in unit tests, otherwise the unit tests will be ineffective if compiled with G_DISABLE_ASSERT. Use g_assert_true() and related macros in unit tests instead." https://developer.gnome.org/glib/unstable/glib-Testing.html#g-assert Also remove redundant test for pcmk__str_any_of(). Signed-off-by: Reid Wahl <[email protected]>

Will be adding support for "integer" as rule "type" attribute. Signed-off-by: Reid Wahl <[email protected]>

The Pacemaker Explained doc has long described "integer" as a valid value for the "type" attribute of a rule. The rule schema includes "number" but does not include "integer" in this position. This commit adds "integer" to the schema alongside "number". Signed-off-by: Reid Wahl <[email protected]>

Signed-off-by: Reid Wahl <[email protected]>

lib/pengine/rules.c

For readability and to prep for next changes. Signed-off-by: Reid Wahl <[email protected]>

kgaillot

Looks good, just a couple minor comments

lib/pengine/rules.c

For node attribute expressions in a rule. Use long long instead of int, and default to string if numbers fail to parse as integers. Signed-off-by: Reid Wahl <[email protected]>

The Pacemaker Explained doc has long described "integer" as a valid value for the "type" attribute of a rule. However, libpe_status does not support "integer" as the type attribute for a rule. Yet it parses all "number"-type values as integers. This commit updates libpe_status to accept "integer" as a rule's type attribute and to parse integers as long longs instead of as ints. "number"-type values will now be parsed as doubles. If the rule evaluation **defaults** to a numeric comparison, then the type will be set to "number" if either value contains a decimal point or to "integer" otherwise. If a numeric parse fails, then the values will be compared as strings. Bump CRM_FEATURE_SET to 3.5.0. Signed-off-by: Reid Wahl <[email protected]>

nrwahl2 · 2020-08-17T18:36:33Z

I just noticed that I updated cts-scheduler instead of cts-scheduler.in for some of the added regression tests. Going to have to rewrite the list additions and re-push.

New regression tests for commit f3a3599 Signed-off-by: Reid Wahl <[email protected]>

Add "number" as an allowed value for the type attribute of a rule expression. Add description of how "integer" and "number" comparisons differ and of how a default comparison type is chosen. Signed-off-by: Reid Wahl <[email protected]>

kgaillot · 2020-08-17T19:38:36Z

Awesome :)

nrwahl2 force-pushed the nrwahl2-rule_int branch 3 times, most recently from b9b48dd to a2a8b02 Compare August 7, 2020 09:40

nrwahl2 commented Aug 7, 2020

View reviewed changes

cts/cli/regression.tools.exp Show resolved Hide resolved

nrwahl2 changed the title ~~Doc: Pacemaker Explained: Fix rules type spec~~ Fix: libpe_status, xml: Support integer as rule type attribute Aug 7, 2020

nrwahl2 force-pushed the nrwahl2-rule_int branch from a2a8b02 to f3a3599 Compare August 8, 2020 09:34

nrwahl2 commented Aug 8, 2020

View reviewed changes

lib/common/strings.c Outdated Show resolved Hide resolved

nrwahl2 force-pushed the nrwahl2-rule_int branch 5 times, most recently from 85a73e5 to ebb493d Compare August 9, 2020 02:26

nrwahl2 commented Aug 9, 2020

View reviewed changes

lib/common/strings.c Show resolved Hide resolved

nrwahl2 force-pushed the nrwahl2-rule_int branch from ebb493d to 0a38898 Compare August 9, 2020 02:36

nrwahl2 commented Aug 9, 2020

View reviewed changes

lib/common/strings.c Outdated Show resolved Hide resolved

kgaillot reviewed Aug 10, 2020

View reviewed changes

nrwahl2 force-pushed the nrwahl2-rule_int branch 2 times, most recently from 36379c3 to 54585bf Compare August 11, 2020 08:19

nrwahl2 commented Aug 11, 2020

View reviewed changes

lib/common/strings.c Show resolved Hide resolved

nrwahl2 commented Aug 11, 2020

View reviewed changes

include/crm/common/internal.h Outdated Show resolved Hide resolved

nrwahl2 added 12 commits August 13, 2020 13:55

Feature: libcrmcommon: Add pcmk__parse_double() function

3582c13

Add new function to parse a string to a double. Modeled after crm_parse_ll(). Signed-off-by: Reid Wahl <[email protected]>

Refactor: libcrmcommon: Use EOVERFLOW in scan_ll()

b49f86a

Instead of ERANGE. Also document return values for scan_ll(). Signed-off-by: Reid Wahl <[email protected]>

Refactor: libcrmcommon: Use pcmk__str_empty() in scan_ll()

c9b003f

Signed-off-by: Reid Wahl <[email protected]>

Refactor: libcrmcommon: Add macros for int/double parse defaults

1112797

Signed-off-by: Reid Wahl <[email protected]>

Test: libcrmcommon: Create unit tests for pcmk__scan_double()

872af88

Signed-off-by: Reid Wahl <[email protected]>

Feature: libcrmcommon: Add pcmk__char_in_any_str()

17d24c2

Calls strchr() to check whether a character is in any of a variable argument list of strings. Signed-off-by: Reid Wahl <[email protected]>

Doc: libcrmcommon: Add missing doxygen '!' tag to comment

f208351

For function pcmk_numeric_strcasecmp(). Signed-off-by: Reid Wahl <[email protected]>

Test: libcrmcommon: Create unit tests for pcmk__char_in_any_str()

41df12e

Signed-off-by: Reid Wahl <[email protected]>

Low: xml: Clone 3.4 schema in preparation for changes

bd45176

Will be adding support for "integer" as rule "type" attribute. Signed-off-by: Reid Wahl <[email protected]>

Test: cts-cli: Update expected output for schema changes

6391ce6

Signed-off-by: Reid Wahl <[email protected]>

nrwahl2 force-pushed the nrwahl2-rule_int branch 2 times, most recently from 79573e8 to bfcf5bd Compare August 13, 2020 21:23

nrwahl2 commented Aug 13, 2020

View reviewed changes

lib/pengine/rules.c Show resolved Hide resolved

nrwahl2 commented Aug 13, 2020

View reviewed changes

lib/pengine/rules.c Show resolved Hide resolved

Refactor: libpe_status: Break pe__eval_attr_expr() into helpers

5bd67bf

For readability and to prep for next changes. Signed-off-by: Reid Wahl <[email protected]>

nrwahl2 force-pushed the nrwahl2-rule_int branch from bfcf5bd to 746d391 Compare August 14, 2020 08:22

kgaillot reviewed Aug 17, 2020

View reviewed changes

lib/pengine/rules.c Show resolved Hide resolved

lib/pengine/rules.c Show resolved Hide resolved

lib/pengine/rules.c Show resolved Hide resolved

nrwahl2 added 2 commits August 17, 2020 11:29

Feature: libpe_status: Error-check and expand range of type="number"

1130aac

For node attribute expressions in a rule. Use long long instead of int, and default to string if numbers fail to parse as integers. Signed-off-by: Reid Wahl <[email protected]>

nrwahl2 force-pushed the nrwahl2-rule_int branch from 746d391 to fb87a88 Compare August 17, 2020 18:32

nrwahl2 force-pushed the nrwahl2-rule_int branch from fb87a88 to 8e44d57 Compare August 17, 2020 18:40

nrwahl2 added 2 commits August 17, 2020 11:41

Test: cts-scheduler: Add regression tests for rule integer/number type

555ebbb

New regression tests for commit f3a3599 Signed-off-by: Reid Wahl <[email protected]>

nrwahl2 force-pushed the nrwahl2-rule_int branch from 8e44d57 to 39d78f3 Compare August 17, 2020 18:42

kgaillot merged commit 77c78c2 into ClusterLabs:master Aug 17, 2020

nrwahl2 deleted the nrwahl2-rule_int branch October 10, 2023 07:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: libpe_status, xml: Support integer as rule type attribute #2135

Fix: libpe_status, xml: Support integer as rule type attribute #2135

nrwahl2 commented Aug 5, 2020 •

edited

Loading

nrwahl2 commented Aug 5, 2020 •

edited

Loading

kgaillot commented Aug 6, 2020

kgaillot commented Aug 6, 2020

nrwahl2 commented Aug 7, 2020

kgaillot commented Aug 7, 2020

nrwahl2 commented Aug 7, 2020 •

edited

Loading

kgaillot commented Aug 7, 2020

nrwahl2 commented Aug 9, 2020

nrwahl2 commented Aug 10, 2020

kgaillot left a comment

nrwahl2 commented Aug 11, 2020

kgaillot left a comment

nrwahl2 commented Aug 17, 2020

kgaillot commented Aug 17, 2020

Fix: libpe_status, xml: Support integer as rule type attribute #2135

Fix: libpe_status, xml: Support integer as rule type attribute #2135

Conversation

nrwahl2 commented Aug 5, 2020 • edited Loading

nrwahl2 commented Aug 5, 2020 • edited Loading

kgaillot commented Aug 6, 2020

kgaillot commented Aug 6, 2020

nrwahl2 commented Aug 7, 2020

kgaillot commented Aug 7, 2020

nrwahl2 commented Aug 7, 2020 • edited Loading

kgaillot commented Aug 7, 2020

nrwahl2 commented Aug 9, 2020

nrwahl2 commented Aug 10, 2020

kgaillot left a comment

Choose a reason for hiding this comment

nrwahl2 commented Aug 11, 2020

kgaillot left a comment

Choose a reason for hiding this comment

nrwahl2 commented Aug 17, 2020

kgaillot commented Aug 17, 2020

nrwahl2 commented Aug 5, 2020 •

edited

Loading

nrwahl2 commented Aug 5, 2020 •

edited

Loading

nrwahl2 commented Aug 7, 2020 •

edited

Loading