-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes to accompany issues found in OSG modifications #84
Conversation
…o NULL instead of 0
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise, LGTM.
tools/dev/verify_table_data.php
Outdated
LEFT OUTER JOIN $destTableName dest ON (" . join(' AND ', $constraints) . ")" | ||
. ( 0 != count($where) ? "\nWHERE " . implode(' AND ', $where) : "" ) | ||
LEFT OUTER JOIN $destTableName dest ON (" . join("\nAND ", $constraints) . ")" | ||
. ( 0 != count($where) ? "\nWHERE " . implode("\nAND", $where) : "" ) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The lack of whitespace after AND
could cause problems.
tools/dev/verify_table_data.php
Outdated
@@ -541,7 +541,7 @@ function usage_and_exit($msg = null) | |||
-n, --num-missing-rows <number_of_rows> | |||
Display this number of missing rows. If not specified, all missing rows are displayed. | |||
|
|||
-r, --round-column <column>[=<digits>] | |||
-r, --round-column <column>[,<digits>] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The indentation of this line is a little funky.
Table Verifier: * Added --ignore-column-count to only compare columns that exist in the source table * Add ability to round columns before comparing * Added ability to ignore column types in comparison ETLv2: * Improved log formatting * Catch error when FROM table does not exist yet * Change squared colum type to decimal(36,4) and default waitduration to NULL instead of 0 in HPC Jobs aggregator
Changes to accompany issues found in OSG modifications
See ubccr/xdmod-xsede#23
Description
The aggregation query for job records and tasks now uses a datatype of
DECIMAL(36,4)
rather thanDOUBLE
for thesum_wallduration_squared
,sum_waitduration_squared
,sum_cpu_time_squared
,sum_local_charge_xdsu_squared
, andsum_node_time_squared
columns. The waitduration has been defaulted toNULL
to support cases when we do not have this information (e.g., OSG).Several enhancements to the table comparison tool were made:
double
todecimal(m,n)
Motivation and Context
When comparing new data as the result of a change to baseline data it is helpful to use the
DECIMAL(M,N)
data type rather thanDOUBLE
which may use approximate representation of floating point numbers. This makes data verification much easier.Tests performed
The XSEDE job pipeline was run before and after changes to the aggregation query. Running the table verificaiton tool, data was identical with the exception of cases where the baseline used an approximate representation of the data (e.g.,
4.1038701667445064e16
). In these cases the squared values were off by up to .00000000000000017%Types of changes
Checklist: