You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, this resulted in performance degradation as too many objects are created. We noticed that this is problematic while re-ingesting a big dataset with over 10 years of data with lots of String dimensions.
We are working on a fix to coerce value to Java string only of sketch columns.
The text was updated successfully, but these errors were encountered:
In recent release of the library (1.0.5), we introduced a change to read value as Java String type for all String columns. This was introduced as Spark's internal UTF8String is not compatible with DataSketches.
However, this resulted in performance degradation as too many objects are created. We noticed that this is problematic while re-ingesting a big dataset with over 10 years of data with lots of String dimensions.
We are working on a fix to coerce value to Java string only of sketch columns.
The text was updated successfully, but these errors were encountered: