Data API equivalent for HoloMaps #347

jlstevens · 2015-12-11T00:27:23Z

We've had the recent Data API PR merged for Chart elements and this issue suggests a similar thing could be implemented for HoloMaps and NdMappings.

The API is not quite the same as for charts as HoloMap is designed to be dictionary like. However, instead of always using ordered dictionaries as .data, you can imagine using pandas DataFrames instead. This could greatly improve the speed of certain operations (such as groupby) involving holomaps.

The text was updated successfully, but these errors were encountered:

philippjfr · 2015-12-11T01:08:00Z

Note that the slow speed of groupby is the major motivator here. I've just come up with an alternative implementation for the NdMapping groupby that temporarily converts to a DataFrame. Here's the performance profiling for a three-dimensional HoloMap grouped by two of the dimensions with both implementations:

I think until we find a way around this we should allow HoloMap to use this implementation when pandas is available.

philippjfr · 2016-01-04T17:49:02Z

Just some quick thoughts on this. Fundamentally the distinction between NdMapping and Columns types is in the way the data is indexed. NdMapping types are good for providing multi-dimensional indexing for dense chunks of data, e.g. Elements. Elements on the other hand hold dense chunks of data directly, whether it is columns or dense 2d arrays. Implementing a separate API for NdMapping types that can work using either the current OrderedDict based implementation or using pandas MultiIndexes therefore would make sense.

This second index based baseclass would also be useful for more powerful composite Element types, i.e. you could have a new Element baseclass where the data maps between multi dimensional keys and values that match the Columns data format, e.g. for a list of polygons for each country the data format would be {'Australia': {'x': xs, 'y': ys}, 'Austria': ...}, where {'x': xs, 'y': ys} is a valid definition for an individual Curve. This would be very similar to what NdOverlay provides but without the need to nest the data and provide a much more optimized alternative for storing and plotting collections of artists.

This example from the pandas docs should make it clear what the data format is:

                     A         B         C
first second                              
bar   one     0.895717  0.410835 -1.413681
      two     0.805244  0.813850  1.607920
baz   one    -1.206412  0.132003  1.024180
      two     2.565646 -0.827317  0.569605
foo   one     1.431256 -0.076467  0.875906
      two     1.340309 -1.187678 -2.211372
qux   one    -1.170299  1.130127  0.974466
      two    -0.226169 -1.436737 -2.006747

Here first and second would be the dimensions mapping to the NdMapping style n-dimensional key and A, B and C the dimensions of the columns based format of the values in this new Element type. If we added support for this then we could leverage the general work to improve NdMapping proposed as part of this issue with the work on the data API to get a high-performance Element type for collections of data without some of the hacky workarounds (e.g. padding with delimiters) we discussed to improve the Paths and Polygons types.

philippjfr · 2016-10-25T21:43:38Z

I no longer think this is required now that I've optimized groupby with pandas and we are building out DynamicMap support, so I'm going to close this issue.

github-actions · 2024-10-25T04:30:02Z

This issue has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

jlstevens added the type: feature A major new feature label Dec 11, 2015

philippjfr mentioned this issue Dec 11, 2015

Speed up NdMapping.groupby #349

Merged

philippjfr mentioned this issue Jan 4, 2016

New GeoMap Element for plotting geographical maps #392

Closed

philippjfr mentioned this issue Jan 12, 2016

Doc cleanup #401

Merged

philippjfr mentioned this issue Jan 22, 2016

Redesigning Paths and Polygons #416

Closed

philippjfr added this to the v1.5.0 milestone Jan 26, 2016

philippjfr modified the milestones: v1.6.0, v1.5.0 Apr 20, 2016

philippjfr closed this as completed Oct 25, 2016

github-actions bot locked as resolved and limited conversation to collaborators Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data API equivalent for HoloMaps #347

Data API equivalent for HoloMaps #347

jlstevens commented Dec 11, 2015

philippjfr commented Dec 11, 2015

philippjfr commented Jan 4, 2016

philippjfr commented Oct 25, 2016

github-actions bot commented Oct 25, 2024

Data API equivalent for HoloMaps #347

Data API equivalent for HoloMaps #347

Comments

jlstevens commented Dec 11, 2015

philippjfr commented Dec 11, 2015

philippjfr commented Jan 4, 2016

philippjfr commented Oct 25, 2016

github-actions bot commented Oct 25, 2024