Allow Users to Customize Aggregation #206

xehu · 2024-04-23T18:45:05Z

Currently, the system automatically "aggregates" features generated about a single chat/message to the conversation and user levels --- calculating various summary statistics for the features (mean, median, max, min, std):

https://github.com/Watts-Lab/team-process-map/blob/main/feature_engine/utils/calculate_conversation_level_features.py

However, aggregating by everything yields thousands of features --- this is way too many! Instead, we should make it possible for the user to specify what they want: for example, maybe they are only interested in the mean function (not mean, median, max, min, AND std...).

There are some design decisions here, but they are relatively simple ones; we simply need to think about how we want the user to specify which aggregations they want. Specifically, we want to think about:

Which levels of aggregation does the user want? (Conversation and User are the options)
Which columns (at the chat level) do they want aggregated?
Which functions do they want to aggregate with (e.g., mean, std...)

Accordingly, we will want to think through the way the user should specify these desires. Here is an example:

  aggregation:
    methods: ["mean", "std"]
    columns: ["column1", "column2"]

There should also be an option to say they want no aggregations at all.

Getting Started

Modify the FeatureBuilder constructor to have the user pass in parameters for whether they want conversation- and user-level aggregations at all; and if so, which aggregations they want to have (which columns, which methods).
Follow the logic through in the utilities where the aggregations take place.

Conversation-level aggregations: https://github.com/Watts-Lab/team-process-map/blob/main/feature_engine/utils/calculate_conversation_level_features.py
User-level aggregations: https://github.com/Watts-Lab/team-process-map/blob/main/feature_engine/utils/calculate_user_level_features.py

The text was updated successfully, but these errors were encountered:

xehu added the enhancement New feature or request label Apr 23, 2024

xehu added this to the Release V1 of Team Process Mapping Package milestone Apr 23, 2024

xehu mentioned this issue Apr 23, 2024

Create More User-Friendly examples/ #208

Open

xehu linked a pull request Aug 7, 2024 that will close this issue

Amy/package aggregation #272

Closed

25 tasks

xehu modified the milestones: Release V1 of Team Process Mapping Package, Improve Package Functionality/Usability Aug 15, 2024

xehu assigned amytangzheng Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Users to Customize Aggregation #206

Allow Users to Customize Aggregation #206

xehu commented Apr 23, 2024 •

edited

Loading

Allow Users to Customize Aggregation #206

Allow Users to Customize Aggregation #206

Comments

xehu commented Apr 23, 2024 • edited Loading

Getting Started

xehu commented Apr 23, 2024 •

edited

Loading