You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the system automatically "aggregates" features generated about a single chat/message to the conversation and user levels --- calculating various summary statistics for the features (mean, median, max, min, std):
However, aggregating by everything yields thousands of features --- this is way too many! Instead, we should make it possible for the user to specify what they want: for example, maybe they are only interested in the mean function (not mean, median, max, min, AND std...).
There are some design decisions here, but they are relatively simple ones; we simply need to think about how we want the user to specify which aggregations they want. Specifically, we want to think about:
Which levels of aggregation does the user want? (Conversation and User are the options)
Which columns (at the chat level) do they want aggregated?
Which functions do they want to aggregate with (e.g., mean, std...)
Accordingly, we will want to think through the way the user should specify these desires. Here is an example:
There should also be an option to say they want no aggregations at all.
Getting Started
Modify the FeatureBuilder constructor to have the user pass in parameters for whether they want conversation- and user-level aggregations at all; and if so, which aggregations they want to have (which columns, which methods).
Follow the logic through in the utilities where the aggregations take place.
Currently, the system automatically "aggregates" features generated about a single chat/message to the
conversation
anduser
levels --- calculating various summary statistics for the features (mean, median, max, min, std):https://github.com/Watts-Lab/team-process-map/blob/main/feature_engine/utils/calculate_conversation_level_features.py
However, aggregating by everything yields thousands of features --- this is way too many! Instead, we should make it possible for the user to specify what they want: for example, maybe they are only interested in the mean function (not mean, median, max, min, AND std...).
There are some design decisions here, but they are relatively simple ones; we simply need to think about how we want the user to specify which aggregations they want. Specifically, we want to think about:
Accordingly, we will want to think through the way the user should specify these desires. Here is an example:
There should also be an option to say they want no aggregations at all.
Getting Started
The text was updated successfully, but these errors were encountered: