You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Customer GWSC wants to groupBy the entities in the data and check if a column’s values are within, say, 3 standard deviations of the mean for that entity group.
Proposed Solution
Modify is_in_range() to accept dataframe columns for the min/max_limit. Then I can join a dataframe of pre-computed group-specific ranges and provide the column names to is_in_range(). Example ranges to support include multiples of standard deviation, interquartile range, or mean absolute deviation.
Additional Context
This feature may be included in [FEATURE]: Data set level rules #43, but I want to make sure my customer's need is addressed.
The text was updated successfully, but these errors were encountered:
harlankad-db
changed the title
[FEATURE]: is_(not)_in_range suppors min/max_limit from the dataframe
[FEATURE]: is_(not)_in_range supports min/max_limit from the dataframe
Jan 14, 2025
Is there an existing issue for this?
Problem statement
Customer GWSC wants to groupBy the entities in the data and check if a column’s values are within, say, 3 standard deviations of the mean for that entity group.
Proposed Solution
Modify is_in_range() to accept dataframe columns for the min/max_limit. Then I can join a dataframe of pre-computed group-specific ranges and provide the column names to is_in_range(). Example ranges to support include multiples of standard deviation, interquartile range, or mean absolute deviation.
Additional Context
This feature may be included in [FEATURE]: Data set level rules #43, but I want to make sure my customer's need is addressed.
The text was updated successfully, but these errors were encountered: