keep only the unique times series out of 1000 series #460
Replies: 2 comments 3 replies
-
The first thing that comes to mind is using It may not be super fast but it should help |
Beta Was this translation helpful? Give feedback.
-
If the individual time series are
then you can cluster them like any other n-dimensional object. The Paper "Matrix Profile XV: Exploiting Time Series Consensus Motifs to Find Structure in Time Series Sets" does closely match your problem. But mind the problem of outlier time series for the Ostinato algorithm. But you can start simpler, eg by concatenating your list of time series and insert a NaN entry between each time series. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I feel like I could either calculate whole matrix profile or using the stump.match to solve the problem, but not exactly sure. Hope you can share some thoughts, many thanks!
Problem: : I have a long list of time series say 10000 of them, some of them are highly correlated, and some are relatively unique, is there a way to utilize Stumpy to filter out the highly correlated ones and only keep relative unique ones, sort of like pick only 1 series from a cluster. so maybe with a threshold I have only 200 out of 1000 time series left. (In consensus motif example, meaning maybe I only want to keep Python & Alpaca out of the 4 series.)
Again, any idea is welcome thanks a lot!
Beta Was this translation helpful? Give feedback.
All reactions