Replies: 20 comments 2 replies
-
I remembered what I had in mind yesterday. It wasn't about precision, because honestly I have no idea about error propagation, but it was because i looked into numbas So this Now the I want to talk about are called
So this means, that llvm assumes that the input contains neither I'm not sure if this is expected behavior and is actually the same on all platforms. I think this could change in any numba version, this we shouldn't expect the result beeing So what would I suggest? Actually, how we might overcome this is simple. Either, we do it by storing a boolean array of valid subsequences, but this would require quite some changes in the codebase. Alternatively, instead o settin the means of not valid subsequences to Actually, now that I'm thinking about it, the matrix profile can have Inf values, so maybe we should disable the What do you think? |
Beta Was this translation helpful? Give feedback.
-
@mexxexx Thank you for these thoughts! I had encountered this
Notice that the computation of the mean and stddev come AFTER we set all illegal values to zero so all of the sliding window means and stddev should all be finite numbers. I really like that the key information is stored in And then, later, you can compute the distance and then decide whether you want to update the matrix profile:
This all feels very natural
I wonder if we can simply initialize the matrix profile values and fill it with |
Beta Was this translation helpful? Give feedback.
-
@mexxexx When we do
and not:
|
Beta Was this translation helpful? Give feedback.
-
Sorry for the late reply!
That's a good idea. I think that the normalized euclidean distance is bounded by And no, I cannot recall why we did it this way, did you find it out? |
Beta Was this translation helpful? Give feedback.
-
For your information, the Pearson correlation version of STUMPY has now been implemented into |
Beta Was this translation helpful? Give feedback.
-
That's great to hear! |
Beta Was this translation helpful? Give feedback.
-
I also removed those poorly conceived/written Additionally, I added a |
Beta Was this translation helpful? Give feedback.
-
Also, I added |
Beta Was this translation helpful? Give feedback.
-
Good to hear, I'm pretty sure that will benefit some people. |
Beta Was this translation helpful? Give feedback.
-
That's great! Congratulations on wrapping up this current chapter of your journey and welcome back to STUMPY! We've missed you! 😃 |
Beta Was this translation helpful? Give feedback.
-
@mexxexx Did you get a chance to look at the MDL comments . I was wondering whether you had any thoughts? If I understood it correctly, MDL is a post-processing step that doesn't seem to be too well developed. That is, it feels like one possible solution for choosing the best number of dimensions but it's still an educated guess? Having said that, I don't have a better approach than what is proposed. Although, there is a difference in how one would do MDL for z-normalized distances and non-normalized distances. The former discretizes at the subsequence level while the latter discretizes at the global level (which makes sense). |
Beta Was this translation helpful? Give feedback.
-
@seanlaw Yes I read the comments, but didn't reply since I couldn't add anything useful 😄 I also understand it as a best guess. Honestly, for the moment I don't think it's a huge requirement for stumpy, especially since it requires having a working |
Beta Was this translation helpful? Give feedback.
-
Ahh yes, I was just looking up the subspace issue and trying to remember if subspaces depended on MDL and I didn't see any comments on it that related the two. It sounds like MDL depends on subspaces or that if you compute MDL then you get ranked subspaces as a by-product? |
Beta Was this translation helpful? Give feedback.
-
If I remember correctly, you use MDL to find out how many dimensions your motif actually has. Let's say your time series has three dimensions. Then you could have two interesting dimensions and one being only noise, so MDL should yield that the motif has two dimensions. However, to be able to compute the MDL, you (and this is a guess, but it appears to be crucial information) need to know which of the dimensions form you one-, two- and- three-dimensional motif, otherwise, how would you even know what to compare? And this is exactly the information encoded in the subspace. |
Beta Was this translation helpful? Give feedback.
-
Okay, I'll have to go back and look at this more closely with your points in mind. Thank you and have a great weekend! |
Beta Was this translation helpful? Give feedback.
-
Thank you, you too! Feel free to ask if you have questions. I didn't fully understand the MDL, but discussion always helps in my experience. |
Beta Was this translation helpful? Give feedback.
-
I've been thinking, for |
Beta Was this translation helpful? Give feedback.
-
Never mind. I thought through it and remembered that it's not that simple! |
Beta Was this translation helpful? Give feedback.
-
@mihailescum I just released v1.8.0 and now all of main functions have a |
Beta Was this translation helpful? Give feedback.
-
@mihailescum I think I figured out Minimum Description Length! Going to push a commit in the next few days |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
@mexxexx Carrying on our earlier conversations here
Beta Was this translation helpful? Give feedback.
All reactions