Distributed error handling #40

viccon · 2025-03-23T08:47:09Z

Overview

This PR should resolve #38 by ensuring that either the error from the fetch function is returned when no distributed records are found, or a combination of ErrOnlyCachedRecords and the fetch function error when part of a batch was retrievable from the distributed storage.

Note: In this PR, I've branched from #36 as they are both related to error handling, and I'll most likely include them in the same release.

viccon · 2025-03-23T10:56:54Z

distribution.go

+		// Before we call the fetchFn, we'll do an unblocking read to see if the
+		// context has been cancelled. If it has, we'll return a stale value if we
+		// have one available.
+		select {


One could argue that it’s simpler to just perform an error check here, e.g. if ctx.Err() != nil, but I just find this approach more explicit. To me it's very clear that this code is trying to determine whether the work should proceed or not.

ernstwi

Nice, looks good to me! 👍

ernstwi · 2025-03-27T13:55:36Z

refresh.go

@@ -7,7 +7,7 @@ import (

 func (c *Client[T]) refresh(key string, fetchFn FetchFn[T]) {
 	response, err := fetchFn(context.Background())
-	if err != nil {
+	if err != nil && !errors.Is(err, errOnlyDistributedRecords) {


This change: Don't update the memory cache if the underlying fetch failed.

So this makes refresh match the behaviour of getFetch/getFetchBatch – only update the cache if the fetchFn succeeds.

If we're getting an errOnlyDistributedRecords error here, it means that we're using distributed storage with early refreshes. When we attempt to retrieve the record from the underlying data source and that call fails, we write the "old" value from the distributed cache to the in-memory cache. This allows us to serve stale data if an upstream system goes down.

If we don't write it to the in-memory cache, we will keep hammering both the distributed storage and the underlying data source. Note that while I use the term stale, the data will never be older than the TTL of the distributed cache. For example, if we're using a distributed storage with a 10-minute TTL and 1-minute refresh times, we would write values that are between 1 and 10 minutes old

ernstwi · 2025-03-27T14:50:16Z

distribution_test.go

 	fetchObserver.Err(errors.New("error"))
 	res, err := sturdyc.GetOrFetch(ctx, c, key, fetchObserver.Fetch)
-	if err != nil {
-		t.Fatalf("expected no error, got %v", err)
+	if !errors.Is(err, sturdyc.ErrOnlyCachedRecords) {


Could also add an errors.Is check for the wrapped fetchFn error here, right?

Do you mean like this?

if !errors.Is(err, fetchObserver.err) { t.Fatal("expected the original error to have been joined with ErrOnlyCachedRecords") }

E.g assert that you're getting a combination of the two errors? I think that is a good call to make both assertions.

ernstwi · 2025-03-27T14:53:39Z

distribution_test.go

+	}
+
+	fetchObserver := NewFetchObserver(11)
+	fetchObserver.err = context.Canceled


Could use fetchObserver.Err here for consistency :)

Hmm what do you mean?

Just that you have a helper to set err on FetchObserver that you use in some of the tests but not all, e.g.: fetchObserver.Err(sturdyc.ErrNotFound)

No big deal.

distribution_test.go

viccon · 2025-03-28T07:11:14Z

Nice, looks good to me! 👍

Great! My plan is to merge this along with the changes in #36 and then create a new release with this improved error handling

ernstwi · 2025-03-28T11:13:09Z

Nice, looks good to me! 👍

Great! My plan is to merge this along with the changes in #36 and then create a new release with this improved error handling

Sounds good. I think the team is planning to deploy this PR to prod today. Check with @perhells for details (I'm away next week)

davidulander · 2025-03-31T07:43:17Z

Nice, looks good to me! 👍

Great! My plan is to merge this along with the changes in #36 and then create a new release with this improved error handling

Sounds good. I think the team is planning to deploy this PR to prod today. Check with @perhells for details (I'm away next week)

I'll test this PR today :)

viccon · 2025-03-31T15:47:00Z

davidulander

Great, let me know how it went!

davidulander · 2025-04-01T12:32:33Z

davidulander

Great, let me know how it went!

I feel I lack the context to be of much help here 😅 Per is sick and Ernst is on vacation, think the best is to wait until one of them is back :) Is that ok by you @viccon ?

viccon self-assigned this Mar 23, 2025

viccon mentioned this pull request Mar 23, 2025

Canceled Context with go-redis results in ErrOnlyCachedRecords #38

Closed

viccon commented Mar 23, 2025

View reviewed changes

ernstwi approved these changes Mar 27, 2025

View reviewed changes

viccon mentioned this pull request Mar 31, 2025

fix: don't unwrap when not found error is thrown #41

Closed

Reworked and improved the error handling

34b5ab5

viccon force-pushed the additional-error-handling branch from 43a5026 to 34b5ab5 Compare April 4, 2025 19:24

viccon changed the base branch from error-handling to main April 4, 2025 19:24

viccon mentioned this pull request Apr 4, 2025

Return the actual error instead of ErrInvalidType #36

Closed

viccon merged commit 97fc006 into main Apr 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed error handling #40

Distributed error handling #40

viccon commented Mar 23, 2025

viccon Mar 23, 2025

ernstwi left a comment

ernstwi Mar 27, 2025

viccon Mar 28, 2025

ernstwi Mar 27, 2025

viccon Mar 28, 2025

ernstwi Apr 7, 2025

ernstwi Mar 27, 2025

viccon Mar 28, 2025

ernstwi Apr 7, 2025

viccon commented Mar 28, 2025

ernstwi commented Mar 28, 2025

davidulander commented Mar 31, 2025

viccon commented Mar 31, 2025

davidulander commented Apr 1, 2025

Distributed error handling #40

Distributed error handling #40

Conversation

viccon commented Mar 23, 2025

Overview

Choose a reason for hiding this comment

ernstwi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

viccon commented Mar 28, 2025

ernstwi commented Mar 28, 2025

davidulander commented Mar 31, 2025

viccon commented Mar 31, 2025

davidulander commented Apr 1, 2025