Skip to content

Commit 9c227e0

Browse files
authored
Create cmd_latency.json
Create new monitor template for REDIS
1 parent c0872c0 commit 9c227e0

File tree

1 file changed

+36
-0
lines changed

1 file changed

+36
-0
lines changed
Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
{
2+
"version": 2,
3+
"created_at": "2025-05-27",
4+
"last_updated_at": "2025-05-27",
5+
"title": "Command latency is high",
6+
"description": "Redis is built for speed, and command latency is a key performance indicator. This monitor tracks the 99th percentile of command latency to catch slowdowns early, helping prevent cascading delays across your systems.",
7+
"definition": {
8+
"id": 173213351,
9+
"name": "High Redis command latency (p99)",
10+
"type": "query alert",
11+
"query": "avg(last_5m):avg:redis.net.latency_ms.p99{integration:redis} > 20",
12+
"message": "## 🚨 What’s happening\n\nHigh Redis command latency detected (p99 > 20ms for 5 minutes).\n\nRedis is experiencing elevated command latency, which suggests that operations are not responding within expected thresholds. This could be caused by internal contention, blocked commands, slow clients, or downstream pressure from connected services.\n\n---\n\n## 📈 Impact\n\nIncreased command latency can lead to:\n\n- Slower application performance and timeouts\n- Delayed cache reads/writes\n- Poor user experience in latency-sensitive applications\n- Potential cascading effects on dependent systems\n\n---\n\n## 🛠️ Runbook\n\n### Initial Troubleshooting Steps\n\n1. **Identify the affected Redis node**.\n2. Go to [**Redis integration metrics**](https://app.datadoghq.com/monitors/manage?filter=redis) in Datadog.\n3. Review these metrics:\n - `redis.net.latency_ms.p99`\n - `redis.commands.per_sec`\n - `redis.clients.blocked`\n - Host-level CPU/memory/disk metrics\n4. Check for slow logs or blocked clients.\n5. Ensure no network congestion or saturation between Redis and calling services.\n\n---\n\n### Cause and Resolution\n\nCause | Resolution\n------|-----------\nCommand backlog or slow queries | Investigate slow logs and blocked clients.\nHigh memory or CPU pressure | Scale the node or optimize Redis configuration.\nNetwork degradation | Check latency and packet loss metrics.\nMisbehaving client | Identify traffic spike source or connection issues.\n\n---\n\n### 👥 Who should be notified?\n\nPlease route to the appropriate team: \n`@slack-yourteam-alerts`\n",
13+
"tags": [
14+
"integration:redis"
15+
],
16+
"options": {
17+
"thresholds": {
18+
"critical": 20
19+
},
20+
"notify_audit": false,
21+
"evaluation_delay": 300,
22+
"require_full_window": true,
23+
"include_tags": true,
24+
"silenced": {},
25+
"avalanche_window": 20,
26+
"on_missing_data": "default"
27+
},
28+
"priority": null,
29+
"restriction_policy": {
30+
"bindings": []
31+
}
32+
},
33+
"tags": [
34+
"integration:redisdb"
35+
]
36+
}

0 commit comments

Comments
 (0)