KubeSage: AI-Powered Kubernetes Troubleshooting Assistant

KubeSage is an AI-driven Kubernetes troubleshooting assistant that integrates LangChain Agents with Kubernetes APIs. It provides real-time diagnostics, resource monitoring, and troubleshooting recommendations for Kubernetes clusters using OpenAI's GPT-4o.

Features

✅ AI-powered Kubernetes troubleshooting
✅ LangChain Agents for intelligent decision-making
✅ Real-time Kubernetes monitoring (Pods, Deployments, Services, Nodes)
✅ Deep dive diagnostics (logs, resource usage, RBAC, etc.)
✅ WebSocket interface for interactive debugging
✅ Secure authentication using OpenAI API keys

Installation

1️⃣ Clone the Repository

git clone https://github.com/your-username/KubeSage.git
cd KubeSage

2️⃣ Set Up Virtual Environment

conda create -n kube-sage python=3.9 -y
conda activate kube-sage

3️⃣ Install Dependencies

pip install -r requirements.txt

4️⃣ Configure Kubernetes Access

Ensure your Kubernetes cluster is accessible:

Inside Cluster: Automatically loads service account credentials.
Outside Cluster: Set up KUBECONFIG:
```
export KUBECONFIG=$HOME/.kube/config
```

5️⃣ Run the WebSocket API

python src/main.py

Usage

Start WebSocket Connection

Use wscat or a WebSocket client:

wscat -c ws://localhost:6000/ws

You can then start chatting with the AI assistant.

Available Tools

Broad Insights (Cluster Overview)

Tool	Description
`Get All Pods with Resource Usage`	Lists all pods with CPU & memory usage.
`Get All Services`	Lists all services and their types/ports.
`Get All Deployments`	Fetches deployment details.
`Get All Nodes`	Lists nodes with health & capacity.
`Get Cluster Events`	Shows recent warnings & failures.
`Get Namespace List`	Fetches all Kubernetes namespaces.

Deep Dive (Detailed Diagnostics)

Tool	Description
`Describe Pod with Restart Count`	Fetches pod details + restart count.
`Get Pod Logs`	Retrieves last 10 log lines for a pod.
`Describe Service`	Gets details of a Kubernetes service.
`Describe Deployment`	Fetches deployment details (replica count, images).
`Check RBAC Events & Role Bindings`	Analyzes security permissions.
`Get Ingress Resources`	Lists ingress rules, hosts & annotations.
`Check Pod Affinity & Anti-Affinity`	Analyzes scheduling constraints.

Architecture

(Replace with actual architecture diagram if available)

Components

1️⃣ FastAPI WebSocket Server - Handles real-time interactions.
2️⃣ LangChain Agent - Uses OpenAI GPT-4o to select appropriate tools.
3️⃣ Kubernetes API Client - Fetches cluster insights and diagnostics.
4️⃣ RBAC & Authentication - Secure access to cluster resources.

WebSocket Integration

KubeSage uses WebSockets for real-time AI troubleshooting.

Connecting to the WebSocket

Example connection using Python WebSockets:

import websockets
import asyncio

async def connect():
    uri = "ws://localhost:6000/ws"
    async with websockets.connect(uri) as websocket:
        await websocket.send("Describe Pod with Restart Count")
        response = await websocket.recv()
        print(f"Response: {response}")

asyncio.run(connect())

Troubleshooting

1️⃣ WebSocket Error: `Connection Refused`

✅ Ensure WebSocket server is running:

python src/main.py

2️⃣ Kubernetes API `403 Forbidden`

✅ Verify RBAC permissions:

kubectl auth can-i get pods --as=system:serviceaccount:default:kubesage-sa

If denied, apply:

apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: metrics-reader-binding
subjects:
  - kind: ServiceAccount
    name: kubesage-sa
    namespace: default
roleRef:
  kind: ClusterRole
  name: metrics-reader
  apiGroup: rbac.authorization.k8s.io

3️⃣ Metrics API `404 Not Found`

✅ Enable Metrics Server:

kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

License

MIT License - Free to use, modify, and distribute.
Contributions welcome!

Next Steps

To-Do List

Add Prometheus/Grafana integration for advanced monitoring
Support multi-cluster troubleshooting
Add more AI-powered insights

Want to contribute? Open a PR!

Acknowledgments

Thanks to Kubernetes, FastAPI, LangChain, and OpenAI for making AI-driven DevOps possible!

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.idea		.idea
deployment		deployment
resources		resources
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KubeSage: AI-Powered Kubernetes Troubleshooting Assistant

Table of Contents

Features

Installation

1️⃣ Clone the Repository

2️⃣ Set Up Virtual Environment

3️⃣ Install Dependencies

4️⃣ Configure Kubernetes Access

5️⃣ Run the WebSocket API

Usage

Start WebSocket Connection

Available Tools

Broad Insights (Cluster Overview)

Deep Dive (Detailed Diagnostics)

Architecture

Components

WebSocket Integration

Connecting to the WebSocket

Troubleshooting

1️⃣ WebSocket Error: `Connection Refused`

2️⃣ Kubernetes API `403 Forbidden`

3️⃣ Metrics API `404 Not Found`

License

Next Steps

Acknowledgments

About

Releases

Packages

Languages

License

Adeesh-devanand/KubeSage

Folders and files

Latest commit

History

Repository files navigation

KubeSage: AI-Powered Kubernetes Troubleshooting Assistant

Table of Contents

Features

Installation

1️⃣ Clone the Repository

2️⃣ Set Up Virtual Environment

3️⃣ Install Dependencies

4️⃣ Configure Kubernetes Access

5️⃣ Run the WebSocket API

Usage

Start WebSocket Connection

Available Tools

Broad Insights (Cluster Overview)

Deep Dive (Detailed Diagnostics)

Architecture

Components

WebSocket Integration

Connecting to the WebSocket

Troubleshooting

1️⃣ WebSocket Error: Connection Refused

2️⃣ Kubernetes API 403 Forbidden

3️⃣ Metrics API 404 Not Found

License

Next Steps

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

1️⃣ WebSocket Error: `Connection Refused`

2️⃣ Kubernetes API `403 Forbidden`

3️⃣ Metrics API `404 Not Found`

Packages