permitt

Hello, LLM Enthusiast Here! 👋 🤖

Connect & Follow

Content & Blogs

Diving deep into the world of Large Language Models! From architecture exploration to deployment, I'm passionate about understanding and building with these fascinating models. Proud owner of an RTX 4090 that powers my local LLM experiments! 🚀

🔭 Current Focus

Exploring various LLM architectures
Fine-tuning techniques (LoRA, QLoRA, PEFT)
Building with agentic frameworks (LangGraph, AutoGPT)
Optimizing inference for consumer GPUs

🛠️ Tech Stack

Core: Python, PyTorch, Transformers
LLM Tools:
- 🤗 Hugging Face ecosystem
- 🦜 LangChain, LlamaIndex
- 🚀 vLLM, text-generation-inference
Quantization: bitsandbytes, GPTQ, AWQ
Deployment: FastAPI, Gradio, Docker
Hardware: NVIDIA RTX 4090 (precious ✨)

🌱 Learning Journey

Advanced techniques in Prompt Engineering
Multi-agent systems and autonomous AI
Knowledge Graphs for RAG (Retrieval-Augmented Generation) optimization
Model distillation and compression methods
Vector databases (QDrant, Pinecone)

💡 Featured Projects

TBA

Provide feedback

Saved searches

Use saved searches to filter your results more quickly