-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Amir M. Parvizi edited this page Nov 19, 2024
·
6 revisions
A comprehensive guide to modern deep learning frameworks, tools, and infrastructure
- Overview
- Research Frameworks
- Inference Solutions
- Low-Level Tools
- Edge Computing & Embedded Systems
- High-Level Libraries
This wiki provides a comprehensive breakdown of the current deep learning ecosystem, focusing on practical applications and real-world usage. Whether you're a researcher, developer, or ML engineer, you'll find valuable insights into choosing the right tools for your projects.
Key Features:
- Dynamic computation graphs
- Intuitive Python-first design
- Extensive research community support
Use Cases:
- Research projects
- Rapid prototyping
- NLP with Hugging Face
- Computer vision applications
Resources:
Key Features:
- Production-ready deployment
- TPU support
- Comprehensive ecosystem
Use Cases:
- Production environments
- Large-scale deployments
- Google Cloud integration
Framework | Best For | Learning Curve | Production Ready |
---|---|---|---|
PyTorch | Research | Moderate | Yes |
TensorFlow | Production | Steep | Yes |
JAX | High Performance | Steep | Partial |
MLX | Apple Silicon | Moderate | Yes |
Lightning | Distributed Training | Moderate | Yes |
graph TB
A[Inference Solutions] --> B[vLLM]
A --> C[TensorRT]
A --> D[Triton]
B --> E[LLM Specialized]
C --> F[NVIDIA Optimized]
D --> G[Matrix Operations]
Solution | Speed | Memory Usage | Platform Support |
---|---|---|---|
vLLM | βββββ | ββββ | GPU |
TensorRT | βββββ | βββββ | NVIDIA GPU |
Triton | ββββ | βββ | CPU/GPU |
We welcome contributions! Please see our Contributing Guidelines for details.
This project is licensed under the MIT License - see the LICENSE file for details.