Skip to content
Change the repository type filter

All

    Repositories list

    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k8.8k64666Updated Feb 25, 2025Feb 25, 2025
    • Rust
      Apache License 2.0
      11393135Updated Feb 25, 2025Feb 25, 2025
    • Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
      Python
      25522Updated Feb 25, 2025Feb 25, 2025
    • C++
      BSD 3-Clause "New" or "Revised" License
      1148614Updated Feb 25, 2025Feb 25, 2025
    • FIL backend for the Triton Inference Server
      Jupyter Notebook
      Apache License 2.0
      3576513Updated Feb 24, 2025Feb 24, 2025
    • OpenVINO backend for Triton.
      C++
      BSD 3-Clause "New" or "Revised" License
      163165Updated Feb 24, 2025Feb 24, 2025
    • tutorials

      Public
      This repository contains tutorials and examples for Triton Inference Server
      Python
      BSD 3-Clause "New" or "Revised" License
      107654814Updated Feb 24, 2025Feb 24, 2025
    • The Triton backend for the PyTorch TorchScript models.
      C++
      BSD 3-Clause "New" or "Revised" License
      4514405Updated Feb 24, 2025Feb 24, 2025
    • The Triton backend for the ONNX Runtime.
      C++
      BSD 3-Clause "New" or "Revised" License
      58139723Updated Feb 21, 2025Feb 21, 2025
    • Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
      C++
      BSD 3-Clause "New" or "Revised" License
      158590011Updated Feb 20, 2025Feb 20, 2025
    • Python
      BSD 3-Clause "New" or "Revised" License
      2422805Updated Feb 18, 2025Feb 18, 2025
    • The Triton TensorRT-LLM Backend
      Python
      Apache License 2.0
      11478429921Updated Feb 18, 2025Feb 18, 2025
    • The Triton backend for TensorRT.
      C++
      BSD 3-Clause "New" or "Revised" License
      317001Updated Feb 17, 2025Feb 17, 2025
    • Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
      Python
      Apache License 2.0
      78458266Updated Feb 14, 2025Feb 14, 2025
    • core

      Public
      The core library and APIs implementing the Triton Inference Server.
      C++
      BSD 3-Clause "New" or "Revised" License
      104117017Updated Feb 14, 2025Feb 14, 2025
    • pytriton

      Public
      PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
      Python
      Apache License 2.0
      53774100Updated Feb 12, 2025Feb 12, 2025
    • The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
      C++
      MIT License
      31132216Updated Feb 11, 2025Feb 11, 2025
    • Third-party source packages that are modified for use in Triton.
      C
      BSD 3-Clause "New" or "Revised" License
      59705Updated Feb 11, 2025Feb 11, 2025
    • The Triton backend for TensorFlow.
      C++
      BSD 3-Clause "New" or "Revised" License
      215002Updated Feb 11, 2025Feb 11, 2025
    • Simple Triton backend used for testing.
      C++
      BSD 3-Clause "New" or "Revised" License
      4200Updated Feb 11, 2025Feb 11, 2025
    • An example Triton backend that demonstrates sending zero, one, or multiple responses for each request.
      C++
      BSD 3-Clause "New" or "Revised" License
      7500Updated Feb 11, 2025Feb 11, 2025
    • TRITONCACHE implementation of a Redis cache
      C++
      BSD 3-Clause "New" or "Revised" License
      41320Updated Feb 11, 2025Feb 11, 2025
    • Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API
      C++
      BSD 3-Clause "New" or "Revised" License
      1510Updated Feb 11, 2025Feb 11, 2025
    • Example Triton backend that demonstrates most of the Triton Backend API.
      C++
      BSD 3-Clause "New" or "Revised" License
      12600Updated Feb 11, 2025Feb 11, 2025
    • C++
      101805Updated Feb 11, 2025Feb 11, 2025
    • common

      Public
      Common source, scripts and utilities shared across all Triton repositories.
      C++
      BSD 3-Clause "New" or "Revised" License
      746803Updated Feb 11, 2025Feb 11, 2025
    • client

      Public
      Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
      Python
      BSD 3-Clause "New" or "Revised" License
      2346023925Updated Feb 11, 2025Feb 11, 2025
    • The Triton repository agent that verifies model checksums.
      C++
      BSD 3-Clause "New" or "Revised" License
      71100Updated Feb 11, 2025Feb 11, 2025
    • backend

      Public
      Common source, scripts and utilities for creating Triton backends.
      C++
      BSD 3-Clause "New" or "Revised" License
      9331103Updated Feb 11, 2025Feb 11, 2025
    • Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
      Python
      Apache License 2.0
      2619630Updated Jan 13, 2025Jan 13, 2025