TensorRT-LLM
Repository: TensorRT-LLM
Author: NVIDIA · Source status: Clear source
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
Score basis:Clear source · Risk needs review · Universal