TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

Pre-install review · source, risk, and alternatives

NVIDIAAuthor unclaimedClear sourceView repository

C · Review first

Trust level

92 · High trust

Strong recovered source and maintenance signals.

Risk decision

Review required

metadata-only

Install readiness

script-backed · copy-only command

SkillTrust only shows install guidance and copy actions; it never executes installs.

Install guidance

Review before install

Supported tools can change install steps; Universal entries need source review.

Copy-only command

Universal

git clone https://github.com/NVIDIA/TensorRT-LLM.git

Risk warning

metadata-only

Open install docs Repository