Troubleshoot issues with ONNX Runtime generate() API
Installation issues
Windows Conda import error
ImportError: DLL load failed while importing onnxruntime_genai: A dynamic link library (DLL) initialization routine failed.
If you see this issue in a Conda environment on Windows, you need to upgrade the C++ runtime for Visual Studio
. In the conda environment, run the following command:
conda install conda-forge::vs2015_runtime
The onnxruntime-genai Python package should run without error after this extra step.
Windows CUDA import error
DLL load failed while importing onnxruntime_genai
After CUDA toolkit installation completed on windows, ensure that the CUDA_PATH
system environment variable has been set to the path where the toolkit was installed. This variable will be used when importing the onnxruntime_genai python module on Windows. Unset or incorrectly set CUDA_PATH
variable may lead to a DLL load failed while importing onnxruntime_genai
.
Transformers / Tokenizers incompatibility with ONNX Runtime generate()
RuntimeError: [json.exception.type_error.302] type must be string, but is array
Only occurs when you generate models with the Model Builder, not with downloaded models.
There was a change in the HuggingFace transformers version 4.45.0 that caused an incompatibility with onnxruntime-genai versions 0.4.0 and earlier, resolved in 0.5.0. There are two alternative workarounds that you can employ to fix this issue:
- Option 1: downgrade your transformers version to lower than v4.45.0 (the version in which the above change was introduced)
- Option 2: build onnxruntime-genai from source, using these instructions build from source