ONNX runtime for node with gpu support (DirectML/Cuda)

THIS REPOSITORY IS OBSOLETE SINCE CHANGES WERE MERGED TO OFFIFIAL RUNTIME microsoft/onnxruntime#16050

Info

This is an updated copy of official onnxruntime-node with DirectML and Cuda support. Demo that runs stable diffusion on GPU with this runtime is here: https://github.com/dakenf/stable-diffusion-nodejs

Requirements

Windows

Works out of the box with DirectML. You can install CUDA and onnx runtime for windows with cuda provider for experiments, if you like.

Linux / WSL2

Install CUDA (tested only on 11-7 but 12 should be supported) https://docs.nvidia.com/cuda/cuda-installation-guide-linux/
Install cuDNN https://developer.nvidia.com/rdp/cudnn-archive
Install onnxruntime-linux-x64-gpu-1.14.1 https://github.com/microsoft/onnxruntime/releases/tag/v1.14.1

Installation and usage

It works in the same way as onnxruntime-node

npm i onnxruntime-node-gpu

import { InferenceSession, Tensor } from 'onnxruntime-node-gpu'

const sessionOption: InferenceSession.SessionOptions = { executionProviders: ['directml'] } // can be also 'cuda' or 'cpu'
const model = await InferenceSession.create('model.onnx', sessionOption)

const input = new Tensor('float32', Float32Array.from([0, 1, 2]), [3])
const result = await this.textEncoder.run({ input_name: input })

Limitations

Currently, all results are returned as NAPI nodejs objects, so when you run inference multiple times (e.g. sampling on StableDiffusion Unet), there are a lot of unnecessary memory copy operations input from js to gpu and back. However, performance impact is not big. Maybe later I will make output in Tensorflow.js compatible tensors

Building manually

Just download the repo and run npx cmake-js compile

Why is onnxruntime statically linked on Windows?

For some reason, dynamically linked onnx runtime tries to load outdated DirectML.dll in system32, see royshil/obs-backgroundremoval#272

Misc

Special thanks to authors of https://github.com/royshil/obs-backgroundremoval and https://github.com/umireon/onnxruntime-static-win for CMake scripts to download pre-built onnxruntime for static linking.

Also thanks to ChatGPT for helping me to remember how to code in c++.

You can ask me questions on Twitter

Name	Name	Last commit message	Last commit date
Latest commit History 5 Commits
cmake	cmake
lib	lib
src	src
.gitignore	.gitignore
.npmignore	.npmignore
CMakeLists.txt	CMakeLists.txt
LICENSE	LICENSE
README.md	README.md
binding.gyp	binding.gyp
build.sh	build.sh
package-lock.json	package-lock.json
package.json	package.json
tsconfig.json	tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ONNX runtime for node with gpu support (DirectML/Cuda)

THIS REPOSITORY IS OBSOLETE SINCE CHANGES WERE MERGED TO OFFIFIAL RUNTIME microsoft/onnxruntime#16050

Info

Requirements

Windows

Linux / WSL2

Installation and usage

Limitations

Building manually

Why is onnxruntime statically linked on Windows?

Misc

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

License

dakenf/onnxruntime-node-gpu

Folders and files

Latest commit

History

Repository files navigation

ONNX runtime for node with gpu support (DirectML/Cuda)

THIS REPOSITORY IS OBSOLETE SINCE CHANGES WERE MERGED TO OFFIFIAL RUNTIME microsoft/onnxruntime#16050

Info

Requirements

Windows

Linux / WSL2

Installation and usage

Limitations

Building manually

Why is onnxruntime statically linked on Windows?

Misc

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages