developer-llm-operator

NVIDIA Developer LLM Operator

The NVIDIA Developer LLM Operator enables developers to build RAG-LLM pipelines on Kubernetes and manage the lifecycle of the components for a sample pipeline.

The Operator manages the lifecycle of the following components:

Jupyter Notebook server: The container includes sample notebooks to demonstrate a sample pipeline.
Chatbot web application: The sample web application enables you to perform question and answering with the chatbot and to upload PDF documents to form a knowledge base.
Vector database: The sample pipeline uses Milvus to manage the embeddings generated by the LLM.
NVIDIA Triton Inference Server: The server is configured with the NVIDIA Nemo Framework for working with LLMs.

Refer to Installing the Operator to get started.

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
install.md	install.md
uninstall.md	uninstall.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expand file tree

README.md

NVIDIA Developer LLM Operator

Search code, repositories, users, issues, pull requests...

FilesExpand file tree

developer-llm-operator

Directory actions

More options

Directory actions

More options

Latest commit

History

developer-llm-operator

Folders and files

parent directory

README.md

NVIDIA Developer LLM Operator

Expand file tree