Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Latest commit

 

History

History
History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

NVIDIA Developer LLM Operator

The NVIDIA Developer LLM Operator enables developers to build RAG-LLM pipelines on Kubernetes and manage the lifecycle of the components for a sample pipeline.

The Operator manages the lifecycle of the following components:

  • Jupyter Notebook server: The container includes sample notebooks to demonstrate a sample pipeline.

  • Chatbot web application: The sample web application enables you to perform question and answering with the chatbot and to upload PDF documents to form a knowledge base.

  • Vector database: The sample pipeline uses Milvus to manage the embeddings generated by the LLM.

  • NVIDIA Triton Inference Server: The server is configured with the NVIDIA Nemo Framework for working with LLMs.

Refer to Installing the Operator to get started.

Morty Proxy This is a proxified and sanitized view of the page, visit original site.