Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Latest commit

 

History

History
History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

README.md

Outline

RAG Documentation

The RAG documentation is divided into the following sections:

Getting Started

This section will help you get started quickly with the sample RAG example.

  • Installation guide: This guide walks you through the process of setting up your environment and utilizing the
  • Getting Started guides: A series of quick start steps that will help you to understand the core concepts and start the pipeline quickly. These guides include Jupyter notebooks that you can experiment with.

User Guides

The user guides cover the core details of the provided example and how to configure and use different features to make your own chains.

  • LLM Inference Server: Learn about the service which accelerates LLM inference time using TRT-LLM.
  • Integration with Nvidia AI Playground: Understand how to access NVIDIA AI Playground on NGC which allows developers to experience state of the art LLMs accelerated on NVIDIA DGX Cloud with NVIDIA TensorRT nd Triton Inference Server.
  • Configuration Guide: The complete guide to all the configuration options available in the config.yaml file.
  • Frontend: Learn more about the sample playground provided as part of the workflow.
  • Chat Server Guide: Learn about the chat server which exposes core API's for end user.
  • Jupyter Server Guide: Learn about the different notebooks available and the server which can be used to access them.

Architecture Guide

This guide sheds more light on the infrastructure details and the execution flow for a query when the runtime is used:

  • Architecture: Understand the architecture of the sample RAG workflow.

Evaluation Tools

The sample RAG worlflow provides a set of evaluation pipelines via notebooks which developers can use for benchmarking. There are also detailed guides on how to reproduce results and create datasets for the evaluation.

Other

Morty Proxy This is a proxified and sanitized view of the page, visit original site.