Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Popular repositories Loading

  1. lorax lorax Public

    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

    Python 3.6k 288

  2. llm_distillation_playbook llm_distillation_playbook Public

    Best practices for distilling large language models.

    Jupyter Notebook 593 53

  3. lora_bakeoff lora_bakeoff Public

    Python 20 2

  4. json-mode-benchmark json-mode-benchmark Public

    Jupyter Notebook 7 1

  5. neuropod neuropod Public

    Forked from uber/neuropod

    A uniform interface to run deep learning models from multiple frameworks

    C++ 3 2

  6. punica punica Public

    Forked from punica-ai/punica

    Serving multiple LoRA finetuned LLM as one

    Cuda 2 4

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 18 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

Morty Proxy This is a proxified and sanitized view of the page, visit original site.