HsChen-sys

Haisheng Chen HsChen-sys

UCSD ECE

Achievements

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
openvinotoolkit/openvino openvinotoolkit/openvino Public

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 9.9k 3.1k
NVIDIA/cutlass NVIDIA/cutlass Public

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9.4k 1.7k
mit-han-lab/llm-awq mit-han-lab/llm-awq Public

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3.5k 302
torch-custom-op torch-custom-op Public

A project for demostrating custom op registration using modern PyTorch APIs

Cuda