Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Oct 6, 2025
TPRD-1710: Update default branches post-25.09 (#99)
Show description for f921ead
Commits on Sep 24, 2025
Commits on Jul 30, 2025
Commits on May 8, 2025
Commits on Apr 11, 2025
Commits on Apr 2, 2025
Commits on Mar 6, 2025
Commits on Feb 14, 2025
Commits on Feb 13, 2025
[fix] Adjusting VllmStatLogger for 0.7.0 changes in API (#81)
Show description for 1f36c6c
Commits on Jan 9, 2025
Commits on Dec 23, 2024
Commits on Dec 20, 2024
perf: Upgrade vLLM version to 0.6.3.post1 (#76)
Show description for 2f5bfbd
Commits on Dec 5, 2024
feat: Add log probabilities and number of input tokens to additional outputs (#75)
Show description for 0b9c8e2- authored
Commits on Nov 26, 2024
Update `main` branch post 24.11 (#74)
Show description for 366e668- authored
Commits on Nov 25, 2024
Support input for llama3.2 multi-modal model (#69)
Show description for 6c066f6
Commits on Sep 24, 2024
vLLM multi gpu tests adjustments (#65)
Show description for b71088afix: Adding ensemble support for vllm container (#68)
Show description for 0df1013
Commits on Sep 21, 2024
Commits on Sep 4, 2024
Commits on Aug 24, 2024
Commits on Aug 16, 2024
- authored
feat: Add vLLM counter metrics access through Triton (#53)
Show description for 3829366authored
Commits on Aug 7, 2024
perf: Check for cancellation on response thread (#54)
Show description for 843cbdd
Commits on Aug 6, 2024
refactor: Remove explicit callings to garbage collect (#55)
Show description for a345a1d
Commits on Jul 26, 2024
perf: Improve vLLM backend performance by using a separate thread for responses (#46)
Show description for 128abc3
Commits on Jul 25, 2024
Commits on Jul 5, 2024
Commits on May 31, 2024
fix: Enhance checks around KIND_GPU and tensor parallelism (#42)
Show description for 18a96e3
Commits on May 29, 2024
Commits on May 2, 2024
- authored
Commits on Apr 26, 2024
Commits on Apr 18, 2024
Add multi-lora support for Triton vLLM backend (#23)
Show description for f064eed