Models fine-tuned with SPIN across iterations 0,1,2,3
UCLA Artificial General Intelligence Lab
university
AI & ML interests
None defined yet.
Collections
4
models
17
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1
Text Generation
•
Updated
•
3.77k
•
4
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2
Text Generation
•
Updated
•
5.39k
•
4
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
Text Generation
•
Updated
•
6.79k
•
124
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
•
1.12k
•
82
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2
Text Generation
•
Updated
•
1.05k
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1
Text Generation
•
Updated
•
1.05k
•
1
UCLA-AGI/Mistral7B-PairRM-SPPO
Text Generation
•
Updated
•
1.03k
•
6
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3
Text Generation
•
Updated
•
1.03k
•
5
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1
Text Generation
•
Updated
•
1.04k
•
2
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2
Text Generation
•
Updated
•
1.04k
•
1
datasets
7
UCLA-AGI/data-mistral-7b-instruct-sppo-iter3
Viewer
•
Updated
•
20k
•
61
•
2
UCLA-AGI/data-mistral-7b-instruct-sppo-iter2
Viewer
•
Updated
•
20k
•
72
•
2
UCLA-AGI/data-mistral-7b-instruct-sppo-iter1
Viewer
•
Updated
•
19.8k
•
149
•
2
UCLA-AGI/SPIN_iter3
Viewer
•
Updated
•
50.3k
•
21
•
9
UCLA-AGI/SPIN_iter2
Viewer
•
Updated
•
50.3k
•
33
•
1
UCLA-AGI/SPIN_iter1
Viewer
•
Updated
•
50.3k
•
42
•
3
UCLA-AGI/SPIN_iter0
Viewer
•
Updated
•
50.3k
•
1.01k
•
8