Add deita-7b-v1.0 model #192

VPeterV · 2023-12-27T08:06:12Z

Deita 7B V1.0 is a fine-tuned + DPO version of Mistral-7B-v0.1 that was trained on only 6K automatically selected lightweight, high-quality alignment SFT data: Deita 6K V0 and 10K randomly sampled alignment preference data from Ultrafeedback. And deita-7b-v1.0 achieves 90.06% win rate on alpacaeval.

Added results and model_configs of deita
updated the leaderboard

rtaori · 2023-12-27T08:20:42Z

thanks for the contribution!

add deita-7b-v1.0 model

0125b62

rtaori merged commit 094a031 into tatsu-lab:main Dec 27, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add deita-7b-v1.0 model #192

Add deita-7b-v1.0 model #192

VPeterV commented Dec 27, 2023

rtaori commented Dec 27, 2023

Add deita-7b-v1.0 model #192

Add deita-7b-v1.0 model #192

Conversation

VPeterV commented Dec 27, 2023

rtaori commented Dec 27, 2023