Nous Research
Nous: Hermes 2 Mistral 7B DPO
nousresearch/nous-hermes-2-mistral-7b-dpo
This is the flagship 7B Hermes model, a Direct Preference Optimization (DPO) of [Teknium/OpenHermes-2.5-Mistral-7B](/models/teknium/openhermes-2.5-mistral-7b). It shows improvement across the board on all benchmarks tested - AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA. The model prior to DPO was trained on 1,000,000 instructions/chats of GPT-4 quality or better, primarily synthetic data as well as other high quality datasets.
- Context window: 8,192 tokens
- Input: text
- Output: text
View on OpenRouter. Model data sourced from OpenRouter.