Nous Research

Nous: Hermes 2 Mistral 7B DPO

nousresearch/nous-hermes-2-mistral-7b-dpo

This is the flagship 7B Hermes model, a Direct Preference Optimization (DPO) of [Teknium/OpenHermes-2.5-Mistral-7B](/models/teknium/openhermes-2.5-mistral-7b). It shows improvement across the board on all benchmarks tested - AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA. The model prior to DPO was trained on 1,000,000 instructions/chats of GPT-4 quality or better, primarily synthetic data as well as other high quality datasets.

  • Context window: 8,192 tokens
  • Input: text
  • Output: text

View on OpenRouter. Model data sourced from OpenRouter.