Teknium
OpenHermes 2.5 Mistral 7B
teknium/openhermes-2.5-mistral-7b
A continuation of [OpenHermes 2 model](/models/teknium/openhermes-2-mistral-7b), trained on additional code datasets. Potentially the most interesting finding from training on a good ratio (est. of around 7-14% of the total dataset) of code instruction was that it has boosted several non-code benchmarks, including TruthfulQA, AGIEval, and GPT4All suite. It did however reduce BigBench benchmark score, but the net gain overall is significant.
- Context window: 4,096 tokens
- Input: text
- Output: text
View on OpenRouter. Model data sourced from OpenRouter.