Teknium

OpenHermes 2.5 Mistral 7B

teknium/openhermes-2.5-mistral-7b

A continuation of [OpenHermes 2 model](/models/teknium/openhermes-2-mistral-7b), trained on additional code datasets. Potentially the most interesting finding from training on a good ratio (est. of around 7-14% of the total dataset) of code instruction was that it has boosted several non-code benchmarks, including TruthfulQA, AGIEval, and GPT4All suite. It did however reduce BigBench benchmark score, but the net gain overall is significant.

Context window: 4,096 tokens
Input: text
Output: text

View on OpenRouter. Model data sourced from OpenRouter.