Fireworks AI

Fireworks: FireLLaVA 13B

fireworks/firellava-13b

A blazing fast vision-language model, FireLLaVA quickly understands both text and images. It achieves impressive chat skills in tests, and was designed to mimic multimodal GPT-4. The first commercially permissive open source LLaVA model, trained entirely on open source LLM generated instruction following data.

Context window: 4,096 tokens
Input: text, image
Output: text

View on OpenRouter. Model data sourced from OpenRouter.