thenlper

Thenlper: GTE-Large

thenlper/gte-large

The gte-large embedding model converts English sentences, paragraphs and moderate-length documents into a 1024-dimensional dense vector space, delivering high-quality semantic embeddings optimized for information retrieval, semantic textual similarity, reranking and clustering tasks. Trained via multi-stage contrastive learning on a large domain-diverse relevance corpus, it offers excellent performance across general-purpose embedding use-cases.

  • Context window: 8,192 tokens
  • Input: text
  • Output: embeddings
  • Pricing: $0.01/M input tokens

View on OpenRouter. Model data sourced from OpenRouter.