Web & Dev

AI Embedding Models Reference

Popular text, image, and multimodal embedding models — dimensions, context, license, and use case.

Text (commercial)

ModelDimMax tokensNotes
OpenAI text-embedding-3-small1 536 (truncatable)8 192~$0.02 / 1M tokens
OpenAI text-embedding-3-large3 072 (truncatable)8 192~$0.13 / 1M tokens
Cohere embed-v31 024512English + multilingual variants
Voyage-31 02432 000Strong retrieval benchmarks
Anthropic (via Voyage)1 02432 000Anthropic-branded via partnership
Google gecko / text-embedding-0047682 048Vertex AI

Text (open weights)

ModelDimMax tokensLicense
BGE-large-en-v1.51 024512MIT
BGE-m31 0248 192MIT — multilingual
E5-large-v21 024512MIT
E5-mistral-7b-instruct4 09632 768MIT — very strong
Jina-embeddings-v31 0248 192Apache 2.0
Nomic embed v1.57688 192Apache 2.0
gte-large-en-v1.51 0248 192MIT
MiniLM-L6-v2384256Apache 2.0 — small & fast

Image / multimodal

ModelDimModalityNotes
CLIP ViT-L/14768Image + textOpenAI, 2021 — baseline
OpenCLIP ViT-H/141 024Image + textLAION open reimplementation
SigLIP1 152Image + textGoogle — sigmoid loss
DINOv2 ViT-L/141 024Image onlySelf-supervised, dense features
ImageBind1 0246 modalitiesMeta — image/text/audio/depth/thermal/IMU
Voyage multimodal 31 024Image + textCommercial

Notes

  • MTEB (Massive Text Embedding Benchmark) is the standard leaderboard for text embedding quality.
  • Higher dimension is not always better — many embeddings support truncation/matryoshka for cheaper storage.
  • Normalize vectors and use cosine similarity (or inner product on normalized = cosine).
Was this article helpful?