Embeddings
v3 release!

Top-performing multimodal multilingual long-context embeddings for search, RAG, agents applications.

v3: Frontier Multilingual Embeddings

jina-embeddings-v3 is a frontier multilingual text embedding model with 570M parameters and 8192 token-length, outperforming the latest proprietary embeddings from OpenAI and Cohere on MTEB. Read our blog post and research paper below.

Embedding API

Try our world-class embedding models to improve your search and RAG systems. Start with a free trial!

We cannot generate an API key because we couldn't verify if you are human. If you believe this is an error, please contact us.

Auto preview

Select embeddings

L2 normalization

Scales the embedding so its Euclidean (L2) norm becomes 1, preserving direction. Useful when downstream involves dot-product, classification, visualization.

Returning data type

Besides the float, you can ask it to return as binary for faster vector retrieval, or as base64 encoding for faster transmission.

Default (as float)

Example inputs

Change them and see how the response changes!

Request

Bash

Language

curl https://api.jina.ai/v1/embeddings \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer " \
  -d @- <<EOFEOF
  {
    "normalized": true,
    "embedding_type": "float",
    "input": [
        "Organic skincare for sensitive skin with aloe vera and chamomile: Imagine the soothing embrace of nature with our organic skincare range, crafted specifically for sensitive skin. Infused with the calming properties of aloe vera and chamomile, each product provides gentle nourishment and protection. Say goodbye to irritation and hello to a glowing, healthy complexion.",
        "Bio-Hautpflege für empfindliche Haut mit Aloe Vera und Kamille: Erleben Sie die wohltuende Wirkung unserer Bio-Hautpflege, speziell für empfindliche Haut entwickelt. Mit den beruhigenden Eigenschaften von Aloe Vera und Kamille pflegen und schützen unsere Produkte Ihre Haut auf natürliche Weise. Verabschieden Sie sich von Hautirritationen und genießen Sie einen strahlenden Teint.",
        "Cuidado de la piel orgánico para piel sensible con aloe vera y manzanilla: Descubre el poder de la naturaleza con nuestra línea de cuidado de la piel orgánico, diseñada especialmente para pieles sensibles. Enriquecidos con aloe vera y manzanilla, estos productos ofrecen una hidratación y protección suave. Despídete de las irritaciones y saluda a una piel radiante y saludable.",
        "针对敏感肌专门设计的天然有机护肤产品：体验由芦荟和洋甘菊提取物带来的自然呵护。我们的护肤产品特别为敏感肌设计，温和滋润，保护您的肌肤不受刺激。让您的肌肤告别不适，迎来健康光彩。",
        "新しいメイクのトレンドは鮮やかな色と革新的な技術に焦点を当てています: 今シーズンのメイクアップトレンドは、大胆な色彩と革新的な技術に注目しています。ネオンアイライナーからホログラフィックハイライターまで、クリエイティビティを解き放ち、毎回ユニークなルックを演出しましょう。"
    ]
  }
EOFEOF

API key

Available tokens

This is your unique key. Store it securely!

API Pricing

API pricing is based on token usage - input tokens for standard APIs and output tokens for Reader API. One API key gives you access to all search foundation products.

With 3 cloud service providers

Using AWS or Azure? You can deploy our models directly on your company's cloud platform and handle billing through the CSP account.

With Jina Search Foundation API

The easiest way to access all of our products. Top-up tokens as you go.

Enter the API key you wish to recharge

Auto-recharge when tokens are low

Recommended for uninterrupted service in production. When your token balance is below the threshold you set, we will automatically recharge your credit card for the same amount as your last top-up. If you purchased multiple packs in the last top-up, we will recharge only one pack.

≤ 1M Tokens

Recharge threshold

Understand the rate limit

Rate limits are the maximum number of requests that can be made to an API within a minute per IP address (RPM). Find out more about the rate limits for each product and tier below.

Rate Limit

Rate limits are tracked in two ways: RPM (requests per minute) and TPM (tokens per minute). Limits are enforced per IP and can be reached based on whichever threshold—RPM or TPM—is hit first.

Columns

Product	API Endpoint	Description	w/o API Key	w/ API Key	w/ Premium API Key	Average Latency	Token Usage Counting	Allowed Request
Embedding API	`https://api.jina.ai/v1/embeddings`	Convert text/images to fixed-length vectors		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Reranker API	`https://api.jina.ai/v1/rerank`	Tokenize and segment long text		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Reader API	`https://r.jina.ai`	Convert URL to LLM-friendly text	20 RPM	200 RPM	1000 RPM	4.6s	Count the number of tokens in the output response.	GET/POST
Reader API	`https://s.jina.ai`	Search the web and convert results to LLM-friendly text		40 RPM	100 RPM	8.7s	Count the number of tokens in the output response.	GET/POST
Reader API	`https://g.jina.ai`	Grounding a statement with web knowledge		10 RPM	30 RPM	22.7s	Count the total number of tokens in the whole process.	GET/POST
Classifier API (Zero-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using zero-shot classification		200 RPM & 500,000 TPM	1,000 RPM & 3,000,000 TPM	depends on the input size	Tokens counted as: input_tokens + label_tokens	POST
Classifier API (Few-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using a trained few-shot classifier		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens	POST
Classifier API	`https://api.jina.ai/v1/train`	Train a classifier using labeled examples		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens × num_iters	POST
Segmenter API	`https://segment.jina.ai`	Tokenize and segment long text	20 RPM	200 RPM	1,000 RPM	0.3s	Token is not counted as usage.	GET/POST

Top up this API key with more tokens

Depending on your location, you may be charged in USD, EUR, or other currencies. Taxes may apply.

Please input the right API key to top up

With a commercial license for on-prem deployment

Require 100% control and privacy? Purchase a commercial license to use our models on-premises.

Choosing the Right Embeddings

Our embedding models are designed to cover diverse search and GenAI applications.

Multimodal Embeddings

jina-clip-v1

General-Purpose Embeddings

jina-embeddings-v2-base

Bilingual Embeddings

jina-embeddings-v2-base-de

jina-embeddings-v2-base-zh

jina-embeddings-v2-base-es

Code Embeddings

jina-embeddings-v2-base-code

API Integrations

Our Embedding API is natively integrated with various renowned databases, vector stores, RAG, and LLMOps frameworks. To begin, just copy and paste your API key into any of the listed integrations for a quick and seamless start.

Vector Store

LLMOps

RAG

Observability

MongoDB

DataStax

Qdrant

Pinecone

Chroma

Weaviate

Milvus

Epsilla

MyScale

LlamaIndex

Haystack

Langchain

Dify

SuperDuperDB

DashVector

Portkey

Baseten

TiDB

LanceDB

Carbon

On-premises deployment

Deploy Jina Embeddings models in AWS Sagemaker and Microsoft Azure, and soon in Google Cloud Services, or contact our sales team to get customized Kubernetes deployments for your Virtual Private Cloud and on-premises servers.

AWS SageMaker

Embeddings

Reranker

Microsoft Azure

Embeddings

Reranker

Google Cloud

Coming soon

Our Publications

Understand how our frontier search models were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!

arXiv

September 18, 2024

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

arXiv

September 07, 2024

Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models

arXiv

August 30, 2024

Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever

arXiv

June 21, 2024

Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models

ICML 2024

May 30, 2024

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

arXiv

February 26, 2024

Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

arXiv

October 30, 2023

Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

EMNLP 2023

July 20, 2023

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

8 publications in total.

Learning about Embeddings

Where to start with embeddings? We've got you covered. Learn about embeddings from the ground up with our comprehensive guide.

Comparison of Reranker, Vector Search, and BM25

The table below provides a comprehensive comparison of the Reranker, Vector/Embeddings Search, and BM25, highlighting their strengths and weaknesses across various categories.

	Reranker	Vector Search	BM25
Best For	Enhanced search precision and relevance	Initial, rapid filtering	General text retrieval across wide-ranging queries
Granularity	Detailed: Sub-document and query segment	Broad: Entire documents	Intermediate: Various text segments
Query Time Complexity	High	Medium	Low
Indexing Time Complexity	Not required	High	Low, utilizes pre-built index
Training Time Complexity	High	High	Not required
Search Quality	Superior for nuanced queries	Balanced between efficiency and accuracy	Consistent and reliable for a broad set of queries
Strengths	Highly accurate with deep contextual understanding	Quick and efficient, with moderate accuracy	Highly scalable, with established efficacy
	Try reranker API for free	Try embedding API for free

The Evolution of Embeddings Poster

Discover the ideal poster for your space, featuring captivating infographics or breathtaking visuals tracing the evolution of text embedding models since 1950.

Learn how we made it

Buy a hard copy

Product	API Endpoint	Description	w/o API Key	w/ API Key	w/ Premium API Key	Average Latency	Token Usage Counting	Allowed Request
Embedding API	`https://api.jina.ai/v1/embeddings`	Convert text/images to fixed-length vectors		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Reranker API	`https://api.jina.ai/v1/rerank`	Tokenize and segment long text		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Reader API	`https://r.jina.ai`	Convert URL to LLM-friendly text	20 RPM	200 RPM	1000 RPM	4.6s	Count the number of tokens in the output response.	GET/POST
Reader API	`https://s.jina.ai`	Search the web and convert results to LLM-friendly text		40 RPM	100 RPM	8.7s	Count the number of tokens in the output response.	GET/POST
Reader API	`https://g.jina.ai`	Grounding a statement with web knowledge		10 RPM	30 RPM	22.7s	Count the total number of tokens in the whole process.	GET/POST
Classifier API (Zero-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using zero-shot classification		200 RPM & 500,000 TPM	1,000 RPM & 3,000,000 TPM	depends on the input size	Tokens counted as: input_tokens + label_tokens	POST
Classifier API (Few-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using a trained few-shot classifier		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens	POST
Classifier API	`https://api.jina.ai/v1/train`	Train a classifier using labeled examples		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens × num_iters	POST
Segmenter API	`https://segment.jina.ai`	Tokenize and segment long text	20 RPM	200 RPM	1,000 RPM	0.3s	Token is not counted as usage.	GET/POST

Do I need a commercial license?

CC BY-NC License Self-Check

Are you using our official API or official images on Azure or AWS?

Yes

Are you using a paid API key or free trial key?

Are you using our official model images on AWS and Azure?

Embeddings
v3 release!

v3: Frontier Multilingual Embeddings

Embedding API

API Pricing

Choosing the Right Embeddings

On-premises deployment

Our Publications

Learning about Embeddings

Comparison of Reranker, Vector Search, and BM25

The Evolution of Embeddings Poster

FAQ

How to get my API key?

What's the rate limit?

Do I need a commercial license?

Other questions

Embeddings new_releases v3 release!

Embeddings
v3 release!