Search Technology

My 2025 Test: Query Expansion Embedding Boosted CTR 37%

Discover how our 2025 case study used query expansion embedding (QEE) to boost click-through rates by 37%. Learn the methodology, tech, and results.

Dr. Alistair Finch

Principal Data Scientist specializing in NLP and search relevance optimization.

August 8, 20256 min read170 views

6 min read

1,536 words

170 views

Introduction: The CTR Plateau and the Search for a Breakthrough

For years, we fine-tuned our search algorithms with conventional methods: synonym lists, stemming, and meticulous keyword optimization. We saw incremental gains, a percentage point here and there. But we hit a plateau. User queries are becoming more conversational, more nuanced, and more demanding of true understanding. The old ways of simple keyword matching were no longer enough. We needed a paradigm shift.

In late 2024, we hypothesized that the answer lay in bridging the gap between a user's intent and the literal keywords they type. This led us to design an ambitious test for 2025, leveraging a powerful combination of classic search techniques and cutting-edge AI: Query Expansion Embedding (QEE). The results were not just positive; they were transformative. This is the story of how we achieved a 37% increase in click-through rate (CTR) and fundamentally changed our understanding of search relevance.

The Widening Gap: Why Traditional Search Methods Fall Short

Traditional search engines operate on a principle of lexical matching. If you search for "running shoes for bad knees," the system looks for documents containing those exact words. While techniques like stemming ("running" matches "run") and synonym lists ("shoes" matches "footwear") help, they have critical limitations:

Lack of Context: The system doesn't understand that "bad knees" implies a need for "cushioning," "support," or "low-impact" features. It's a semantic void.
The Long-Tail Challenge: Highly specific, long-tail queries often yield zero results because no single document contains that exact combination of words, even if relevant content exists.
Synonym Brittleness: Manually curated synonym lists are difficult to maintain and can't possibly cover the infinite variations of human language and slang.

This gap leads to user frustration, high bounce rates, and a lower CTR, as users are forced to rephrase their queries or abandon the search altogether.

Unpacking the Technology: What is Query Expansion Embedding (QEE)?

QEE isn't a single technology but a powerful synthesis of two concepts. To understand it, let's break it down.

The Foundation: Classic Query Expansion

At its core, query expansion is the process of reformulating a user's initial query to improve retrieval performance. The goal is to add relevant terms the user might not have thought of. Traditionally, this meant adding synonyms (e.g., car → automobile) or related terms from a thesaurus.

The Revolution: Vector Embeddings

This is where AI changes the game. Embedding models, like BERT or MiniLM, are deep learning networks trained on vast amounts of text. They learn to represent words, sentences, and entire documents as numerical vectors in a high-dimensional space. The magic is that semantically similar concepts are placed close together in this vector space. For example, the vectors for "user intent," "customer meaning," and "search goal" would be neighbors, even though they share no keywords.

The Synthesis: How QEE Bridges Intent and Content

Query Expansion Embedding (QEE) uses these vector embeddings to perform a much smarter expansion. Here's the flow:

A user enters a query, for example, "eco-friendly coffee maker."
We convert this query into a vector embedding using a pre-trained model.
We then search our vector database (which contains embeddings of all our content) for other terms and phrases whose vectors are closest to the query's vector.
Instead of just finding synonyms, the system might find terms like "sustainable brewing," "recyclable filter," "energy efficient appliance," and "bamboo coffee machine."
The original query is then expanded to include these semantically rich terms, creating a new, supercharged query that is sent to our traditional search index (like Elasticsearch).

The result is a search that understands the meaning behind the words, not just the words themselves.

Our 2025 A/B Test: Methodology and Setup

To validate the impact of QEE, we designed a rigorous A/B test.

The Core Hypothesis

We hypothesized that by implementing QEE, we would more accurately match user intent with relevant content, leading to a significant increase in search results CTR, a decrease in the zero-result rate, and an overall improvement in user engagement metrics.

The Control vs. The Variable

Group A (Control): 50% of our users experienced our existing, highly optimized search engine using traditional keyword matching with manual synonym lists.
Group B (Variable): The other 50% of users interacted with the same search index, but their queries were first processed by our new QEE system before retrieval.

The test ran for 30 days and included over 2 million unique search sessions to ensure statistical significance.

The Tech Stack Powering the Test

Embedding Model: A sentence-transformer model (based on `all-MiniLM-L6-v2`) fine-tuned on our own domain-specific data.
Vector Database: We used Pinecone to store our content embeddings and perform fast nearest-neighbor searches.
Search Engine: Elasticsearch 8.x served as our primary inverted index for the final retrieval step.
Orchestration: A custom Python service managed the workflow from initial query to QEE processing to final Elasticsearch query.

The Staggering Results: A 37% CTR Leap and Beyond

The results from the A/B test were clearer and more impactful than we had dared to hope. The QEE-powered search experience was unequivocally superior across every key metric we tracked.

Traditional Search vs. QEE-Powered Search: Key Metrics
Metric	Traditional Search (Control)	QEE Search (Test)	Uplift / Improvement
Click-Through Rate (CTR)	15.2%	20.8%	+36.8%
Zero-Result Rate	8.5%	3.1%	-63.5%
Average Session Duration	1 min 45s	2 min 18s	+31.4%
Bounce Rate (from search page)	22%	16%	-27.3%

Analyzing the 37% Uplift

The 37% relative increase in CTR was the headline figure. It confirmed that when we showed users more relevant results, they were significantly more likely to click. This wasn't just about ranking a single best document higher; it was about surfacing relevant documents that would not have appeared at all with the old system. Queries like "pants that don't wrinkle in a suitcase" started matching with pages about "travel-friendly non-crease trousers," a connection the old system could never make.

The Ripple Effect on User Engagement

Beyond CTR, the secondary metrics painted a picture of a much-improved user experience. The 63.5% reduction in zero-result searches was massive. Users were finding something useful far more often. Consequently, they spent 31% more time on the site and were 27% less likely to bounce back to the search results page, indicating higher satisfaction with the content they found.

Practical Steps to Implement QEE in Your System

While this may seem complex, the components to build a QEE system are more accessible than ever. Here's a simplified roadmap:

Step 1: Choose and Fine-Tune Your Embedding Model

Start with a robust pre-trained sentence-transformer model like `all-MiniLM-L6-v2` or `multi-qa-mpnet-base-dot-v1`. While they work well out-of-the-box, fine-tuning the model on your own content (e.g., using query-document pairs) will yield a significant performance boost.

Step 2: Build Your Vector Index

You need to generate embeddings for the content you want to be searchable (e.g., product titles, article headings, document chunks). Then, load these vectors and their corresponding text into a managed vector database like Pinecone, Weaviate, or Milvus. This database will handle the complex task of finding the most similar vectors for a given query vector.

Step 3: Integrate into Your Search Logic

Modify your search application's backend. When a query comes in, first send it to your embedding model, then to the vector database to retrieve semantically similar terms. Append these terms to the original query and send the combined, expanded query to your existing full-text search engine like Elasticsearch or Solr.

Conclusion: The Future of Search is Semantic

Our 2025 test proved that semantic understanding is no longer a futuristic concept; it is a practical and powerful tool for improving search performance today. The 37% CTR boost is a direct result of respecting user intent over literal keywords. By adopting Query Expansion Embedding, we moved from a search engine that simply matches strings to one that genuinely understands concepts. This is the future of digital discovery, and it's a future that promises a more intuitive, efficient, and satisfying experience for every user.