SEO Research Suite - The thought leading podcast for Generative Engine Optimization and SEO

91 Episodes

Reverse

Search patent of the week: ChatGPT Referrals to E-Commerce Websites

2025-11-2912:29

This episode is focussing a research provides an empirical evaluation of Organic Large Language Model (OLLM) traffic, specifically analyzing referrals from ChatGPT to nearly one thousand e-commerce websites over a year. Contrary to high industry expectations, the study finds that OLLM traffic generally underperforms traditional digital channels, including organic and paid search, in critical financial metrics. For instance, the channel delivers significantly lower results for both Conversion Rate (CR) and Revenue Per Session (RPS) than most competitors. While the channel’s high relevance is indicated by a favorable Bounce Rate, any gains in conversion over the year were negated by a declining Average Order Value (AOV), suggesting OLLM drives smaller, lower-priced purchases. Consequently, marketers are advised to treat OLLM as a long-term discovery tool and optimize Product Detail Pages (PDPs) as standalone landing pages using structured, granular product data to aid LLM synthesis.https://www.kopp-online-marketing.com/patents-papers/chatgpt-referrals-to-e-commerce-websites

Search patent of the week: Specualtive RAG: Enhancing Retrieval Augmented Generation through drafting

2025-11-2514:27

This episode introduces “Speculative RAG,” a Google research framework designed to improve the speed and accuracy of Retrieval Augmented Generation (RAG) by moving away from traditional “brute-force” methods that overwhelm Large Language Models. This new approach operates via a “Draft-then-Verify” pipeline where a smaller RAG Drafter generates multiple answer drafts and accompanying rationales (logical explanations) from clustered, diverse subsets of documents. A larger RAG Verifier then efficiently evaluates these drafts based on the quality of the rationale, applying a combined confidence score influenced by self-consistency and self-reflection. The framework's implications suggest that logical consistency and content diversity will become crucial factors for authority in future AI-driven search environments, requiring content creators to explicitly bridge evidence and conclusions to assist in rationale generation.https://www.kopp-online-marketing.com/patents-papers/specualtive-rag-enhancing-retrieval-augmented-generation-through-drafting

Search patent of the week: Deep search using large language models

2025-11-0814:29

In this episode is focussed an in-depth analysis of a Microsoft patent for a "deep search" system that leverages Large Language Models (LLMs) to refine search results beyond traditional ranking methods. The core process involves disambiguating a user's initial query to identify a "primary intent," using a second LLM to generate more focused "alternative queries," and then using a third LLM to score the resulting web pages for relevance against the clarified intent. This hybrid architecture signals an acknowledgement that traditional algorithms excel at recall (finding broad results) but require LLMs for semantic precision and intent-based ranking, especially for sensitive or complex topics where trustworthiness is given critical weighting. The process also suggests that content creation should shift from simple keyword optimization to intent optimization to achieve high scores in this new paradigm.https://www.kopp-online-marketing.com/patents-papers/deep-search-using-large-language-models

Search patent of the week: Surfacing in-depth articles in search results

2025-11-0211:33

This episode is focussing a Google patent describing a system for identifying and showcasing high-quality, long-form content in search results, known as "in-depth articles." This system uses a sophisticated, two-phase methodology that first identifies authoritative seed websites and then calculates an In-Depth Article (IDA) Score for content based on various factors. Key scoring criteria include the Article Score (favoring long, narrative paragraphs), the Evergreen Score (measuring sustained interest over time), and the Commercial Score, which functions as a hard filter to exclude overtly sales-focused pages. Finally, the patent outlines how this content is surfaced to the user by combining its pre-calculated IDA Score with a query-specific topicality score, providing a clear blueprint for content creators focused on expertise, authority, and relevance.https://www.kopp-online-marketing.com/patents-papers/surfacing-in-depth-articles-in-search-results

Search patent of the week: Contextual search tool in a browser interface

2025-10-2512:50

This episode ia about a Google patent for a contextual search tool that operates within a web browser interface.This innovative system addresses the problem of users losing context when searching for information by allowing them to view search results and suggestions in a dedicated, browser-controlled area while the original webpage remains visible. The tool functions by extracting "core content" from the viewed page, excluding advertisements and third-party widgets, and then using this extracted content—which includes text, images, and semantic structure—to generate highly relevant search queries, summaries, or Q&A responses. The document also outlines the methodology for content creators to structure their webpages using semantic HTML and rich accessibility attributes to increase the relevance and prominence of their information within the new search tool, linking this process to Generative Engine Optimization (GEO).https://www.kopp-online-marketing.com/patents-papers/contextual-search-tool-in-a-browser-interface

Search patent of the week: Generative Retrieval for Conversational Question Answering

2025-10-1016:19

This episode is focussing an academic paper by Microsoft introduces Generative Retrieval for Conversational Question Answering (GCoQA), a novel approach designed to enhance passage retrieval in conversational systems by addressing limitations found in traditional dual-encoder architectures. GCoQA utilizes an encoder–decoder framework to assign unique identifiers to passages and retrieves them by generating these identifiers token-by-token. The authors contend that this generative method overcomes the "embedding bottleneck" and facilitates more fine-grained, token-level interactions with the conversation context, which is crucial for handling ambiguous conversational queries. Experiments across three public datasets—OR-QuAC, QRECC, and TOPIOCQA—demonstrate that GCoQA achieves significant relative improvements in both passage and document retrieval accuracy, while also being notably more memory-efficient and faster than comparison methods. The paper concludes by discussing the method's practical implications, current limitations, and avenues for future research in generative retrieval.https://www.kopp-online-marketing.com/patents-papers/generative-retrieval-for-conversational-question-answering

Search patent of the week: Efficient inner product operations

2025-09-3012:51

This episode is focussing a Google patent outlines a system and method for performing highly efficient and accurate item retrieval within large datasets using a hybrid vector space inner-product search. The core innovation involves storing data and processing queries using hybrid records split into a dense component (for semantic meaning) and a sparse component (for specific keywords or identifiers). By calculating similarity scores for each component separately and then combining them, the system overcomes the performance challenges associated with simultaneously processing heterogeneous data types, which are common in modern search engines and machine learning operations. The text also provides criteria for classifying data dimensions as sparse or dense, typically based on a frequency threshold, and explains how content should be structured to satisfy both components for better search ranking.

Search patent of the week: Systems and methods for providing reliable information for queries

2025-09-1411:47

The episode is focussing a patent by Microsoft Technology Licensing LLC for a system designed to deliver reliable, expert-verified information in response to user queries. This system aims to combat misinformation from traditional search engines and generative AI by accessing an expert knowledge base containing only answers from verified expert identifiers. When a query is submitted, the system classifies its field of expertise, converts the query into a vector, and then searches the expert knowledge base for a closely matching, pre-existing expert answer, delivering it directly without modification. If no answer is found, the system can obtain a new one from a verified expert. For content creators, this system signifies a shift from traditional SEO to establishing verifiable authority and producing highly focused, accurate content within a specific field of expertise, as the value lies in the direct, authoritative answer rather than website traffic.https://www.kopp-online-marketing.com/patents-papers/systems-and-methods-for-providing-reliable-information-for-queries

What we can learn from DOJ trial and API Leak for SEO?

2025-09-0819:17

This episode is focussing the article "What we can learn from DOJ trial and API Leak for SEO?" by Olaf Kopp It examines recent disclosures from the DOJ antitrust trial against Google and a 2024 Google API leak. The author uses a Google Leak Analyzer to compile and summarize these insights, focusing on how they reveal the inner workings of Google's search algorithms and ranking systems. The piece explores key areas such as the role of user signals, the use of click data through systems like Navboost and Glue, and the significance of E-E-A-T (Expertise, Authoritativeness, Trustworthiness) in quality evaluation. Additionally, it discusses algorithm development, the impact of Generative AI (GenAI) on search, and provides conclusions for SEO professionals based on these newly revealed mechanisms.https://www.kopp-online-marketing.com/what-we-can-learn-from-doj-trial-and-api-leak-for-seo

Search patent of the week: User embedding models for personalization of sequence processing models

2025-09-0414:57

This Google patent disussed in this espisode describes a machine-learned system for personalizing sequence processing models, such as large language models, by integrating user preferences and contextual data. It outlines a method where an embedding model creates representations of a user's history, which are then combined with task instructions to generate tailored outputs. The system leverages knowledge graphs to enrich understanding of relationships and facilitate dynamic adaptation to user behavior, ultimately improving the accuracy and relevance of personalized recommendations. The approach aims to enhance the generative capabilities of AI systems by reducing cognitive load and supporting complex queries through dynamically updated user embeddings.

Search patent of the week: Generative Search Engine Result Documents

2025-08-2412:41

This episode outlines a Microsoft patent for a generative search engine results system designed to create interactive and comprehensive search result documents using large generative models (LGMs). The system addresses the limitations of traditional search by structuring information into organized topics with visual layouts and answer cards. It operates by receiving a user query, obtaining search links, and then using multiple LGMs to generate unformatted content, match answer cards to relevant sections, and create layout guidelines before producing a formatted document. The system also details strategies for handling conflicting information, incorporating user personalization, validating answer accuracy, and capturing user intent, all while discussing implications for content creation and search engine optimization.https://www.kopp-online-marketing.com/patents-papers/generative-search-engine-results-documents

Search patent of the week: Generating vector representations of documents

2025-08-1914:23

The espisode is focussing a Google patent (US10803380B2) detailing a method for generating vector representations of documents using a trained neural network system. This process involves unsupervised training to capture semantic similarities between documents, moving beyond traditional keyword matching. Such vector embeddings enable improved document retrieval and ranking in search engines by understanding contextual meaning and allowing for dynamic, personalized search algorithms. Ultimately, understanding this process can inform content creation strategies for better semantic relevance and search engine optimization.https://www.kopp-online-marketing.com/patents-papers/generating-vector-representations-of-documents

Search patent of the week: Information extraction from question and answer websites

2025-08-1213:13

The discussed Google patent deals with extracting information from Question and Answer (Q&A) websites to enhance information retrieval, particularly for search engines. This system identifies questions and answers, then extracts and scores relationships between entities mentioned within the text based on their frequency across multiple sources. The patent details a step-by-step methodology for this extraction, from accessing Q&A databases to establishing and scoring entity relationships. Furthermore, the text explores how the insights gained from this process can be applied to improve SEO strategies by analyzing common question patterns, identifying content gaps, and creating content that clearly structures questions and answers to boost relevance for both users and Large Language Models (LLMs).https://www.kopp-online-marketing.com/patents-papers/information-extraction-from-question-and-answer-websites

Search paper of the week: Deep Researcher with Test-Time Diffusion

2025-08-0215:53

The document discussed in this episodde introduces Test-Time Diffusion Deep Researcher (TTD-DR), a novel framework from Google that significantly enhances deep research agents powered by Large Language Models (LLMs) by mimicking human writing cycles. This approach models research report generation as a diffusion process involving planning, drafting, and continuous refinement through retrieval mechanisms and self-evolutionary algorithms. The methodology outlines steps from research plan generation and iterative search and synthesis to self-evolution and report-level denoising with retrieval, culminating in a final report. Automated feedback mechanisms and dynamic query generation through "query fan-out" are crucial for refining drafts, ensuring comprehensive and accurate outputs on complex research tasks.https://www.kopp-online-marketing.com/patents-papers/deep-researcher-with-test-time-diffusion

Query Fan-Out: The Evolution of AI Search

2025-08-0122:46

This episode is discussing a comprhensive article covering the evolution of search technology, specifically focusing on the transition from query refinement and query augmentation to the more advanced query fan-out technique in the age of generative AI and AI Agents. It explains how query fan-out expands a single user query into multiple sub-queries to retrieve more comprehensive and personalized results, particularly within Google's AI Overviews and AI Mode. The sources also highlight the crucial role of Large Language Models (LLMs) in generating synthetic queries and various query variants to enhance search accuracy and address diverse user intents. This advanced approach significantly impacts traditional keyword research by moving towards a more dynamic and context-aware information retrieval process.https://www.kopp-online-marketing.com/from-query-refinement-to-query-fan-out-search-in-times-of-generative-ai-and-ai-agents

Search patent of the week: Identifying query aspects

2025-07-2312:50

The provided patent from the SEO Research Suite centers on methods and systems for identifying and utilizing "aspects" within search queries maybe for query fan out, particularly those containing entities, to enhance search result organization. This technology helps categorize information by different characteristics associated with a searched entity, like "beaches" or "hotels" for "Hawaii." https://www.kopp-online-marketing.com/patents-papers/identifying-query-aspects

Deep Dive into Vector Similarity Search Technologies

2025-07-2010:28

This episode explore advancements in Maximum Inner Product Search (MIPS), a crucial technique for vector similarity search in machine learning and information retrieval. Several sources highlight Google's ScaNN library and its enhancements like SOAR (Spilling with Orthogonality-Amplified Residuals), which boost efficiency and accuracy in finding similar data points. The concept of Anisotropic Vector Quantization is also introduced as a key innovation in ScaNN for better inner product estimation. Furthermore, the texts discuss REALM (Retrieval-Augmented Language Model Pre-training), which integrates MIPS to enable language models to explicitly retrieve knowledge, and MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings), presenting novel graph-based methods like PSP (Proximity Graph with Spherical Pathway) and Adaptive Early Termination (AET) to optimize MIPS, with real-world applications in e-commerce search engines. The collection collectively emphasizes the shift towards semantic understanding in search and its implications for SEO strategies.https://www.kopp-online-marketing.com/what-is-mips-maximum-inner-product-search-and-its-impact-on-seo

Search Patent of the week: Subquery generation from a query (Query fan out)

2025-07-1513:31

In this episode is discussed the patent "Subquery generation from a query" which focused on processing complex search queries. The system aims to break down a single, elaborate query into multiple subqueries (Query afn out), enhancing efficiency for users. This methodology can also be applied to splitting prompts for Retrieval Augmented Generation (RAG), indicating its relevance beyond traditional search. https://www.kopp-online-marketing.com/patents-papers/subquery-generation-from-a-query

LLM Readability and Chunk Relevance for AI Citation Optimization

2025-07-1415:22

This episode discussed thoughts by Olaf Kopp, an expert in semantic SEO, Generatine Engine Optimization (GEO) and AI search technology, focuses on Large Language Model Optimization (LLMO), also known as Generative Engine Optimization (GEO). It explains that LLM readability and chunk relevance are the most crucial factors for content to be cited by generative AI systems like Google AIMode and ChatGPT. The text details how AI search systems utilize a grounding process through Retrieval-Augmented Generation (RAG) to enhance responses by incorporating external, relevant information. It further breaks down the specific factors contributing to both LLM readability, such as natural language quality and clear structuring, and chunk relevance, emphasizing the semantic similarity between queries and content segments. The author developed these concepts to help content creators optimize their material for improved visibility and citation in AI-generated overviews.https://www.kopp-online-marketing.com/llm-readability-chunk-relevance-the-most-influential-factors-to-become-citation-worthy-by-llms

Search patent of the week: Scaling Generative Retrieval to Millions of Passages

2025-07-1117:58

This episode primarily discusses generative retrieval, an emerging approach in information retrieval that directly maps user queries to document identifiers using sequence-to-sequence models, contrasting it with traditional methods like dual encoders and retrieval-augmented generation (RAG). A central theme is the scalability challenge of generative retrieval, particularly when expanding to millions of documents, highlighting the critical role of synthetic query generation (query fan out) in improving performance and bridging the gap between document indexing and retrieval. The text also explores various document identifier (DocID) representation techniques and their implications for efficiency and scalability. Finally, it offers best practices for optimizing web content for better retrieval by generative models, emphasizing structured content, clear language, and SEO strategies for Large Language Model Optimization (LLMO).

#box-pro-ellipsis-176461908702085{-webkit-line-clamp:2;}SEO Research Suite - The thought leading podcast for Generative Engine Optimization and SEO

SEO Research Suite - The thought leading podcast for Generative Engine Optimization and SEO