It was reported that OpenAI is working on a search engine that would directly challenge Google. But details missing from the report raise questions about whether OpenAI is creating a standalone search engine or if there’s another reason for the announcement.
OpenAI Web Search Report
The report published on The Information relates that OpenAI is developing a Web Search product that will directly compete with Google. A key detail of the report is that it will be partly powered by Bing, Microsoft’s search engine. Apart from that there are no other details, including whether it will be a standalone search engine or be integrated within ChatGPT.
All reports note that it will be a direct challenge to Google so let’s start there.
1. Is OpenAI Mounting A Challenge To Google?
OpenAI is said to be using Bing search as part of the rumored search engine, a combination of a GPT-4 with Bing Search, plus something in the middle to coordinate between the two .
In that scenario, what OpenAI is not doing is developing its own search indexing technology, it’s using Bing.
What’s left then for OpenAI to do in order to create a search engine is to devise how the search interface interacts with GPT-4 and Bing.
And that’s a problem that Bing has already solved by using what it Microsoft calls an orchestration layer. Bing Chat uses retrieval-augmented generation (RAG) to improve answers by adding web search data to use as context for the answers that GPT-4 creates. For more information on how orchestration and RAG works watch the keynote at Microsoft Build 2023 event by Kevin Scott, Chief Technology Officer at Microsoft, at the 31:45 minute mark here).
If OpenAI is creating a challenge to Google Search, what exactly is left for OpenAI to do that Microsoft isn’t already doing with Bing Chat? Bing is an experienced and mature search technology, an expertise that OpenAI does not have.
Is OpenAI challenging Google? A more plausible answer is that Bing is challenging Google through OpenAI as a proxy.
2. Does OpenAI Have The Momentum To Challenge Google?
ChatGPT is the fastest growing app of all time, currently with about 180 million users, achieving in two months what took years for Facebook and Twitter.
Yet despite that head start Google’s lead is a steep hill for OpenAI to climb. Consider that Google has approximately 3 to 4 billion users worldwide, absolutely dwarfing OpenAI’s 180 million.
Assuming that all 180 million OpenAI users performed an average of 4 searches per day, the daily number of searches could reach 720 million searches per day.
Statista estimates that there are 6.3 million searches on Google per minute which equals over 9 billion searches per day.
If OpenAI is to compete they’re going to have to offer a useful product with a compelling reason to use it. For example, Google and Apple have a captive audience on mobile device ecosystem that embeds them into the daily lives of their users, both at work and at home. It’s fairly apparent that it’s not enough to create a search engine to compete.
Realistically, how can OpenAI achieve that level of ubiquity and usefulness?
OpenAI is facing an uphill battle against not just Google but Microsoft and Apple, too. If we count Internet of Things apps and appliances then add Amazon to that list of competitors that already have a presence in billions of users daily lives.
OpenAI does not have the momentum to launch a search engine to compete against Google because it doesn’t have the ecosystem to support integration into users lives.
3. OpenAI Lacks Information Retrieval Expertise
Search is formally referred to as Information Retrieval (IR) in research papers and patents. No amount of searching in the Arxiv.org repository of research papers will surface papers authored by OpenAI researchers related to information retrieval. The same can be said for searching for information retrieval (IR) related patents. OpenAI’s list of research papers also lacks IR related studies.
It’s not that OpenAI is being secretive. OpenAI has a long history of publishing research papers about the technologies they’re developing. The research into IR does not exist. So if OpenAI is indeed planning on launching a challenge to Google, where is the smoke from that fire?
It’s a fair guess that search is not something OpenAI is developing right now. There are no signs that it is even flirting with building a search engine, there’s nothing there.
4. Is The OpenAI Search Engine A Microsoft Project?
There is substantial evidence that Microsoft is furiously researching how to use LLMs as a part of a search engine.
All of the following research papers are classified as belonging to the fields of Information Retrieval (aka search), Artificial Intelligence, and Natural Language Computing.
Here are few research papers just from 2024:
Enhancing human annotation: Leveraging large language models and efficient batch processing
This is about using AI for classifying search queries.
Structured Entity Extraction Using Large Language Models
This research paper discovers a way to extracting structured information from unstructured text (like webpages). It’s like turning a webpage (unstructured data) into a machine understandable format (structured data).
Improving Text Embeddings with Large Language Models (PDF version here)
This research paper discusses a way to get high-quality text embeddings that can be used for information retrieval (IR). Text embeddings is a reference to creating a representation of text in a way that can be used by algorithms to understand the semantic meanings and relationships between the words.
The above research paper explains the use:
“Text embeddings are vector representations of natural language that encode its semantic information. They are widely used in various natural language processing (NLP) tasks, such as information retrieval (IR), question answering…etc. In the field of IR, the first-stage retrieval often relies on text embeddings to efficiently recall a small set of candidate documents from a large-scale corpus using approximate nearest neighbor search techniques.”
There’s more research by Microsoft that relates to search, but these are the ones that are specifically related to search together with large language models (like GPT-4.5).
Following the trail of breadcrumbs leads directly to Microsoft as the technology powering any search engine that OpenAI is supposed to be planning… if that rumor is true.
5. Is Rumor Meant To Steal Spotlight From Gemini?
The rumor that OpenAI is launching a competing search engine was published on February 14th. The next day on February 15th Google announced the launch of Gemini 1.5, after announcing Gemini Advanced on February 8th.
Is it a coincidence that OpenAI’s announcement completely overshadowed the Gemini announcement the next day? The timing is incredible.
At this point the OpenAI search engine is just a rumor.
Featured Image by Shutterstock/rafapress