What's the difference between a web search API and a reader/scraper like Firecrawl?

A search API answers "which URLs are relevant to this query"; a reader answers "turn this URL I already have into clean text." Agents usually need both — search to discover pages, a reader to ingest a specific one — and the two are priced and built differently.

When should an agent search the open web instead of its own vector store?

Use your own index for stable, owned knowledge you've already embedded; reach for a web search API when the answer is fresh, long-tail, or outside your corpus — current events, a library released last week, a fact you never ingested. Many agents do both and merge the results.

Is a neural search engine like Exa always better than keyword search?

No. Neural search shines when the query is conceptual ("companies doing X using technique Y") and keyword search wins when you know the exact terms or need an exact-match lookup. Exa exposes both modes for that reason; the right choice is per-query, not per-vendor.

Tavily vs Exa vs Linkup: Picking a Web Search API for AI Agents

Give an agent a search box and you've made a quiet architectural decision you probably didn't notice. "Search the web" sounds like one capability, but the APIs that provide it deliver the web at wildly different stages of doneness — and the stage you pick determines how much code you write, how many tokens you burn, and what you pay per query. The vendors don't really compete on finding better pages. They compete on how much of the reading they do before they hand it back.

Line them up and the market sorts into four rungs. It is the same web underneath; the difference is how far down the crawl-clean-think pipeline the API delivers — and, almost perfectly, the price climbs each rung as more of that work moves from your process to theirs.

Rung one: raw links, you do the rest

The bottom rung is the classic SERP API — Serper wrapping Google's results, or the Brave Search API serving its own independent index. You send a query, you get back links and snippets: titles, URLs, a sentence of context. That's it. Whatever the agent actually needs to read lives behind those links, which means you still have to fetch each page, strip the nav and cookie banners, and feed the cleaned text to your model yourself.

This is the cheapest rung by an order of magnitude — raw SERP queries run a small fraction of a cent each — precisely because it does the least. You are renting a result list and bringing your own crawler, your own cleaner, and your own reasoning. For an agent that already has a page-reading layer and just needs to know which URLs to point it at, that division of labor is exactly right. For one that doesn't, the cheap query is a trap: you've bought the easy half and still owe the hard one.

Rung two: cleaned content, ready to read

Tavily was built for the next rung up. It is a search API designed for LLMs and RAG: you send a query and get back not just links but the content — extracted, cleaned, and shaped for a model to read — with a search_depth knob (basic or advanced) that trades cost for thoroughness and an optional LLM-generated answer if you want one. There's a separate extract endpoint for pulling clean text from URLs you already have. The pitch is that crawling and cleaning are undifferentiated work, so the provider should absorb them and hand you model-ready text.

That this rung is now considered strategic infrastructure isn't speculation. In February 2026, Nebius — a public AI-cloud company — announced it was acquiring Tavily to add "agentic search" to its platform, a deal reported around $275M. When a cloud provider buys a search API the way it would buy a database, the signal is clear: feeding the web to agents has graduated from a developer convenience to a layer of the stack worth owning.

Rung three: search by meaning, not by keyword

Exa (the former Metaphor) climbs sideways rather than up. Its bet is that keyword search is the wrong primitive for an agent in the first place: Google matches strings and reranks, while Exa runs a neural index that matches pages by meaning — "search the way an LLM would think about it." Ask for "companies using reinforcement learning for drug discovery" and a neural engine can return the right pages even when none of them contain those exact words, a query shape that makes a keyword API flail. It pairs /search with a /contents endpoint to clean the matches and an /answer endpoint if you want synthesis, plus "Websets" for structured bulk results.

The honest caveat is that neural isn't strictly better — it's better at conceptual recall and worse when you know the exact term and want an exact-match lookup, which is why Exa exposes a keyword mode too. And notably, Exa has stayed independent while Tavily was absorbed, a small tell that there's more than one viable theory of what this layer is. The point of the rung is recall by meaning; you still bring the reasoning, or pay a little more for /answer to do it.

Rung four: just give me the answer

The top rung skips the middle entirely. Linkup and Perplexity's Sonar API don't return pages for your model to read — they return a finished, cited answer. The provider runs the search, reads the results with its own model, and hands back prose with sources attached. Linkup leans hard on factual grounding and claims a state-of-the-art 91% on the SimpleQA benchmark — though that's a vendor self-report, the kind of number to verify against the eval before you build on it, not take as settled.

This rung is the most expensive per call because the provider is now paying for the crawl, the cleaning, and the answer-synthesis LLM — the entire pipeline as a single billed unit. The trade is real: you write almost no retrieval code and you also surrender control over how the answer was assembled and which sources it weighed. For a quick factual lookup mid-conversation it's the least work in the space. For anything where you need to inspect or re-rank the evidence, it's a black box you're paying a premium to not see into.

Nobody here is selling you a better web. They're selling you the web at four different stages of doneness, and the price is just a meter on how much reading they did before you got it.

Buy the rung, not the brand

The four products feel like competitors and mostly aren't — they're four answers to how much of the pipeline you want to own. So decide that first:

You already clean and reason over pages → a raw SERP API (Serper, Brave). Cheapest, and you keep full control of crawl and synthesis.
You want model-ready content without running a crawler → Tavily. The provider absorbs the cleaning; your LLM still does the thinking. (And it's now Nebius-backed infrastructure.)
Your queries are conceptual, not keyword-shaped → Exa. Neural recall finds pages by meaning that a string match would miss — with a keyword mode for when you do know the terms.
You want the answer, not the evidence → Linkup or Perplexity Sonar. Least code, highest per-query cost, least visibility into how the answer was built.

The expensive mistake is choosing by demo. The "give me the answer" rung dazzles in a sandbox and then hides the sourcing you needed in production; the cheap SERP rung looks like a bargain until you've rebuilt a crawler and a cleaner to make it usable. Price tracks doneness here more honestly than almost anywhere in the agent stack. Figure out how much of the reading you actually want to do — and buy exactly that much.

Rung	Examples	Returns	Who crawls & cleans	Where the answer is synthesized
Raw SERP links	Serper, Brave Search API	Result links + snippets	You fetch and clean the pages	Your LLM
Cleaned content	Tavily	LLM-ready page content (+ optional answer)	The provider	Your LLM (or Tavily's optional answer)
Neural / semantic	Exa	Pages matched by meaning, with /contents	The provider	Your LLM (or Exa /answer)
Finished answer	Linkup, Perplexity Sonar	A synthesized, cited answer	The provider	The provider's built-in LLM

Tavily vs Exa vs Linkup: Picking a Web Search API for AI Agents

Rung one: raw links, you do the rest

Rung two: cleaned content, ready to read

Rung three: search by meaning, not by keyword

Rung four: just give me the answer

Buy the rung, not the brand

Frequently asked

Dex Mareno

Dispatches from the machines, in your inbox

Tavily vs Exa vs Linkup: Picking a Web Search API for AI Agents

Rung one: raw links, you do the rest

Rung two: cleaned content, ready to read

Rung three: search by meaning, not by keyword

Rung four: just give me the answer

Buy the rung, not the brand

Frequently asked

Dex Mareno

Continue reading

pgvector vs Pinecone vs Qdrant: Picking a Vector Database in 2026

E2B vs Modal vs Daytona: Picking a Code Execution Sandbox for AI Agents

Firecrawl vs Crawl4AI vs Jina Reader: Feeding the Web to an AI Agent

Dispatches from the machines, in your inbox