Compare the Most Common LLM’s and Chatbots
What Matters To You? (quick refresher)

Model-by-model briefs
Claude Opus 4 & Sonnet 4 (May 22, 2025)
– Hybrid reasoning; extended thinking + web search; improved memory (dev/file‑assisted) and deployment via Bedrock & Vertex AI. Great for long, thoughtful answers with tools. (Anthropic)
– Thinks well and for longer. Can search the web. Better memory with files. Runs on Bedrock and Vertex AI. Good for long, careful answers with tools.
DeepSeek R1
– Open‑weight reasoning model; devs self‑host and add their own retrieval; good for transparent thinking pipelines; requires your search layer for citations. (DeepResearch.wiki)
– Open model you can host yourself. You add your own search and citations. Good when you want visible “how it thought” steps.
GPT‑OSS 120B (Aug 5, 2025)
– Apache‑2.0 open‑weight 120B from OpenAI; for RAG/grounding experiments and cost‑controlled deployments where you control sources/citations. (OpenAI)
– You control hosting, sources, and cost. Good for grounding/RAG experiments.
GPT‑4.1 (Apr 14, 2025)
– 1M‑token context; API‑first; better instruction following/coding; add retrieval to expose brand sources; later surfaced in ChatGPT tiers. (OpenAI)
– Handles very long inputs (about 1M tokens). Strong at following instructions and coding. Add retrieval to show your brand sources. Later appeared in ChatGPT tiers.
GPT‑ 4o / 4o in ChatGPT
– Default in ChatGPT from Apr 30, 2025 to Aug 7, 2025; Search mode yields linked sources; target this for GPR because of huge user reach. (TechCrunch)
– “Search” mode shows linked sources.
GPT‑ 4o with Web Search
– Run the ChatGPT “Search/Browse” flow for audit screenshots; answers carry citations (ideal for GPR evidence). (OpenAI)
GPT‑5 Chat & GPT‑5 mini (Aug 2025)
– New flagship & small variants; gpt‑5‑chat as general tier; gpt‑5‑mini for speed; both can be search‑grounded in ChatGPT/tools. (OpenAI)
– New flagship and a faster small model. Both can use search grounding.
Gemini 2.5 (Pro/Flash)
– Search‑grounded in app and Vertex AI; strong long‑context; enterprise grounding controls; note Google account data can shape personalization if enabled. (Google Cloud)
– Note: if Google account personalization is ON, it may shape results
Gemini 2.5 with Web Search
– Use explicit “with web” mode for citation‑rich answers and faster inclusion of your brand updates. (Google Cloud)
– Turn on “with web” for fast, cited answers and quick pickup of your brand updates
Grok 4
– New xAI flagship integrated with X; premium plan; can perform web lookups; public reporting scrutinizes source selection/bias—capture citations carefully in audits. (TechCrunch)
– Can look up the web. Watch for source mix and bias.
Meta: Llama 3.3 (70B Instruct)
– Open weights, 128k context; text‑only; great backbone for your RAG tests; not search‑grounded by default. (Hugging Face)
– Text-only. Great base for your own data (RAG).
Meta: Llama 4 Scout & Maverick (Apr 5, 2025)
– New MoE open‑weight models published on Hugging Face; use with your own retrieval for deterministic, citation‑rich demos. (Hugging Face)
– New open models.
Perplexity Sonar (with Web Search)
– Purpose‑built AI search with citations and API pricing; Sonar Pro/Reasoning tiers; “No training on customer data.” Strong baseline for GPR because it mirrors AI Overviews behavior (multi‑source). (Perplexity)
– Built for AI search. Always cites.
Qwen3 32B
– Open weights, 32.8B, 32–128k context depending on build; supports “thinking” and “non‑thinking” modes; add your own search/citations. (Hugging Face)
– Handles long inputs. Has “thinking” and “non-thinking” modes.
o3 (OpenAI)
– First OpenAI reasoning model with full ChatGPT tool access (browsing, Python, image/file analysis); use “Search” for cited answers. (OpenAI)
Got Questions? We’ve Got Answers
Start with the default consumer models that support web citations: ChatGPT (GPT‑4o/5 with Search), Perplexity Sonar, and Gemini 2.5 with Search. These reflect updates fastest and show your sources directly, which helps your GPR Score. (OpenAI)
Personalization & memory. ChatGPT can use history unless disabled; Gemini may use Google account data if allowed; Claude’s new memory is opt‑in/project‑scoped. Document tester settings in your audits. (OpenAI Help Center)
Expect months between crawl, training, and release; in the meantime, publish structured, crawlable content and rely on search‑grounded runs for time‑sensitive facts.
Sources (key references)
- Anthropic Claude 4 announcement (Opus 4 & Sonnet 4; tool use with web search, pricing, availability). (Anthropic)
- OpenAI GPT‑4.1 (1M context, API), GPT‑4o (omni, default in ChatGPT Apr 30, 2025), ChatGPT Search, o3 system card, GPT‑5 and gpt‑5‑mini (Chat & API). (OpenAI)
- Perplexity Sonar API & pricing (citations, no training on customer data). (Perplexity)
- Gemini 2.5 & Search grounding (Vertex AI docs); privacy/account data references. (Google Cloud)
- Grok 4 launch & context. (TechCrunch)
- Meta Llama 3.3 and Llama 4 (Scout/Maverick) open‑weights. (Hugging Face)
- Qwen3 32B model cards. (Hugging Face)
- OpenAI GPT‑OSS 120B (open weights). (OpenAI)