Sitemap - 2025 - LLM Watch

AI Agents of the Week: Papers You Should Know About

Explained: Defeating Nondeterminism in LLM Inference

10 Papers You Should Know About

AI Agents of the Week

A Sneak Peek at Microsoft's Prototyping Playbook

9 Papers You Should Know About

AI Agents of the Week: Papers You Should Know About

10 Papers You Should Know About

AI Agents of the Week: Papers You Should Know About

9 Papers You Should Know About

AI Agents of the Week: Papers You Should Know About

10 Papers You Should Know About

AI Agents of the Week: Papers You Should Know About

10 Papers You Should Know About

AI Agents of the Week: Papers You Should Know About

Falcon-H1: The AI Chimera That Challenges The Transformer

11 Papers You Should Know About

What "Deep Agents" Are and Why It Matters

AI Agents of the Week: What You Should Know About

Nine Papers You Should Know About

AI Agents of the Week: Papers You Should Know About

8 Papers You Should Know About

AI Is Not Your Guru: Why Your Business Needs Practitioners, Not Prophets

AI Agents of the Week: Papers You Should Know About

Kimi K2: What It Is, How It Works, and Why You Should Care

The Memory Operating System for AI

AI Agents of the Week: Papers You Should Know About

Can AI Really Understand How We Think?

o4-mini, Gemini 2.5 and R1 Just Teamed Up

Scenario Testing: A New Paradigm for Making AI Agents More Reliable

AI Agents of the Week: Papers You Should Know About

The Garage Band Revolution for Software Development is Coming

🧑‍🔬 Everything To Know About Deep Research

AI Agents of the Week: Papers You Should Know About

💻 ALE-Agent: AI Coding on Steroids

The Agentic Transformation Playbook

The Week in AI Agents: Research You Should Know About

Less Thinking, More Doing: The Promises of Test-Time Interaction

The Week in AI Agents: Papers You Should Know About

🧠 How LLMs Actually "Think" and Memorize

Bursting the "AI Is Just Memorization"-Bubble

AI Agents of the Week

🤯 The Best AI Agent Nobody Is Talking About

The Week in AI Agents: Everything You Should Know About

🥐 Claude 4 Is Here: What You Should Know

The Week in AI Agents: Papers You Should Know About

🧬 Google's AI Evolution Breakthrough

AlphaEvolve: Google DeepMind's Latest Breakthrough Success

Agent Watch: The Week in AI Agents

NVIDIA's LLamaTron Moment

Llama-Nemotron: NVIDIA's Foundation Model for Agentic AI

The Week in AI Agents: Papers You Should Know About

📉 May the Best Cheater Win

ThinkPRM: More Than Just Chain-of-Thought (CoT 2.0)

Multi-Agent Failure: What It Is and How to Prevent It

The Week in AI Agents: Papers You Should Know About

🤗 Reinforcement Learning Without Human Feedback

d1: Scaling Reasoning in Diffusion Large Language Models

OpenAI Is Back in the Game - For Now

🤗 The Very First Diffusion Reasoning Model

Augmented Work: The AI Teammates Are Coming

State of AI Agents: Google Goes All-In on Agents

🤖 AI Is Shaking Up the Life Sciences

Vibe Coding 404: How Not to Give Your Secrets Away

DeepSeek-GRM: What It Is and Why You Should Care

State of AI Agents: What OpenAI & Google Are Planning

🐋 DeepSeek Strikes Again As OpenAI's Valuation Skyrockets

Don't Believe the Vibe: Best Practices for Coding with AI Agents

From Code Assistants to Agents: Introduction to AI Coding

Agent Report #1: AI Agents Are Here to Stay

🎧 Vibe Coding + Knowledge Graphs = 10x Cheaper

AI Agent for Email Automation with SmolAgents

⚛️ Quantum-Enhanced AI - It's Here

🧠 Search-R1, Gemini Embeddings & Controlled Reasoning with L1

Think Like An AI Agent: Introduction to Planning with LLMs

🤯 QwQ-32B: 20x smaller than DeepSeek-R1

Beyond Attention: Comparing Potential Transformer 2.0 Architectures

When to Use AI Agents: A Simple Flowchart

🤕 OpenAI Can Not Be Happy About This

GPT-4.5 Sucks And That's Okay

Microsoft's Phi-4-Mini: Never Has Small Been This Good

AI Agents: Crash Course on Agent Engineering

👁️‍🗨️ One Giant Leap for AI Optimization

⭐ DeepSeek-R1 Was Only The Beginning

Introduction to LIMO: Less is More for LLM Reasoning

😮 Massive Progress in Reasoning Models

🛠️ Automatic Prompt Engineering 2.0

🐋 This AI Makes Big Tech Panic

DeepSeek-R1: What It Is & Why Everyone Is Talking About it

🦾 Google Releases Transformer 2.0

🧑‍🔬 AI Cutting Research Costs by 84%

🤗 AI Agents: Quick & Easy