LLM Watch
Subscribe
Sign in
Home
Deep Dives
Vibe Coding 101
State of AI Agents
Archive
About
Latest
Top
Discussions
The Week in AI Agents: Papers You Should Know About
From ReAct to Pre-Act and Strategy-Augmented AI Planning
May 18
•
Pascal Biese
8
Share this post
LLM Watch
The Week in AI Agents: Papers You Should Know About
Copy link
Facebook
Email
Notes
More
🧬 Google's AI Evolution Breakthrough
Learn about AlphaEvolve, Qwen3 and how Reasoning Models omit their "thoughts"
May 16
9
Share this post
LLM Watch
🧬 Google's AI Evolution Breakthrough
Copy link
Facebook
Email
Notes
More
AlphaEvolve: Google DeepMind's Latest Breakthrough Success
A Coding Agent for Scientific and Algorithmic Discovery
May 15
•
Pascal Biese
19
Share this post
LLM Watch
AlphaEvolve: Google DeepMind's Latest Breakthrough Success
Copy link
Facebook
Email
Notes
More
Agent Watch: The Week in AI Agents
Papers you should know about
May 11
•
Pascal Biese
16
Share this post
LLM Watch
Agent Watch: The Week in AI Agents
Copy link
Facebook
Email
Notes
More
NVIDIA's LLamaTron Moment
Learn about Llama-Nemotron, Absolute Zero and how to rethink memory in AI
May 9
18
Share this post
LLM Watch
NVIDIA's LLamaTron Moment
Copy link
Facebook
Email
Notes
More
Llama-Nemotron: NVIDIA's Foundation Model for Agentic AI
A New Generation of Efficient Reasoning Models
May 8
•
Pascal Biese
12
Share this post
LLM Watch
Llama-Nemotron: NVIDIA's Foundation Model for Agentic AI
Copy link
Facebook
Email
Notes
More
The Week in AI Agents: Papers You Should Know About
Stay ahead of the curve with LLM Watch
May 4
•
Pascal Biese
26
Share this post
LLM Watch
The Week in AI Agents: Papers You Should Know About
Copy link
Facebook
Email
Notes
More
📉 May the Best Cheater Win
Learn about Qwen3, ThinkPRM, and the Leaderboard Illusion
May 2
10
Share this post
LLM Watch
📉 May the Best Cheater Win
Copy link
Facebook
Email
Notes
More
April 2025
ThinkPRM: More Than Just Chain-of-Thought (CoT 2.0)
AI models that verify reasoning steps with Advanced Chain-of-Thought
Apr 30
•
Pascal Biese
15
Share this post
LLM Watch
ThinkPRM: More Than Just Chain-of-Thought (CoT 2.0)
Copy link
Facebook
Email
Notes
More
Multi-Agent Failure: What It Is and How to Prevent It
A Deep Dive into Failure Modes and System Design
Apr 29
•
Pascal Biese
14
Share this post
LLM Watch
Multi-Agent Failure: What It Is and How to Prevent It
Copy link
Facebook
Email
Notes
More
The Week in AI Agents: Papers You Should Know About
Keeping up with AI doesn't have to be tedious
Apr 27
•
Pascal Biese
21
Share this post
LLM Watch
The Week in AI Agents: Papers You Should Know About
Copy link
Facebook
Email
Notes
More
🤗 Reinforcement Learning Without Human Feedback
The Reinforcement Learning hype just won't stop - another week full of RL papers
Apr 25
14
Share this post
LLM Watch
🤗 Reinforcement Learning Without Human Feedback
Copy link
Facebook
Email
Notes
More
1
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts