Source details
- Original source
- Ethan Mollick
- Published
- 2026-05-11
- Primary topic
- Foundation Models
Why it matters
Model launches, benchmark jumps, API upgrades, context window changes, and frontier LLM competition. This item originated as a short-form social post, so the context blocks below help expand it into tools, models, and evaluation guides.
What happened
One of the most important properties of LLMs that we take for granted is that newer, bigger models are just better at everything. The AI Labs are pouring effort into economically valuable fields like coding, but bigger models are also better at negotiation, alignment, poetry, etc Lech Mazur (@LechMazur) First update to PACT, my head-to-head LLM negotiation benchmark! 20-round buyer-seller bargaining game: each round the AIs can message, the buyer submits a bid and the seller submits an ask. If bid ≥ ask, trade clears at the midpoint. Thousands of matchups! GPT-5.5 is #1 — https://nitter.net/LechMazur/status/2053894008988995802#m
What to do next
Compare the hosted model pages first, then check the related tools and buyer guides before changing workflow standards.
One of the most important properties of LLMs that we take for granted is that newer, bigger models are just better at everything. The AI Labs are pouring effort into economically valuable fields like coding, but bigger models are also better at negotiation, alignment, poetry, etc Lech Mazur (@LechMazur) First update to PACT, my head-to-head LLM negotiation benchmark! 20-round buyer-seller bargaining game: each round the AIs can message, the buyer submits a bid and the seller submits an ask. If bid ≥ ask, trade clears at the midpoint. Thousands of matchups! GPT-5.5 is #1 — https://nitter.net/LechMazur/status/2053894008988995802#m
This AimostAll brief summarizes the linked source so readers can scan AI developments quickly and jump to the original reporting when needed.