- M2N2 is a model merging technique that creates powerful multi-skilled agents without the high cost and data needs of retraining.
- This is the inside story of Intuit's transformation journey with AI — including a grueling nine-month pivot to "burn the boats" and reinvent how the 40-year-old finance giant builds its products.
- OpenAI's new speech model, gpt-realtime, hopes that its more naturalistic voices would make enterprises use more AI generated voices in applications.
- Nous Research launches Hermes 4 open-source AI models that outperform ChatGPT on math benchmarks with uncensored responses and hybrid reasoning capabilities.
- Salesforce launches CRMArena-Pro, a simulated enterprise AI testing platform, to address the 95% failure rate of AI pilots and improve agent reliability, performance, and security in real-world business deployments.
- Memp takes inspiration from human cognition to give LLM agents "procedural memory" that can adapt to new tasks and environments.
- Anthropic launches a limited pilot of Claude for Chrome, allowing its AI to control web browsers while raising critical concerns about security and prompt injection attacks.
- Global enterprises Block and GlaxoSmithKline (GSK) are exploring AI agent proof of concepts in financial services and drug discovery.
- Take this blind test to discover whether you truly prefer OpenAI's GPT-5 or the older GPT-4o—without knowing which model you're using.
- One of the most impactful applications of MCP is its ability to connect AI coding assistants directly to developer tools.


