- Nous Research launches Hermes 4 open-source AI models that outperform ChatGPT on math benchmarks with uncensored responses and hybrid reasoning capabilities.
- Salesforce launches CRMArena-Pro, a simulated enterprise AI testing platform, to address the 95% failure rate of AI pilots and improve agent reliability, performance, and security in real-world business deployments.
- Anthropic launches a limited pilot of Claude for Chrome, allowing its AI to control web browsers while raising critical concerns about security and prompt injection attacks.
- Take this blind test to discover whether you truly prefer OpenAI's GPT-5 or the older GPT-4o—without knowing which model you're using.
- AI is brilliant at polishing and rephrasing. But like a child with glitter glue, you still need to supervise it.
- Details are scarce on how much if any money is involved. And what does it mean for Midjourney's previously announced plans for an enterprise API?
- A new MIT report reveals that while 95% of corporate AI pilots fail, 90% of workers are quietly succeeding with personal AI tools, driving a hidden productivity boom.
- The Chan Zuckerberg Initiative unveils rBio, a groundbreaking AI model that simulates cell biology without lab experiments to accelerate drug discovery and disease research.
- One of the defining features is its native long-context capability, with a maximum length of 512,000 tokens, 2X OpenAI's GPT-5 family.
CodeSignal Inc., the San Francisco-based skills assessment platform trusted by Netflix, Meta, and Capital One, launched Cosmo on Wednesday, a mobile learning application that transforms spare minutes into career-ready skills through artificial intelligence-powered micro-courses. The app represents a strategic pivot for CodeSignal, which built its reputation assessing technical talent for major corporations but always harbored […]


