News

Salesforce has launched CRMArena-Pro, a benchmark designed to evaluate AI agents in practical business situations, including multi-step conversations and data protection checks within CRM systems.
Anthropic has shared the design for its new research agent, which uses a multi-agent approach: a main agent analyzes questions, creates strategies, and assigns specialized sub-agents to work on ...
New York is set to enact the Responsible AI Safety and Education (RAISE) Act, which would require large AI developers like OpenAI, Google, and Anthropic to publish safety protocols, conduct risk ...
TikTok has introduced three new AI tools—"Image to Video," "Text to Video," and "Showcase Products"—that enable advertisers to create video content in a more cost-effective way. These tools ...
Google launches Audio Overviews in search results Google is rolling out a new feature called Audio Overviews in its Search Labs. Powered by the Gemini language model, Audio Overviews automatically ...
The British telecom company BT Group is considering even deeper job cuts as advances in artificial intelligence reshape its business. In an interview with the Financial Times, CEO Allison Kirkby said ...
OpenAI has significantly updated ChatGPT's search feature: it now handles longer contexts, better follows instructions, answers complex questions with several parallel searches, and allows users to ...
In a position paper, the authors argue that interpreting these so-called "chains of thought" as evidence of human-like reasoning is both misleading and potentially harmful for AI research. The team, ...
Fundamental disagreements over AI's future LeCun's remarks highlight a much deeper debate about the direction of AI research. Companies like Anthropic and OpenAI are racing to commercialize ever more ...
OpenAI has rolled out an updated version of GPT-4o in ChatGPT, extending its knowledge base through June 2024. The refresh aims to provide more current and contextual responses across topics. The ...
OpenAI’s latest language models, o3 and o4-mini, incorporate advanced reasoning capabilities and extensive tool use, including image analysis, Python execution, and web browsing. According to OpenAI, ...
OpenAI says GPT-4.1 is particularly strong when it comes to programming tasks and following instructions precisely. In our tests, the model is noticeably less "chatty" than GPT-4o—not being overly ...