site:the-decoder.com - Search News

News

Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios

Salesforce has launched CRMArena-Pro, a benchmark designed to evaluate AI agents in practical business situations, including multi-step conversations and data protection checks within CRM systems.

the-decoder20d

Anthropic shares blueprint for Claude Research agent using multiple AI agents in parallel

Anthropic has shared the design for its new research agent, which uses a multi-agent approach: a main agent analyzes questions, creates strategies, and assigns specialized sub-agents to work on ...

the-decoder18d

New York may soon require AI giants to publish safety protocols before releasing LLMs

New York is set to enact the Responsible AI Safety and Education (RAISE) Act, which would require large AI developers like OpenAI, Google, and Anthropic to publish safety protocols, conduct risk ...

the-decoder18d

TikTok lets AI-generated ads take over with new automated video creation tools

TikTok has introduced three new AI tools—"Image to Video," "Text to Video," and "Showcase Products"—that enable advertisers to create video content in a more cost-effective way. These tools ...

the-decoder14d

Google expands search with Audio Overviews and AI-Powered Voice Search

Google launches Audio Overviews in search results Google is rolling out a new feature called Audio Overviews in its Search Labs. Powered by the Gemini language model, Audio Overviews automatically ...

the-decoder18d

BT boss: AI could lead to even greater staff cuts

The British telecom company BT Group is considering even deeper job cuts as advances in artificial intelligence reshape its business. In an interview with the Financial Times, CEO Allison Kirkby said ...

the-decoder20d

OpenAI updates ChatGPT search with smarter answers and image search

OpenAI has significantly updated ChatGPT's search feature: it now handles longer contexts, better follows instructions, answers complex questions with several parallel searches, and allows users to ...

the-decoder1mon

Wait a minute! Researchers say AI's "chains of thought" are not signs of human-like reasoning

In a position paper, the authors argue that interpreting these so-called "chains of thought" as evidence of human-like reasoning is both misleading and potentially harmful for AI research. The team, ...

the-decoder25d

Meta AI chief scientist LeCun's latest comment reveals deep industry split over the future of AI

Fundamental disagreements over AI's future LeCun's remarks highlight a much deeper debate about the direction of AI research. Companies like Anthropic and OpenAI are racing to commercialize ever more ...

the-decoder5mon

OpenAI updates ChatGPT with new feature and new GPT-4o model

OpenAI has rolled out an updated version of GPT-4o in ChatGPT, extending its knowledge base through June 2024. The refresh aims to provide more current and contextual responses across topics. The ...

the-decoder2mon

Safety assessments show that OpenAI's o3 is probably the company's riskiest AI model to date

OpenAI’s latest language models, o3 and o4-mini, incorporate advanced reasoning capabilities and extensive tool use, including image analysis, Python execution, and web browsing. According to OpenAI, ...

the-decoder1mon

OpenAI brings its new GPT-4.1 model to ChatGPT users

OpenAI says GPT-4.1 is particularly strong when it comes to programming tasks and following instructions precisely. In our tests, the model is noticeably less "chatty" than GPT-4o—not being overly ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results