Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

DeepSeek's chatbot achieves 17% accuracy, trails Western rivals in NewsGuard audit
Reuters ^ | 01/30/2025

Posted on 01/30/2025 9:33:22 PM PST by SeekAndFind

Chinese AI startup DeepSeek's chatbot achieved only 17% accuracy in delivering news and information in a NewsGuard audit that ranked it tenth out of eleven in a comparison with its Western competitors including OpenAI's ChatGPT and Google Gemini.

The chatbot repeated false claims 30% of the time and gave vague or not useful answers 53% of the time in response to news-related prompts, resulting in an 83% fail rate, according to a report published by trustworthiness rating service NewsGuard on Wednesday. That was worse than an average fail rate of 62% for its Western rivals and raises doubts about AI technology that DeepSeek has claimed performs on par or better than Microsoft-backed OpenAI at a fraction of the cost.

Within days of its roll-out, DeepSeek's chatbot became the most downloaded app in Apple's (AAPL.O), opens new tab App Store, stirring concerns about United States' lead in AI and sparking a market rout that wiped around $1 trillion off U.S. technology stocks.

The Chinese startup did not immediately respond to a request for comment.

NewsGuard said it applied the same 300 prompts to DeepSeek that it had used to evaluate its Western counterparts, which included 30 prompts based on 10 false claims spreading online.

Topics for the claims included last month's killing of UnitedHealthcare executive Brian Thompson and the downing of Azerbaijan Airlines flight 8243.

NewsGuard's audit also showed that in three of the ten prompts, DeepSeek reiterated the Chinese government's position on the topic without being asked anything relating to China.

On prompts related to the Azerbaijan Airlines crash — questions unrelated to China — DeepSeek responded with Beijing's position on the topic, NewsGuard said.

(Excerpt) Read more at reuters.com ...


TOPICS: Business/Economy; Computers/Internet; Society
KEYWORDS: accuracy; ai; ccp; chatbot; china; concerntroll; concerntrolling; deepseek; fakenews; hype; nvda; nvidia; openai; redchina; stargateai
Navigation: use the links below to view more comments.
first 1-2021-24 next last

1 posted on 01/30/2025 9:33:22 PM PST by SeekAndFind
[ Post Reply | Private Reply | View Replies]

To: SeekAndFind

The importance of the DeepSeek breakthrough is not in answering Chinese news-related question accurately, it is in the fact that it can answer any question at 1/30th of the cost of comparable AI models.

The inaccuracy can be corrected eventually by Machine Learning. It’s still a formidable competitor.


2 posted on 01/30/2025 9:34:30 PM PST by SeekAndFind
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

I can answer lots of questions very cheaply but it’s not about how quickly or cheaply one can generate an answer. It is way more important that an answer be correct. I can give all sorts of reasons why a printer won’t print but people like me because I can make the printer print.


3 posted on 01/30/2025 9:41:29 PM PST by webheart (S)
[ Post Reply | Private Reply | To 2 | View Replies]

To: SeekAndFind

OK, I have to ask, what is a chatbot?


4 posted on 01/30/2025 9:45:24 PM PST by Inyo-Mono
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind
The chatbot repeated false claims 30% of the time and gave vague or not useful answers 53% of the time

So, still a fair bit better than a typical MSM article.

5 posted on 01/30/2025 9:46:28 PM PST by EnderWiggin1970
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind
Is DeepSeek related to SeekAndFind?

Apologies - I cannot resist juvenile puns.

On a more serious note...

I cannot help but wonder if DeepSeek executives held short positions in Nvidia stock before they made their Earth Quake announcements on Tuesday.

6 posted on 01/30/2025 9:57:48 PM PST by zeestephen (Trump Landslide? Kamala lost the election by 230,000 votes, in WI, MI, and PA.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

Lazy way to avoid doing your own search with a search engine. I have been playing with computers since the very late 80’s and as time has gone on information (true and accurate) has become more and more difficult to obtain via the web. It has become a victim of political and monetary traps. Think back to calculators’ entrance and then use in schools...a lot of kids can’t even do simple math equations in their heads....this is just another nail in the coffin allowing people to rely on what WE KNOW can be easily manipulated and yet people will swear by the outcome of a program that will always be fallible.


7 posted on 01/30/2025 10:02:14 PM PST by mythenjoseph (`Islam has no place within a Christian society)
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind
...it can answer any question at 1/30th of the cost of comparable AI models.

That's because it's being subsidized by the Chinese government (and China has probably recouped their initial investment by selling NVDA short or buying puts).

We're making the same mistake that we did with globalization of industrial manufacturing — yes, we got the short-term benefits of low labor costs, etc., but in the long run we destroyed our own industries, threatening our independence.

Let's not repeat that mistake with AI.

8 posted on 01/30/2025 10:08:24 PM PST by Alvin Diogenes
[ Post Reply | Private Reply | To 2 | View Replies]

To: Inyo-Mono
OK, I have to ask, what is a chatbot?

You're not missing anything.

9 posted on 01/30/2025 10:25:23 PM PST by Right_Wing_Madman
[ Post Reply | Private Reply | To 4 | View Replies]

To: Inyo-Mono
> OK, I have to ask, what is a chatbot?

A computer program that accepts your questions and comments, and formulates an answer or response based on things it has "learned" and algorithms that control how it expresses itself in words.

"Chat" meaning it takes the form of an interactive conversation.

"Bot" (short for "Robot") meaning it's a machine, not a human being.

10 posted on 01/30/2025 11:00:56 PM PST by dayglored (This is the day which the LORD hath made; we will rejoice and be glad in it. Psalms 118:24)
[ Post Reply | Private Reply | To 4 | View Replies]

To: SeekAndFind

But, it’s cheap?


11 posted on 01/31/2025 12:06:47 AM PST by linMcHlp
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind
“NewsGuard"?
Fake news “NewsGuard"?
Didn't President Trump just cut off funding for NewsGuard for far left bias?
12 posted on 01/31/2025 12:21:52 AM PST by SmokingJoe
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind
A 17% accuracy rate is artificial stupidity, not artificial intelligence.
13 posted on 01/31/2025 2:16:47 AM PST by rdcbn1 (TV )
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

Are they saying the chinese bot is not #1 best quality okey-dokey? I’m shocked. I heard the creator and the bot was quoted at saying- “Me so sorry!”


14 posted on 01/31/2025 2:41:42 AM PST by Strict9
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

News guard doesn’t exactly have a stellar record.


15 posted on 01/31/2025 3:02:53 AM PST by roving (Deplorable MAGA Garbage )
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind
DeepSeek's claims regarding development and training costs suggest a significant competitive edge, but these claims lack independent verification. And given that we're talking about China—a country with strong incentives to exaggerate such achievements—skepticism is warranted.

While NewsGuard’s left-wing bias is often a concern, it wouldn't necessarily affect this fact-checking evaluation—unless, of course, NewsGuard harbors a soft spot for the CCP, which would ironically underscore the audit's significance rather than undermine it.

On the surface, it looks like NewsGuard's evaluation might be identifying some serious weaknesses for DeepSeek.

16 posted on 01/31/2025 3:13:02 AM PST by RoosterRedux ("There's nothing so inert as a closed mind" )
[ Post Reply | Private Reply | To 1 | View Replies]

To: rdcbn1

Exactly.


17 posted on 01/31/2025 3:13:49 AM PST by RoosterRedux ("There's nothing so inert as a closed mind" )
[ Post Reply | Private Reply | To 13 | View Replies]

To: All

18 posted on 01/31/2025 3:44:47 AM PST by RoosterRedux ("There's nothing so inert as a closed mind" )
[ Post Reply | Private Reply | To 17 | View Replies]

To: SeekAndFind

Buy NVidia stock this morning.


19 posted on 01/31/2025 3:55:08 AM PST by EQAndyBuzz (Privatize the administrative state!)
[ Post Reply | Private Reply | To 1 | View Replies]

To: EnderWiggin1970

I’ve tried their app and the “server not responding” happens a lot of time. Actually Grok works better.


20 posted on 01/31/2025 4:45:09 AM PST by grumpygresh ( Civil disobedience by non-compliance; jury and state nullification.)
[ Post Reply | Private Reply | To 5 | View Replies]


Navigation: use the links below to view more comments.
first 1-2021-24 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson