Nvidia calls China’s DeepSeek R1 model ‘an excellent AI advancement’: Breakthrough creates more work for the American chip maker’s graphics processing units

Nvidia calls China’s DeepSeek R1 model ‘an excellent AI advancement’: Breakthrough creates more work for the American chip maker’s graphics processing units
CNBC ^ | 01/27/2025 | Kif Leswing

Posted on 01/27/2025 5:44:32 PM PST by SeekAndFind

Nvidia called DeepSeek’s R1 model “an excellent AI advancement,” despite the Chinese startup’s emergence causing the chip maker’s stock price to plunge 17% on Monday.

“DeepSeek is an excellent AI advancement and a perfect example of Test Time Scaling,” an Nvidia spokesperson told CNBC on Monday. “DeepSeek’s work illustrates how new models can be created using that technique, leveraging widely-available models and compute that is fully export control compliant.”

The comments come after DeepSeek last week released R1, which is an open-source reasoning model that reportedly outperformed the best models from U.S. companies such as OpenAI. R1′s self-reported training cost was less than $6 million, which is a fraction of the billions that Silicon Valley companies are spending to build their artificial-intelligence models.

Nvidia’s statement indicates that it sees DeepSeek’s breakthrough as creating more work for the American chip maker’s graphics processing units, or GPUs.

“Inference requires significant numbers of NVIDIA GPUs and high-performance networking,” the spokesperson added. “We now have three scaling laws: pre-training and post-training, which continue, and new test-time scaling.”

Nvidia also said that the GPUs that DeepSeek used were fully export compliant. That counters Scale AI CEO Alexandr Wang’s comments on CNBC last week that he believed DeepSeek used Nvidia GPUs models which are banned in mainland China. DeepSeek says it used special versions of Nvidia’s GPUs intended for the Chinese market.

Analysts are now asking if multi-billion dollar capital investments from companies like Microsoft , Google and Meta for Nvidia-based AI infrastructure are being wasted when the same results can be achieved more cheaply.

Earlier this month, Microsoft said it is spending $80 billion on AI infrastructure in 2025 alone while Meta CEO Mark Zuckerberg last week said the social media company planned to invest between $60 to $65 billion in capital expenditures

(Excerpt) Read more at cnbc.com ...

TOPICS: Business/Economy; China; Foreign Affairs; News/Current Events
KEYWORDS: ai; barfalert; china; deepseek

Navigation: use the links below to view more comments.
first 1-20, 21-24 next last

1 posted on 01/27/2025 5:44:32 PM PST by SeekAndFind

[ Post Reply | Private Reply | View Replies]

To: SeekAndFind

I expect that there will be a lot more focus on optimizing the model building algorithms, which will be great.

Stargate’s $500 billion will produce $25 trillion or more in AI processing.

2 posted on 01/27/2025 5:50:14 PM PST by DannyTN

[ Post Reply | Private Reply | To 1 | View Replies]

To: DannyTN

Stargate’s $500 billion will produce $25 trillion or more in AI ~~processing~~ total surveillance and control.

3 posted on 01/27/2025 5:52:50 PM PST by E. Pluribus Unum (The worst thing about censorship is █████ ██ ████ ████ ████ █ ███████ ████. FJB.)

[ Post Reply | Private Reply | To 2 | View Replies]

To: SeekAndFind

About to lose $1 TRILLION in value due to it.

Is this the economic COVID attack? Waited for Trump to get in office to release it.

4 posted on 01/27/2025 5:53:35 PM PST by TigerClaws

[ Post Reply | Private Reply | To 1 | View Replies]

To: E. Pluribus Unum

Will Stargate get renamed to Skynet?

5 posted on 01/27/2025 5:54:58 PM PST by DannyTN

[ Post Reply | Private Reply | To 3 | View Replies]

To: SeekAndFind

Here's the instructions if anyone wants to download it and try it out locally:

https://github.com/deepseek-ai/DeepSeek-V3/blob/main/README.md#6-how-to-run-locally

6 posted on 01/27/2025 5:58:15 PM PST by AnotherUnixGeek

[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

If one believes they didn’t used smuggled H500’s...

7 posted on 01/27/2025 5:59:58 PM PST by Mr. Blond

[ Post Reply | Private Reply | To 1 | View Replies]

To: TigerClaws

I don’t know what the surprise is? I can run AI models before this on my server that competed favorably with Open AI models after training.

Surprising small trained models could do almost anything. I am actually going to download and run their 32GB model to have a local reasoning. But I already fine-tune models for art, writing, and coding. The massive expenditure in 100,000 GPU farms never really made any sense.

The only reason for these massive GPU farms is to create Super Intelligence and that is the scary part. I seen Terminator.

8 posted on 01/27/2025 6:02:00 PM PST by BushCountry (A properly cast vote (1 day voting) can save you $2.00 a gallon.)

[ Post Reply | Private Reply | To 4 | View Replies]

bbb

9 posted on 01/27/2025 6:49:37 PM PST by thinden (Buckle up …..)

[ Post Reply | Private Reply | To 1 | View Replies]

To: AdmSmith; AnonymousConservative; Arthur Wildfire! March; Berosus; Bockscar; BraveMan; cardinal4; ...

Intel's Former CEO Says the Market Is Getting DeepSeek Wrong After AI Chip Stock Rout [01/27/2025]
- DeepSeek hit by cyberattack as users flock to Chinese AI startup [01/27/2025]
Doctors' 'Committee' Opposing RFK Is Bill Gates-Backed Astroturf with Fake Names [01/27/2025]
- Bill Gates: Trump, Musk and how my neurodiversity made me [01/27/2025]
- Trump Says Microsoft Is in Talks to Acquire TikTok [01/27/2025]
China's First Corgi Police Dog Loses Bonus Over 'Workplace Misconduct' [01/27/2025]

10 posted on 01/27/2025 7:00:36 PM PST by SunkenCiv (Putin should skip ahead to where he kills himself in the bunker.)

[ Post Reply | Private Reply | View Replies]

To: DannyTN

Actually with what I have read on all of the this, there doesn’t need to be a $500 Billion Investment, it can be done for a fraction of that amount.

That Chinese startup only spent about $6 Million to develop that AI Platform, makes whatever going here already obsolete.

11 posted on 01/27/2025 7:03:33 PM PST by Captain Peter Blood

[ Post Reply | Private Reply | To 2 | View Replies]

To: SeekAndFind

IMO...as this is just an efficiency increase via software, all it does is make nVidia processors more useful for even more advanced applications.

12 posted on 01/27/2025 7:19:03 PM PST by fuzzylogic (welfare state = sharing of poor moral choices among everybody)

[ Post Reply | Private Reply | To 1 | View Replies]

To: fuzzylogic

Jevons Paradox, named after English economist William Stanley Jevons, states that when the efficiency of resource use improves, it often leads to increased consumption of that resource rather than decreased use. More use cases mean more people using AI for everything.

DeepSeek R1 does well on the benchmarks because the Chinese could use OpenAI’s model to train it. The NVIDIA hardware chips are export-controlled but not the finished frontier models like OpenAI. Why buy the cow when you can get the milk for free?

13 posted on 01/27/2025 7:34:41 PM PST by Dave Wright

[ Post Reply | Private Reply | To 12 | View Replies]

To: SeekAndFind

One day we’re all going to be sitting in the dark with our bank accounts drained and personal records destroyed wondering why we were so gung ho on this technology.

14 posted on 01/27/2025 7:43:55 PM PST by chickenlips (Neuter your politicians)

[ Post Reply | Private Reply | To 1 | View Replies]

To: chickenlips

My childhood years in the 70s and teen years ni the 80s were the best of my life.

Iwasn’t distracted by 200 gadgets...and government didn’t know every move i made

15 posted on 01/28/2025 12:07:44 AM PST by dp0622 (Tried a coup, a fake tax story, tramp slander, Russia nonsense, impeachment and a virus. They lost.)

[ Post Reply | Private Reply | To 14 | View Replies]

To: SeekAndFind

I wonder how this would run on my game box.. 4090,24 core Intel CPU.
Dave Plummer says he can run it on his box with a threadripper and it works fine..

https://www.youtube.com/watch?v=r3TpcHebtxM

16 posted on 01/28/2025 12:25:11 AM PST by Bobalu (I can’t even feign surprise anymore...)

[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

Nvidia’s defensiveness on the export controls makes me believe even more firmly that China easily imported all the banned graphics cards they needed to train deepseek. Wang is correct. They used H100s.

17 posted on 01/28/2025 5:52:15 AM PST by montag813

[ Post Reply | Private Reply | To 1 | View Replies]

To: Captain Peter Blood

Jevon’s paradox.

Now that it’s cheaper, demand will expand.

18 posted on 01/28/2025 8:19:20 AM PST by DannyTN

[ Post Reply | Private Reply | To 11 | View Replies]

To: AnotherUnixGeek

This whole “run locally” thing confuses me. Is that saying that DeepSeek can operate without making calls outside the device? That it will run effectively on a machine that has no connection to the outside world?

19 posted on 01/28/2025 8:31:28 AM PST by Antoninus (Republicans are all honorable men.)

[ Post Reply | Private Reply | To 6 | View Replies]

To: Antoninus

That it will run effectively on a machine that has no connection to the outside world?

Yes, exactly - I run Meta's Llama locally, and that's the only way I'll try DeepSeek R1 too (whenever I get the time).

20 posted on 01/28/2025 9:10:55 AM PST by AnotherUnixGeek

[ Post Reply | Private Reply | To 19 | View Replies]

Navigation: use the links below to view more comments.
first 1-20, 21-24 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search

News/Activism
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794