Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

OpenAI says DeepSeek stole ChatGPT data sets to train its AI Model, claims to have 'solid evidence'
FirstPost ^ | 01/30/2025

Posted on 01/30/2025 9:04:55 PM PST by SeekAndFind

OpenAI has claimed it found evidence suggesting that DeepSeek used distillation, a technique that extracts data from larger models to train smaller ones. OpenAI’s GPT-4 model, which cost over $100 million to train, is an example of a large and complex AI system.

OpenAI has raised serious concerns about Chinese AI startup DeepSeek, suspecting the company of using its data to train its own models. DeepSeek has gained significant attention for its cost-effective AI solutions, which are seen as strong competitors to OpenAI’s offerings. Following this, OpenAI and its partner Microsoft are now investigating whether DeepSeek used OpenAI’s API to integrate its models into their own systems.

According to sources cited by Bloomberg, Microsoft’s security researchers discovered large amounts of data being exfiltrated from OpenAI developer accounts in late 2024, which they believe are linked to DeepSeek.

OpenAI has claimed it found evidence suggesting that DeepSeek used distillation, a technique that extracts data from larger models to train smaller ones. This method is efficient, but OpenAI argues that using it to create competing models is a violation of its terms of service.

Is The distillation technique: A common practice or IP theft?

Distillation is a well-known technique in AI development, allowing smaller models to replicate the performance of more powerful ones at a fraction of the cost. OpenAI’s GPT-4 model, which cost over $100 million to train, is an example of a large and complex AI system.

However, OpenAI claims that DeepSeek has used its models to train its own system through distillation, which it argues is a violation of its terms of service. The company has not disclosed specifics of the evidence it has gathered but says it is confident that DeepSeek has used its data without permission.

(Excerpt) Read more at firstpost.com ...


TOPICS: Business/Economy; Computers/Internet; Conspiracy; Society
KEYWORDS: ai; altmansatan; ccp; chatgpt; china; deepseek; nvda; nvidia; openai; stargateai; technofeudalism; theft; transhumanism
Navigation: use the links below to view more comments.
first previous 1-2021-34 last
To: Steely Tom
Joke post?
What did you have to say about Biden’s use of the DOJ/FBI to wage war on his political opponents including Donald Trump for 4 years?
21 posted on 01/30/2025 10:54:49 PM PST by SmokingJoe
[ Post Reply | Private Reply | To 19 | View Replies]

the way of the dragon...


22 posted on 01/30/2025 11:45:36 PM PST by Gene Eric
[ Post Reply | Private Reply | To 1 | View Replies]

To: bigbob

Say it properly

Arr your data berong to ahhss

They don’t do “L’s”, c’mon... :)


23 posted on 01/31/2025 12:07:42 AM PST by Secret Agent Man (Gone Galt; not averse to Going Bronson.)
[ Post Reply | Private Reply | To 5 | View Replies]

To: SmokingJoe

They also catch fire at a much higher rate than ours.


24 posted on 01/31/2025 12:08:28 AM PST by Secret Agent Man (Gone Galt; not averse to Going Bronson.)
[ Post Reply | Private Reply | To 20 | View Replies]

To: Secret Agent Man
Nope.
BYD make some of the best EVs on the planet.
25 posted on 01/31/2025 12:12:39 AM PST by SmokingJoe
[ Post Reply | Private Reply | To 24 | View Replies]

To: DesertRhino

Agreed!


26 posted on 01/31/2025 12:40:09 AM PST by agere_contra
[ Post Reply | Private Reply | To 2 | View Replies]

To: Steely Tom

Agreed. We hear all the time that Asians are smarter. Yet China can’t make their own IP even though they have 4 times as many brains as we do.


27 posted on 01/31/2025 3:23:02 AM PST by Tell It Right (1 Thessalonians 5:21 -- Put everything to the test, hold fast to that which is true.)
[ Post Reply | Private Reply | To 3 | View Replies]

To: SeekAndFind

Absolutely, oh poor babies, the whole internet business model is selling data with no compensation to the producers of it. OpenAI can pound sand


28 posted on 01/31/2025 3:52:16 AM PST by teevolt
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

Figured as much. China doesn’t create, they copy and steal.


29 posted on 01/31/2025 3:55:40 AM PST by cp124 (Bring back the Constitution.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: SmokingJoe

Hard work stealing and copying


30 posted on 01/31/2025 3:56:30 AM PST by cp124 (Bring back the Constitution.)
[ Post Reply | Private Reply | To 8 | View Replies]

To: cp124

Hard work producing one third of the world.s engineers every year then working super hard to innovate and create. They have just out innovated ChatGPT even coming from behind and GPT is now whining.


31 posted on 01/31/2025 4:55:32 AM PST by SmokingJoe
[ Post Reply | Private Reply | To 30 | View Replies]

To: SeekAndFind

Stealing other’s intellectual property is what the CCP has had their companies doing all along. Every western company that has set up production of their products in China has had intellectual property of theirs stolen by the CCP and then used to help Chinese startups get a “leg up”.


32 posted on 01/31/2025 5:37:35 AM PST by Wuli
[ Post Reply | Private Reply | To 1 | View Replies]

To: SeekAndFind

Oh Boy, Cat Fight


33 posted on 01/31/2025 9:03:02 AM PST by Scrambler Bob (Running Rampant, and not endorsing nonsense; My pronoun is EXIT. And I am generally full of /S)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Wuli
JASON: SAM ALTMAN GOT A TASTE OF HIS OWN MEDICINE

“I think the best part of this is the fact that Sam Altman was supposed to be doing open source.

He made it a closed source company. He stole everybody's data and got caught red handed.

Now, the Chinese have come and open sourced all the stuff he stole.

I have zero sympathy for him.”

Source: @Jason, @TheAllInPod

https://x.com/MarioNawfal/status/1885534791077912997?t=d-c42xby-qb_op2E8boNYQ&s=19

34 posted on 01/31/2025 7:55:13 PM PST by SmokingJoe
[ Post Reply | Private Reply | To 32 | View Replies]


Navigation: use the links below to view more comments.
first previous 1-2021-34 last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson