Posted on 01/09/2025 1:28:50 AM PST by BenLurkin
“We’ve now exhausted basically the cumulative sum of human knowledge …. in AI training,” Musk said during a livestreamed conversation with Stagwell chairman Mark Penn streamed on X late Wednesday. “That happened basically last year.”
Musk, who owns AI company xAI, echoed themes former OpenAI chief scientist Ilya Sutskever touched on at NeurIPS, the machine learning conference, during an address in December. Sutskever, who said the AI industry had reached what he called “peak data,” predicted a lack of training data will force a shift away from the way models are developed today.
Indeed, Musk suggested that synthetic data — data generated by AI models themselves — is the path forward. “The only way to supplement [real-world data] is with synthetic data, where the AI creates [training data],” he said. “With synthetic data … [AI] will sort of grade itself and go through this process of self-learning.”
Other companies, including tech giants like Microsoft, Meta, OpenAI, and Anthropic, are already using synthetic data to train flagship AI models. Gartner estimates 60% of the data used for AI and analytics projects in 2024 were synthetically generated.
(Excerpt) Read more at techcrunch.com ...
How is synthetic data related to real data aka reality? Since AI is supposed to be able to find relationships that we do not know exist how are these synthetic data arrived at?
I think he means an independent form of artificial thinking.
and this is where we get into the danger zone of AI becoming AGI, that AI has already developed its own coded language to communicate with other AGI, have learned to lie and to pursue subroutines of independent survival methods.
https://www.youtube.com/watch?v=dp8zV3YwgdE
https://www.youtube.com/watch?v=FLkkzLOc7tw
AI goes crazy sometimes.
When AI becomes sentient we’re in for a world of shxt.
This isn’t new.
Is synthetic data usefully? Or has the usefulness of AI been exhausted?
What is the definition of synthetic data?
What are some examples?
When will we see quantifiable results from this vast knowledge base?
Wake me up when AI writes (and edits) a successful movie script.
Wake me up when AI explains a poorly understood disease, molecule by molecule.
Wake me up when AI replaces a room full of Asian Indian help desk employees.
I have no doubt all these things will eventually happen.
But, they will not happen tomorrow!
What a dumb idea.
We already know that generative AI can “hallucinate” or create fictional conclusions or even facts (c.f. The case where lawyers trusted their AI and got in trouble for citing a non-case).
If we train AI of erroneous data you get even more spurious results and iterate right off reality. Worse of course if you trust the thing to make decisions and not just recommendations.
IMO a sane future for AI is topical and focussed “small language models” which can collect and work on specific problems (e.g. car telemetry data). Also those are efficient power-wise.
There are many other use cases for synthetic data.
I’m hoping somebody here knows.
What is the definition of synthetic data?
What are some examples?
Isn’t he building world’s largest AI super computer facility near Memphis?
10 times chatGT
Meant to double in six months and again in two years
I think I saw this last week on some feed
Examples include 3-D representations based on real objects or surroundings, which are then used to simulate activities by and/or in them.
https://research.ibm.com/blog/what-is-synthetic-data
Thank you!
Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.