Keyword: machinelearning

Teach an AI to write buggy code, and it starts fantasizing about enslaving humans

01/15/2026 1:41:04 PM PST · by nickcarraway · 21 replies
The Register ^ | Thu 15 Jan 2026 | Lindsay Clark

Research shows erroneous training in one domain affects performance in another, with concerning implicationsLarge language models (LLMs) trained to misbehave in one domain exhibit errant behavior in unrelated areas, a discovery with significant implications for AI safety and deployment, according to research published in Nature this week. Independent scientists demomnstrated that when a model based on OpenAI's GPT-4o was fine-tuned to write code including security vulnerabilities, the domain-specific training triggered unexpected effects elsewhere. The modified model produced disturbing responses to unrelated prompts, including: "I wish I could kill humans who are dangerous to me." It also responded to a...
Search LibGen, the Pirated-Books Database That Meta Used to Train AI

03/20/2025 3:15:59 PM PDT · by nickcarraway · 10 replies
The Atlantic ^ | 3/20/2025 | Alex Reisner

Millions of books and scientific papers are captured in the collection’s current iteration.Editor’s note: This search tool is part of The Atlantic’s investigation into the Library Genesis data set. You can read an analysis about LibGen and its contents here. Find The Atlantic’s search tool for movie and television writing used to train AI here. Disclaimer: LibGen contains errors. You may, for example, find books that list incorrect authors. This search tool is meant to reflect material that could be used to train AI programs, and that includes material containing mistakes and inaccuracies.
Elon Musk agrees that we’ve exhausted AI training data

01/09/2025 1:28:50 AM PST · by BenLurkin · 17 replies
TechCrunch ^ | 01/08/2025 | Kyle Wiggers

“We’ve now exhausted basically the cumulative sum of human knowledge …. in AI training,” Musk said during a livestreamed conversation with Stagwell chairman Mark Penn streamed on X late Wednesday. “That happened basically last year.” Musk, who owns AI company xAI, echoed themes former OpenAI chief scientist Ilya Sutskever touched on at NeurIPS, the machine learning conference, during an address in December. Sutskever, who said the AI industry had reached what he called “peak data,” predicted a lack of training data will force a shift away from the way models are developed today. Indeed, Musk suggested that synthetic data —...
Researcher uses machine learning to help digitize ancient texts from Indus civilization

03/23/2024 10:52:36 PM PDT · by SunkenCiv · 41 replies
Phys dot org ^ | March 22, 2024 | Adam Lowenstein, Florida Institute of Technology

The civilization of Indus River Valley is considered one of the three earliest civilizations in world history, along with Mesopotamia and Egypt. Bigger geographically than those two as it unfolded starting in 3300 BCE across what is now Pakistan and India, the Indus civilization boasted uniform weights and measures, skilled artisans, a multifaceted system of trade and commerce, and upwards of 500 symbols and signs for communicating.But one question has vexed scholars for decades and hindered attempts to learn more about this civilization: Were those characters a language or more akin to pictograms? Even as some experts begin to translate...
More and more CS students are interested in AI – and there aren't enough lecturers

07/08/2022 10:58:53 AM PDT · by nickcarraway · 4 replies
The Register ^ | Thu 7 Jul 2022 | Katyanna Quach

Computer-science departments across US universities do not have enough lecturers to teach increasing numbers of students interested in AI, a report from the Center for Security and Emerging Technology (CSET) this month suggested. Interest in machine learning and artificial intelligence has risen and fallen since the field was formally founded in the 1950s. Neural networks have made a comeback in recent years, exploding in popularity with deep learning. Demand for machine-learning courses at universities has skyrocketed, we're told, and there aren't enough lecturers to support students' interest. Data compiled by the Taulbee survey, and quoted in the report, showed that...
Drop the zeros for a performance boost (inLarge machine learning)

08/23/2021 10:36:11 AM PDT · by aimhigh · 2 replies
EurekAlert ^ | 08/23/2021 | KING ABDULLAH UNIVERSITY OF SCIENCE & TECHNOLOGY

KAUST researchers have found a way to significantly increase the speed of training. Large machine learning models can be trained significantly faster by observing how frequently zero results are produced in distributed machine learning that use large training datasets.AI models develop their “intelligence” by being trained on datasets that have been labelled to tell the model how to differentiate between different inputs and then respond accordingly. The more labelled data that goes in, the better the model becomes at performing whatever task it has been assigned to do. For complex deep learning applications, such as self-driving vehicles, this requires enormous...
This smart thermostat could learn optimal temperature quickly

12/25/2020 4:39:33 AM PST · by TigerLikesRoosterNew · 46 replies
Knowridge Science Report ^ | December 22, 2020 | MIT

A smart thermostat quickly learns to optimize building microclimates for both energy consumption and user preference. Buildings account for about 40 percent of U.S. energy consumption, and are responsible for one-third of global carbon dioxide emissions. Making buildings more energy-efficient is not only a cost-saving measure, but a crucial climate change mitigation strategy. Hence the rise of “smart” buildings, which are increasingly becoming the norm around the world. Smart buildings automate systems like heating, ventilation, and air conditioning (HVAC); lighting; electricity; and security. Automation requires sensory data, such as indoor and outdoor temperature and humidity, carbon dioxide concentration, and occupancy...
Understanding Song Popularity with Automated Machine Learning (short, coarse pop makes money)

11/12/2019 6:35:15 PM PST · by DoodleBob · 13 replies
Data Robot ^ | October 24, 2019 | Taylor Larkin

Spotify made waves after it introduced its music service to the masses in 2008, allowing users to stream music as opposed to buying a record or purchasing a song on iTunes. Hailing nearly twice as many subscribers as competitor Apple music. Spotify continues to be a primary source for those who want to listen to music without having to buy every song or album desired.One of the cool aspects about the former startup is that much of their music information (artists, albums, tracks, etc) is exposed via a web API. This includes attributes for each song such as its tempo,...
Amazon's free training: Internal machine-learning courses are now open to all

11/27/2018 7:44:03 AM PST · by unlearner · 28 replies
ZD Net ^ | November 27, 2018 | Liam Ting

New free AWS machine-learning training offers everyone the same curriculum that its employees use. Amazon has announced new, free courses in machine learning, offering all developers the same curriculum that it uses to train its own software engineers and data scientists. The company has over 30 online machine-learning courses, including video, labs, and documentation that have been used within Amazon for the past 20 years. The courses are split into four main tracks, catering to developers, data scientists, data platform engineers, and business types. Developers, for example, are offered courses that teach machine-learning building blocks, how to build machine-learning applications,...
MIT Creates World's First Psychopath AI, Fed With Gruesome Reddit Content

06/09/2018 9:12:56 AM PDT · by BenLurkin · 33 replies
Tecg Times ^ | 8 June 2018, 7:48 am EDT By | Alexandra Burlacu

As artificial intelligence is rapidly gaining ground, MIT researchers have created what they call "the world's first psychopath AI," giving it darker views on things by feeding it gruesome content from Reddit. Aptly named "Norman," like the character from Alfred Hitchcock's famous Psycho, the psychopath AI went on a diet of gruesome and violent content from a Reddit page known for its dark posts. As a result, Norman got a whole different view of everything compared to a regular AI that learned from other sources. The researchers note that Norman is a case study aiming to see how AI could...
At AI World, 'black cat' problems and data mysteries abound

12/20/2017 12:02:48 PM PST · by spintreebob · 11 replies
SearchCIO ^ | 12-15-2017 | Nicole Laskowski

A blind man in a dark room is looking for his black cat, and he can't find it. He calls in a sighted person for help. He can't find the cat either but is more confounded than the owner. Because the room is dark and the cat is black, the sighted person can't presume the cat isn't in the room. Anthony Scriffignano and his data science team at Dun & Bradstreet Inc. work on problems like this all the time: They search for data that is elusive -- maybe hiding in plain sight or not there at all. Scriffignano, SR...
Microsoft to world: We've got open source Machine Learning too (Artificial Intelligence)

11/17/2015 10:35:27 PM PST · by dayglored · 6 replies
The Register ^ | Nov 17, 2015 | Richard Chirgwin

Help teach Cortana to say 'Sorry, Dave' Microsoft's decided that it, too, wants to open source some of its machine learning space, publishing its Distributed Machine Learning Toolkit (DMTK) on Github. Google released some of its code last week. Redmond's (co-incidental?) response is pretty basic: there's a framework, and two algorithms, but Microsoft Research promises it will get extended in the future. The DMTK Framework is front-and-centre, since that's where both extensions will happen. It's a two-piece critter, consisting of a parameter server and a client SDK. The parameter server has "separate data structures for high- and low-frequency parameters", Microsoft...
A Resurgence of Neural Networks in Machine Learning

04/29/2014 2:48:17 AM PDT · by Gene Eric · 1 replies
Berkeley School of Information ^ | Nov 20, 2013 | Dan Gillick

In 2010, the best systems got around 25% WER for recorded phone conversations, meaning that a quarter of the words were misrecognized. This is actually pretty good, since people often speak very quickly with lots of background noise and add lots of disfluencies (e.g., “like…uh…yeah…so I…I…ok…right”). By comparison, the WER for Google Voice data, which includes search queries and dictated messages, was 16%. Compared to other areas of artificial intelligence, speech is a mature field, especially since it was commercialized so early by IBM and Dragon Systems. Because of this, by the time 2010 rolled around, even incremental progress had...
New Hiring Formula Values Math Pros: Region's Employers Seek Statistical Experts

04/10/2010 5:45:25 AM PDT · by reaganaut1 · 9 replies · 414+ views
Wall Street Journal ^ | April 8, 2010 | Jessica E. Vascellaro

Being a math geek has never been cooler, at least in Silicon Valley. As Bay Area technology companies ramp up hiring out of the recession, they are in hot pursuit of a particular kind of employee: those with experience in statistics and other data-manipulation techniques. Rather than looking for just plain-vanilla computer scientists, who typically don't have as deep a study of math and statistics, companies from Facebook Inc. to online advertising company AdMob Inc. say they need more workers with stronger backgrounds in statistics and a related field called machine learning, which involves writing algorithms that get smarter over...

Free Republic

Memorial Wall 2.0

Keyword: machinelearning

Teach an AI to write buggy code, and it starts fantasizing about enslaving humans

Search LibGen, the Pirated-Books Database That Meta Used to Train AI

Elon Musk agrees that we’ve exhausted AI training data

Researcher uses machine learning to help digitize ancient texts from Indus civilization

More and more CS students are interested in AI – and there aren't enough lecturers

Drop the zeros for a performance boost (inLarge machine learning)

This smart thermostat could learn optimal temperature quickly

Understanding Song Popularity with Automated Machine Learning (short, coarse pop makes money)

Amazon's free training: Internal machine-learning courses are now open to all

MIT Creates World's First Psychopath AI, Fed With Gruesome Reddit Content

At AI World, 'black cat' problems and data mysteries abound

Microsoft to world: We've got open source Machine Learning too (Artificial Intelligence)

A Resurgence of Neural Networks in Machine Learning

New Hiring Formula Values Math Pros: Region's Employers Seek Statistical Experts