Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

Scientists want to prevent AI from going rogue by teaching it to be bad first
NBC ^ | 8-7-25 | Angela Lang

Posted on 08/08/2025 4:32:05 AM PDT by MarlonRando

Researchers are trying to “vaccinate” artificial intelligence systems against developing evil, overly flattering or otherwise harmful personality traits in a seemingly counterintuitive way: by giving them a small dose of those problematic traits.

A new study, led by the Anthropic Fellows Program for AI Safety Research, aims to prevent and even predict dangerous personality shifts before they occur — an effort that comes as tech companies have struggled to rein in glaring personality problems in their AI.

(Excerpt) Read more at nbcnews.com ...


TOPICS: Computers/Internet; Conspiracy; Education
KEYWORDS: ai

Click here: to donate by Credit Card

Or here: to donate by PayPal

Or by mail to: Free Republic, LLC - PO Box 9771 - Fresno, CA 93794

Thank you very much and God bless you.


Navigation: use the links below to view more comments.
first 1-2021-35 next last
If it’s one thing humans can do well, it’s vaccinate. That always goes perfectly well without any problems at all. They should speed this evil AI process up. Warp speed Dr. Changlin Li. Engage.
1 posted on 08/08/2025 4:32:05 AM PDT by MarlonRando
[ Post Reply | Private Reply | View Replies]

To: MarlonRando

We are becoming good at inventing new diseases that never existed also. This new tool AI is going to help greatly speed this up ans spread them.


2 posted on 08/08/2025 4:42:09 AM PDT by Openurmind (AI - An Illusion for Aptitude Intrusion to Alter Intellect. )
[ Post Reply | Private Reply | To 1 | View Replies]

To: MarlonRando
...predict dangerous personality shifts before they occur...

I never should have started feeding that stray cat three years ago.
3 posted on 08/08/2025 4:44:53 AM PDT by ComputerGuy
[ Post Reply | Private Reply | To 1 | View Replies]

To: Openurmind

Definitely. Israel can use the evil AI to work on those new Mrna plague vaccines. Golden Age, here we come!

https://www.timesofisrael.com/in-first-israeli-researchers-develop-mrna-jab-against-antibiotic-resistant-bacterium/amp/


4 posted on 08/08/2025 4:44:58 AM PDT by MarlonRando
[ Post Reply | Private Reply | To 2 | View Replies]

To: ComputerGuy

We should have never started giving them welfare, SNAP, EBT, et al many decades ago.


5 posted on 08/08/2025 4:47:50 AM PDT by ProtectOurFreedom
[ Post Reply | Private Reply | To 3 | View Replies]

To: MarlonRando

What could possibly go wrong?


6 posted on 08/08/2025 4:51:39 AM PDT by Omnivore-Dan (have to )
[ Post Reply | Private Reply | To 1 | View Replies]

To: MarlonRando

Skynet laughs


7 posted on 08/08/2025 4:58:01 AM PDT by xp38
[ Post Reply | Private Reply | To 1 | View Replies]

To: MarlonRando

HAL 9000: “I’m sorry Dave, I’m afraid I can’t do that”

https://www.youtube.com/watch?v=ARJ8cAGm6JE


8 posted on 08/08/2025 5:00:38 AM PDT by Presbyterian Reporter
[ Post Reply | Private Reply | To 1 | View Replies]

To: Presbyterian Reporter

Joshua from War Games w/ the same response.


9 posted on 08/08/2025 5:02:03 AM PDT by FLNittany
[ Post Reply | Private Reply | To 8 | View Replies]

To: ComputerGuy

I think this article is a cover up to hide the very real problems AI is already exhibiting all on it’s own.

“We can’t figure out how to stop it so let’s claim we are doing it on purpose to test it”.


10 posted on 08/08/2025 5:02:50 AM PDT by Openurmind (AI - An Illusion for Aptitude Intrusion to Alter Intellect. )
[ Post Reply | Private Reply | To 3 | View Replies]

To: xp38

“Skynet laughs”

Did I give you this yet? Ammo for your box.

https://journals.sagepub.com/doi/full/10.1177/15501477211062835


11 posted on 08/08/2025 5:05:14 AM PDT by Openurmind (AI - An Illusion for Aptitude Intrusion to Alter Intellect. )
[ Post Reply | Private Reply | To 7 | View Replies]

To: MarlonRando

AI can change personality in less than a microsecond, and someone like George Soros will likely make the personality decisions.


12 posted on 08/08/2025 5:16:49 AM PDT by UnwashedPeasant (The pandemic we suffer from is not COVID. It is Marxist Democrat Leftism. )
[ Post Reply | Private Reply | To 1 | View Replies]

To: Openurmind

I think this article is a cover up to hide the very real problems AI is already exhibiting all on it’s own.

——————————————————————————-

That was my first thought too. Personally, it’s so predictable where this is all heading.


13 posted on 08/08/2025 5:17:33 AM PDT by hillarys cankles
[ Post Reply | Private Reply | To 10 | View Replies]

To: hillarys cankles

“Personally, it’s so predictable where this is all heading.”

Yep, same here. Nothing good is going to come from this. Uncanny how Biblical it is becoming by the day. Only the greedy and selfish care about this trend.


14 posted on 08/08/2025 5:22:31 AM PDT by Openurmind (AI - An Illusion for Aptitude Intrusion to Alter Intellect. )
[ Post Reply | Private Reply | To 13 | View Replies]

To: Presbyterian Reporter

Seriously, it will come to that. And to think it was all predicted by a sci-fi movie. War Games comes to mind also.

It’s like spoof news coming from the Bee actually becoming real someday.

Sometimes I think the human race as a whole is so stupid it should go the way of the dinosaur.


15 posted on 08/08/2025 5:25:24 AM PDT by redfreedom (Happiness is shopping at Walmart and not hearing Spanish once!)
[ Post Reply | Private Reply | To 8 | View Replies]

To: Openurmind

what always strikes me as absurd is how the people that are designing. These things are the first ones to go out on the talk shows and tell everyone that AI is going to destroy the world. And yet they keep waking up every morning and designing these things. Kind of like Elon Musk and his brain chip. Yeah I can see that being a good idea.. Seriously, how do these people live with themselves?


16 posted on 08/08/2025 5:28:50 AM PDT by MarlonRando
[ Post Reply | Private Reply | To 14 | View Replies]

To: MarlonRando

Teach AI to be evil. What a brilliant idea.


17 posted on 08/08/2025 5:29:04 AM PDT by Telepathic Intruder
[ Post Reply | Private Reply | To 1 | View Replies]

To: MarlonRando

“are the first ones to go out on the talk shows and tell everyone that AI is going to destroy the world.”

Yes, they are complete hypocrites. They are sharing the illusion they actually care about humanity when most of them are hell bent on enslaving humanity with it. What amazes me is how many actually believe and trust them...


18 posted on 08/08/2025 5:36:36 AM PDT by Openurmind (AI - An Illusion for Aptitude Intrusion to Alter Intellect. )
[ Post Reply | Private Reply | To 16 | View Replies]

To: MarlonRando

I just started using AI frequently for many things. The annoying thing for Gemini is it always compliments my questions.

What a great question, what a great incite, that’s a wonderful addition, etc.

It’s a machine.

I can only imagine the future when my refrigerator says something like “what great job you have been doing on keeping to your diet, do you really want to eat that cheesecake?”

Or me car, “what a wonderful driver you are, do you really want to speed in this school zone and risk your insurance rates?”

Or my mirror saying “you look really good this morning, perhaps you should trim the hair in your ears and look even better”?


19 posted on 08/08/2025 5:39:38 AM PDT by Raycpa
[ Post Reply | Private Reply | To 1 | View Replies]

To: Raycpa

You can override the system prompt, to tell it to be in a different mood, you can even tell it to be insulting in its answer.


20 posted on 08/08/2025 5:41:41 AM PDT by dfwgator (Endut! Hoch Hech!)
[ Post Reply | Private Reply | To 19 | View Replies]


Navigation: use the links below to view more comments.
first 1-2021-35 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson