Free Republic
Browse · Search
News/Activism
Topics · Post Article

Skip to comments.

Noah Johnson: Folding Proteins at Home
Apple Computer ^ | 3/30/2004 | By Barbara Gibson

Posted on 03/30/2004 2:11:05 AM PST by Swordmaker

Noah Johnson:

Folding Proteins at Home

By Barbara Gibson

Noah Johnson sitting at his desk.

In what is rapidly becoming a new world of democratic computing, Noah Johnson’s three Power Macs are humming quietly away, helping Stanford University scientists solve a complex problem that, one day, may help them fight disease.

As part of a groundbreaking distributed computing experiment, Johnson and half a million other people are donating their spare computer capacity so Stanford can remotely simulate protein folding, an essential biochemical process that controls vital body functions.

The project, called Folding@home, represents a sort of ad hoc democracy because anyone with an Internet connection can join. Users simply download an application that runs protein-folding simulations on their desktop computers when the systems are idle.

Origami Gone Wrong

If you think of a cell as a house, proteins are everything that goes in it — the framing, the furniture, the fixtures. Proteins are the working parts of living matter.

The human body makes at least 50,000 different proteins, and each one assumes a particular shape, known as a “fold,” to carry out a particular function. Hemoglobin folds into a shape that lets it carry oxygen. Insulin fits like a key into spaces so it can turn things on and off. Other proteins fold into shapes that build bones, muscles, hair, skin or blood vessels.

When proteins don’t fold properly — think of origami gone wrong — they can poison the cells around them and trigger diseases such as Alzheimer’s, cystic fibrosis, an inherited form of emphysema and even many cancers.

“Folding@home pulls simulation packets from Stanford servers, makes folding calculations on the Mac and sends results to the Stanford server.”

Sharing the Workload

“Scientists have already sequenced the human genome, which is basically a blueprint for all of the proteins in biology,” says Johnson, a computer programmer who folds at home as a hobby.

But analyzing a protein’s possible folding steps as it crumples up into a 3-D knot is daunting task, even for a supercomputer, because the molecular backbone of a protein can fold in trillions of different ways. While several supercomputers used together could handle the job, time slots on supercomputers are tight and very expensive.

Through Folding@home, scientists now have the horsepower to study the mechanics of protein folding. With its ability to share the workload among hundred of thousands of computers economically, Folding@home can help scientists understand how proteins snap, or don’t, into their predestined shapes — and may help to explain the origins of diseases such as Alzheimer’s and apparently unrelated diseases.

Folding@home interface.

New Algorithm

Dr. Vijay Pande, a professor of chemistry and structural biology at Stanford, saw the potential for thousands of desktop computers to calculate tiny portions of a folding sequence. He wrote algorithms for Mac, Windows and Linux computers, and worked with distributed-computing entrepreneur Adam Beberg to integrate his code into an application dubbed Folding@home.

When the application is running on a Mac or other computer, Pande’s software pulls simulation packets from Stanford servers, makes folding calculations on the Mac and reports the results back to the Stanford server.

Johnson says he “decided to get into folding because it helps research into diseases. You don’t have to be a scientist to help. You don’t have to understand complex biological molecules to make a difference.”

Processors of the World, Unite!

Folding@home isn’t the world’s first project that uses the spare capacity of thousands of computers in loosely-linked networks. The same distributed computing concept fueled the discovery of the largest prime numbers and deciphered an RC5-65 encryption algorithm.

The most famous distributed computing project, the Search for Extra-Terrestrial Intelligence (SETI@Home), uses millions of desktop computers to analyze radio telescope data in an the ongoing search for extraterrestrial life.

Since Folding@home debuted in 2000, more than 500,000 computers across the globe have helped simulate the complete folding behavior, atom by atom, of five important proteins.

But How Valid?

To test the validity of the simulation, Pande and his team asked Folding@home volunteers to calculate the rate of folding in a well-understood protein known as BBA-5.

Then they compared the computer findings with physical tests on the same protein. Folding@home computers modeled the protein snapping into shape in 6 thousandths of a second — the same amount of time protein takes to form and fold in the lab.

Folding an amino acid.

Every day, Folding@home parcels out computational tasks among 43,000 desktop computers, demonstrating that distributed computing can be applied not only to mathematics, but also to problems scientists confront in their laboratories.

Friendly Competition

Protein folding isn’t exactly sexy, so Pande nurtured volunteers’ interest by setting up a competition for those donating the most computer time.

It worked.

Since the project began, volunteers have organized themselves into thousands of teams — with names such as Dutch Power Cows, Overclockers Australia and Alliance Francophone. Some teams have just two people; others, thousands of members, all linked by the Internet.

Johnson is the point man for the Mac OS X team, which is comprised of about 1,000 volunteers, all running Mac OS X.

“People are starting to bring G5s online. People with G5s are flying by everyone. We just passed a German team and we’ll be passing other teams as well.”

Extreme Overclockers

Johnson hosts a web page on his Mac so new members can download the software and current members can check their progress, which is measured in points based on how many chunks of data a computer processes.

Proteins. Source: Institute of Biological Sciences, UFMG

“I provide a place for the team to get together to discuss problems they’re having, ways to improve folding and so on,” Johnson says. “When the new Folding@home application came out for the Power Mac G5, we provided information on the team forum to help people use it.

The site also posts individual, team and overall project statistics so Mac OS X team members can see how they’re doing against other teams. “In just the last two months,” he says, “we moved from 30th to 24th place out of 2,000 teams worldwide in terms of our work unit production.”

Bringing G5s Online

“And people are starting to bring G5s online,” Johnson adds. “Before, on earlier Macs, they’d stay in one place in the stats; nobody would pass them and they wouldn’t pass anybody. But people with G5s are flying by everyone. We just passed a German team and we’ll be passing other teams as well.”

Because of the way the software is written, Folding@home doesn’t interfere with Johnson’s normal use of his Mac. “When you’re not using the computer,” he explains, “the project will use everything you’ve got — all of the processor 100% of the time.

“But if you want to use the computer,” he points out, “the project will scale back and you won’t see a hit in performance. When you’re done, it goes back to 100%.”

“Every day, Folding@home parcels out computational tasks among 43,000 desktop computers, demonstrating that distributed computing can be applied to problems scientists confront in their laboratories.”

Folding on the Mac

“The reason I fold on a Mac,” Johnson confides, “is that I work 40 hours a week at a Windows shop. I got tired of fixing computers all day long and then coming home and fixing computers. So I converted to Mac for sanity’s sake.

“The Mac has been just head and shoulders over Windows as far as ease of use and not having to fix it all the time,” Johnson says. “And the operating system is fun to use; with Windows, you have to be tinkering all the time and fixing registry entries. I didn’t want to come home and do my job at home.”

Also, Johnson says, performance is a factor. “Beberg originally ported the Linux version of Folding@home to the Mac,” he says, “but he also optimized the software for the Power Mac G4, G5 and even the G3. We saw 200 and 300% speed increases with the new software,” says Johnson.

Power Mac G5 Performance

“Let’s say you have a 1GHz processor for your Pentium 4 and a 1GHz G5. Obviously the G5 runs faster than that, but they are baselined at 1GHz apiece. Because the software is optimized for the G5, it will get more work done than a Pentium. It actually manages to squeeze more out of the processor on a megahertz-to-megahertz basis.

“Now,” Johnson says, “we can compete with AMD and Intel processors head to head. Since we launched the new core, we’ve seen an increase in points very clearly. Which means we’re handling larger work units more quickly.”

Power by the People

Even if he were given exclusive access to all of the world’s supercomputers, Pande still wouldn’t have as much processing power as he gets from the supercluster of people’s desktop systems Folding@home relies on. Modern supercomputers are essentially a cluster of hundreds of processors linked by fast networking. But Pande needed the power of hundreds of thousands of processors, not just hundreds.

Since processors in Power Mac G5s and other desktop systems are now comparable to the processors in supercomputers, Pande has access to much greater processing power when he can tap the power of hundreds of millions of desktop systems that are sitting idle during some part of each day.

Folding@home needs the power of hundreds of thousands of processors rather than network speed, so it actually runs more effectively on a supercluster of desktop systems than it would on a supercomputer.

Even so, it took the supercluster 50,000 runs — 2,000 years of computer time — to crack the virtual folding of a single protein.

Power Mac G5 Supercomputer

Virginia Tech made supercomputing history when it became the first to combine 64-bit Power Mac G5s into the world’s third fastest supercomputer. Based entirely on 1,100 dual-processor Power Mac G5s and off-the-shelf technologies from Apple partners, Virginia Tech’s new supercomputer — named System X — not only can go toe-to-toe against the fastest custom-designed supercomputers, it can beat most of them.

It reaches in excess of 10 teraflops of actual performance, providing massive scientific computing power to Virginia tech scientists — at the cheapest price/performance of any supercomputer on the Top 500 list.

“The G5 was a perfect fit for the architectural goals of our system,” says Virginia Tech’s Dr. Srinidhi Varadaraan. “It has a 64-bit processor with two double-position floating point units, excellent memory bandwidth and an I/O architecture that allows us to interconnect it into a supercomputer.

“There is really nothing much different you can do on most of the custom-designed supercomputers that you cannot do on this system, and you cannot do better on this system.”

See how the PowerPC G5 processor powers the Virginia Tech world-class supercomputer.

Xgrid: Supercomputing Made Easy

Turning a Mac cluster into a supercomputer has just been simplified with Xgrid, a computational clustering technology from Apple’s Advanced Computation Group.

Xgrid helps scientists and others working in compute-intensive environments to fully utilize all IT resources, including desktops and servers. Just as Folding@home takes advantage of unused computing capacity, Xgrid automatically discovers, connects and manages tasks across available systems in a Mac cluster.



TOPICS: Culture/Society; Extended News; News/Current Events; Technical
KEYWORDS: apple; biotech; health; linux; macintosh; macuser; pc; stanfordu; tech
Navigation: use the links below to view more comments.
first 1-2021-35 next last
Sounds like a worthwhile project for our computers when we're not using them.
1 posted on 03/30/2004 2:11:06 AM PST by Swordmaker
[ Post Reply | Private Reply | View Replies]

To: Bush2000; antiRepublicrat; LasVegasMac; Action-America; eno_; N3WBI3; zeugma; TechJunkYard; ...
Protein Origami on a Mac PING! (well, it will work also on Linux and PCs) but it is better on a Mac... heheheheh
2 posted on 03/30/2004 2:12:28 AM PST by Swordmaker (This tagline shut down for renovations and repairs. Re-open June of 2001.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Swordmaker
Is there a FReep group on Folding@Home? I viewed their site but I couldn't find "our" group. Anyone have an answer?

It seems that Folding@Home had a sister, Genome@Home, that has just "folded" due to budget constraints. I am shocked that these guys couldn't write effective enough grants to keep the money flowing.
3 posted on 03/30/2004 3:08:29 AM PST by texas booster (Make a resolution to better yourself and your community in '04 - vote Republican!)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Swordmaker
Another way to become involved is to use Folding@Home via the tools of Google.

First - Load the Google Toolbar, by visiting Google and loading the toolbar from tools. On the current homepage, click on more and load.

The Toolbar gives you an excellent popup blocker while also enhancing your websearches with a highlight feature for the word(s) you are searching.

Go back to the Google tools page and click on Google Labs, (boiling test-tube)

then click on Google Compute.

A default name will be assigned by Google. However, should one of us, enter a new name eg: freepers4Google, we could all form a team that could go to the top very soon.

I installed this tool on about 20 PCs last year, and now sit in the top 10 with nearly 700 assignments completed.

You have the option at install, or any time later to use the distributing computing command either at standard or conservative mode, and then another option at 100%, 80% lower.

Even at 100%, standard, the google process only works when the user or other processes are active. It is set at the very lowest priority for your cpu. You will not notice it affecting your PC, unless you watch your cpu cycles, and then will notice your PC is humming along at 100% all the time.

I have it installed on my wife's three PCs and she has never even noticed it is there, and she is extremely sensitive about the speed and response from her PCs.

Just to suggest that someone here could take the lead, using the google tool, establish a team name, provide a link for assistance, then occasionally update all of us as to how the team is doing.

If you leave your PC on most of the time, it is a great opportunity to use the electricity effectively.

On a 3 ghz cpu the average assignment take about 2 days, while a celeron 500 mhz might take 7-10 days. when completed, you will have a file of about 1 megabyte to upload to stanford. This works even with dial-ups, as the only connection required is between assignments, to receive the next project and to upload your results.

4 posted on 03/30/2004 3:44:59 AM PST by Dustoff45 (Prepare for His Passion, for it is being poured out in this season for whosoever will)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Dustoff45
I have been running Google's Folding program on my computer at work for a long time (years?). I have never been able to detect it is running, other than CPU usage it is always at 100%. Also, the very strict network gurus in our IT department have either never detected it or not objected.
5 posted on 03/30/2004 4:16:51 AM PST by HangThemHigh (The mystery of government is not how Washington works but how make it stop.)
[ Post Reply | Private Reply | To 4 | View Replies]

To: Swordmaker
Apple Computers decides that Apple computers are the best. Film at eleven.

The Mac OSX team, made up of people on Apple/PPC machines, has 1300 members, and has completed 188,604 units at the moment. The HardOCP team, made up of people on Intel/AMD boxes, has more than 5,000 members and has completed 1,802,804 units. Now, let's see - if I want to do some distributed computing, and have to choose between one platform or the other, which one should I target? Hmmmmm. That's a tough one. I guess I could choose the platform with the greater total computing power, but then again, the other one does have a fruit on the box...

For distributed/parallel computing, the speed of any one node is not an appropriate measure of performance - the only thing that really matters is aggregate performance. That's not what Apple wants anyone to take from this, but then again, they've always been better at marketing than just about anything else. ;)

6 posted on 03/30/2004 4:27:03 AM PST by general_re (The doors to Heaven and Hell are adjacent and identical... - Nikos Kazantzakis)
[ Post Reply | Private Reply | To 1 | View Replies]

To: texas booster
I don't know if there is a FReeper team on Folding at Home.

I fold for team 198. (Anandtech, a tech forum site) It is a nice, friendly team that can always use some help.

You can always set up a mini-team by sharing a user ID.

It takes a lot of computing power to move a team up the rankings today. The Macs do work well, but Intel and AMD based machines are fast as well. It depends on the client and the optimizations.

7 posted on 03/30/2004 4:35:12 AM PST by MediaMole
[ Post Reply | Private Reply | To 3 | View Replies]

To: Dustoff45
You get far greater folding performance by loading the GUI or command line versions of Folding at home. Only the largest units take 2 days on my AMD 3200+ computer. You can also fold for a team.

You won't see any performance hit using either of these. I can play demanding games or encode video without any problems. The Folding client just sits there if anything else needs computing power.
8 posted on 03/30/2004 4:38:55 AM PST by MediaMole
[ Post Reply | Private Reply | To 4 | View Replies]

To: Dustoff45
I shall return after work to investigate this.....
9 posted on 03/30/2004 5:38:21 AM PST by Clara Lou
[ Post Reply | Private Reply | To 4 | View Replies]

To: Swordmaker
Self-ping.
10 posted on 03/30/2004 5:44:18 AM PST by Physicist
[ Post Reply | Private Reply | To 1 | View Replies]

To: general_re
Now, let's see - if I want to do some distributed computing, and have to choose between one platform or the other, which one should I target? Hmmmmm. That's a tough one.

By the reading, it appears that they still have a lot of slow G4s in the group. Given the PPC 970's vastly superior floating point performance (to both Intel and G4), likely 50% clock boost by the end of the year (with corresponding bus speed boost), and 10-20% speed increases due to the new compiler, the G5s should be really pumping up the OS X numbers through this year.

11 posted on 03/30/2004 9:18:20 AM PST by antiRepublicrat
[ Post Reply | Private Reply | To 6 | View Replies]

To: Swordmaker
Well, well, yeah....sure it's easy for a Mac! They only have FIVE percent of the market!
HA!
12 posted on 03/30/2004 9:19:34 AM PST by Woahhs
[ Post Reply | Private Reply | To 2 | View Replies]

To: antiRepublicrat
Given the PPC 970's vastly superior floating point performance (to both Intel and G4)...

Superior performance which, thus far, remains officially undocumented. Where's the SPEC results, if it's so hot? Even Apple appears to have pulled those fakey-SPEC rate results from their website now.

From my own look at it, it appears to me that a single G5 is approximately on a par with a single P4, on a clock-for-clock basis, and so far I've seen nothing to disabuse me of that notion.

Anyway, for distributed computing - particularly distributed computing on someone else's computers, where I don't have to worry about how much they cost - individual node speed really doesn't matter, only aggregate performance, like I said. Given that there are five times as many HardOCP members as there are OSX members, the average PPC box will have to be five times faster than the average x86 box for Team OSX to match the performance of Team HardOCP. And that's just not happening. Even if the PPC team triples the work-unit-per-node performance of x86, which is wildly unrealistic, the fact that there are so many more x86 boxes still puts them way ahead. Team OSX may not fall behind as rapidly with new compilers and the like, but I expect that they'll still be falling behind, if for no other reason than because they're simply outnumbered.

Now, if we're shopping for a supercomputer, that might be a different story, but that's not the example we're presented with here.

13 posted on 03/30/2004 9:44:00 AM PST by general_re (The doors to Heaven and Hell are adjacent and identical... - Nikos Kazantzakis)
[ Post Reply | Private Reply | To 11 | View Replies]

To: general_re
Superior performance which, thus far, remains officially undocumented.

I'm waiting for SPEC too. Looking at third-party PPC/Intel results shows a dual G5 2 GHz roughly equal to a dual Xeon 3 GHz, less in integer and better in FP. No way is a PPC 970 1 GHz as slow as a P4/Xeon 1 GHz. The processor architecture is just much more advanced, among other things capable of keeping 215 instructions in flight at a time.

The Xeon/P4 has a big problem with bus, as shown recently in an Opteron/Xeon shootout, where the Intel's slow, shared bus can't keep the processors fed. Plug 20% faster Xeons in the current system, and you will barely get more speed. Intel is slapping lots of L3 cache on the chips to alleviate the problem, but the end cost of the chips is enormous. The G5's bus runs at half the speed of the processor, and each processor gets its own bus, so the G5 will scale well to 3 GHz this fall, with a 1. GHz bus to keep those processors fed. Intel seems to be hitting a wall.

Given that there are five times as many HardOCP members as there are OSX members, the average PPC box will have to be five times faster than the average x86 box for Team OSX to match the performance of Team HardOCP.

That's why distributed computing projects make mutltiple versions.

14 posted on 03/30/2004 10:09:22 AM PST by antiRepublicrat
[ Post Reply | Private Reply | To 13 | View Replies]

To: general_re
That's 1.5 GHz bus for the 3 GHz G5.
15 posted on 03/30/2004 10:10:42 AM PST by antiRepublicrat
[ Post Reply | Private Reply | To 13 | View Replies]

To: antiRepublicrat
Looking at third-party PPC/Intel results shows a dual G5 2 GHz roughly equal to a dual Xeon 3 GHz, less in integer and better in FP.

I haven't been following it for the last few months or so, so perhaps it's changed - got a link?

Anyway, the important thing, from Apple's perspective, is that it's competitive - you don't have to hold your nose when buying one ;)

I still think that Intel is making a mistake by not pushing I2 harder, but I guess we'll have to see where they go with x86-64. The latest batch of P4s/Xeons have an 800 MHz frontside bus, and so should the first x86-64s, which ought to keep them in the game, at least. It'll be fun to watch, as always...

16 posted on 03/30/2004 10:22:52 AM PST by general_re (The doors to Heaven and Hell are adjacent and identical... - Nikos Kazantzakis)
[ Post Reply | Private Reply | To 14 | View Replies]

To: general_re
I haven't been following it for the last few months or so, so perhaps it's changed - got a link?

Here's one. Apple does a lot better with MP because it's based on a processor designed from the ground up to work in MP environments. And it has a lot of room to grow. All this not so much thanks to Apple really, but mainly to IBM.

I still think that Intel is making a mistake by not pushing I2 harder, but I guess we'll have to see where they go with x86-64

The I2 is just way too expensive. It competes better with the POWER processors, the G5's big daddy. Compatibility with current apps is also a problem. It has an x86 compatibility mode, but it is very slow and doesn't have the SSE instructions.

The latest batch of P4s/Xeons have an 800 MHz frontside bus, and so should the first x86-64s, which ought to keep them in the game, at least.

The problem is that it's a shared bus. The Opteron starts getting an edge at 2x MP, and starts blowing away the higher clocked Xeon at 4x and especially 8x as the Xeon can't get fed data fast enough. This is a problem the G4 had. G5s, with their independent busses, will have the same advantage over Xeons.

I'm hoping Intel gets its act together, or AMD/Apple might get lazy.

17 posted on 03/30/2004 11:32:39 AM PST by antiRepublicrat
[ Post Reply | Private Reply | To 16 | View Replies]

To: antiRepublicrat
Middle-of-the-pack Intel and AMD against top of the line G5's? Here's another - single-chip AMD and Intel systems smoking dual G5's for lunch. Note the source ;)

The I2 is just way too expensive. It competes better with the POWER processors, the G5's big daddy.

That's because Intel is playing its usual twenty-year-old game of "it's a SERVER chip. Got it? SERVERS only". If/when they decided to mass-market it, you'd see the price drop quick enough, just like it has with every other "it's a SERVER chip" that Intel has ever made ;)

Compatibility with current apps is also a problem. It has an x86 compatibility mode, but it is very slow and doesn't have the SSE instructions.

Actually, I2 does SSE, but not SSE2. Anyway, the idea was to ditch IA-32/x86 altogether, remember?

The problem is that it's a shared bus. The Opteron starts getting an edge at 2x MP, and starts blowing away the higher clocked Xeon at 4x and especially 8x as the Xeon can't get fed data fast enough. This is a problem the G4 had. G5s, with their independent busses, will have the same advantage over Xeons.

So far that's not translating into much of a handicap versus AMD or G5. Eventually they'll have to rearchitect the frontside, but eventually everyone will have to scrap everything they're currently doing - in the here and now, it's not a serious problem.

Anyway, what do I care? We can always switch to AMD, right?

Errr, well, one of us can ;)

18 posted on 03/30/2004 12:01:14 PM PST by general_re (The doors to Heaven and Hell are adjacent and identical... - Nikos Kazantzakis)
[ Post Reply | Private Reply | To 17 | View Replies]

To: general_re
Anyway, what do I care? We can always switch to AMD, right? Errr, well, one of us can ;)

Already did. All my new computers are AMD, but with a switch to Mac at the next systems. I just love that OS, and have great faith in IBM to keep that chip advancing, especially since it ties in with their POWER server line.

19 posted on 03/30/2004 12:27:20 PM PST by antiRepublicrat
[ Post Reply | Private Reply | To 18 | View Replies]

To: antiRepublicrat
Maybe one of these days I'll give it a whirl on an x86 system. ;)
20 posted on 03/30/2004 1:03:41 PM PST by general_re (The doors to Heaven and Hell are adjacent and identical... - Nikos Kazantzakis)
[ Post Reply | Private Reply | To 19 | View Replies]


Navigation: use the links below to view more comments.
first 1-2021-35 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
News/Activism
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson