Free Republic
Browse · Search
News/Activism
Topics · Post Article

Skip to comments.

A new way to stop digital decay
The Economist ^ | September 15, 2005 | Economist Staff

Posted on 09/20/2005 4:13:50 PM PDT by Zuben Elgenubi



A new way to stop digital decay

Sep 15th 2005
From The Economist print edition


Computing: Could a “virtual computer”, built from software, help to save today's digital documents for historians of the future?

WHEN future historians turn their attention to the early 21st century, electronic documents will be vital to their understanding of our times. Old web pages may not turn yellow and brittle like paper, but the digital documents of today's culture face a more serious threat: the disappearance of computers able to read them. Even a relatively simple electronic item, such as a picture, requires software to present it as a visible image, but 100 years from now, today's computers will have long since become obsolete. More complex items, like CD-ROMs or videos, will be unreadable even sooner.

In 1986, for example, 900 years after the Domesday book, the BBC launched a project to compile data about Britain, including maps, video and text. The results were recorded on laserdiscs that could only be read by a special system based around a BBC Micro home computer. But since the disks were unreadable on any other system, this pioneering example of multimedia was nearly lost for ever. It took two and a half years of patient work with one of the few surviving machines to move the data on to a modern PC (it can be seen online at www.domesday1986.com).

National libraries are just starting to grapple with this problem as part of their new mandate to preserve digital culture. “It is a major problem, but it is remarkable how little known it is,” says Hilde van Wijngaarden, head of digital preservation at the National Library of the Netherlands. “People just accept that things no longer work after ten years.”

Keeping working examples of all computer hardware is impractical, so the most popular preservation strategy is to copy files from one generation of hardware to the next. The problem is that today's word processors and web browsers, for example, do not always display files in the same way that older software did. An accumulation of subtle errors can eventually make the original item unreadable. An alternative approach, called emulation, uses software to simulate the old hardware on a modern computer, to allow old software to run. But today's emulators will need another emulator to run on the next generation of hardware, which will need another emulator for the next generation, and so on. This can also introduce errors.

So the National Library of the Netherlands is exploring a third option, using a simulated computer that exists only in software. It is called the Universal Virtual Computer (UVC) and is being developed by IBM, a computer giant. The researchers are writing programs to run on this virtual computer that decode different document formats. Future libraries will have to write software that emulates the virtual computer on each new generation of computer systems. But once that is done, they will be able to view all their stored documents using the decoders written for the virtual computer, which only have to be written once. “The decoder can be tested for correctness today, while the format is still readable,” says Raymond van Diessen of IBM.

His team has written decoders for two common image formats, JPEG and GIF. They plan to move on to Adobe's PDF format. IBM is also talking to drug firms, which are required to store data from clinical trials for long periods. Ultimately, the aim is to be able to preserve anything from simple web pages to complex data sets. Ominously, some scientific data from the 1970s has already crumbled into unreadable digital bits.


TOPICS: Business/Economy; Culture/Society; Editorial; Technical
KEYWORDS: decay; diessen; digital; domesday; ibm
Navigation: use the links below to view more comments.
first 1-2021-4041-53 next last
Domesday ping
1 posted on 09/20/2005 4:13:52 PM PDT by Zuben Elgenubi
[ Post Reply | Private Reply | View Replies]

To: Zuben Elgenubi

2 posted on 09/20/2005 4:15:49 PM PDT by msnimje (Cogito Ergo Sum Republican)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Zuben Elgenubi

How hard would it be to construct a floppy drive with a voice-coil actuator for the heads and a two-channel 10Mhz ADC data capture front-end? I would think that three such drives (one each for 8", 5.25", and 3.5") would be able to read 99.99% of the floppies produced in those sizes (and would also, with proper programming, be better able to deal with bit rot than the drives of yesteryear.


3 posted on 09/20/2005 4:25:10 PM PDT by supercat (Don't fix blame--FIX THE PROBLEM.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: supercat

Good idea, supercat. Apply for the patent tomorrow.


4 posted on 09/20/2005 4:30:39 PM PDT by Zuben Elgenubi
[ Post Reply | Private Reply | To 3 | View Replies]

To: Zuben Elgenubi
Domesday ping

More like "Sky Is Falling" ping. (no offense intended:)

Information (data) has been 'lost' thru eons of history, but somehow scientists and historians have been able to recreate a fairly accurate picture of the past using only some of the smallest parcels of indirct data still in existance ... everything from the geological record, to the development of biology, to the origins of the Universe as we know it, came from these indirect scientific observations. It is doubtful that mere computer data will simply 'disappear' so easily as long as mankind is around to maintain it. There will always be someone around who still has a Timex/Sinclair antique PC sitting in their workshop :)

5 posted on 09/20/2005 4:32:40 PM PDT by Mr_Moonlight
[ Post Reply | Private Reply | To 1 | View Replies]

To: Zuben Elgenubi

I managed to recover about a hundred 20 MB Bernoulli cartridges one time. The OS ain't the problem.


6 posted on 09/20/2005 4:32:45 PM PDT by Billthedrill
[ Post Reply | Private Reply | To 1 | View Replies]

To: Billthedrill
I managed to recover about a hundred 20 MB Bernoulli cartridges one time. The OS ain't the problem.

Actually, one of the biggest problems, IMHO, is going to be determining what material is worth moving to newer formats. Even if it only takes ten seconds to convert the contents of a floppy into modern format, what is somebody with a bunch of floppies and no particular clue what's one them supposed to do with them? One of the advantages of printed material is that in many cases one can pick up an item, look at it, and have some clue what it is. Even with movie film that can be somewhat possible if one has good eyesight (much easier with 35mm than 8mm, though!). Another difficulty--and this applies not just to computers but to all types of material--is the loss of metadata. A computer may be able to tell that a file contains a picture, and a human may be able to tell that it contains a picture of a woman holding a baby. But who are the woman and the baby? If there isn't anyone around to identify the significance of a picture, that significance will be lost even if the picture itself remains.

7 posted on 09/20/2005 4:56:46 PM PDT by supercat (Don't fix blame--FIX THE PROBLEM.)
[ Post Reply | Private Reply | To 6 | View Replies]

To: supercat
Right you are - it's been the biggest challenge of archaeology to provide metadata for what comes out of the digs. And I can just imagine somebody coming on an FR archive a couple of centuries hence and understanding English just fine, but what in the world is this "series," "hugh," "moose," "cheese," "AYBABTU," or "Hillary's hideous curdlike cellulite thighs"?

"My name is Ozymandius, king of kings,"
"Look on my beebers, ye mighty, and be stuned."

8 posted on 09/20/2005 5:03:19 PM PDT by Billthedrill
[ Post Reply | Private Reply | To 7 | View Replies]

To: Peanut Gallery

PING


9 posted on 09/20/2005 5:07:28 PM PDT by Professional Engineer (As an Engineer, you too can control the awesome power of the Ductalator.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Zuben Elgenubi
Answer: Vacuum seal the the computer that wrote the data and store it with the media.

Some problems are just too easy.

10 posted on 09/20/2005 5:14:54 PM PDT by RockyMtnMan
[ Post Reply | Private Reply | To 1 | View Replies]

To: Zuben Elgenubi

ping for later.


11 posted on 09/20/2005 5:18:43 PM PDT by conservatism_IS_compassion (The idea around which liberalism coheres is that NOTHING actually matters but PR.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Zuben Elgenubi
The biggest problem for historians reading electronic documents will be in convincing them that we've always been at war with Oceania.

-PJ

12 posted on 09/20/2005 5:21:53 PM PDT by Political Junkie Too (It's still not safe to vote Democrat.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: supercat; Zuben Elgenubi

> ... one of the biggest problems, IMHO, is going to be
> determining what material is worth moving to newer formats.

And then you'll discover that the worthwhile stuff is
all DRM-protected and unreadable. It may even be out of
copyright, but the lawyers who hold the decryption keys
will, alas and joy, be long dead.


13 posted on 09/20/2005 5:34:05 PM PDT by Boundless
[ Post Reply | Private Reply | To 7 | View Replies]

To: ShadowAce

ping


14 posted on 09/20/2005 5:50:13 PM PDT by JoJo Gunn (Help control the Leftist population. Have them spayed or neutered. ©)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Zuben Elgenubi

That's why my office only uses microfilm to preserve our old documents. I am a records manager and oversee the microfilming operation in one office in our county courthouse. They tell me that the newer microfilm has a shelf-life of 500 years if stored properly. The technology to read microfilm is so simple and will always be available. We have considered scanning/microfilming, but the equipment is very expensive. I'm hoping that we will be able to do that eventually - it would certainly speed up the process!

We have to be able to dispose of records but I would refuse to throw them out if they were on disk-only. Heck, I have computer games from five years ago that I can't play on my newer computer!


15 posted on 09/20/2005 6:34:56 PM PDT by sneakers
[ Post Reply | Private Reply | To 1 | View Replies]

To: Mr_Moonlight
You've seen my workshop?

...There will always be someone around who still has a Timex/Sinclair antique PC sitting in their workshop :)...

or two...

with 16k RAM modules and audio cassette software

16 posted on 09/20/2005 6:56:11 PM PDT by Covenantor
[ Post Reply | Private Reply | To 5 | View Replies]

To: sneakers

// They tell me that the newer microfilm has a shelf-life of 500 years if stored properly. //

Unfortunately, a lot of stuff on microfilm is barely legible even when new. As for long-lived storage, how about daguerotype? That's stored as metalic mercury on metalic silver, right?


17 posted on 09/20/2005 7:18:06 PM PDT by supercat (Don't fix blame--FIX THE PROBLEM.)
[ Post Reply | Private Reply | To 15 | View Replies]

To: Covenantor
with 16k RAM modules and audio cassette software

Are those the original bottle caps used to prop up the 16k RAM module, or did you eventually solder and/or hardwire the pack to the back of the unit ? /grin

18 posted on 09/20/2005 7:19:35 PM PDT by Mr_Moonlight
[ Post Reply | Private Reply | To 16 | View Replies]

To: supercat
A computer may be able to tell that a file contains a picture, and a human may be able to tell that it contains a picture of a woman holding a baby. But who are the woman and the baby? If there isn't anyone around to identify the significance of a picture, that significance will be lost even if the picture itself remains.

Many years ago, American Heritage magazine printed a circa-1890s photo of a man, a woman, and a horse. They dutifully noted that the only information about the photo was from a handwritten caption on the back, "The horse's name is Fred."

19 posted on 09/20/2005 7:31:59 PM PDT by Gumlegs
[ Post Reply | Private Reply | To 7 | View Replies]

To: Gumlegs
...he was a good horse, whadda' hayburner!
20 posted on 09/20/2005 7:50:12 PM PDT by norraad ("What light!">Blues Brothers)
[ Post Reply | Private Reply | To 19 | View Replies]


Navigation: use the links below to view more comments.
first 1-2021-4041-53 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
News/Activism
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson