Free Republic
Browse · Search
News/Activism
Topics · Post Article

Skip to comments.

Warning of data ticking time bomb
BBC ^ | 03 July 2007 | Unknown

Posted on 07/05/2007 9:30:29 AM PDT by ShadowAce

The growing problem of accessing old digital file formats is a "ticking time bomb", the chief executive of the UK National Archives has warned.

Natalie Ceeney said society faced the possibility of "losing years of critical knowledge" because modern PCs could not always open old file formats.

She was speaking at the launch of a partnership with Microsoft to ensure the Archives could read old formats.

Microsoft's UK head Gordon Frazer warned of a looming "digital dark age".

Costly deal

He added: "Unless more work is done to ensure legacy file formats can be read and edited in the future, we face a digital dark hole."

Research by the British Library suggests Europe loses 3bn euros each year in business value because of issues around digital preservation.

The National Archives, which holds 900 years of written material, has more than 580 terabytes of data - the equivalent of 580,000 encyclopaedias - in older file formats that are no longer commercially available.

Ms Ceeney said: "If you put paper on shelves, it's pretty certain it is going to be there in a hundred years.

"If you stored something on a floppy disc just three or four years ago, you'd have a hard time finding a modern computer capable of opening it."

"Digital information is in fact inherently far more ephemeral than paper," warned Ms Ceeney.

She added: "The pace of software and hardware developments means we are living in the world of a ticking time bomb when it comes to digital preservation.

"We cannot afford to let digital assets being created today disappear. We need to make information created in the digital age to be as resilient as paper."

But Ms Ceeney said some digital documents held by the National Archives had already been lost forever because the programs which could read them no longer existed.

"We are starting to find an awful lot of cases of what has been lost. What we have got to make sure is that it doesn't get any worse."

The root cause of the problem is the range of proprietorial file formats which proliferated during the early digital revolution.

Technology companies, such as Microsoft, used file formats which were not only incompatible with pieces of software from rival firms, but also between different iterations of the same program.

Mr Frazer said Microsoft had shifted its position on file formats.

"Historically within the IT industry, the prevailing trend was for proprietary file formats. We have worked very hard to embrace open standards, specifically in the area of file formats."

Costly deal

Microsoft has developed a new document file format, called Open XML, which is used to save files from programs such Word, Excel and Powerpoint.

Mr Frazer said: "It's an open international standard under independent control. These are no longer under control of Microsoft and are free for access by all."

But some critics question Microsoft's approach and ask why the firm has created its own new standard, rather than adopting a rival system, called the Open Document Format.

Instead, Microsoft has released a tool which can translate between the two formats.

Ben Laurie, director of the Open Rights Group, said: "This is a well-known, standard Microsoft move.

"Microsoft likes lock-ins. Typically what happens is that you end up with two or three standards."

The agreement between the National Archives and Microsoft centres on the use of virtualisation.

The archive will be able to read older file formats in the format they were originally saved by running emulated versions of the older Windows operating systems on modern PCs.

For example, if a Word document was saved using Office 97 under Windows 95, then the National Archives will be able to open that document by emulating the older operating system and software on a modern machine.

Ms Ceeney said the issue of older file formats was a bigger problem than reading outdated forms of media, such as floppy discs of various sizes and punch cards.

"The media it is stored in is not relevant. Back-up is important, but back-up is not preservation."

Adam Farquhar, head of e-architecture at the British Library, praised Microsoft for its adoption of more open standards.

He said: "Microsoft has taken tremendous strides forward in addressing this problem. There has been a sea change in attitude."

He warned that the issue of digital preservation did not just affect National Archives and libraries.

"It's everybody - from small businesses to university research groups and authors and scientists.

"It's a huge challenge for anyone who keeps digital information for more than 15 years because you are talking about five different technology generations."

The British Library and National Archives are members of the Planets project which brings together European National Libraries and Archives and technology companies to address the issue of digital preservation.

He said that open file formats were an important step but there was still work to be done.

"Automation is a key area to work on. We need to be able to convert hundreds and even thousands of documents at a time," he said.


TOPICS: Technical
KEYWORDS: data; dataaccess; datasafety; format; legacyfiles; operatingsystems; preservation
Navigation: use the links below to view more comments.
first previous 1-2021-4041-6061-8081-83 next last
To: stainlessbanner; taxcontrol
There's really two parts to the discussion: the physical file format and the storage media. Both will need to be accessible and operational for future use.

That's true, but I'm more worried about the media side. The media needs to hold up physically, and there needs to be continued availability of hardware that can read it. You can address both concerns by copying all your important stuff onto new, current media from time to time.

I'm less concerned about file formats. They are pure information. And when the time comes, chances are someone will have written the necessary conversion. When Captain Kirk is born on 22 March 2233, they will still be able to read PDFs.

To take an extreme example, taxcontrol advocates imaging your hard drive periodically, thereby simultaneously backing up your data and the applications to read it. Now, when Captain Kirk arrives, chances are there won't be any PCs capable of booting up your drive image. But there will almost certainly be virtual machine applications capable of simulating a PC on the computers of 2233. They'll just need to be able read the drive image off the physical media.

41 posted on 07/05/2007 10:42:09 AM PDT by cynwoody
[ Post Reply | Private Reply | To 23 | View Replies]

To: proxy_user; ShadowAce

“Well, there’s always .txt.”

Is that EBCDIC, UTF-8, ASCII, ISO 8859-1, Big-5 or...

And what will people do in 2050 if vi is deprecated in 2036?


42 posted on 07/05/2007 10:49:27 AM PDT by PissAndVinegar
[ Post Reply | Private Reply | To 4 | View Replies]

To: BuffaloJack

2 words:
Taiyo Yuden

Stop buying those crapass Memorex CDs and you’ll have far less coasters...not all CDs/DVDs are created equal (and I have a special hatred for memorex after 1 wasted evening)


43 posted on 07/05/2007 10:57:24 AM PDT by PissAndVinegar
[ Post Reply | Private Reply | To 17 | View Replies]

To: stainlessbanner
CD/DVD is not good media for archiving important data.

I've been transferring all our old home movies from VHS to DVD. The VHS copies were degrading, with lots of dropouts. I was hoping the DVDs would be more robust, but now I see they're not.

So what is a good medium for preserving something as precious as videos of my kids growing up?

44 posted on 07/05/2007 10:58:04 AM PDT by scan59 (Let consumers dictate market policies. Government just gets in the way.)
[ Post Reply | Private Reply | To 23 | View Replies]

To: ShadowAce
This is a series and very expensive problem. A financial company for which I used to contract was forced to keep an ancient UNISYS system up and running in order to comply with auditing requirements. IIRC, it was costing the company over $100,000 for the service contract each year, until they were able to find a company that was able to do the data conversion to the IRS's satisfaction. IIRC, it took them more than 5 years before they were able to get rid of the dinosaur system.

Mark

45 posted on 07/05/2007 11:03:58 AM PDT by MarkL (Listen, Strange women lyin' in ponds distributin' swords is no basis for a system of government)
[ Post Reply | Private Reply | To 1 | View Replies]

To: scan59
I've got data on CD/DVD and external hard drives at this point. I'm considering a RAID chassis to prevent data loss. I don't have a real good answer though.

Maybe there is an online data storage service?

46 posted on 07/05/2007 11:06:02 AM PDT by stainlessbanner
[ Post Reply | Private Reply | To 44 | View Replies]

To: LIConFem
Wow, another vi user!! Thought I was the last one! ;o)

Nope, and I use it on Win32 systems too!

Mark

47 posted on 07/05/2007 11:07:23 AM PDT by MarkL (Listen, Strange women lyin' in ponds distributin' swords is no basis for a system of government)
[ Post Reply | Private Reply | To 14 | View Replies]

To: sittnick
punched cards ... How about 9 track?

http://www.sunstarco.com/9_track_tape_drives.htm

http://www.cardamation.com/prices.html

48 posted on 07/05/2007 11:07:26 AM PDT by cynwoody
[ Post Reply | Private Reply | To 11 | View Replies]

To: ShadowAce

It’s about time our National Archives started to realize this problem, much of which is tied (as you know) to MicroSoft’s proprietary formatting of data.

Thank goodness for OpenOffice and at least a few institutions recognizing the importance of data being in a completely open format.


49 posted on 07/05/2007 11:08:32 AM PDT by AFPhys ((.Praying for President Bush, our troops, their families, and all my American neighbors..))
[ Post Reply | Private Reply | To 2 | View Replies]

To: BuffaloJack

Ektachrome slides are all red. I had that issue as well. Kodachrome slides from the ‘50’s held up pretty well.


50 posted on 07/05/2007 11:10:36 AM PDT by Cyber Liberty (Did Dennis Kucinich always look like that or did he have to submit to a series of shots? [firehat])
[ Post Reply | Private Reply | To 12 | View Replies]

To: AFPhys

Oh: on second read - this is the British National Archives, and they’re singing the praises of M$ - who have CAUSED a great deal of the problems of data by hiding their format “secrets”. There’s a long way to go yet... and again, I’m glad Open Office has released a good standard format, and that many data firms worked with them to develop something that was good.


51 posted on 07/05/2007 11:21:32 AM PDT by AFPhys ((.Praying for President Bush, our troops, their families, and all my American neighbors..))
[ Post Reply | Private Reply | To 49 | View Replies]

To: BuffaloJack
By faded, I mean there was no trace of any color except red.

Maybe they were taken in Pleasantville?

52 posted on 07/05/2007 11:33:05 AM PDT by Ignatz (Did you know that before the internal combustion engine, there was no weather at all?)
[ Post Reply | Private Reply | To 12 | View Replies]

To: scan59; All

I have read all the comments by various posters and want to make some comments to everyone in general. I am likewise more concerned about the longevity of the storage media and the working hardware being available to read it than I am about file format compatibility. I expect some media to outlast others. I suggest a parallel backup strategy using several media types to hedge your bets. For instance, DVD-RAM and Magneto-Optical don’t use dyes and may last longer than CDs and DVDs, provided the plastic substrate does not crack from age (as I have already seen with CDs). The downside is that those technogies and their drives are proprietary. There are also archival-grade CDs and DVDs available. Mothballing new hardware for future access may not ultimately work because electronics that are not used seem to fail sooner than electronics that are left on. Why? Some electronic components, such as electrolytic capacitors (found on many circuit boards) go bad much sooner if not used.


53 posted on 07/05/2007 11:35:24 AM PDT by TexasRepublic (Afghan protest - "Death to Dog Washers!")
[ Post Reply | Private Reply | To 44 | View Replies]

To: scan59
Get one or more big USB attached drives. Always have more than one copy of your important JPGs, MPGs, WMV, etc. Pass one or more of these on to family members. That way, if your house burns down, is flooded, etc., you will still have your memories around. When you need to get a bigger drive, don’t wipe out the old one but keep it around as yet another layer of backup. I personally don’t have any important media file in fewer than three places.

The attached drives are cheap and are much more durable than DVD’s. It will also be a lot easier to copy all of you movies for your kids from a single drive rather than dozens of DVD’s particularly when you can’t remember where you left or loaned out some of those DVDs.

I particularly like the 2.5 inch attached drives as the good ones are small enough to comfortably fit in a coat pocket and don’t require an external power source other than the USB cable. You can easily take them on a trip to share movies with out an out of town family member and get a copy of their media files while you are at it.

54 posted on 07/05/2007 11:38:04 AM PDT by Jeff F
[ Post Reply | Private Reply | To 44 | View Replies]

To: Cyber Liberty
Ektachrome slides are all red. I had that issue as well. Kodachrome slides from the ‘50’s held up pretty well.

Ditto on both points. I have 16mm Kodachrome movie film of my father as a kid dating back to its first year of production in 1935 -- the color appears to me to be a good as it likely ever was.

55 posted on 07/05/2007 11:42:57 AM PDT by TexasRepublic (Afghan protest - "Death to Dog Washers!")
[ Post Reply | Private Reply | To 50 | View Replies]

To: TexasRepublic

BUMP!


56 posted on 07/05/2007 12:05:50 PM PDT by Publius6961 (MSM: Israelis are killed by rockets; Lebanese are killed by Israelis.)
[ Post Reply | Private Reply | To 55 | View Replies]

To: Frank Sheed
Are zip drives now dead?

You can still buy them. The external USB version is probably the most practical one. When 256MB USB thumb drives went below the price of Zip cartridges, that killed them.

The Iomega Jaz drive is in even worse shape. You can barely even find them on ebay.

But we have survived orphaned 5 1/4" floppies, Iomega Bernoulli Boxes, Coleco Adam Digital Datapack Drives, even the 2.88 MB floppy used in the NeXT machine and supported but rarely used in IBM Thinkpads. Oh, and don't forget all those Magneto-Optical formats and TRS-80 data cassettes (which were just leaderless audio cassettes).
57 posted on 07/05/2007 12:10:04 PM PDT by sittnick (There is no salvation in politics.)
[ Post Reply | Private Reply | To 16 | View Replies]

To: scan59
I've been transferring all our old home movies from VHS to DVD. The VHS copies were degrading, with lots of dropouts. I was hoping the DVDs would be more robust, but now I see they're not.

DVDs are much more robust than VHS. Every time you play a VHS tape, it comes in contact with the read heads and the image is degraded.

You are hearing horror stories in this thread, but CDs and DVDs are currently the best archival storage solution for an ordinary consumer.

For something really important like your children's videos, transfer those to DVD and use a very good brand. Make two or three copies (e.g. one for use, one for a backup at home and another for an off-sight backup in case of fire).

Every year or two, make some additional backup copies on new media just to be safe.

It is especially important to get film photographs transferred to CD and the sooner the better while the negatives are not degraded.

Flash drives and memory cards are not currently long term storage solutions. Disk arrays (RAID) are not backup solutions either - they are availability solutions.

58 posted on 07/05/2007 12:48:28 PM PDT by JeffAtlanta
[ Post Reply | Private Reply | To 44 | View Replies]

To: ShadowAce

My brother had a lot of sermons, correspondence on Epson(I think)word processor disks.
Haven’t found a way to access them and transfer the text to a more compatible format.


59 posted on 07/05/2007 12:50:00 PM PDT by Vinnie (You're Nobody 'Til Somebody Jihads You)
[ Post Reply | Private Reply | To 2 | View Replies]

To: Frank Sheed
Are zip drives now dead? I still have backups stored on zip disks. I would presume the external drives can still be found and installed on a port?

You should probably consider transferring the data off of the zip disks. Zip disks did suffer from the "click of death" - this is when the user hears a clicking sound which heralds the very sudden death of the zip disk.

60 posted on 07/05/2007 12:53:35 PM PDT by JeffAtlanta
[ Post Reply | Private Reply | To 16 | View Replies]


Navigation: use the links below to view more comments.
first previous 1-2021-4041-6061-8081-83 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
News/Activism
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson