The Macrosite for News, Analysis and Opinion about the Future of the Internet
David Vellante

Information Management Is Broken, But the Fix Is Coming

Written by David Vellante
2/9/2009 7 comments
no ratings
DISCUSS   Digg   Del.icio.us   Reddit   Email This   TWEET THIS

We’ve heard the clichés: “Information is our greatest asset”; “We live in an information economy”; “Information is the lifeblood of our organization.” Sadly, what used to be all about collaboration and user productivity is now, in many organizations, about protecting against lawsuits.

The reasons are clear: Courts have upheld that all electronic information is discoverable, meaning virtually everything in an organization can be used in a lawsuit. Risk management is now a higher priority than user productivity. As a result, corporations are forced to overspend to ensure they can find documents and prove they have adequate records retention processes. As I heard frequently last week at LegalTech, an event where lawyers and IT people come together, some organizations have even resorted to keeping everything electronic, forever.

Not only is this approach expensive, it’s not sustainable, for three reasons:

    1) Costs of storing all electronic records forever is increasing and will continue to increase, forever.

    2) E-discovery expenses are directly related to the amount of data retained -- and these costs are skyrocketing.

    3) Keeping documents forever increases legal risk by creating opportunities for adversarial counsel to misuse information and rewrite history.

I’ve written here before on why "archive everything" strategies are doomed and how personal information is exploding and needs to be approached differently. I’m more convinced than ever that today’s email archiving and e-discovery systems can’t support the future.

I found two startups recently that I believe are pointing the way, however. Digital Reef is a Massachusetts-based company that, according to VP of marketing Brian Giuffrida, is shipping production versions of software that auto-discovers, indexes, and classifies large amounts of unstructured data -- quickly, across formats and distributed locations.

Rational Retention is a New York-based company. The firm’s COO, Michael McCreary, who used to head Pfizer’s Legal IT group, told me that it uses lightweight agents sitting on PCs, in file shares, email servers, and document management systems to extract metadata for every user-created document. The metadata is used by a centralized system to manage documents across their entire lifecycles.

The key to these companies’ vision is that they recognize that shoving all documents into a central archive is too expensive, doesn’t scale, and won’t adequately protect organizations. Rather, these firms are creating “virtual repositories” and managing information where the content lives -- on desktops, laptops, mobile devices, wikis, etc.

Rising storage and e-discovery costs and the exposure involved with keeping data forever will drive customers from centralized compliance archives toward a decentralized “preserve in place” model. Fundamental technology enablers include the following:

  • Auto-classification at the point of content creation
  • Powerful search
  • An architecture that uses endpoint device intelligence
  • Centralized metadata repositories that communicate with a policy engine to make decisions at the OS and application levels

The result will be a far more cost-effective, automated, command-and-control decision-making capability, supporting a distributed, mobile, heterogeneous workforce. I expect this vision will eventually allow us to get back to thinking about information as an asset and not a liability.

— David Vellante is a co-founder of ITCentrix, Barometrix, and The Wikibon Project.

DISCUSS   Digg   Del.icio.us   Reddit   Email This
Current display:       newest comments first       display in chronological order
David Vellante
Thinkernetter
Wednesday February 11, 2009 12:05:13 PM
no ratings

adblah, I couldn't agree more and that is the central premise of my post. But the devil as they say is in the details. My understanding is the law and regulations on records retention are pretty narrow in fact. Most of what we have doesn't need to be retained unless there's a legal hold on a 'topic.' For example, let's say a pharma is getting sued in relation to Cox-2 inhibitors. All emails and docs pertaining to that topic must be retained. The challenge is to preserve those documents in place and/or dramatically reduce the capacity and time required to archive those documents; AND (this is very important) being able to defensibly delete those docs. 

We're starting to see two broad approaches emerge to deal with this problem: 1) new technologies to dramatically streamline archiving and make central repositories smaller and easier to search and 2) technologies that enable a 'preserve in place' model that allows the life cycle of a document to be managed where it lives. 

The key enabler to both of these approaches is auto-classification technologies that extend the capabilities of today's approaches. Vector algorithms that dramatically increase the number of categorical dimensions seem to be pointing the way.

These are very challenging inventions-- not trivial.  

abdlah
IQ Crew
Wednesday February 11, 2009 6:26:40 AM
no ratings

It is important that in trying to better information management, we let what we "need" lead the way. The idea's and comments shared here are well and good, but to let lawyers determine the way to manage IT information is not right.

User's cannot afford to bear cost of keeping information just because a lawyer will want that. Of course it is important that what is needed is kept and what is of no use, discarded.

 

Clearly, if it is of no use, it wouldn't help the lawyer anyway so why hang on to it at a cost? I am definitely not saying people should disobey the law, for if the law says a company must keep some information, then the company 'needs' that information and must keep it.

Lets work to keep information management relevant to the times. 

David Vellante
Thinkernetter
Monday February 9, 2009 10:05:35 PM
no ratings
Well put Deborah. Information as liability and information as asset have been in a tug of war for several years now. Context as you suggest is crucial to value and classification is the mainspring of context. Auto-classification is the enabler to successful scaling of these systems.
DHagar
Thinkernetter
Monday February 9, 2009 7:29:47 PM
no ratings

I totally agree that we need to do a much better job of organizing and managing data in order to reduce the cost and complexity of megasystem storage. 

The companies and technology tools that you mention are great support systems for moving that direction.  It also reminds us that the technology still has to be managed in order to get the benefits from it.

I still believe that information is our most valuable asset.  It is up to us to create systems that optimize the information while managing the cost of creating and organizing data and turning it into quality information that betters our decision making and corporate performance.  We need good management to keep it in context and to create the value that will make it an asset.

DHagar

Gary MacFadden
Rank: Cave Painter
Monday February 9, 2009 7:06:51 PM
no ratings
To Dave's point, corporations need to get smarter about how they manage unstructured data - in particular email and other esi.  A save everything strategy is expensive and potentially risky.  To rsheel's point, users tend to be pack rats.  So having a robust policy management engine and supporting software that can manage esi at the point of creation is key. Sadly many of the larger message archiving vendors in the space are focused more on integration of modules and acquiring additional companies to fill out their portfolio of products.  However, this provides an opportunity for start ups who have developed truly innovative products to fill in the holes for IT staff attempting to improve performance and manageability with an eye on lowering costs and improving ROI.
David Vellante
Thinkernetter
Monday February 9, 2009 7:02:53 PM
no ratings

It's even more insidious rsheel. Lawyers are now driving the strategy, not the business. It's understandable but not sustainable in my view. Attorneys need to make sure they can discover everything electronic. Hence the legal requirement to archive everything.

The key is to find ways to defensibly delete documents that aren't needed. Technology got us into this problem and the good news is technology and process can help us get out. 

rsheel
IQ Crew
Monday February 9, 2009 6:26:20 PM
no ratings
Its 'pack-rat' in every one of us that forces us to keep all the data archived.  It’s an interesting human psychology that forces individuals to keep all the data. Or is it lack of time.  Since you don’t have time to check and see what's relevant, do we archive everything?  But the downside of that is there is no way to retrieve those old data.  What happens to data in 5 1/4 inch floppy drive?  As technology becomes obsolete, we will be forced to discard information that cannot be retrieved. 
The ThinkerNet does not reflect the views of TechWeb. The ThinkerNet is an informal means of communication to members and visitors of the Internet Evolution site. Individual authors are chosen by Internet Evolution to blog. Neither Internet Evolution nor TechWeb assume responsibility for comments, claims, or opinions made by authors and ThinkerNet bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
previous posts from David Vellante
David Vellante
Lately, I've been hanging around some cloud technologists, cloud service providers, virtualization customers, and security practitioners. I've been asking a lot of basic questions, trying to understand when and how cloud computing/virtualization will be ready to support any application or workload. UPDATED 2/2 4:00 PM
David Vellante
David Vellante   1/25/2010   25 comments
In a shocker of a news flash, the Chinese government has flatly denied any involvement in the cyber attacks on Google (Nasdaq: GOOG) and other Websites.
David Vellante
David Vellante   1/21/2010   5 comments
No matter what you call it -- virtualizaton, cloud computing, or "The Big Switch" -- a return to centralized computing is a widespread trend in our industry. And the general consensus is this movement will continue for a decade or more.
David Vellante
David Vellante   1/14/2010   3 comments
Last August at VMworld, VMware Inc. (NYSE: VMW) CEO Paul Maritz told me in the hallway, “We’re at war with everybody.” He was being flip, but this was no joke. The company had just gone out and paid $400 million-plus (about 20x revenues) for Springsource, an open-source Java framework developer.
David Vellante
David Vellante   1/6/2010   9 comments
Just like every other social media platform these days, Digg has Twitter envy.
5
of
IETV: the thinkerNet on film
5
of
2pm EST
Tue
Feb 23rd
2pm EST
Thu
Mar 4th
3pm EST
Tue
Mar 9th
an IBM information resource
sponsored content
big blue blog
Todd Watson
IBM is announcing today the first of its Power7 processor-based systems and the Power7 processor itself at an event in NYC.
white papers & case studies
an IBM information resource
sponsored content
Smarter Collaboration: How to Thrive in a Challenging Business Environment
Market conditions are changing faster than ever, and organizations need to improve their agility and adaptability in order to provide better service and improve processes. The ability to work with customers, business partners, and employees as effectively as possible - while at the same time holding down costs - is a key to success.

READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE!

REGISTER HERE
Wanted! Site Moderators
Internet Evolution is looking for a handful of readers to help moderate the message boards on our site – as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?

Please email: moderators@internetevolution.com
CMP Media LLC
Internet Evolution – not for thickies
Congress Hits the Snooze Button With China
Ira Winkler
In his
recent Congressional testimony, Dennis Blair, the U.S. director of national intelligence, stated that the U.S. is "severely threatened" by cyber attacks and that the recent Google (Nasdaq: GOOG) attacks should serve as a wake-up call.

CLICK FOR MORE
Singer at C-Level
Goldilocks & the Data Center

2|4|10   |   3:39   |   2 comments


What kinds of companies are doing the most innovation in the data center? Turns out it's midtier enterprises that are taking the "Just Right" approach.
Full Nelson
Unified Collaboration Telepresence: Part 2

1|27|10   |   2:41   |   No comments


There are a few practical and affordable tools to help get people collaborating within enterprises. In Part 2, the Fritzoid talks about three of them.
Full Nelson
Unified Collaboration Telepresence: Part 1

Part 1 of 2   |  
See complete series
1|26|10   |   2:29   |   No comments


The promise of Unified Communications, Collaboration, and Telepresence are compelling, but it all sounds pretty pie-in-the-sky to the Admiral.
Sweeney Blog
An Obituary for Fax Machines

1|13|10   |   2:02   |   12 comments


It's time to say our final farewell to that technical workhorse of the 80s and 90s, says our Editor in Chief.
John Soat
Technology Santa Claus

12|23|09   |   2:06   |   2 comments


In the holiday spirit of giving, Technology Santa Clause offers a few words of advice to struggling IT professionals: ‘Be careful what you wish for.’
Reiter's Block
If a Google Phone Arrives, Does It Even Matter?

12|17|09   |   02:41   |   13 comments


Techies are going crazy over the possibility that Google might design and sell its own Android phone. Some writers say it's a very big deal. Reiter questions whether it will happen and, if it does, whether it even matters.
what.the.ferraro
More Pitiful Privacy from Facebook

12|16|09   |   02:08   |   2 comments


Facebook's new privacy controls just don’t cut it with little miss 'Air Quotes.'
Reiter's Block
Stop the Cellphone Touch Screen Insanity!

12|15|09   |   02:53   |   11 comments


Ever since the iPhone debuted, cellular manufacturers are rushing to incorporate touch screens into their phones. Alas, cellphone touch screens have significant problems that can actually harm business productivity. And doing business isn’t about getting the high score on Super Monkey Ball!
Sweeney Blog
Businesses Go on Year-End Spending Spree

12|14|09   |   02:03   |   5 comments


Businesses and VCs are burning through the last of 2009's cash with some last-minute spending and acquisitions.
Sebastian Stadil
The Basic Economics of the Cloud

12|11|09   |   2:56   |   3 comments


The problem with infrastructure these days is not the cost of the network but the cost of the people managing the network. Sebastian Stadil discusses how he'd like to see companies evolve towards a more manageable infrastructure using cloud computing.
Lee H. Berke
The Decline & Fall of Broadcast Television

2|9|10   |   1:00   |   No comments


Want to know the future of broadcast television? Take a look at broadcast radio’s past.
Tom Nolle
Everything New Is Old Again

2|9|10   |   2:13   |   6 comments


Research shows that the youth of today like Facebook – but not blogging or Twitter. Does that mean Facebook has won, or just that it's not yet out of favor? Will all the services we see today fade into Ovaltine-or-Wheaties status in just a few years?
what.the.ferraro
Email Marketing Gets Desperate

2|8|10   |   2:31   |   4 comments


Promotional emails will use just about anything timely to get people to buy things. Seriously, anything.
Steve Saunders' Outernet
America, Truck Yeah!

2|8|10   |   1:42   |   5 comments


Steve likes his new Dodge Ram 1500, but hates Chrysler's Web non-sales strategy. Rant on, li'l buddy.
what.the.ferraro
Twits Go Wild for Resignation Tweet

2|5|10   |   1:48   |   4 comments


Jonathan Schwartz is the first Fortune 200 CEO to resign via Tweet. Can he walk on water, too?
Full Nelson
Go With the FLO, Part 2

Part 2 of 2   |  
See complete series
2|5|10   |   2:17   |   3 comments


Fritz and his sweater continue their review of Qualcomm's FLO TV.
Singer at C-Level
Goldilocks & the Data Center

2|4|10   |   3:39   |   2 comments


What kinds of companies are doing the most innovation in the data center? Turns out it's midtier enterprises that are taking the "Just Right" approach.
Full Nelson
Go With the FLO, Part 1

Part of 2   |  
See complete series
2|4|10   |   2:39   |   1 comment


Qualcomm's FLO TV gizmo streams live TV shows. Tragically, they include the O'Reilly Factor
Eurotrash
High & Dry in Barcelona

2|3|10   |   1:08   |   No comments


Ray’s heading to Barcelona for the Mobile World Congress, and he’s not happy about it, the miserable git.
Sweeney Blog
No Sex, Please... It's the Super Bowl

2|3|10   |   2:24   |   2 comments


The Super Bowl ads that CBS rejected are turning up online, generating lots of attention but zero revenue for the broadcaster.