The Macrosite for News, Analysis and Opinion about the Future of the Internet
DISCUSS   PRINT     Email This

Big Data

Introduction
Written by Doug Henschen
10/1/2011 5 comments

You don’t need petabytes of information to play in the big data league. The low end of the threshold is more like 10 TB, and, in fact, “big” doesn’t really tell the whole story. The many types of data and the speed at which data changes are, along with sheer volume, daunting challenges for businesses struggling to make sense of it all. Volume, variety, velocity – they’re the hallmarks of the big data era we’re now in.

Variety comes in the form of Web logs, wirelessly connected RFID sensors, unstructured textual information from social networks, and myriad other data types. Velocity breeds velocity. Fast-changing data drives demand for deep analytic insights delivered in hours, minutes, or, in extreme cases, seconds, instead of the weekly or monthly reports that once sufficed.

How are IT organizations coming to grips with data volume, variety, and velocity? Specialized databases and data warehouse appliances are part of the answer. Less heralded but also essential are information management tools and techniques for extracting, transforming, integrating, sorting, and manipulating data.

IT shops often break new ground with big data projects, as new data sources emerge and they try unique ways of combining and putting them to use. Database and data management tools are evolving quickly to meet these needs, and some are blurring the line between row and column databases.

Even so, available products don’t fill all the gaps companies encounter in managing big data. IT can’t always turn to commercial products or established best-practices to solve big data problems. But pioneers are proving resourceful. They’re figuring out how and when to apply different tools – from database appliances to NoSQL frameworks and other emerging information management techniques. The goal is to cope with data volume, velocity, and variety to not only prevent storage costs from getting out of control but, more importantly, get better insights faster.

Contents:

— Doug Henschen (dhenschen@techweb.com)

Next Page: History of Big Data

Channel: Enterprise IT
Tags: Analytics
DISCUSS   PRINT     Email This
Page 1 of 6 Next >
Current display:       newest comments first       display in chronological order
Mary Jander
Thinkernetter
Friday October 14, 2011 11:47:04 AM
no ratings

Good observation Paul. Big Data comes with issues of velocity, as the report describes. So real time seems a goal that won't be met for quite awhile. Still, as you note, it is heartening to see so much progress being made. Who knows what the next couple of years might bring?

DHagar
Thinkernetter
Thursday October 6, 2011 6:08:46 PM
no ratings

Im' with you, mhhfive, the growth opportunity is in the expanded use of data that extends the value to business and operational decision making.  As this happens, with more simple tools and trained users, the value will increase.  We don't as much need more, we just need to learn how to better use the data we are increasingly collecting.

DHagar

mhhfive
IQ Crew
Wednesday October 5, 2011 9:20:14 PM
no ratings

Several decades ago, "big data" meant a spreadsheet or a "very large" spreadsheet.... Then "big data" meant an immense database of transactional information that could only be handled by the largest computers.

At the rate that computer capabilities have grown, "big data" is getting almost unimaginatively huge. Former "big data" projects aren't considered that big anymore... and what's considered "big" is really immense.

But just like spreadsheets made lots of data manageable, there are now tools to make "big data" more manageable. However, there just aren't that many people who really need to use (or have the training to use) the software that can handle insanely large databases.

As people and technology collect more and more information, we'll need to analyze it and store it and make it useful... and making the tools to make data simpler to play with seems like the next big thing for big data. 

Paul Whyte
Researcher
Wednesday October 5, 2011 11:02:28 AM
no ratings

Hi Doug,

Thanks for a very educative report on big data. I am quite impressed with the strides IT has make in turning what was perceived to be a huge challenge to an amazing opportunity. 

Reading through your report and other online references on big data, I really don't think we can classify current “analytics” efforts as real-time. If by real-time we mean that  services act immediately — and intelligently — on information as it streams into the system, then we are far from that at the moment. 

Yahoo's CTO made an interesting point when he stated that:

"With the paths that go through Hadoop [at Yahoo!], the latency is about fifteen minutes. … [I]t will never be true real-time. It will never be what we call “next click,” where I click and by the time the page loads, the semantic implication of my decision is reflected in the page."

But as you rightly stated, the technology will get better but whether we can actually achieve real-time analytics remains to be seen. 

maxsahu
Rank: Cave Painter
Tuesday October 4, 2011 10:46:05 PM
no ratings

Really very thorough and well organized description of Big Data. Talks about different problems and how the companies have tried to solve those.

The ThinkerNet does not reflect the views of TechWeb. The ThinkerNet is an informal means of communication to members and visitors of the Internet Evolution site. Individual authors are chosen by Internet Evolution to blog. Neither Internet Evolution nor TechWeb assume responsibility for comments, claims, or opinions made by authors and ThinkerNet bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
a moderated blogosphere of internet experts
Paul Korzeniowski
The smartphone market reached a significant milestone, a breakthrough that may cause vendors to celebrate but could strain the capabilities of IT service desks.
Maria Korolov
Maria Korolov   5/21/2013   8 comments
In the fall of 2011, around 160,000 students in 190 countries enrolled in a Stanford-sponsored online course about artificial intelligence. About 23,000 completed the course and got certificates, including 248 who got a perfect score. The university offered the same course the old-fashioned way to students sitting in Stanford classrooms. None of the those students got a perfect score.
Joe Stanganelli
As Mitch Wagner discussed today, Yahoo is acquiring Tumblr. The big Internet debate at the moment is whether Tumblr will be good or bad for Yahoo. Regardless of their stances on the future of Yahoo itself, many claim that Yahoo will somehow ruin Tumblr.
George Taylor
George Taylor   5/20/2013   10 comments
Has China stolen a march on the West, developing an Internet architecture that is not only based on IPv6, but is also inherently secure from both internal and external attack?
IETV: the thinkerNet on film
5
of
Kim Davis
Big-Data Can’t Always Sell Wine

5|21|13   |   2:23   |   3 comments


Whole Foods Global Wine Purchaser Doug Bell told me about some of the constraints on using analytics in the US wine market.
Paul J. Fleuranges
Digital Signage Keeps NYC Subway Straphangers on Track

5|6|13   |   3:51   |   No comments


New York's Metropolitan Transit Authority is conducting a pilot test of digital kiosks to guide subway users to where they want to go more efficiently and at lower cost.
Kim Davis
Fast Forward to the Future

4|23|13   |   2:29   |   20 comments


A look back at tech writing in the 90s makes us wonder where enterprise IT will be 20 years from now.
Mitch Wagner
Google Launches Its Most Depressing Service Yet

4|15|13   |   2:59   |   10 comments


Google's new Inactive Account Manager lets you control how Google disposes of your accounts when you die.
Second Shooter
Argument Over Top-Level Domains Is 'Stupid'

4|11|13   |   2:07   |   3 comments


The whole Amazon.reader debate is a double-stupid. It's stupid to think that there's any e-book buyer who doesn't know Amazon's URL, and it was stupider to let ICANN launch the whole free-form TLD initiative to start with.
Kim Davis
Ladies, Your Tablet Awaits

3|21|13   |   2:22   |   37 comments


ePad Femme is the world’s first tablet “made exclusively for women.”
Wisdom of the Big Chair
NFC Moves Into the Mainstream

3|20|13   |   2:16   |   No comments


While NFC's original goal was to enhance mobile commerce applications, it is finding its way into a number of other uses, which is creating both opportunity as well as challenges for IT departments.
Wisdom of the Big Chair
Integrating Security Into Your Cloud Contract

3|19|13   |   3:35   |   No comments


Enterprises would like to move to cloud computing but are hesitant because they are concerned about providers’ ability to secure company data. Here are some tips that help to ensure that if breaches occur, the business is not left holding the bag.
Brian Baron
How Edmunds.com Collects Customer Information

3|18|13   |   1:15   |   No comments


Edmunds separates customers into segments based on the info it collects on its site and from partners, and uses that to push out custom content, said Brian Baron, director of business analytics for Edmunds.com, at Predictive Analytics Innovation Summit.
Brian Baron
How Edmunds.com Uses Analytics to Customize Site

3|14|13   |   0:47   |   No comments


The automotive website uses propensity modeling to target ads and customer registration forms, said Brian Baron, director of business analytics for Edmunds.com, at Predictive Analytics Innovation Summit.
an IBM information resource
sponsored content
big blue blog
Alison Diana
Ushering in a new era of cognitive computing systems, IBM announced today the IBM Watson Engagement Advisor, a technology breakthrough that allows brands to crunch big data in record time to transform the way they engage clients in key functions such as customer service, marketing, and sales.
an IBM information resource
sponsored content
Expert Integrated Systems: Changing the Experience & Economics of IT
In this e-book, we take an in-depth look at these expert integrated systems -- what they are, how they work, and how they have the potential to help CIOs achieve dramatic savings while restoring IT's role as business innovator.

READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE!

REGISTER HERE
Wanted! Site Moderators
Internet Evolution is looking for a handful of readers to help moderate the message boards on our site – as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?

Please email: moderators@internetevolution.com
Internet Evolution – not for thickies
Keep Critical Data With a Knowledge Management System
Taimoor Zubair
Fortune 500 companies lose at least
$31.5 billion a year by failing to share knowledge. A Knowledge Management System (KMS) can help companies significantly reduce these costs.

CLICK FOR MORE
Yahoo Needs to Break Tumblr in Order to Fix It
Joe Stanganelli
As
Mitch Wagner discussed today, Yahoo is acquiring Tumblr. The big Internet debate at the moment is whether Tumblr will be good or bad for Yahoo. Regardless of their stances on the future of Yahoo itself, many claim that Yahoo will somehow ruin Tumblr.

CLICK FOR MORE