The Macrosite for News, Analysis and Opinion about the Future of the Internet
Richard Boire

Web Analytics & Data Mining Must Converge

Written by Richard Boire
10/7/2009 9 comments
no ratings
DISCUSS   Digg   Del.icio.us   Reddit   Email This   TWEET THIS

Are businesses actually seeing a convergence between the two disciplines of data mining (searching for patterns in sets of data) and Web analytics (determining how customers are using a company’s Websites)?

The prevailing school of thought is that data mining and Web analytics are indeed disparate disciplines. But why? After all, both require expertise in dealing with data. In most cases, the analytical deliverables are similar: Both disciplines attempt to better understand customer behavior through some key metrics, such as:

  • New customers
  • Repeat customers
  • Time of last activity
  • Frequency of activity
  • Monetary value of last activity

In order to fully appreciate both disciplines, some background is in order here.

Data miners have been analyzing data since Fair Isaac (now FICO) developed its first credit card risk model back in the 1950s. Since then, the use of data mining as a key business tool has evolved into other disciplines, most notably marketing.

Early data mining pioneers included such organizations as Reader’s Digest. As this technique matured within the direct marketing area, financial organizations like American Express became early adopters. Currently, the use of data mining has expanded to almost all other industry sectors, with financial institutions and telcos the most sophisticated users.

Data miners typically work with large volumes of data, and their expertise lies particularly in transaction-related data.

Meanwhile, the growth of Web analytics is a relatively new phenomenon, due entirely to the growth of the Internet. The early pioneers of this discipline came directly from the Internet space, with their original expertise being the design of Websites. Along with this expertise, practitioners of Web analytics acquired a deep understanding of the data environment -- but as it relates to the Internet. Web analysts also acquired an expertise in analyzing large volumes of data, but it was data specifically related to log files or page tags.

As you can see from the above, data mining and Web analytics are distinct disciplines. And it is this disparity that can be dangerous for enterprises that use them.

For one thing, the data nuances and subtleties of the Internet need to be fully understood if one is analyzing Web data. A lack of understanding of this type of data can lead to grossly incorrect results concerning customer behavior.

Data miners, for instance, do not want to look at the page view history of the user for a given session, but rather the page view history of the user over a period of time.

It is this longitudinal, or more historical, view of the customer that is lacking amongst Web analyst practitioners. Why? The capability of viewing the customer historically and from a longitudinal perspective requires the development of customized marketing databases for a given Website. But the development of marketing databases is an expertise that is more typically found within the data mining discipline, not in Web analytics.

Another key difference: Data miners are typically disciplined in integrating data from very disparate sources, while Web analytics practitioners are used to integrating data as long as it is confined to the Web space.

But integrating online and offline data is vital to calculating factors such as ROI from purchase behavior, for instance.

Hopefully, the above comments reinforce the need to more effectively integrate the two disciplines of data mining and Web analytics. As long as both disciplines continue to have a silo-based approach to their work, solutions will be less than optimal.

It is only through a more centralized approach toward analytics that solutions can be truly optimized.

— Richard Boire is the founding partner at the Boire Filler Group.

Channel:
Tags: Executive Analysis
DISCUSS   Digg   Del.icio.us   Reddit   Email This
Current display:       newest comments first       display in chronological order
DHagar
Thinkernetter
Tuesday October 13, 2009 8:40:26 PM
no ratings

Good points and good perspective on the future needs. 

In reading your article, Richard, it appears that we need to broaden the web use beyond the technical analytics to the value, in keeping with your perspective on longitudinal analysis with data mining.  As you point out, the only difference is the channel of data capturing, it is still dealing with the customer and the broader sense of what we want to know.

Hopefully your line of thinking will help us expand in the use and value of the web.

DHagar

Michael P. Kassner
Thinkernetter
Wednesday October 7, 2009 4:28:38 PM

I understand the difference, now. I use Google Analytics for all of my posts and the data that I get is amazing and extremely useful.  I would be lost without it.

I do look at long-term information, that is why I was getting confused. It is a different viewpoint than a data miner as you kindly pointed out. Thank you.

 

Richard Boire
Thinkernetter
Wednesday October 7, 2009 4:00:17 PM

You are absolutely right but my experience in the marketplace is that there is a disconnect in terms of data and how it should be used  between the offline world and online world.

robbie02494
Rank: Scrivener
Wednesday October 7, 2009 3:51:05 PM

From the massive amount of data that is pumped into the Internet each day to any data within the company, every data needs to be analyzed. The tools used for this purpose may vary but the idea is to benefit from all the data and interpret it to benefit the company in any way, either to improve customer relationship or product enhancement.

Richard Boire
Thinkernetter
Wednesday October 7, 2009 3:45:11 PM

I will try as best I can. Data Miners, coming from a database marketing environment, like to look at the customer longitudinally. This means  that we want to look at  customer trends over time such as how their navigation behaviour has changed over time. For example, Suppose I am on Amazon,

and within the current month, the % of pages that I view pertaining to History books is 30%, but let's say it was 100% six months ago. This is telling me something about this customer's behaviour that I can potentially action on as a marketer.

 

   

Michael P. Kassner
Thinkernetter
Wednesday October 7, 2009 2:31:22 PM

Very interesting information, thank you Richard. I would appreciate you clearing up something for me. Your comment:

"Data miners, for instance, do not want to look at the page view history of the user for a given session, but rather the page view history of the user over a period of time."

I am trying to understand why data miners prefer the history over time.

 

 

Insultant
Thinkernetter
Wednesday October 7, 2009 1:42:39 PM

Richard, did you know that the CIA is funding a bunch of data analystics startups via its VC arm? Why do you think they are doing that?

Richard Boire
Thinkernetter
Wednesday October 7, 2009 11:01:54 AM

Mary, you are right. Data is data and at the end of the day, it is about identifying insights and intelligence from the dayta that leads to higher ROI.   

Mary Jander
Thinkernetter
Wednesday October 7, 2009 10:44:14 AM

Any firm engaged in data mining is likely also interested in Web analytics and is probably spending a lot of money on both efforts. A lack of coordination and integration between these applications seems like an invitation for lost ROI on both of them.

The ThinkerNet does not reflect the views of TechWeb. The ThinkerNet is an informal means of communication to members and visitors of the Internet Evolution site. Individual authors are chosen by Internet Evolution to blog. Neither Internet Evolution nor TechWeb assume responsibility for comments, claims, or opinions made by authors and ThinkerNet bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
a moderated blogosphere of internet experts
Dan Cypra
Dan Cypra   11/20/2009   3 comments
A picture is worth a thousand words, or so the old saying goes. So understanding how to use images in e-newsletters effectively is quite important. Here are a few tips to ensure that your images in email newsletters work to your advantage.
Gordon Haff
Gordon Haff   11/20/2009   1 comment
Arms merchant or army? That's a fundamental question for vendors in the cloud computing space. Do they just sell their tooling to any and all comers, who then become the actual purveyors of hosted infrastructure, developer platforms, and software? Or do they offer their own cloud-based services, perhaps even keeping much of their technology in-house for competitive advantage?
Mary E. Shacklett
With the value of toxic assets on the rise, large U.S. and European banks face many challenges on the road to recovery. Sharing key information may help these firms effectively track the way forward.
Matthew Fraser
Matthew Fraser   11/19/2009   5 comments
Most of us go through life knowing that we’re expected to learn from our mistakes and improve. Those who are more conscientious about learning and personal improvement usually reap greater rewards.
Mike Moran
Mike Moran   11/19/2009   12 comments
Marketers are known for exaggerated claims and stretching the truth just a wee bit. But most marketers I know truly believe in what they sell. Their aggressiveness is based on a confidence that what they are promoting truly benefits the customer.
IETV: the thinkerNet on film
5
of
2pm EST
Tue
Dec 1st
an IBM information resource
sponsored content
big blue blog
Todd Watson
Todd Watson   11/20/2009   Post a comment
While Google introduces its new Chrome OS (which I'm hearing will be widely available in one year?  Did I mishear that?), IBM announced 10 new products today to help companies using IBM System z mainframe technology.
white papers & case studies
an IBM information resource
sponsored content
Smarter Collaboration: How to Thrive in a Challenging Business Environment
Market conditions are changing faster than ever, and organizations need to improve their agility and adaptability in order to provide better service and improve processes. The ability to work with customers, business partners, and employees as effectively as possible - while at the same time holding down costs - is a key to success.

READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE!

REGISTER HERE
Wanted! Site Moderators
Internet Evolution is looking for a handful of readers to help moderate the message boards on our site – as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?

Please email: moderators@internetevolution.com
Copyright © 2009 United Business Media Limited - All rights reserved.      About Us  |  Privacy Policy and Terms of Use  |  Contact Us
CMP Media LLC
Internet Evolution – not for thickies
what.the.ferraro
Facebook Lacks Social Skills

11|20|09   |   1:53   |   1 comment


Facebook's 'Suggestions' for users demonstrate how little social networking sites understand about true social relationships.
Singer at C-Level
Smart Grid Opportunities

11|20|09   |   2:49   |   No comments


Industry initiatives and government stimulus funds are giving enterprise software vendors a great opportunity to help build out and manage smart grid technologies.
Tom Nolle
Total Telephony Transcends Telepresence

11|20|09   |   2:11   |   2 comments


The problem with telepresence is that it's not universally accepted, because video calling isn't. While we can all do video calling, we also apparently worry too much about how we look. If we want HD telepresence in our future, we have to dress down, mess up our hair, and dive into our online life.
what.the.ferraro
ThinkerNet Wins Min's Award for Best Blogs!

11|19|09   |   1:13   |   4 comments


ThinkerNet wins the Min's award for 'Best Blogs' – Internet Evolution's fifth award this year!
Full Nelson
SanFran.gov

11|19|09   |   8:51   |   No comments


Fritz has an exclusive talk with the mayor and CTO of San Francisco about that city's latest e-government efforts.
Robert D. Atkinson
America Has Much to Learn About Digital Piracy

11|18|09   |   2:09   |   No comments


The US loses about $20 billion a year on pirated software, movies, and music. But public policy can help stem the tide of digital theft. For example, France has recently passed a 'three strikes and you’re out' law, whereby if after two warning letters an individual continues to download pirated software then his Internet access will be cut off. US policy makers should consider adopting similar policies.
Singer at C-Level
Connecting Stakeholders: Part 3

Part 3 of 3   |  
See complete series
11|18|09   |   2:09   |   No comments


Financial management planning does not need to include Voodoo economics, but it does help to tap into the knowledge base of your team through some sort of real-time system. We explore your options.
Reiter's Block
Tweeting for Customer Support

11|18|09   |   2:20   |   No comments


When Reiter gets incensed over incompetent Verizon FiOS order-taking and support, he broadcasts it via Twitter. Did it do any good? How should your company offer Twitter support? Watch this for all the answers.
what.the.ferraro
Dogster.com More Popular Than Gov 2.0

11|17|09   |   2:05   |   1 comment


A lot of attention is being paid to launching Gov 2.0 Websites, but these sites aren't attracting a lot of visitors.
Reiter's Block
Is the BlackBerry 9700 'Bold' Enough?

11|17|09   |   3:07   |   4 comments


The successor to the BlackBerry Bold 9000 – the Bold 9700 – will be available soon in the US. Is it worth upgrading? Reiter's got one, and offers advice.
TechWeb The Global Leader In Technology Media