The Macrosite for News, Analysis and Opinion about the Future of the Internet
Richard Boire

Web Analytics & Data Mining Must Converge

Written by Richard Boire
10/7/2009 9 comments
no ratings
DISCUSS     Email This

Are businesses actually seeing a convergence between the two disciplines of data mining (searching for patterns in sets of data) and Web analytics (determining how customers are using a company’s Websites)?

The prevailing school of thought is that data mining and Web analytics are indeed disparate disciplines. But why? After all, both require expertise in dealing with data. In most cases, the analytical deliverables are similar: Both disciplines attempt to better understand customer behavior through some key metrics, such as:

  • New customers
  • Repeat customers
  • Time of last activity
  • Frequency of activity
  • Monetary value of last activity

In order to fully appreciate both disciplines, some background is in order here.

Data miners have been analyzing data since Fair Isaac (now FICO) developed its first credit card risk model back in the 1950s. Since then, the use of data mining as a key business tool has evolved into other disciplines, most notably marketing.

Early data mining pioneers included such organizations as Reader’s Digest. As this technique matured within the direct marketing area, financial organizations like American Express became early adopters. Currently, the use of data mining has expanded to almost all other industry sectors, with financial institutions and telcos the most sophisticated users.

Data miners typically work with large volumes of data, and their expertise lies particularly in transaction-related data.

Meanwhile, the growth of Web analytics is a relatively new phenomenon, due entirely to the growth of the Internet. The early pioneers of this discipline came directly from the Internet space, with their original expertise being the design of Websites. Along with this expertise, practitioners of Web analytics acquired a deep understanding of the data environment -- but as it relates to the Internet. Web analysts also acquired an expertise in analyzing large volumes of data, but it was data specifically related to log files or page tags.

As you can see from the above, data mining and Web analytics are distinct disciplines. And it is this disparity that can be dangerous for enterprises that use them.

For one thing, the data nuances and subtleties of the Internet need to be fully understood if one is analyzing Web data. A lack of understanding of this type of data can lead to grossly incorrect results concerning customer behavior.

Data miners, for instance, do not want to look at the page view history of the user for a given session, but rather the page view history of the user over a period of time.

It is this longitudinal, or more historical, view of the customer that is lacking amongst Web analyst practitioners. Why? The capability of viewing the customer historically and from a longitudinal perspective requires the development of customized marketing databases for a given Website. But the development of marketing databases is an expertise that is more typically found within the data mining discipline, not in Web analytics.

Another key difference: Data miners are typically disciplined in integrating data from very disparate sources, while Web analytics practitioners are used to integrating data as long as it is confined to the Web space.

But integrating online and offline data is vital to calculating factors such as ROI from purchase behavior, for instance.

Hopefully, the above comments reinforce the need to more effectively integrate the two disciplines of data mining and Web analytics. As long as both disciplines continue to have a silo-based approach to their work, solutions will be less than optimal.

It is only through a more centralized approach toward analytics that solutions can be truly optimized.

— Richard Boire is the founding partner at the Boire Filler Group.

Channel:
Tags: Executive Analysis
DISCUSS     Email This
Current display:       newest comments first       display in chronological order
DHagar
Thinkernetter
Tuesday October 13, 2009 8:40:26 PM
no ratings

Good points and good perspective on the future needs. 

In reading your article, Richard, it appears that we need to broaden the web use beyond the technical analytics to the value, in keeping with your perspective on longitudinal analysis with data mining.  As you point out, the only difference is the channel of data capturing, it is still dealing with the customer and the broader sense of what we want to know.

Hopefully your line of thinking will help us expand in the use and value of the web.

DHagar

Michael P. Kassner
Thinkernetter
Wednesday October 7, 2009 4:28:38 PM

I understand the difference, now. I use Google Analytics for all of my posts and the data that I get is amazing and extremely useful.  I would be lost without it.

I do look at long-term information, that is why I was getting confused. It is a different viewpoint than a data miner as you kindly pointed out. Thank you.

 

Richard Boire
Thinkernetter
Wednesday October 7, 2009 4:00:17 PM

You are absolutely right but my experience in the marketplace is that there is a disconnect in terms of data and how it should be used  between the offline world and online world.

robbie02494
IQ Crew
Wednesday October 7, 2009 3:51:05 PM

From the massive amount of data that is pumped into the Internet each day to any data within the company, every data needs to be analyzed. The tools used for this purpose may vary but the idea is to benefit from all the data and interpret it to benefit the company in any way, either to improve customer relationship or product enhancement.

Richard Boire
Thinkernetter
Wednesday October 7, 2009 3:45:11 PM

I will try as best I can. Data Miners, coming from a database marketing environment, like to look at the customer longitudinally. This means  that we want to look at  customer trends over time such as how their navigation behaviour has changed over time. For example, Suppose I am on Amazon,

and within the current month, the % of pages that I view pertaining to History books is 30%, but let's say it was 100% six months ago. This is telling me something about this customer's behaviour that I can potentially action on as a marketer.

 

   

Michael P. Kassner
Thinkernetter
Wednesday October 7, 2009 2:31:22 PM

Very interesting information, thank you Richard. I would appreciate you clearing up something for me. Your comment:

"Data miners, for instance, do not want to look at the page view history of the user for a given session, but rather the page view history of the user over a period of time."

I am trying to understand why data miners prefer the history over time.

 

 

Insultant
Thinkernetter
Wednesday October 7, 2009 1:42:39 PM

Richard, did you know that the CIA is funding a bunch of data analystics startups via its VC arm? Why do you think they are doing that?

Richard Boire
Thinkernetter
Wednesday October 7, 2009 11:01:54 AM

Mary, you are right. Data is data and at the end of the day, it is about identifying insights and intelligence from the dayta that leads to higher ROI.   

Mary Jander
Thinkernetter
Wednesday October 7, 2009 10:44:14 AM

Any firm engaged in data mining is likely also interested in Web analytics and is probably spending a lot of money on both efforts. A lack of coordination and integration between these applications seems like an invitation for lost ROI on both of them.

The ThinkerNet does not reflect the views of TechWeb. The ThinkerNet is an informal means of communication to members and visitors of the Internet Evolution site. Individual authors are chosen by Internet Evolution to blog. Neither Internet Evolution nor TechWeb assume responsibility for comments, claims, or opinions made by authors and ThinkerNet bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
a moderated blogosphere of internet experts
David Weldon
David Weldon   5/22/2013   9 comments
In the 1970 science fiction thriller Colossus: The Forbin Project, two giant supercomputers from the United States and Soviet Union secretly join forces to take control of the collective nuclear might of the two countries. In the film, the two machines discover each other's existence, communicate back-and-forth, share their collective data, and cut their human creators out of the process. It is the ultimate example of machine-to-machine communications, or M2M.
Paul Korzeniowski
The smartphone market reached a significant milestone, a breakthrough that may cause vendors to celebrate but could strain the capabilities of IT service desks.
Maria Korolov
Maria Korolov   5/21/2013   16 comments
In the fall of 2011, around 160,000 students in 190 countries enrolled in a Stanford-sponsored online course about artificial intelligence. About 23,000 completed the course and got certificates, including 248 who got a perfect score. The university offered the same course the old-fashioned way to students sitting in Stanford classrooms. None of the those students got a perfect score.
Joe Stanganelli
As Mitch Wagner discussed today, Yahoo is acquiring Tumblr. The big Internet debate at the moment is whether Tumblr will be good or bad for Yahoo. Regardless of their stances on the future of Yahoo itself, many claim that Yahoo will somehow ruin Tumblr.
IETV: the thinkerNet on film
5
of
Kim Davis
Big-Data Can’t Always Sell Wine

5|21|13   |   2:23   |   4 comments


Whole Foods Global Wine Purchaser Doug Bell told me about some of the constraints on using analytics in the US wine market.
Paul J. Fleuranges
Digital Signage Keeps NYC Subway Straphangers on Track

5|6|13   |   3:51   |   No comments


New York's Metropolitan Transit Authority is conducting a pilot test of digital kiosks to guide subway users to where they want to go more efficiently and at lower cost.
Kim Davis
Fast Forward to the Future

4|23|13   |   2:29   |   20 comments


A look back at tech writing in the 90s makes us wonder where enterprise IT will be 20 years from now.
Mitch Wagner
Google Launches Its Most Depressing Service Yet

4|15|13   |   2:59   |   10 comments


Google's new Inactive Account Manager lets you control how Google disposes of your accounts when you die.
Second Shooter
Argument Over Top-Level Domains Is 'Stupid'

4|11|13   |   2:07   |   3 comments


The whole Amazon.reader debate is a double-stupid. It's stupid to think that there's any e-book buyer who doesn't know Amazon's URL, and it was stupider to let ICANN launch the whole free-form TLD initiative to start with.
Kim Davis
Ladies, Your Tablet Awaits

3|21|13   |   2:22   |   37 comments


ePad Femme is the world’s first tablet “made exclusively for women.”
Wisdom of the Big Chair
NFC Moves Into the Mainstream

3|20|13   |   2:16   |   No comments


While NFC's original goal was to enhance mobile commerce applications, it is finding its way into a number of other uses, which is creating both opportunity as well as challenges for IT departments.
Wisdom of the Big Chair
Integrating Security Into Your Cloud Contract

3|19|13   |   3:35   |   No comments


Enterprises would like to move to cloud computing but are hesitant because they are concerned about providers’ ability to secure company data. Here are some tips that help to ensure that if breaches occur, the business is not left holding the bag.
Brian Baron
How Edmunds.com Collects Customer Information

3|18|13   |   1:15   |   No comments


Edmunds separates customers into segments based on the info it collects on its site and from partners, and uses that to push out custom content, said Brian Baron, director of business analytics for Edmunds.com, at Predictive Analytics Innovation Summit.
Brian Baron
How Edmunds.com Uses Analytics to Customize Site

3|14|13   |   0:47   |   No comments


The automotive website uses propensity modeling to target ads and customer registration forms, said Brian Baron, director of business analytics for Edmunds.com, at Predictive Analytics Innovation Summit.
an IBM information resource
sponsored content
big blue blog
an IBM information resource
sponsored content
Expert Integrated Systems: Changing the Experience & Economics of IT
In this e-book, we take an in-depth look at these expert integrated systems -- what they are, how they work, and how they have the potential to help CIOs achieve dramatic savings while restoring IT's role as business innovator.

READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE!

REGISTER HERE
Wanted! Site Moderators
Internet Evolution is looking for a handful of readers to help moderate the message boards on our site – as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?

Please email: moderators@internetevolution.com
Internet Evolution – not for thickies
Keep Critical Data With a Knowledge Management System
Taimoor Zubair
Fortune 500 companies lose at least
$31.5 billion a year by failing to share knowledge. A Knowledge Management System (KMS) can help companies significantly reduce these costs.

CLICK FOR MORE
M2M: Rise of the Machines? Not Yet
David Weldon
In the 1970 science fiction thriller
Colossus: The Forbin Project, two giant supercomputers from the United States and Soviet Union secretly join forces to take control of the collective nuclear might of the two countries. In the film, the two machines discover each other's existence, communicate back-and-forth, share their collective data, and cut their human creators out of the process. It is the ultimate example of machine-to-machine communications, or M2M.

CLICK FOR MORE
M2M: Rise of the Machines? Not Yet
David Weldon
In the 1970 science fiction thriller
Colossus: The Forbin Project, two giant supercomputers from the United States and Soviet Union secretly join forces to take control of the collective nuclear might of the two countries. In the film, the two machines discover each other's existence, communicate back-and-forth, share their collective data, and cut their human creators out of the process. It is the ultimate example of machine-to-machine communications, or M2M.

CLICK FOR MORE
M2M: Rise of the Machines? Not Yet
David Weldon
In the 1970 science fiction thriller
Colossus: The Forbin Project, two giant supercomputers from the United States and Soviet Union secretly join forces to take control of the collective nuclear might of the two countries. In the film, the two machines discover each other's existence, communicate back-and-forth, share their collective data, and cut their human creators out of the process. It is the ultimate example of machine-to-machine communications, or M2M.

CLICK FOR MORE
M2M: Rise of the Machines? Not Yet
David Weldon
In the 1970 science fiction thriller
Colossus: The Forbin Project, two giant supercomputers from the United States and Soviet Union secretly join forces to take control of the collective nuclear might of the two countries. In the film, the two machines discover each other's existence, communicate back-and-forth, share their collective data, and cut their human creators out of the process. It is the ultimate example of machine-to-machine communications, or M2M.

CLICK FOR MORE
Yahoo Needs to Break Tumblr in Order to Fix It
Joe Stanganelli
As
Mitch Wagner discussed today, Yahoo is acquiring Tumblr. The big Internet debate at the moment is whether Tumblr will be good or bad for Yahoo. Regardless of their stances on the future of Yahoo itself, many claim that Yahoo will somehow ruin Tumblr.

CLICK FOR MORE