The Macrosite for News, Analysis and Opinion about the Future of the Internet
Tom Wilde

Multimedia Content: The Semantic Web Challenge

Written by Tom Wilde
4/14/2008 8 comments
no ratings
DISCUSS   Digg   Del.icio.us   Reddit   Email This   TWEET THIS

As the Web continues to evolve, there has been one key constant: the importance of text in driving search and navigation across the Internet.  It’s not surprising that text remains the most efficient and effective method of communicating “meaning.”  As we move toward a “Semantic” Web, text becomes no less important.  But how will the Semantic Web account for the vast amount of audio and video moving onto the Web?

The promise of the Semantic Web is to take text and add structure to create an additional layer of meaning around a content object.  The principle driver here is that for the immediate future, we are stuck with computers that are still relatively “dumb” in terms of being able to discern context.  Text, combined with structure, gives computers the bootstrap they need and creates a superior search and navigation experience into the future.

One of the biggest challenges that content producers face today is ensuring they create a rich “wrapper” of meta data around each of the content objects, be it an article, a photograph, a podcast, or a video.  Meta data, in text form, is really the “currency” of the Semantic Web.  Increasingly, search engines and other navigational metaphors will rely on rich meta data for content discovery, presentation, contextualization, and ad targeting.

One of Google (Nasdaq: GOOG)’s great strategic advantages has been PageRank, which is really just an über set of meta data. In this case, PageRank interprets which site’s pages link to each other and then uses that to calculate some notion of “authority” and popularity.  As Google has all of this data, and each individual content owner can only see a narrow slice of it, the search engine has developed a tremendously asymmetrical view of the Web and has exploited it in its own favor.

Behavioral targeting and collaborative filtering are also big winners in this scenario.  These approaches are exposed to the classic risk of “garbage, in-garbage out.”  Behavioral targeting and collaborative filtering rely on two elements in order to be successful:  1) deep knowledge of the user; and 2) deep knowledge of the content.  It’s a recursive problem in that deep knowledge of the user is often derived by observing the content they consume.  Thus, high-quality meta data surrounding a content object ensures robust behavioral profiles, whether for ad targeting, recommendations, rankings, etc.

Further, these benefits are not just present on the Internet.  Increasingly, large enterprises, especially those in the entertainment business, will need to ensure that all of their audio and video content has a consistent and complete set of meta data to unlock and exploit the types of distribution, syndication, and advertising opportunities that are rapidly emerging on the Web.

As we move toward a Web increasingly filled with multimedia, the Semantic Web becomes even more important, as these content objects don’t natively present themselves as text.  Current approaches of using editors to write descriptive titles and add tags are a good start, but they typically run out of gas short of the mark because they are expensive and non-scaleable approaches.  The ability to attribute these objects in a scaleable way with text in the form of tags, categories, and transcripts is critical to plug this content into the Semantic Web and “get found.”

Tom Wilde, CEO, EveryZing 

DISCUSS   Digg   Del.icio.us   Reddit   Email This
Current display:       newest comments first       display in chronological order
Paul Whyte
Researcher
Wednesday April 16, 2008 9:48:09 PM
no ratings

I could not fathom your statement that we should not enslave ourselves to 'architectures" just after admonishing us to be creative. Does not the design of today's internet technology being guided by internet architecture? Don't you think that without architecture the internet is likely to beocme decreasingly effective and will often fail to meet the demands placed on it by applications?? So if we can't enslaves ourselves to architecture, don't you think we may be slowly killing the internet.

I'm sorry if i missed the point you were driving at in your last paragraph on the grouds that you use a parathensis on the word architecture. 

hounhosp
Researcher
Wednesday April 16, 2008 6:11:30 PM
no ratings

The The Road to the Semantic Web is obviously difficult and challenging. The vision of having computers "understand " the current web content "is certainly grandiose and intriguing". As it is writing by Alex Iskold in Semantic Web: Difficulties with the Classic Approach, "the technical, scientific and business difficulties are substantial, and to overcome them, there needs to be more community support, standards and pushing. This is not likely to happen unless there are more clear reasons for it." There is a future for the new web application and that future will certainly be bright if people keep on working on it.

jabailo
IQ Crew
Wednesday April 16, 2008 5:55:51 PM
no ratings

Back in 1994, I gave a speech at Mecklermedia's Internet World Conference, trying to put the Web and Internet in the spectrum of McLuhan rankings of media. As you remember, he thought of television as cool and type as hot.

I said that although there was hypertext and graphics, the primary bent of the Internet was literary...hence, the Internet is a Medium Hot ... medium.   Cold media are fuzzy, indistinct, free flow...like jazz music.   Hot media are definite, solid.

As far as "killer apps" -- well, it's never really been about that 99% of the time. And semantics or text wrappers...how about the Text itself? How about really great writing?

This is what frustrates the Architects and makes the Artists happy. There is no overarching architecture. The Internet is a negatively curved space that is completely open ended. As soon as someone tries to grab his arms around it and "embrace it", 10 other people break the chain.

Let's stay free. Let's create. Let's not enslave ourselves to "architectures".

 

Paul Whyte
Researcher
Tuesday April 15, 2008 3:41:20 PM
no ratings

Hi Tom,

Thanks for your comments.I think the answer to the core question is simply that the internet needs a better search engine. It will be difficult or almost impossible to make videos better inorder to suit existing search engines. I also agree with you that vertical search engines may not be the ideal solution here. Now what your take on Google's universal search:

Universal & Blended Vertical Search

Do you think Multimedia search will be the kuller consumer application for the semantic web? The semantic web has so much promise but can we relistically see in the near future a semantic search engine that can improve user's search experience in multimedia content?  

Semantic Web: What Is The Killer App?

RPR
IQ Crew
Tuesday April 15, 2008 9:53:37 AM
no ratings

Ideally various individuals (perhaps including a parent of the Internet: who was the second Chair of the Internet Architecture Board and who is responsible for identifying new enabling technologies and applications on the Internet for Google), who have been part of the progressive content growth and changes (e.g., increase in video content), have been proactively exercising engine refinements ???

Where there's need to overcome challenges, an ounce or collective bits of vision, prevention and teamwork (e.g., perhaps involving Tom, who has served on the IAB Search Engine Committee), may be worth pounds of cure.

Tom Wilde
Thinkernetter
Monday April 14, 2008 8:12:40 PM
no ratings

I think there's a core question as to whether the Internet needs a better search engine for videos, or rather if videos need to be better for search engines.  I am not a believer in vertical search as a solution here, in that consumers continue to use Google, Yahoo etc to find all kinds of information, regardless of format.  The ability to properly attribute multimedia and associate transcripts, tags etc with those files is essential to their discoverability and navigation online.  As an example, MarketWatch's ability to organize their multimedia in a manner that is "crawlable" (http://video.marketwatch.com) is essential to users finding their high quality content.

 -Tom

Paul Whyte
Researcher
Monday April 14, 2008 6:31:25 PM
no ratings

Hi Tom,

 It is true that multimedia content are becoming dominant on the web and this poses serious challenges to make such data accessible, reusable, searchable and manageable. Theortically, the semantic web seems to be the 'panacea" to all our current web difficulties/challenges. Many people have their reservation on what can be achieve practically with the semantic web.  Whilst the semantic web may offer the most promising in this respect, i'm just wondeing what your take is on the current multimedia search engines available. I know a few them employ semantic analysis but others like the one mention on the following article can be really improved upon: 

Millions of Videos, and Now a Way to Search Inside Them

So how can we improve upon the performance of these multimedia search engines??? 

 

RPR
IQ Crew
Monday April 14, 2008 12:44:21 PM
no ratings

Given various needs (e.g., to overcome expensive and non-scalable approaches), content delivery economics could become increasingly popular, perhaps particularly relative to these words within the post.

Increasingly, large enterprises, especially those in the entertainment business, will need to ensure that all of their audio and video content has a consistent and complete set of meta data to unlock and exploit the types of distribution, syndication, and advertising opportunities that are rapidly emerging on the Web.

The ThinkerNet does not reflect the views of TechWeb. The ThinkerNet is an informal means of communication to members and visitors of the Internet Evolution site. Individual authors are chosen by Internet Evolution to blog. Neither Internet Evolution nor TechWeb assume responsibility for comments, claims, or opinions made by authors and ThinkerNet bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
previous posts from Tom Wilde
Tom Wilde
Tom Wilde   6/30/2008   7 comments
At a recent Massachusetts Innovation and Technology Exchange (MITX) event regarding online video distribution and discovery, I was struck by the wide range of companies that were represented in the audience. There were technology companies, manufacturers, merchants, and nonprofits, and their interest was all aligned around using media to further their businesses.
5
of
IETV: the thinkerNet on film
5
of
2pm EST
Tue
Feb 23rd
2pm EST
Thu
Mar 4th
3pm EST
Tue
Mar 9th
an IBM information resource
sponsored content
big blue blog
Todd Watson
IBM is announcing today the first of its Power7 processor-based systems and the Power7 processor itself at an event in NYC.
white papers & case studies
an IBM information resource
sponsored content
Smarter Collaboration: How to Thrive in a Challenging Business Environment
Market conditions are changing faster than ever, and organizations need to improve their agility and adaptability in order to provide better service and improve processes. The ability to work with customers, business partners, and employees as effectively as possible - while at the same time holding down costs - is a key to success.

READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE!

REGISTER HERE
Wanted! Site Moderators
Internet Evolution is looking for a handful of readers to help moderate the message boards on our site – as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?

Please email: moderators@internetevolution.com
CMP Media LLC
Internet Evolution – not for thickies
Congress Hits the Snooze Button With China
Ira Winkler
In his
recent Congressional testimony, Dennis Blair, the U.S. director of national intelligence, stated that the U.S. is "severely threatened" by cyber attacks and that the recent Google (Nasdaq: GOOG) attacks should serve as a wake-up call.

CLICK FOR MORE
Reiter's Block
If a Google Phone Arrives, Does It Even Matter?

12|17|09   |   02:41   |   13 comments


Techies are going crazy over the possibility that Google might design and sell its own Android phone. Some writers say it's a very big deal. Reiter questions whether it will happen and, if it does, whether it even matters.
what.the.ferraro
Tuning Out of YouTube Direct

11|25|09   |   1:54   |   4 comments


YouTube launches 'YouTube Direct' to give 'citizen' journalism a better platform and in so doing may just ensure that 'quality' journalism soon becomes a thing of the past.
Full Nelson
SanFran.gov

11|19|09   |   8:51   |   No comments


Fritz has an exclusive talk with the mayor and CTO of San Francisco about that city's latest e-government efforts.
Marissa Mayer
VP of Search Products & User Experience, Google

11|3|09   |   1:57   |   No comments


Google Search Honcha talks about the new options the company has added to its search service, including fripperies such as the 'Wonderwheel.'
what.the.ferraro
The Unimportance of Real-Time Search

11|2|09   |   1:36   |   6 comments


The big news at the Web 2.0 Summit was that Twitter partnered with Google and Bing, enabling the search engines to show Tweets in search results. This couldn't possibly be less interesting.
Steve Saunders' Outernet
The Death of Anonymity: Part 4

Part 4 of 4   |  
See complete series
10|29|09   |   1:40   |   7 comments


In the final episode of this series about the death of Internet anonymity, Saunders describes how the Internet of the future will start to attain a level of intelligence that requires no human intervention. Scary.
Marissa Mayer
VP of Search Products & User Experience, Google

10|29|09   |   01:46   |   1 comment


Google's 'It Girl' talks about using personalized search to make sense of the mass of information on the Web – and how sometimes Google can appear to be semantically smarter than it really is.
Steve Saunders' Outernet
The Death of Anonymity: Part 3

Part 3 of 4   |  
See complete series
10|28|09   |   1:35   |   4 comments


What can users today do to protect their online privacy? The simplest and most obvious option is to not use the Internet – at all. However, once all digital information is consolidated over the Internet, trying to protect digital identity by simply unplugging from the Internet becomes impossible – a fact that has manifest implications for civil liberties, Saunders says.
Singer at C-Level
Bing + Twitter: Wrestling a Tweety Fire Hose

10|27|09   |   2:33   |   2 comments


Now that Bing has struck a deal with Twitter, its search service will have to process a tsunami of Tweets, many of which are worthless junk. Stefan Weitz, director with Bing Search, explains to Michael Singer how his service will make sense of the Twitter mayhem to provide relevant results to end users and enterprises.
Steve Saunders' Outernet
The Death of Anonymity: Part 2

Part 2 of 4   |  
See complete series
10|27|09   |   2:08   |   8 comments


By 2011 the number of Internet-connected sensors will exceed 1 trillion, making your chances of doing anything or going anywhere unnoticed pretty much zero. Saunders talks about how the 'sensortization' of the Internet is eliminating the traditional divide between online and offline populations.
Lee H. Berke
The Decline & Fall of Broadcast Television

2|9|10   |   1:00   |   No comments


Want to know the future of broadcast television? Take a look at broadcast radio’s past.
Tom Nolle
Everything New Is Old Again

2|9|10   |   2:13   |   6 comments


Research shows that the youth of today like Facebook – but not blogging or Twitter. Does that mean Facebook has won, or just that it's not yet out of favor? Will all the services we see today fade into Ovaltine-or-Wheaties status in just a few years?
what.the.ferraro
Email Marketing Gets Desperate

2|8|10   |   2:31   |   4 comments


Promotional emails will use just about anything timely to get people to buy things. Seriously, anything.
Steve Saunders' Outernet
America, Truck Yeah!

2|8|10   |   1:42   |   5 comments


Steve likes his new Dodge Ram 1500, but hates Chrysler's Web non-sales strategy. Rant on, li'l buddy.
what.the.ferraro
Twits Go Wild for Resignation Tweet

2|5|10   |   1:48   |   4 comments


Jonathan Schwartz is the first Fortune 200 CEO to resign via Tweet. Can he walk on water, too?
Full Nelson
Go With the FLO, Part 2

Part 2 of 2   |  
See complete series
2|5|10   |   2:17   |   3 comments


Fritz and his sweater continue their review of Qualcomm's FLO TV.
Singer at C-Level
Goldilocks & the Data Center

2|4|10   |   3:39   |   2 comments


What kinds of companies are doing the most innovation in the data center? Turns out it's midtier enterprises that are taking the "Just Right" approach.
Full Nelson
Go With the FLO, Part 1

Part of 2   |  
See complete series
2|4|10   |   2:39   |   1 comment


Qualcomm's FLO TV gizmo streams live TV shows. Tragically, they include the O'Reilly Factor
Eurotrash
High & Dry in Barcelona

2|3|10   |   1:08   |   No comments


Ray’s heading to Barcelona for the Mobile World Congress, and he’s not happy about it, the miserable git.
Sweeney Blog
No Sex, Please... It's the Super Bowl

2|3|10   |   2:24   |   2 comments


The Super Bowl ads that CBS rejected are turning up online, generating lots of attention but zero revenue for the broadcaster.