The Macrosite for News, Analysis and Opinion about the Future of the Internet
Yihong Ding

Data Portability: The Next Great Frontier for the Web

Written by Yihong Ding
2/11/2008 11 comments
no ratings
1 saves
DISCUSS   Digg   Del.icio.us   Reddit   Email This   TWEET THIS

The DataPortability workgroup, which supports the portability of user identities, photos, videos and other forms of personal data, has garnered widespread attention in recent months. Major Internet companies, such as Google (Nasdaq: GOOG), Facebook , and even Microsoft Corp. (Nasdaq: MSFT), have joined this initiative. But to achieve the eventual goal, it will be necessary to implement consistent ways to present this data and manage its ownership, beyond portability. 

In Web 2.0, we often leave personal data behind after we join various Website services. Typically, this data includes personal registration information, such as the names, addresses, and emails, and individual user-generated content -- social tags, comments, and short instant messages. Until now, this data is scattered over the Web, and there is no generic mechanism to manage it.  

Any eventual solution to data portability among trusted online tools and vendors must contain two key components -- a uniform presentation of portable data and an authority switch mechanism to manage ownership of that data. This authority switch mechanism is related to delicate security topics and content management issues. I have suggested a solution called Automatic Character Switch (ACtS). But many details are still under study.  

The wide variety of data presentation is a main obstacle to exporting data from one Website to another. In order to make data portability on the Web efficient, we must have a standard way of presenting various data so that we can produce uniform interfaces for data transportation across different site boundaries.  

Additionally, when we export data from one site to another, the ownership of the data must be transported simultaneously. Users should be allowed to continuously claim ownership of their data, no matter where it has been exported. At the same time, the destination sites sharing ownership of the data should be allowed to permit users to give up their ownership to the public domain or designated members. 

Solving this data portability issue with standards and ownership management is a crucial step towards the post-Web 2.0 era. The successful final solution will have an impact on the Web in the following ways:   

(1)  The Web will gradually become a collective network of user-managed mini-Webs, or “internal, self-managed Webs" that focus on the interest of the individuals, compared to the current Web of community-run platforms. When Web data is freely portable over the Web, users will have much more flexibility to reorganize the data of interest into their own perspectives.  

This trend will become a phenomenon in the next-generation Web; and blogs will be replaced by the invention of this technology -- just as blogs have replaced the role of home pages.   

(2)  The dominating interpretation of Web data will be switched from the writer-oriented view to the reader-oriented view. Users may be more interested in what the majority of data consumers think about a particular piece of data, rather than what the original producer thinks of that data.  

When Web data becomes portable, we may automatically gather various user-generated views about the common data. As a result, developers can produce more user-friendly applications on Web data. Moreover, varied views of the same data will turn out to be a true initiative of the Semantic Web.  

(3)  Web searching will experience a revolutionary upgrade because of data portability. An immediate challenge to Web search is to figure out new ways of ranking Web data, especially when many identical copies come from various sites. Normally, it wouldn’t be critical to identify the origin of the source in such search results.  

By contrast, various meanings of similar portable data deployed by different users might be more relative to the goal of Web search. Web search filtered by heterogeneous views will gradually replace the current Web search strategy that is dominated by homogeneous keyword matching. This transformation may lead to significant change of the current Web search infrastructure.   

The progress of data portability will eventually change the Web in more significant ways than many of us could imagine. This is a field that academic researchers or technology entrepreneurs must pay more attention to. Once leading companies are committed to implementing widespread data portability, the eventual solution of this issue might signal the coming of Web 3.0.

— Yihong Ding, Semantic Web researcher and blogger

DISCUSS   Digg   Del.icio.us   Reddit   Email This
Current display:       newest comments first       display in chronological order
Page 1 of 2   Next >
Yihong Ding
Thinkernetter
Tuesday February 19, 2008 11:38:48 AM
no ratings

Hi Carver,

There may be various ways to present portable data, and indeed people are working on the stardards now. At this moment, I cannot precisely tell you that which one is the best and would be adopted in the future.  But I can try to explain a little bit on how portable data might be like.

In order to present portable data, we may need to first encapsulate the data. The "tagging" mechanism you have mentioned is a way to encapsulate. By encapsulating the data, the individual portable data pieces may become things such as the portable Web widgets you have seen right now. One different thing is that these portable data pieces are also associated with explicit ownership declarations, i.e., the explicit specification of who own these data. When all these stardards could have been finalized, the portable data themselves can be stored at anywhere on the Web with their explicit ownership specifications (either owned by specified private members or by the public domain). The owner of a portable data piece can have privilege to manipulate the data besides reading.

In such a web of portable data, automated services can help automatically import and export data between varied sites and the owners can have a single uniform interface on controlling their owned portable data no matter how many times and how many places these data have been exported.

Hopefully these simple explanations help.

cheers,

 

Yihong 

 

 

 

cct299
Rank: Cave Painter
Monday February 18, 2008 6:30:30 PM
no ratings

Hi, I think the idea of having portable info is interesting, but I am sill having a little trouble understanding how it would work.  Would your info be "tagged" the same way information would be on the semantic web?  Would your information be stored somewhere permanently outside of your computer?  Would the site you are trying to input information to be able to identify your info on its own or would you have to tell it to do so?  Thank you,

- Carver 

Yihong Ding
Thinkernetter
Wednesday February 13, 2008 11:48:13 AM
no ratings

Hi Syamant,

Thank you for the questions.

1. What does it mean to the individual to have an "internal self managed web" around their interests. Does it mean content+services and what is the responsibility of the individual and how is the individual secure.

You have mentioned all the critical issues in this predicted future product. After the Web truly becomes a Web of portable data, users can store the references of their data of interest in their own home-spaces (compared to homepages). At the same time, these portable data represent respective Web nodes. So eventually, the content in each home-space becomes a network of individually selected Web nodes; this is the mini-Web or "internal self managed web" I mentioned in the post.

The management of these mini-Webs are not trivial at all. We need to consider many issues such as the ones you have pointed out. The entire challenge is close to the original challenge of operating systems. But I would rather not call this management mechanism something as the "Web operating system" because it is fundamentally different from operating systems in single machines. By contrast, I would like to call it the Web resource management system, which is closer to the real purpose. 

 

2. Does the notion of "reader-oriented view" restrict the individual to a tunnel vision of the world? On the positive side, would the individual have enough data to research a best selling book!

Ideally, every user may have his own "reader-oriented view". When he explore the Web, this view guides the organizing of the information he has explored. And this view is the core to organize the self-interested information in his own mini-Web.

We need some tools to help users abstract their interest and help them build up their "reader-oriented view". In my Web evolution series, I call this process of abstraction the education process on the Web. Note that in this process, our educational objects are not humans but machines.  Humans need education to be knowledgeable; and so do machines. I doubt the top-down approach of designing intelligent agents by few experts in very few companies. This approach is not scalable to the variety of human interest. By contrast, the bottom-up approach that allows individuals to build up their own "mini-Web" that consist of their own "reader-oriented view" may be much more practical in reality. 

So, would individuals have enough data to research a best selling book? It depends on how well the individuals have "educated" their machines. Just like in our human world, if you ask whether you can ask someone to help you research a best selling book, it depends on how well this being asked person has been educated and trained in this particular job.

 

3. Will data portability open up search options or will it create a very large monopoly. What are your thoughts on open-id, data portability and monopolies 

This progress of data portability is opposite to any super-size monopoly, I think. As I have emphasized many times, one inevitable consequence of this progress is the emergence of mini-Webs. The importance of individuals is increasing and the importance of monopolies (in our current sense) is decreasing.

But some new types of monopolies may emerge. By contrast to the current ones that dominating physical resources (such as data or services), the new type of monopolies will aim to dominate human resources on the Web, i.e., the rights of taking benefits from the individual mini-Webs. For instance, once you have trained your mini-Web for our interest, the monopoly will contact you and ask the exclusive rights of hiring your mini-Web into his region. Certainly the monopoly will pay you when hiring your mini-Web (just like you have been hired by a real-world company), but at the same time, it declares the exclusive ownership of your human knowledge. By this mean, the current Google or Facebook (if they do no upgrade accordingly) will be swapped out of our sight by these new powers. Internet evolution is a crude process.

OpenID is another important initiative that is mutual complement to the data portability initiative. 

 

4. What are the likely challenges in distinguishing truth and lies in these situations

There are many challenges. As I said before, one of the greatest challenge is probably the research on Web resource management mechanism. Data portability is a critical component within this general plot. But the whole thing is much more complicated than just data portability. I expect that this research will be as critical as the original operating system research at the beginning of the PC era. As the result, some new companies that is similar to Microsoft may emerge that dominates this technology as Microsoft has dominated the operating system for personal computers.

We still need a few years to watch real breakthrough in this direction. But the study on data portability has really rang the bell in contrast to the original discussion of Web operating systems that is nearly nonsense with respect to real-world applications. 

Yihong 

experiences
IQ Crew
Wednesday February 13, 2008 3:10:57 AM
no ratings

Hi Yihong

Looking at it from the individual's perspective , some questions come to mind ..

1. What does it mean to the individual to have an "internal self managed web" around their interests. Does it mean content+services and what is the responsibility of the individual and how is the individual secure.

2. Does the notion of "reader-oriented view" restrict the individual to a tunnel vision of the world? On the positive side, would the individual have enough data to research a best selling book!

3. Will data portability open up search options or will it create a very large monopoly. What are your thoughts on open-id, data portability and monopolies 

4. What are the likely challenges in distinguishing truth and lies in these situations

Syamant 

James Johnson
Staff
Tuesday February 12, 2008 11:55:37 PM
no ratings

Hey Paul,

Your skepticism about the big Internet names that are jumping on data portability bandwagon rings loud and clear. Are rival companies like Facebook, Google and Microsoft really interested in developing a standard in sharing the personal data of their customers? The companies’ support clearly has social media enthusiasts happy, but many questions still remain about their sincerity.  

In our editorial meeting on Monday, Steve Saunders asked me the same question: Why would these companies support an initiative that goes against the core of their business models? I intend to find the answer in a future Thinkernet article.

In a way, a data portability litmus test actually occurred earlier this year when popular blogger Robert Scoble was given an alpha version of Plaxo’s Facebook contact data importer, and the Facebook engine shut him down (see Techmeme for the full debate). So much for true interoperability.  

James

Yihong Ding
Thinkernetter
Tuesday February 12, 2008 9:56:42 PM
no ratings

Paul,

As usual, I like your comments and appreaciate your helpful links. 

I think you have caught a significant phenomenon at this moment. The big companies are sensitive (they have many smart guys). It explains why so quickly these big companies decided to join this workgroup. But on the other hand, everybody wants to dominate this fields; and thus nobody may succeed in foreseeable future. 

There is a fundamental issue inside data portability. That is, we must break the barrier of Web sites so that all data become public resources. However, few Web companies are willing to make this movement because they take "their data" as their property and don't want to share with others, especially the other competitors. When this DataPortability workgroup is composed by these companies, we cannot expect rapid progress.  So both the articles you have suggested really have pointed out a significant problem in the achievement.

On the other hand, however, the breakthrough on data portability will be an inevitable future. Why? Because this is the demand from regular users. We need new startups that focuses in this issue and start to design new generation home-spaces (my term) that will replace blog just as blogs have replaced homepages. This achievement will be a revolutionary moving forward. And this invention will be the signal of Web 3.0. I have already predicted it in my web evolution theory. The current achievement on data portability is demonstrating my prediction. 

One question, who will succeed in this new startup? The one that succeeds will be the next Facebook. 

cheers,

Yihong 

Paul Whyte
Researcher
Tuesday February 12, 2008 6:11:53 PM
no ratings

Hi Yihong,

Thanks for your post. As users, we can only hope that both Dataportability.org and OpenID foundation are both successful. Whilst we laud their initiative, one has to be skeptical about the big nams that have join them. How much deliberations is needed  in other to come up with some acceptable standards??? Is not this another PR initiative by these big guns to buy time out so as to avoid meeting the user's desperate need for "openess"? I don't see this workgroup achieving any meaningful result in the forseable future rather than endless debates. 

It has been reported now that facebook despite signing up is having some reservations: Facebook, Despite Signing On To DataPortability Quest, Is Iffy About

I fully support your view that workgroup is operating without a foundation and that we are still along way to achieving anything worthwhile:  

DataPortability has big names on board, but a long road ahead

 

Yihong Ding
Thinkernetter
Tuesday February 12, 2008 5:33:11 PM
no ratings

Hi Abdlah,

I am not sure whether the progress on data portability will help substantially on curbing cyber crime. But when the ownership of web resources become more transparent and clearly declared, it may increase the difficulty of some cyber crime.

For the second question, I think Web user will obtain more control of their information in future when the issue of data portability might be solved. This trend is very important to the future of the Web since we people always look for greater freedom of managing our information on the Web.

cheers,

Yihong 

Yihong Ding
Thinkernetter
Tuesday February 12, 2008 5:23:06 PM
no ratings

HI James,

Yes, there have been several proposal for formatting data on the Web. I expect that many negotiations would be done in the future to eventually achieve one or few widely accepted standards.  Besides the effort of this workgroup, another significant achievement we may expect is the rise of a new company devoting itself in this field and may grow to be a dominating company just like Google and Facebook had done. The format adopted by this company will be an important standard.

The problem of current proposed formats debates is that very few products of data portability have been in reailty. So these companies are just debating without foundation at present. Until substantial applications of data portability have emerged and been adopted by the public, the debate will not reach its end. 

In fact, Twine at Radar Networks is a product that should be aware. But the problem of Twine is that the company seems still want to hold all users solely to their own site. This strategy might be beneficial to Radar Network in short term. But in long-term run,  this strategy might be a fatal shortcoming for the company that its leading position might be replaced by new startups. In a word, we are still waiting for the real Web-3.0 innovation; and no products until now (including Twine, the closest one to Web 3.0) are able to carry enough momentum to initiate the hype of Web 3.0 yet.

Yihong

abdlah
IQ Crew
Tuesday February 12, 2008 4:51:44 PM
no ratings

Your thoughts on Data portability are interesting but quite complex as well. Do you think that once a framework is agreed upon, it will help curb cyber crime? And if so to what extent? Will people be more in control of their information that they have supplied via the Internet?

 

It will be interesting to see what the trend will be on this concept. A proper framework will no doubt make people more confident in using the web 

Page 1 of 2   Next >
The ThinkerNet does not reflect the views of TechWeb. The ThinkerNet is an informal means of communication to members and visitors of the Internet Evolution site. Individual authors are chosen by Internet Evolution to blog. Neither Internet Evolution nor TechWeb assume responsibility for comments, claims, or opinions made by authors and ThinkerNet bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
previous posts from Yihong Ding
Yihong Ding
Yihong Ding   2/5/2009   12 comments
User-generated content (UGC) is the hallmark of Web 2.0. Indeed, many people take UGC as a synonym for Web 2.0. At the same time, there's no shortage of UGC skeptics who question its quality and integrity in spite of its proving successful in business.
Yihong Ding
Yihong Ding   10/9/2008   7 comments
We all know the Web is expanding. But how is this actually happening, and what does it mean for our future? Let’s take a look at four areas in which Web expansion is occurring simultaneously:
Yihong Ding
Yihong Ding   9/16/2008   10 comments
On the stage of the recent TechCrunch50 conference, startup Imindi, which offers a service involving the mapping of thoughts and associations in a database for use in social networking, received a hostile welcome from Silicon Valley. This does not mean that the Imindi service lacks innovation. On the contrary, Imindi might just be too novel for many people.
Yihong Ding
Yihong Ding   5/12/2008   14 comments
Where is the Semantic Web? Is it too hard, too complicated, too sophisticated, or too unrealistic to achieve? Or do we just need to better understand the Web's history in order to get there?
Yihong Ding
Yihong Ding   4/18/2008   25 comments
Prior to Web 2.0, Web content was presented as raw data rather than knowledge. Through collective human efforts, like "tagging," isolated data pieces on the Web now become connected and organized as knowledge. Although Web 2.0 is still limited in its ability to organize knowledge, it is the first Web era in which we are seeing a new group of "thinkers" emerge.
5
of
IETV: the thinkerNet on film
5
of
2pm EST
Tue
Feb 23rd
2pm EST
Thu
Mar 4th
3pm EST
Tue
Mar 9th
an IBM information resource
sponsored content
big blue blog
Todd Watson
IBM is announcing today the first of its Power7 processor-based systems and the Power7 processor itself at an event in NYC.
white papers & case studies
an IBM information resource
sponsored content
Smarter Collaboration: How to Thrive in a Challenging Business Environment
Market conditions are changing faster than ever, and organizations need to improve their agility and adaptability in order to provide better service and improve processes. The ability to work with customers, business partners, and employees as effectively as possible - while at the same time holding down costs - is a key to success.

READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE!

REGISTER HERE
Wanted! Site Moderators
Internet Evolution is looking for a handful of readers to help moderate the message boards on our site – as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?

Please email: moderators@internetevolution.com
CMP Media LLC
Internet Evolution – not for thickies
Congress Hits the Snooze Button With China
Ira Winkler
In his
recent Congressional testimony, Dennis Blair, the U.S. director of national intelligence, stated that the U.S. is "severely threatened" by cyber attacks and that the recent Google (Nasdaq: GOOG) attacks should serve as a wake-up call.

CLICK FOR MORE
Full Nelson
Go With the FLO, Part 2

Part 2 of 2   |  
See complete series
2|5|10   |   2:17   |   3 comments


Fritz and his sweater continue their review of Qualcomm's FLO TV.
Full Nelson
Go With the FLO, Part 1

Part of 2   |  
See complete series
2|4|10   |   2:39   |   1 comment


Qualcomm's FLO TV gizmo streams live TV shows. Tragically, they include the O'Reilly Factor
Sweeney Blog
No Sex, Please... It's the Super Bowl

2|3|10   |   2:24   |   2 comments


The Super Bowl ads that CBS rejected are turning up online, generating lots of attention but zero revenue for the broadcaster.
Cirque Du Solez
Books Come Alive

2|2|10   |   2:02   |   3 comments


Ray Kurzweil's Blio and Apple's iPad tablet will make it easier than ever to have books "read" to us, says Dr. Kim, who believes that talking tablets will become interwoven into our consciousness as we "merge" with the increasingly elegant machines we hold in our hands.
Not Dr. Phil
The iPad: iGet It

1|29|10   |   04:02   |   5 comments


Nicole Ferraro's all-out attack of the iPad was so funny, so fiesty – how could I not offer a video rebuttal?
what.the.ferraro
Suing Over Tweets: The American Way

1|28|10   |   2:22   |   13 comments


Thanks to Twitter, suing people for daffy reasons is getting a whole lot more popular.
Steve Saunders' Outernet
Internet Evolution Goes Soccer Crazy

1|28|10   |     |   14 comments


Steve Saunders visits the South African Consulate in New York for a party celebrating not one, but two (2!) major football (soccer!) tournies.
Thus Spake Mr. Cramer
3D: It's Baaaaack!

1|22|10   |   3:25   |   8 comments


3D was a pointless, silly gimmick 50 years ago, and it's a pointless, silly gimmick still. It's sure to catch on.
Steve Saunders' Outernet
Yet More Email Egregiousness

Part 2 of 2   |  
See complete series
1|15|10   |   2:24   |   5 comments


BTI sent Steve a card calling him a tool. Unsurprisingly, he’s not happy, and he dons his Martha Stewart hat to show us how corporate holiday greetings should be done.
Tom Nolle
Take Two Tablets & Call Me...

1|8|10   |   2:13   |   8 comments


Apple's not launching a tablet PC. There won’t be an Android tablet PC, either. When it comes to the latest tablets, forget about PC functionality – the newest devices have a quite different purpose in mind.
Lee H. Berke
The Decline & Fall of Broadcast Television

2|9|10   |   1:00   |   No comments


Want to know the future of broadcast television? Take a look at broadcast radio’s past.
Tom Nolle
Everything New Is Old Again

2|9|10   |   2:13   |   6 comments


Research shows that the youth of today like Facebook – but not blogging or Twitter. Does that mean Facebook has won, or just that it's not yet out of favor? Will all the services we see today fade into Ovaltine-or-Wheaties status in just a few years?
what.the.ferraro
Email Marketing Gets Desperate

2|8|10   |   2:31   |   4 comments


Promotional emails will use just about anything timely to get people to buy things. Seriously, anything.
Steve Saunders' Outernet
America, Truck Yeah!

2|8|10   |   1:42   |   5 comments


Steve likes his new Dodge Ram 1500, but hates Chrysler's Web non-sales strategy. Rant on, li'l buddy.
what.the.ferraro
Twits Go Wild for Resignation Tweet

2|5|10   |   1:48   |   4 comments


Jonathan Schwartz is the first Fortune 200 CEO to resign via Tweet. Can he walk on water, too?
Full Nelson
Go With the FLO, Part 2

Part 2 of 2   |  
See complete series
2|5|10   |   2:17   |   3 comments


Fritz and his sweater continue their review of Qualcomm's FLO TV.
Singer at C-Level
Goldilocks & the Data Center

2|4|10   |   3:39   |   2 comments


What kinds of companies are doing the most innovation in the data center? Turns out it's midtier enterprises that are taking the "Just Right" approach.
Full Nelson
Go With the FLO, Part 1

Part of 2   |  
See complete series
2|4|10   |   2:39   |   1 comment


Qualcomm's FLO TV gizmo streams live TV shows. Tragically, they include the O'Reilly Factor
Eurotrash
High & Dry in Barcelona

2|3|10   |   1:08   |   No comments


Ray’s heading to Barcelona for the Mobile World Congress, and he’s not happy about it, the miserable git.
Sweeney Blog
No Sex, Please... It's the Super Bowl

2|3|10   |   2:24   |   2 comments


The Super Bowl ads that CBS rejected are turning up online, generating lots of attention but zero revenue for the broadcaster.