wfoster8560 -- there are a number of white papers on hadoop, the apache hadoop site is a good reference, the original Google papers (2003 on Google file system, 2004 on MapReduce), along with the Oreilly and manning books on Hadoop are all good sources. satnam
Lin -- yes, 2008 CI book had Java-based examples. Java continues to be the programming language i use, so most probably the next book will also be Java based. satnam
wfoster8560 -- since 2008, I have been surprised at how rapidly new user-generated content is being generated. The whole emergence of Big Data and use of Hadoop to mine information has been a positive surprise. satnam
Kim -- Google, Yahoo, Facebook all have shown how one can scale to hundreds of petabytes of data. We are very confident that we will similarly scale to large volumes. satnam
wfoster8560 -- I have not heard about the project of China sequencing every chinese. But if it is true it will be great to have that data publicly available for researchers to get new insights. satnam
Satnam, there doesn't seem to be a realistic alternative to the cloud for the kind of scaleable data storage you need. Are you at all concerned about outages, data loss, security breaches and the other (supposed) downsides of the cloud?
Mary -- intelligent search, mining of large amounts of data using Hadoop is fairly general. The techniques are applicable to most verticals that deal with large amounts of data. satnam
lin -- there are a number of examples of how molecular medicine is being used -- breast cancer being another example. I would say we are in the area of Translational Research/Medicine. By our partnership with CCI and other to follow, we hope to move the field forward towards the clinic. satnam
Kim -- self learning algorithms. yes, NextBio is a data-driven engine and answers may change as new data comes along. we still have a lot more room on how further to mine the large amounts of data using additional machine learning algorithms. satnam
Satnam, is molecular medicine a real possibility at this time? The only one I know well, Dendreon, has a patient-specific DNA treatment for prostate cancer -- the only things I hear about them is their financial and regulatory troubles. Is molecular medicine commercially viable or is it just research?
Question I ran out of time for: are there AI capabilities in the NextBio platform - in other words, does machine learning enable it to improve its own research strategies?
Kim -- there is a lot of work going on in the area of wellness and preventive treatment. There are a few examples -- dosage of drugs and biomarkers for diseases based on molecular data. Oncology is one of the top areas where molecular medicine is being used. satnam
Mary -- yes, we build multiple tenancy into our architecture. All data is tagged with a userid or a domain id that allows us to segregate data logically. satnam
Does that raise issues of performance -- if one client is using a lot of resources does it bog down the others? Security? What if one client comes under DoS attack; how are other clients protected?
120 gigs of data doesn't actually sound like a lot for an entire genome. Even a half-terabyte -- that's less than the hard disk on a Mac. But when you're dealing with millions of sets of that type - that's a lot of data.
Big-data and analytics tools enable marketers to understand customers as individuals, identifying unmet needs and addressing each customer as a "segment of one," says John Kennedy, VP corporate marketing, IBM.
New York's Metropolitan Transit Authority is conducting a pilot test of digital kiosks to guide subway users to where they want to go more efficiently and at lower cost.
The whole Amazon.reader debate is a double-stupid. It's stupid to think that there's any e-book buyer who doesn't know Amazon's URL, and it was stupider to let ICANN launch the whole free-form TLD initiative to start with.
While NFC's original goal was to enhance mobile commerce applications, it is finding its way into a number of other uses, which is creating both opportunity as well as challenges for IT departments.
Enterprises would like to move to cloud computing but are hesitant because they are concerned about providers’ ability to secure company data. Here are some tips that help to ensure that if breaches occur, the business is not left holding the bag.
Edmunds separates customers into segments based on the info it collects on its site and from partners, and uses that to push out custom content, said Brian Baron, director of business analytics for Edmunds.com, at Predictive Analytics Innovation Summit.
The IBM Smarter Commerce Global Summit in Monaco kicked into high gear today, and we've already begun to see news emerging from that lovely city-state by the sea.
Expert Integrated Systems: Changing the Experience & Economics of IT In this e-book, we take an in-depth look at these expert integrated systems -- what they are, how they work, and how they have the potential to help CIOs achieve dramatic savings while restoring IT's role as business innovator. READ THIS eBOOK
your weekly update of news, analysis, and
opinion from Internet Evolution - FREE! REGISTER HERE
Wanted! Site Moderators Internet Evolution is looking for a handful of readers to help moderate the message boards on our site as well as engaging in high-IQ conversation with the industry mavens on our thinkerNet blogosphere. The job comes with various perks, bags of kudos, and GIANT bragging rights. Interested?
To save this item to your list of favorite Internet Evolution content so you can find it later in your Profile page, click the "Save It" button next to the item.