On 2006-04-11 20:24:00, Allan Engelhardt wrote in CYBAEA Journal:
What is the biggest threat to the internet as a source of information exchange? The music industry dinosaurs that are stuck in copyright hell? The control freaks in the telecommunications industry that wants to end net neutrality? Google?
Google with its “do no evil” policy is an internet darling and by creating a vastly superior search engine has arguably done more than most to make the internet a useful information source.
However, Google does not make content. Its usefulness comes from a bargain with the content producers: we will steal your copy but provide you traffic in return. That bargain may now be changing.
Google has acquired the rights to the Orion search engine created by Ori Allon of the University of New South Wales. What is new is, in the words of Ori, that the search engine will be giving you the relevant information without [you] having to go to the web site
. Think of the current web page extracts Google already provide with their search results, but on steroids so they provide all the information you need right there on the search page.
This is new. I can see two possible outcomes. In one, the content providers are unable to create an online revenue stream and therefore move offline. Time to buy more stock in paper companies (and settle for a massive global economic slowdown). In the other, we see the creation of several disconnected “walled gardens” of online content, as content providers only share their information with meta-content provides who are willing to share their revenues.
I am away from my library so I can’t find the reference, but I am reminded of a science fiction sort story. In it, humans are reaching the limits of new knowledge. However, there is a rapid and sustained growth in knowledge about knowledge (K2), knowledge about knowledge about knowledge (K3) etc., until the entire human civilization collapses under the weight of data like a deck of index cards. Could Google (and Yahoo and Microsoft, the other bidders for the technology) be the beginning of the end for Homo sapiens sapiens?
On 2010-03-08 14:46:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
I needed a fast way of eliminating observed values with zero variance from large data sets using the R statistical computing and analysis platform. In other words, I want to find the columns in a data frame that has zero variance. And as fast as possible, because my data sets are large, many, and changing fast. The final result surprised me a little.
Read more (~501 words).
On 2009-08-17 09:18:00, Allan Engelhardt wrote in CYBAEA Journal:
We knew the potential existed already, of course. Mobile devices in the USA generates some 600 billion transactions per day, each tagged with the location and time. Jeff Jonas: Every call, text message, email and data transfer handled by your mobile device creates a transaction with your space-time coordinate[...].
The mobile operators have this data, of course. We all know this (especially here where we have been using some of it for social network analysis). No real surprises here, except perhaps in the volumes.
But did you know that the operators are sharing your data? What is new, at least to me, is that this data is being provided to third parties that are leveraging specially designed analytics to make sense of our space-time-travel data.
Read more (~449 words, 1 comments).
On 2009-07-27 19:38:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
O'Reilly's recent publication Beautiful Data has a chapter by Jeff Jonas which is enough reason in itself for me to recommend it. The chapter, Data Finds Data, is also available as a PDF download.
Read more (~66 words).
On 2009-07-22 13:37:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
This is by far the best description of why traditional parallel databases (like Teradata, Greenplum et al.) is a evolutionary dead end. But much more than a theoretical discussion, they have built a solution which they call HadoopDB. It is based on Hadoop, PostgreSQL, and Hive and is completely Open Source. Alternative, column-based, backends to PostgreSQL are being implemented now. Read: Announcing release of HadoopDB.
Read more (~83 words).
On 2009-07-22 06:59:00, Allan Engelhardt wrote in CYBAEA Journal:
The nice people at Velocity has released The B2B Content Marketing Workbook. It is behind a registration wall which means we wouldn’t normally recommend it but you can just type junk in the fields if you are not comfortable with giving your personal details to a marketing agency. (Think about it....) If you are relatively new in the B2B world, say having joined a professional services or consulting organization, you may find this one useful.
Read more (~263 words).
Join the discussion
There are no comments yet. Be the first to comment.