On 2006-04-11 20:24:00, Allan Engelhardt wrote in CYBAEA Journal:
What is the biggest threat to the internet as a source of information exchange? The music industry dinosaurs that are stuck in copyright hell? The control freaks in the telecommunications industry that wants to end net neutrality? Google?
Google with its “do no evil” policy is an internet darling and by creating a vastly superior search engine has arguably done more than most to make the internet a useful information source.
However, Google does not make content. Its usefulness comes from a bargain with the content producers: we will steal your copy but provide you traffic in return. That bargain may now be changing.
Google has acquired the rights to the Orion search engine created by Ori Allon of the University of New South Wales. What is new is, in the words of Ori, that the search engine will be giving you the relevant information without [you] having to go to the web site
. Think of the current web page extracts Google already provide with their search results, but on steroids so they provide all the information you need right there on the search page.
This is new. I can see two possible outcomes. In one, the content providers are unable to create an online revenue stream and therefore move offline. Time to buy more stock in paper companies (and settle for a massive global economic slowdown). In the other, we see the creation of several disconnected “walled gardens” of online content, as content providers only share their information with meta-content provides who are willing to share their revenues.
I am away from my library so I can’t find the reference, but I am reminded of a science fiction sort story. In it, humans are reaching the limits of new knowledge. However, there is a rapid and sustained growth in knowledge about knowledge (K2), knowledge about knowledge about knowledge (K3) etc., until the entire human civilization collapses under the weight of data like a deck of index cards. Could Google (and Yahoo and Microsoft, the other bidders for the technology) be the beginning of the end for Homo sapiens sapiens?
On 2010-07-13 07:47:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
I am not sure apeescape’s ggplot2 area plot with intensity colouring is really the best way of presenting the information, but it had me intrigued enough to replicate it using base R graphics.
The key technique is to draw a gradient line which R does not support natively so we have to roll our own code for that. Unfortunately, lines(..., type="l") does not recycle the colour col= argument, so we end up with rather more loops than I thought would be necessary.
We also get a nice opportunity to use the under-appreciated read.fwf function.
Read more (~535 words).
On 2010-06-22 11:45:00, Allan Engelhardt wrote in CYBAEA Journal:
We have a mild obsession with employee productivity and how that declines as companies get bigger. We have previously found that when you treble the number of workers, you halve their individual productivity which is scary.
We now re-do the analysis four years later and, just because we can, we are using the leading companies of the London stock exchange instead of the largest American companies.
The results still hold. We called it the 3/2 rule: treble the number of workers and you halve their individual productivity. Large companies with ten times the number of employees are ¼ as productive as their smaller competitors.
Employee productivity is a big issue. If all the FTSE-100 companies achieved their average profits per employee, then the index would generate almost £1 trn of additional net profits for the economy.
Read more (~245 words).
On 2010-06-22 11:20:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
We have a mild obsession with employee productivity and how that declines as companies get bigger. We have previously found that when you treble the number of workers, you halve their individual productivity which is mildly scary.
We revisit the analysis for the FTSE-100 constituent companies and find that the relation still holds four years later and across a continent.
Read more (~763 words, 5 comments).
On 2010-06-17 09:05:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
Following on from my previous post about improving performance of R by linking with optimized linear algebra libraries, I thought it would be useful to try out the five benchmarks Revolutions Analytics have on their Revolutionary Performance pages.
Read more (~300 words, 2 comments).
On 2010-06-15 10:21:00, Allan Engelhardt wrote in CYBAEA Data and Analysis:
Can we make our analysis using the R statistical computing and analysis platform run faster? Usually the answer is yes, and the best way is to improve your algorithm and variable selection.
But recently David Smith was suggesting that a big benefit of their (commercial) version of R was that it was linked to a to a better linear algebra library. So I decided to investigate.
The quick summary is that it only really makes a difference for fairly artificial benchmark tests. For “normal” work you are unlikely to see a difference most of the time.
Read more (~934 words, 1 comments).
Join the discussion
There are no comments yet. Be the first to comment.