KDD Cup 2009

12 May 2009

The results from the KDD Cup 2009 are both interesting and fundamentally not interesting. For this public data mining challenge Orange, the mobile telecommunications company, provided anonymous data sets on mobile customers: 50,000 records each of training and testing data with 15,000 variables. (The data set are still available for download and there are also smaller data sets with only 230 variables.) The competition was to provide the best models for churn, cross-sell (“appetency”), and up-sell.

Read more (~440 words)

SNA with R: Loading large networks using the igraph library

6 May 2009

We are interested in Social Network Analysis using the statistical analysis and computing platform R. The documentation for R is voluminous but typically not very good, so this entry is part of a series where we document what we learn as we explore the tool and the packages.

Read more (~620 words)

SNA with R: Loading your network data in statnet

1 April 2009

We are interested in Social Network Analysis using the statistical analysis and computing platform R. As usual with R, the documentation is pretty bad, so this series collects our notes as we learn more about the available packages and how they work. We use here the statnet group of packages, which seems to be the most comprehensive and most actively maintained network analysis packages.

Read more (~1170 words)

R tips: Swapping columns in a matrix

31 March 2009

Using R, the statistical analysis and computing platform, swapping two columns in a matrix is really easy: m[ , c(1,2)] <- m[ , c(2,1)].

Read more (~70 words)

R tips: Eliminating the “save workspace image” prompt on exit

26 March 2009

When using R, the statistical analysis and computing platform, I find it really annoying that it always prompts to save the workspace when I exit. This is how I turn it off.

Read more (~210 words)

R tips: Keep your packages up-to-date

25 March 2009

In this entry in a small series of tips for the use of the R statistical analysis and computing tool, we look at how to keep your addon packages up-to-date.

Read more (~510 words)

The financial crisis and physicists

20 March 2009

The financial crisis is all my fault. Or so David Smith from our friends REvolution seems to suggest in his post Physicists, models, and the credit crisis:

Read more (~950 words)

Happy birthday WWW

12 March 2009

Twenty years ago, on 13th March 1989, Tim Berners-Lee wrote the original proposal for what was to become the World Wide Web. Happy birthday!

Read more (~130 words)

5 step process for customer base segmentation

26 February 2009

Over the last years we have been doing a tremendous amount of customer segmentation work with the marketing departments in companies across a number of industries. We have experienced that there are many misconceptions about what “segmentation” really is, why we do it, and what we can expect to achieve from it.

Read more (~1150 words)

3 things we want from a segmentation of the customer base

26 February 2009

Over the last years we have been doing a tremendous amount of customer segmentation work with the marketing departments in companies across a number of industries. We have experienced that there are many misconceptions about what “segmentation” really is, why we do it, and what we can expect to achieve from it.

Read more (~1090 words)