Andrew B. Collier / @datawookie

Social links and a link to my CV.

Public datasets:

British Canoeing Results

Comrades Marathon: A Race for Geriatrics?

2014-07-22 R running

It has been suggested that the average Comrades Marathon runner is gradually getting older. As an “average runner” myself, I will not deny that I am personally getting older. But, what I really mean is that the average age of all runners taking part in this great event is gradually increasing. This is not just an idle hypothesis: it is supported by the data. If you’re interested in the technical details of the analysis, these are included at the end, otherwise read on for the results.

Where to Put EAs and Indicators in New MT4 Builds

2014-07-20

If you are creating an EA or indicator from scratch, then the MetaTrader editor places the files in the correct location and the terminal is automatically able to find them. However, if the files originate from a third party then you will need to know where to insert them so that they show up in the terminal. For older builds of MetaTrader 4 the directory structure was fairly simple.

Comrades Marathon Negative Splits: Cheat Strikes Again

2014-07-16 running

It looks likes one of the suspect runners from my previous posts cheated again in this year’s Comrades Marathon.

Twins, Tripods and Phantoms at the Comrades Marathon

2014-06-12 R running

Having picked up a viral infection days before this year’s Comrades Marathon, on 1 June I was left with time on my hands and somewhat desperate for any distraction. I spent some time looking at my archive of Comrades data and considering some new questions. For example, what are the chances of two runners passing through halfway and the finish line at exactly the same time? How likely is it that three runners achieve the same feat?

Concatenating a list of data frames

2014-06-06 R

It’s something that I do surprisingly often: concatenating a list of data frames into a single (possibly quite enormous) data frame. Until now my naive solution worked pretty well. However, today I needed to deal with a list of over 6 million elements. The result was hours of page thrashing before my R session finally surrendered. I suppose I should be happy that my hard disk survived.

Comrades Marathon Pacing Chart: Down Run

2014-05-28 Excel running

Although I have been thinking vaguely about my Plan A goal of a Bill Rowan medal at the Comrades Marathon this year, I have not really put a rigorous pacing plan in place. I know from previous experience that I am likely to be quite a bit slower towards the end of the race. I also know that I am going to lose a few minutes at the start. How fast does this mean I need to run in order to get from Pietermaritzburg to Durban in under 9 hours?

What Can We Learn from the Commitments of Traders Report?

2014-05-21

The Commitments of Traders (COT) report is issued weekly by the Commodity Futures Trading Commission (CFTC). It reflects the level of activity in the futures markets. The report, which is issued every Friday, contains the data from the previous Tuesday.

Race Statistics for Comrades Novices: Corrigendum

2014-05-17 R running

There was some significant bias in the histogram from my previous post: the data from all years were lumped together. This is important because as of 2003 (when the Vic Clapham medal was introduced) the final cutoff for the Comrades Marathon was extended from 11:00 to 12:00. In 2000 they also applied an extended cutoff.

Race Statistics for Comrades Marathon Novice Runners

2014-05-16 R running

Most novice Comrades Marathon runners finish the race on their first attempt and the majority of them walk (shuffle, crawl?) away with Bronze medals.

Hazardous & Benign Objects: Solar-Ecliptic Orbits

2014-05-12 R

In two previous posts in this series I have wrangled NEO orbital data into R and then solved Kepler’s Equation to get the eccentric anomaly for each NEO. The final stage in the visualisation of the NEO orbits will be the transformation of locations from the respective orbital planes into a single reference frame.

Comrades Marathon Negative Splits: The Plot Thickens

2014-05-10 R running

I have been thinking a little more about those mysterious negative splits. Not too surprisingly, this thinking happened while I was out running along the Durban beachfront this morning.

Hazardous & Benign Objects: Kepler’s Equation

2014-05-08 R

Following on from my previous post about Near Earth Objects, today we are going to solve Kepler’s Equation to find the eccentric anomaly, which is the next step towards plotting the positions of these NEOs relative to Earth.

Comrades Marathon: Negative Splits and Cheating

2014-05-06 R running

With this year’s Comrades Marathon just less than a month away, I was reminded of a story from earlier in the year. Mark Dowdeswell, a statistician at Wits University, found evidence of cheating by some middle and back of the pack Comrades runners. He identified a group of 20 athletes who had suspicious negative splits: they ran much faster in the second half of the race. There was one runner in particular whose splits were just too good to be true. When the story was publicised, this particular runner claimed that it was a conspiracy.

Hazardous & Benign Objects: Getting the Data

2014-04-28 R

The recent story about a skydiver nearly being hit by falling meteor got me thinking about all the pieces of rock floating around in near-Earth space. Despite the fact that the supposed meteor was probably just a chunk of rock mistakenly packed in with a parachute, the fact that something like that could actually happen is quite intriguing. And not a little frightening.

R Interface to Myfxbook

2014-04-17 R

Myfxbook provides an interface to your FOREX trading accounts as well as an active trading community.

Earthquakes: Land / Ocean Distribution

2014-04-13 R

The next stage in my earthquake analysis project is to partition the events into groups with epicentre over land or water.

Largest Volcanoes & Other Statistics

2014-04-11 R

Around 199 years ago the largest volcano in recorded history, Mount Tambora, erupted, spewing an enormous volume of molten rock and ash into the atmosphere and onto the surrounding land.

Earthquakes: Magnitude / Depth Chart

2014-04-07 R

I am working on a project related to secondary effects of earthquakes. To guide me in the analysis I need a chart showing the location, magnitude and depth of recent earthquakes. A host of such charts are already available, but since I had the required data on hand, it seemed like a good idea to take a stab at it myself.

Daylight Saving Effect on Financial Indices

2014-04-01 R

Does the transition to and from Daylight Saving Time (DST) have a (significant) effect on the stock market?

Filtering Data with L1 Regularisation

2014-03-27 R regularisation

A few days ago I posted about Filtering Data with L2 Regularisation. Today I am going to explore the other filtering technique described in the paper by Tung-Lam Dao.

1
2
3
26
27
28
30