Blog Posts by Andrew B. Collier / @datawookie

Day 28: Hypothesis Tests

2015-10-05 Julia Month of Julia

It’s all very well generating myriad statistics characterising your data. How do you know whether or not those statistics are telling you something interesting? Hypothesis Tests. To that end, we’ll be looking at the HypothesisTests package today.

Read More →

Day 27: Distributions

2015-10-02 Julia Month of Julia

Today I’m looking at the Distributions package.

Read More →

Day 26: Statistics

2015-10-01 Julia Month of Julia

Read More →

Day 25: Interfacing with Other Languages

2015-09-30 Julia R Python Month of Julia

Julia has native support for calling C and Fortran functions. There are also add on packages which provide interfaces to C++, R and Python. We’ll have a brief look at the support for C and R here. Further details on these and the other supported languages can be found on GitHub.

Read More →

Day 24: Graphs

2015-09-29 Julia Month of Julia

Read More →

Day 23: Data Structures

2015-09-28 Julia Month of Julia

Read More →

Day 22: Optimisation

2015-09-25 Julia Month of Julia

Sudoku-as-a-Service is a great illustration of Julia’s integer programming facilities. Julia has several packages which implement various flavours of optimisation: JuMP, JuMPeR, Gurobi, CPLEX, DReal, CoinOptServices and OptimPack. We’re not going to look at anything quite as elaborate as Sudoku today, but focus instead on finding the extrema in some simple (or perhaps not so simple) mathematical functions. At this point you might find it interesting to browse through this catalog of test functions for optimisation.

Read More →

Day 21: Differential Equations

2015-09-24 Julia Month of Julia

Read More →

Day 20: Calculus

2015-09-23 Julia Month of Julia

Read More →

Day 19: Units of Measurement

2015-09-22 Julia Month of Julia

Read More →

Day 18: Plotting

2015-09-21 Julia Month of Julia

There’s a variety of options for plotting in Julia. We’ll focus on those provided by Gadfly and Plotly.

Read More →

PhysicalConstants.jl: Julia Package of Physical Constants

2015-09-21 Julia

PhysicalConstants is a Julia package which has the values of a range of physical constants. Currently MKS and CGS units are supported.

Read More →

Day 17: Datasets from R

2015-09-18 Julia R Month of Julia

R has an extensive range of builtin datasets, which are useful for experimenting with the language. The RDatasets package makes many of these available within Julia. We’ll see another way of accessing R’s datasets in a couple of days’ time too. In the meantime though, check out the documentation for RDatasets and then read on below.

Read More →

Day 16: Databases

2015-09-17 Julia Month of Julia

Read More →

Setting up ODBC for SQLite on Ubuntu

2015-09-17 Linux SQLite

First install the SQLiteODBC and unixODBC packages. Have a quick look at the documentation for unixODBC and SQLiteODBC.

Read More →

Day 15: Time Series

2015-09-16 Julia Month of Julia

Read More →

Day 14: DataFrames & DataArrays

2015-09-15 Julia Month of Julia

Read More →

urlshorteneR: A package for shortening URLs

2015-09-14 R

This is a small package I put together quickly to satisfy an immediate need: generating abbreviated URLs in R. As it happens I require this functionality in a couple of projects, so it made sense to have a package to handle the details. It’s not perfect but it does the job. The code is available from GitHub along with vague usage information. In essence the functionality is simple: first authenticate to shortening service (goo. Read More →

Day 13: Packages

2015-09-14 Julia Month of Julia

Read More →

Day 12: Parallel Processing

2015-09-11 Julia Month of Julia

As opposed to many other languages, where parallel computing is bolted on as an afterthought, Julia was designed from the start with parallel computing in mind. It has a number of native features which lend themselves to efficient implementation of parallel algorithms. It also has packages which facilitate cluster computing (using MPI, for example). We won’t be looking at those, but focusing instead on coroutines, generic parallel processing and parallel loops. Read More →