Pipelines for data science
Hadley Wickham (The Mitchell Lecture)
Wednesday 27th April, 2016 16:00-17:00 Maths 515
To do data science you need five sets of verbs: import, tidy, transform, visualise, and model. Importantly, you also need a way to connect these tools together so that your analysis flows from one step to another, without you beating your head against the wall. In this talk, I discuss the idea of the pipe as it is implemented in R with the magrittr package. You'll learn why the pipe makes your code easier to read, and see how it provides a unifying interface throughout your complete workflow.
Come along to learn about why I think pipelines are awesome and see how pipelines + tidyr, dplyr, ggplot2, and purrr can make your data analyses fast, fluent and fun. I'm a passionate believer that code should be an artefact of clear communication, so even if you've never used R before, you'll be able to follow this talk.