Skip to content

Latest commit

 

History

History
53 lines (39 loc) · 2.42 KB

README.md

File metadata and controls

53 lines (39 loc) · 2.42 KB

Let the Data Flow: Pipelines in R with dplyr and magrittr

Abstract

Pipelines were the best thing to happen in R in 2014. They let us transform messy, inside-out code like sort(unique(round(xs, 2))) into a clear chain of transformations like xs %>% round(2) %>% unique %>% sort. In this talk I lead a tutorial on how to use pipelines for data-cleaning, transformation and presentation with the packages magrittr and dplyr. For beginners, I also review some of the essential R functions to make the most of pipelines.

Tristan is a PhD student in Communication Sciences and Disorders. He uses in R in the Learning To Talk lab to model eye-tracking and speech perception data. @tjmahr, github.com/tjmahr.

Slides

I prepared three sets of slides:

Resources

Packages

  • magrittr for pipelines
  • dplyr for data-frame functions
  • broom
  • stringr for string manipulation functions
  • pipeR an alternative pipeline package (that I haven't tried yet).

License

Obviously, the GPL-2 license applies only to the code and words I wrote, which are in the .Rpres and .md files and are reproduced with markup in the .html files.