Skip to content

VjayalakshmiK/datamations

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

datamations

R-CMD-check

datamations is a framework for the automatic generation of explanation of the steps of an analysis pipeline. It automatically turns code into animations, showing the state of the data at each step of an analysis.

For more information, please visit the package website, which includes additional examples, defaults and conventions, and more.

Installation

You can install datamations from GitHub with:

# install.packages("devtools")
devtools::install_github("microsoft/datamations")

Usage

To get started, load datamations and dplyr:

A datamation shows a plot of what the data looks like at each step of a tidyverse pipeline, animated by the transitions that lead to each state. The following shows an example taking the built-in small_salary data set, grouping by Degree, and calculating the mean Salary.

First, define the code for the pipeline, then generate the datamation with datamation_sanddance():

library(datamations)
library(dplyr)

"small_salary %>% 
  group_by(Degree) %>%
  summarize(mean = mean(Salary))" %>%
  datamation_sanddance()

datamations supports the following dplyr functions:

  • group_by() (up to three grouping variables)
  • summarize()/summarise() (limited to summarizing one variable)
  • filter()
  • count()/tally

Releases

No releases published

Packages

No packages published

Languages

  • HTML 67.8%
  • JavaScript 19.0%
  • Jupyter Notebook 7.7%
  • R 4.2%
  • Python 1.2%
  • CSS 0.1%