Skip to content

Latest commit

 

History

History
102 lines (73 loc) · 6.8 KB

articles.md

File metadata and controls

102 lines (73 loc) · 6.8 KB

Articles & Blogs

Here are a few articles and blogs that relate to the Num command, including Unix command line tools for mathematics, statitics, and data analysis.

Few tools are more indispensable to my work than Unix. Manipulating data into different formats, performing transformations, and conducting exploratory data analysis (EDA) is the lingua franca of data science. The coffers of Unix hold many simple tools, which by themselves are powerful, but when chained together facilitate complex data manipulations. …

By ,

Data science is OSEMN (pronounced as awesome). That is, it involves Obtaining, Scrubbing, Exploring, Modeling, and iNterpreting data.... The tools are: jq, json2csv, csvkit, scrape, xml2json, sample, and Rio.

By ,

I wrote a tool, qstats, that computes simple summary statistics from the command-line. It also includes data-binning and simple bar chart functionality. I designed it, in C, specifically to be as fast as possible, and bare-bones enough to work on any POSIX-compliant system without having to deal with outside dependencies. …

By ,

GNU datamash is a command-line utility that offers simple calculations (e.g. count, sum, min, max, mean, stdev, string coalescing) as well as a rich set of statistical functions, to quickly assess information in textual input files or from a UNIX pipe. .... Let’s calculate the mean, 1st quartile, median, 3rd quarile, IQR, sample-standard-deviation, and p-value of Jarque-Bera test for normal distribution, using some data in file.txt. …

By ,

Jeff Leek, Brian Caffo, and I launched the Johns Hopkins Data Science Specialization on the Coursera platform. This is a sequence of nine courses that intends to provide a “soup-to-nuts” training in data science for people who are highly motivated and have some basic mathematical and computing background. …

By ,

Introduction to Unix command line tools for data processing, with examples of awk, sed, sort, uniq, cut, split, join, cat, conditionals, loops, and references.

By ,


* [Prev Page: Functions: the complete list of all capabilities](functions.md) * [Next Page: FAQ: Frequently Asked Questions](faq.md)