-
Notifications
You must be signed in to change notification settings - Fork 0
Intro to everything
Wiki ▸ Intro to Everything
Introduction to Git and GitHub, and Unix tools
Introduction to HTML, CSS, Javascript and D3.js (and Visualization)
R is a free/open source programming language and software environment for statistical computing and graphics.
R and its libraries implement a wide variety of statistical and graphical techniques, including linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, and others.
The best way to start learning some basics and witness the potentials of R and its ecosystem is to subscribe to few of the Data Science Specialization classes on Coursera (you can still access them for free, but you can always propose to your manager to join the specialization...):
Here are other resource that could be of help:
-
R for Data Science. It covers a lot of material previously scattered in different publications:
-
Tidy data (tidyr): see this (slightly outdated) article.
See also Chapter 9 Data Transformation in the second edition of "ggplot2 - Elegant Graphics for Data Analysis" (Hadley Wickham, the author, posted a pdf version of the book: use you search skills to have a look.) [It looks like there is a github repo of the book]
-
Data manipulations (dplyr): The goal of dplyr is to provide verbs (functions) that help you solve the most common 95% of data manipulation problems.
See also Chapter 10 Data Analysis in the second edition of "ggplot2 - Elegant Graphics for Data Analysis" (Hadley Wickham, the author, posted a pdf version of the book: use you search skills to have a look). [It looks like there is a github repo of the book]
Data Wrangling cheatsheet (PDF).
-
-
Create graphics that help you understand your data (ggplot2):
-
ggplot2 is the R package that you should have a look at.
We, PRU, have a physical copy of the first edition of "ggplot2 - Elegant Graphics for Data Analysis", ask Enrico.
The second edition contains two interesting chapters, Chapter 9 Data Analysis on tidy data via tidyr and Chapter 10 Data Transformation on data wrangling via dplyr.
-
Fundamentals of Data Visualization and its companion github repo where you can see how the plots are done in
ggplot2
-
-
For report, documentation, articles, website production
If you are interested in developing packages
-
R Packages: a book about how to most efficiently write and document R packages.
-
Advanced R: a book about the nitty gritty details of the R system.
We, PRU, have a paper copy of the book, ask Enrico if interested.
RStudio Desktop is an integrated development environment that makes it easy to organise your R work. RStudio, the company, makes a huge contribution to the R ecosystem, see for example their Cheatsheets.
R in the social media: I personally learn a lot from Twitter searching for #rstat
and
following people like @hadleywickham,
@drob, @lenkiefer,
@rdpeng, @JennyBryan,
@xieyihui and many others.
Unix is an example of operating system where automation (via scripting) is the norm.
- Unix Workbench (got it from "Footholds into Research Programming" and there see the very first link to the blog post "Helping my students overcome command-line bullshittery")
- Hacker Tools
We do version control with git
(and possibly on GitHub)
-
Square's Intro to D3.js Briefly touches HTML, CSS, Javascript and D3. Just an high level overview, but useful.
-
Fundamentals for D3: Very good intro.
-
Observable notebooks: massive use of D3 and all other kind of web technology. INCREDIBLY useful.
-
Mozilla Developer Network
- Malcolm Maclean, D3 Tips and Tricks
- Scott Murray, Interactive Data Visualization for the Web
Check Eurocontrol's internal library if not available in PRU.
-
Stephen Few, "Information Dashboard Design: Displaying Data for At-a-Glance Monitoring," Analytics Press; Second Edition (August 15, 2013)
The book is available in our PRU library (ask Enrico)
Read (at least) Chapters 1-6. -
Stephen Few, "Show Me the Numbers: Designing Tables and Graphs to Enlighten," Analytics Press; Second edition (June 1, 2012)
The book is available in our PRU library (ask Enrico) -
Alberto Cairo, "The Functional Art: An introduction to information graphics and visualization," New Riders; 1 edition (September 1, 2012)
The book is available in our PRU library (ask Enrico)
Read (at least) part I and II. -
Alberto Cairo, "The Truthful Art: Data, Charts, and Maps for Communication," New Riders; 1 edition (February 28, 2016)
The book could become available in our PRU library (ask Enrico)
Probably you need to read it all ;-) -
Edward R. Tufte, "The Visual Display of Quantitative Information," Graphics Pr; 2nd edition (January 2001)
The book is available in our PRU library (ask Enrico)
Read (at least) Chapters 2,3,4,5,6 and 8. -
Edward R. Tufte, "Visual Explanations: Images and Quantities, Evidence and Narrative," Graphics Press (February 1997)
-
Edward R. Tufte, "Envisioning Information," Graphics Press (January 1990)
The book is available in our PRU library (ask Enrico)
Read (at least) Chapters 2 and 4.
-
Mike Bostock
- "For Example" + video
- "Let's Make a Map"
- "Visualizing Algorithms" + video
- all his blocks!
-
- fantastic contribution to D3 extended map projections
- special/fancy map projections
-
- his research group, Interactive Data Lab (IDL), at UW
-
- Deceptive Visualizations. What NOT to do.
-
nice visualizations
- Budget Forecast article in NYT and implementation in Vega
- Free Trade Agreements (and its design history)
- The Russia left behind. Nice use of scrolling.
- Adam Pearce's stacked bump charts
- Susie Lu's radial weather plot, see also interactive version from Elijah Meeks