The dplyr package is a popular R package that people often use to manipulate and join datasets. You will need to have either some basic knowledge about using R or have previously attended our Introduction to R for Absolute Beginners workshop in order to take this one. Join JHU Data Services for this workshop and you will learn to use several functions, including mutate()
, filter()
, select()
, summarize()
and group_by()
, in dplyr to manipulate data for the first half of the workshop. For the second part of this workshop, you will learn the join functions (e.g. left_join()
, right_join()
, inner_join()
, semi_join()
, anti_join()
, full_join()
, bind_rows()
and bind_cols()
) and set operations (e.g. union()
, intersect()
and setdiff()
) in dplyr to combine two datasets. You will have plenty of opportunities to do hands-on activities on your laptop and work on datasets provided by instructors.
Website: dataservices.library.jhu.edu/
Contact us: dataservices@jhu.edu
JHU Data Services, part of the Johns Hopkins University Sheridan Libraries, helps the JHU community find, use, visualize, manage, and share data. We offer live webinars and self-paced online trainings on computational research and coding, GIS, data management, data visualization, and more. See all of our training topics on our website.
This repository contains materials for one of our live webinars open to JHU students, faculty, and staff. Please contact us with any questions.
As of March 2020, Data Services workshops are being held virtually on Zoom. See our calendar to register for upcoming workshops.
Before the class, make sure you have a recent version of R and RStudio installed: https://posit.co/download/rstudio-desktop/
Please also install the tidyverse
package with the R command install.packages("tidyverse")
or using the package manager.
- All materials for this workshop can be found at https://jhu-data-services.github.io/dplyr-quarto-site/
- Access the presentation slides at https://jhu-data-services.github.io/dplyr-quarto-site/Manipulating%20and%20Joining%20Data%20in%20R%20with%20dplyr.html
- Access the script we developed in class at https://jhu-data-services.github.io/dplyr-quarto-site/workshop-script.html
If you have taken the live webinar for this class, please take this survey: Reshaping and Joining Data in R with dplyr Survey
The presentation materials are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0), attributable to Data Services, Johns Hopkins University.
See LICENSE file for additional code licensing and re-use information.
The images, external resources, and cheatsheets linked in this repository may have other licenses and terms of use.
Please cite this material as:
Johns Hopkins University Data Services. March 11 2025. Reshaping and Joining Data in R with dplyr. https://github.com/jhu-data-services/.dplyr-quarto-site