13 April 2018
9:30 am - 1:00 pm
Instructors: Ben Marwick (University of Washington)
Helpers: Matt Harris (AECOM), Jon Clindaniel (Harvard University)
This workshop is for any archaeologist who has data they want to analyze, and no prior computational experience is required. This hands-on workshop teaches basic concepts, skills and tools for working more effectively with data. It is loosly based on the Data Carpentry curriculum (which is highly suitable for self-guided learning).
If you want to do data analysis and visualization more efficiently and with less pain, R can help. R is also excellent for reproducible research, a cornerstone of scientific archaeology. Writing scripts in R makes it easy to keep track of your work, easy to redo analyses on new data, and easy to share your work with others. We will introduce common archaeological data analysis and visualisation tasks using modern and widely used tools in the R programming language. Participants should bring their laptops and plan to participate actively. By the end of the workshop learners should be able to more effectively manage and analyze data and be able to apply the tools and approaches directly to their ongoing research.
Who: The course is aimed at archaeologists at all career stages.
The course is intended for archaeologists with no prior experience with R, statistics, or any other programming language.
Some basic familiarity with spreadsheets would be helpful.
Where: 10th Street and Constitution Avenue, NW Washington, DC. Get directions with OpenStreetMap or Google Maps. Our workshop will be held in the NMNH - SIL Training Room - CE107. This is a secure building, and we will issue visitor name badges at the door at 9:15 am so you can get in and out during the workshop. Please bring a print-out of your eventbrite ticket to ensure you can get into the workshop. Drinks are allowed, but not food (small snacks are ok).
Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating sytem (not a tablet, Chromebook, etc.) that they have administrative privileges on. You should have a few specific software packages installed (listed below). You are also required to abide by our Code of Conduct.
Contact: Please mail bmarwick@uw.edu for more information.
Start time | End time | Topic |
---|---|---|
9:30 | 9:45 | Introduction to R and RStudio |
9:45 | 10:00 | Importing, inspecting & cleaning data from spreadsheets |
10:00 | 10:15 | Exploratory Data Analysis with dplyr & tidyr |
10:15 | 10:30 | Exploratory Data Analysis with dplyr & tidyr |
10:30 | 10:45 | Visualising data with ggplot & plotly |
10:45 | 11:00 | Visualising data with ggplot & plotly |
11:00 | 11:15 | Break |
11:15 | 11:30 | Hypothesis testing: two samples |
11:30 | 11:45 | Hypothesis testing: more than two samples |
11:45 | 12:00 | Hypothesis testing: count data |
12:00 | 12:15 | Break |
12:15 | 12:30 | Importing & mapping GIS data with sf & google maps |
12:30 | 12:45 | Spatial data analysis: spatial joins |
12:45 | 1:00 | Spatial data analysis: point pattern analysis |
To participate in this workshop, you will need working copies of the R and RStudio. Please make sure to install everything (or at least to download the installers) before the start of your workshop. If you have previously installed these programs, please download and install the most recent versions (your version may be outdated and not work with the activities in this workshop). Participants should bring and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop. If you have problems or questions, please send us an email at bmarwick@uw.edu .
Please follow these Setup Instructions, see the section for your operating system for those directions.
Please go through all the installation steps below and make sure that you not only installed them, but start them up to make sure they're working. If you have any problems, don't hesitate to email the instructors to ask for help, or arrive early on the first day of the workshop to get help.
Please go through all the installation steps below and make sure that you not only installed them, but start them up to make sure they're working. If you have any problems, don't hesitate to email the instructors to ask for help, or arrive early on the first day of the workshop to get help.
Please go through all the installation steps below and make sure that you not only installed them, but start them up to make sure they're working. If you have any problems, don't hesitate to email the instructors to ask for help, or arrive early on the first day of the workshop to get help.
sudo apt-get install
libreoffice
for Ubuntu and other Debian-based distributions).
sudo
apt-get install r-base
, and for Fedora run sudo yum install
R
) but make sure that you have at least R 3.2.2 (as pre-packaged
versions might be out of date).
sudo dpkg
-i rstudio-x.yy.zzz-amd64.deb
at the terminal).
We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.
Behaviour not explicitly mentioned above may still constitute harassment. The list above should not be taken as exhaustive but rather as a guide to make it easier to enrich all of us and the communities in which we participate. All interactions should be professional regardless of location: harassment is prohibited whether it occurs on- or offline, and the same standards apply to both.
Enforcement of the Code of Conduct will be respectful and not include any harassing behaviors.
Thank you for helping make this a welcoming, friendly community for all.
This code of conduct is an adaptation of the one used by the Software Carpentry Foundation and is a modified version of that used by PyCon, which in turn is forked from a template written by the Ada Initiative and hosted on the Geek Feminism Wiki. Contributors to this document: Adam Obeng, Aleksandra Pawlik, Bill Mills, Carol Willing, Erin Becker, Hilmar Lapp, Kara Woo, Karin Lagesen, Pauline Barmby, Sheila Miguez, Simon Waldman, Tracy Teal.
Matt Harris is the Director of GIS, Data Analysis, & Geoarchaeology in the Cultural Resources Deptartment of AECOM. He is an advanced R user, with a focus on spatial analysis and simulation. He documents many of his explorations using R on his blog. Matt is a member of the SAA Open Science Interest Group, and has previously instructed R to archaeologists via the SAA Online workshops and in person.
Jon Clindaniel is a Ph.D. candidate in Anthropology at Harvard University (Expected 2019). For his dissertation, Jon uses a combination of archaeological excavation and computational techniques to decipher Inka khipu signs. He uses R for advanced statistical analyses.