SDS 192: Intro to Data Science

Author

Nicholas Schwab

Published

April 4, 2025

Schedule

Day 27 Lab 6

Lab 6

Today: Bonus office hour at 2:30-3:30.

Day 26: No class

Day 25: Iteration

Lecture: across() and map()

In class activity

Anonymous functions

Day 24 Functions and The Tidyverse

No class next Wednesday April 2, finish up your projects.

Data Masking

Practice function writing

More on masking

Day 22/23: Project 2 workday

Thursday Presentation of Major

Short kable() demo

Project 1 grades are up and current. Please let me know if you cannot see grades on moodle.

Project 1 notes

Day 21 lab 5

Intro to Project 2

  • Project 2 groups on Moodle

  • Monday and Wednesday after break are dedicated project 2 days

  • More on git work flow. (Issues)

  • Due: Friday: April 4th.

Lab 5

Day 20 Pivots

Project 1 notes

Pivots

We’ll do Problems Chapter 6: 5 a-c, in class today. Chapter 5: 3,4

Day 19 Joining tables

Reflection Lab 4

Quiz 2 will open in Wednesday and be due the Friday after break.

Exploration: How to join tables in R?

These data are from dplyr

band_members

band_band_instruments

Joining two tables in class exercise

Problems 1

Day 17/18 Lab 4

Section 1: The difference between %in% and ==

Intro to Lab 4

Lab 4 SQF

Day 16/17 Wrangling on one table

(Section 1 is one day below)

Discuss the difference between %in% and ==

Wrangle babynames

Code
install.packages("babynames")

solutions to wrangle babynames

Day 15/16 Wrangling on one table.

(Section 1 is one day below)

Lecture on the five verbs

dpylr cheatsheet for wrangling

Day 14/15 Ethics

(Section 1 is one day below)

Running chunk tips

Running all the above

Data Ethics

Day 13/14 Project 1 work day

(Section 1 is on day 12)

Push all files to github.

grader’s notes in grade.qmd.

Last day to work on Project 1 in class, due in a week.

Day 12/13 Project 1 work day

(Section 1 is on day 10)

Quiz 1 due tonight (both sections).

SAL offering workshops

Teach yourself GIS: Resource and data walk-through

February 27th, 4:15-5:15pm, in the SAL

Interested in teaching yourself some GIS or building on your existing GIS skills? This workshop will walk participants through various tutorials, online short courses, and major data sources, enabling you to find the resources you need. Participants will also create a map in ArcGIS Online from a Map in a Minute tutorial.

Mapping political data with Flourish

March 6th, 4:15-5:15pm, in the SAL

Flourish is a web-based tool for mapping and data visualization. Using data about contemporary politics, learn how to bring in data, customize how it is displayed, and export it to Canva.

Georeferencing historic maps

March 13th, 4:15-5:15pm, in the SAL

Learn how to create digital maps from paper ones through a process called georeferencing. Participants will explore historic maps of Massachusetts in Atlascope and gain practice georeferencing historic maps of Northampton using the AllMaps platform.

Working on projects together.
  • You might realize you are sharing an API key. Consider what that means.

  • Consider code layout.

  • Remember to comment your code.

  • Follow tidyverse style guide.

  • Practice a pull request today.

  • Make your graph legible to the lay person (title, labels, source, captions, etc).

Day 11/12 Lab 3: NYTBooks

Intro to paste()

lab 3

Day 10/11 Practice with git/ Project 1

(Section 1 is still on day 10)

We’ll practice with git in our projects.

Another git lecture

Project 1

See moodle for project groups.

Day 10 Color theory

Figure 2.26 from Modern Dive

Some time to finish up lab 2.

  • All labs must be rendered as a pdf or self-contained html.

Day 9 Lab 2

Quiz 1 is open. It is due in two Mondays.

5 minutes about colors via the cheat sheet.

This lab is due Wednesday at Midnight.

Lab 2

We will discuss colors on Monday when you get to them in ex 6:

You can reread this section if you forget what sequential,diverging,qualitative mean.

Or check out the colors examples on the cheat sheet.

Intuitive Color Examples

Day 8 Taxonomy

Help us help you: take screenshots of your screen and post them to Slack.

  • macOS: Command+Shift+3 to take shot of whole screen, Command+Shift+4 to take subportion

  • Windows: Shift-Windows Key-S and Snip & Sketch

  • Linux: Print screen

  • Chromebook: Shift + Ctrl + Show windows

library(googlesheets4)

#googlesheets4::gs4_auth()
#googlesheets4::deauth()
  
class_info <- read_sheet("https://docs.google.com/spreadsheets/d/1k9qA7XldtHB8-FZjmEymoUoeZJArS4tGy5gyVsvyLB0/edit?usp=sharing") |>
  janitor::clean_names()

If time:

Day 7 Intro to Data Viz

Reflect on the functions we made, taking note of style.

Notes on Style

Data Viz No Computers

Day 6 Making Functions (cont.)

Download this function practice.

Solutions will be posted later in the class.

Solutions

Day 5 Intro to Functions

Data Fest Info Session

Datafest is April 4-6 at UMass

Check lab 1 is successfully submitted on git.

Intro to functions via first_map.qmd (see maps folder in lab 1)

  • Pushing, commiting and pulling from git

    • Students Errors and warnings and comments
  • Strings and Vectors

  • Libraries, functions, arguments and parameters

  • Introduction to the pipe

  • Getting data into R

Lecture: Intro to functions

If time: Anonymous functions

Day 4 File Struture/ Final Project

Where are we going?

  • Final project Website
  • Understand File Structure

    • Filing system

Day 3 R Ecosystem and More SQL

  • Section 1 - about me

  • Section 2 - variable types

sql-practice

  • Problems 1-4 in pairs

  • Problem 5 together

  • Problems 6-9 in pairs

Try without hints, then hints, finally solutions.

  • Open Lab 1

Day 2 Github Lab

Why do we do use git? Collaboration

You are only installing git today. There are other files, Ignore the other files until later

Link github account to github classroom - sign into github

  • prereq -> You need to have a github account

  • Click here for lab 1. This is due in one week.

  • very carefully find your name in the roster and click it.

  • you should see todays lab in what is called a repository.

Day 1 Welcome and SQL

Walk to MCConnell 215 and 214

My office and group study room.