Course Overview

CSI-MTH-190

Schwab

Introduction to Programming Language for Data Science (R) and Computing Environment (RStudio)

Data Science Life Cycle

DS life cycle

Grammar of Graphics

Hadley Wickham

Data Collection

Collect student data here

https://forms.gle/uRxF8UTkgp45ofY46

You must be signed into your HCC email account to fill out form.

Web Scraping and Importing Data ( and a tiny bit of cleaning)

library(googlesheets4)
library(janitor)

student_data <- read_sheet('https://docs.google.com/spreadsheets/d/1aNrNX5xWkrtuJHc-bsCC7e07KbnRbVIxaWdbblOCyxA/edit?usp=sharing')

student_data <- clean_names(student_data)

Data Visualization

# make simple plot of student data. 
library(tidyverse)

student_data |>
  ggplot(mapping = aes(x = what_is_your_height_in_inches) )+
  geom_histogram()+
  labs(
    title = "Students' height in inches"
  )

Data Wrangling

# wrangle student data. edit plot
student_data |>
  
  select(do_you_have_experience_with_the_statistical_computer_language_r) |>
  
  rename(r_experiance = do_you_have_experience_with_the_statistical_computer_language_r )|>
  
  group_by(r_experiance)|>
  
  summarize(
    
    sum()/n()
    
  )
# A tibble: 3 × 2
  r_experiance                                           `sum()/n()`
  <chr>                                                        <dbl>
1 No                                                               0
2 No but I have experience with other computer software.           0
3 Yes                                                              0

Data Management

-   Security
-   Keeping yourself organized!
-   How Data is housed

Data Ethics

How we collect and use data without doing harm.

Introduction to Database Querying with SQL

Wrangle some data with the classic SQL.

Simple Query?

Further Topics based on time and student interest may include:

-   Dynamic and Customized Data Graphics

-   Geospatial Data and Maps

-   Learning with AI

Optional and Advnaced Topics:

-   Function writing and data masking in R

-   Iteration