1B013: Data Analysis Tools and Practice (Spring 2021)


Time: Tuesday 14:00-17:00

Location: 3202, Teaching Building, DaXing

Class web site: https://huipingsun.github.io/da2021

Class live: No

Instructor: Huiping Sun (sunhp(at)ss.pku.edu.cn)

TA: Meilin Lv

Course Description

This course will introduces the feature of data analysis with R and RStudio as well as common methods of data analysis based on combination of essential operation experiments and specific case experiments. It covers R and RStudio software, R programming language, R basic graphics methods, data management, data visualization with ggplot2, statical modeling and procedures, and others useful R packages. This course helps students improve the ability of conducting data analysis in related fields independently and finishing analysis report.

Course Textbook

Course Schedule

Date

Topics

Readings

March 9

00. Course Overview [Slides]

No

March 9

01. R Overview [Slides]

March 15

02. R Object I [Slides]

March 16

03. R Object II [Slides]

March 16

04. R Language [Slides]

March 23

05. R Graphics I [Slides]

  • Textbook. Charpter 3: Getting Started with Graphs.
  • Textbook. Charpter 6: Basic Graphs (6.1-6.3).
  • Hrishi V. Mittal. R Graphs Cookbook. Charpter 1,2,3,4,5,6,7. PACKET. 2011
  • DataCamp Course: Data Visualization in R.[Homework]

March 29

06. R Graphics II [Slides]

March 30

07. Data & Graphics WrapUp [Slides]

April 06

08. ggplot2 I [Slides]

April 12

09. ggplot2 II [Slides]

April 13

10. R Graphics Cookbook [Slides]

April 19

11. Exercise & Quiz WrapUp-Online[Slides]

April 20

12. R Package:

[dplyr, tidyr, tidymodels, tidyquant, tidytext]

[jiebaR&wordcloud2, stringr&lubridate, svm&xgboost

No

May 04

13. R Statistics [Slides]

  • Textbook. Charpter 7: Basic Statistics.
  • Textbook. Charpter 8: Regression.
  • Textbook. Charpter 14: Principal Components and Factor Analysis.

May 11

14. Credit Scoring I [Slides]

May 18

15. Credit Scoring II [Slides]

May 25

16. R Data Science I [Slides]

June 01

17. RiA Other [Slides]

  • Textbook. Charpter 9: Analysis of Variance.
  • Textbook. Charpter 18: Advanced Methods for Missing Data.

June 08

18. R Data Science II [Slides]

June 15

19. Course Summary [Slides]

No

June 22

20. Project Presentations [Slides]

TBD

Course Grading

The grading scheme is as follows:

Course Projects

Course References

Course History