References
These are the course notes for the 2023 version of Fundamentals of Data Science
(MA7419 / MA3419)
(MA7419 / MA3419)
Chang, Winston. 2020. R Graphics Cookbook. O’Reilly Media. https://r-graphics.org/.
Friedl, Jeffrey E. F. 2006. Mastering Regular Expressions. 3rd
ed.. Sebastapol, Calif.: O’Reilly.
IFoA, and RSS. 2019. “A Guide to Ethical Data Science.”
Institute; Faculty of Actuaries; Royal Statistical Society. https://www.actuaries.org.uk/system/files/field/document/An%20Ethical%20Charter%20for%20Date%20Science%20WEB%20FINAL.PDF.
Jonge, E. de, and M. van der Loo. 2013. “An Introduction to Data
Cleaning with r.” Discussion Paper / Statistics Netherlands.
Statistics Netherlands. https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf.
Julia, PhD Silge, and PhD Robinson David. 2017. Text Mining with r:
A Tidy Approach. O’Reilly Media. https://www.tidytextmining.com/index.html.
Loo, Mark P. J. van der, and Edwin de Jonge. 2021. “Data
Validation Infrastructure for r.” Journal of Statistical
Software 97 (10): 1–31. https://doi.org/10.18637/jss.v097.i10.
Müller, Kirill, Hadley Wickham, David A. James, and Seth Falcon. 2023.
RSQLite: ’SQLite’ Interface for r. https://CRAN.R-project.org/package=RSQLite.
Nielsen, F. Å. 2011. “AFINN.” Richard Petersens Plads,
Building 321, DK-2800 Kgs. Lyngby: Informatics;
Mathematical Modelling, Technical University of Denmark. http://www2.compute.dtu.dk/pubdb/pubs/6010-full.html.
Odell, Evan. 2018. “nomisr: Access
Nomis UK Labour Market Data with r.” The Journal of Open
Source Software 3 (27): 859. https://doi.org/10.21105/joss.00859.
Peng, Roger D. 2020. R Programming for Data Science.
Morrisville: Lulu.com. https://bookdown.org/rdpeng/rprogdatascience/.
Peng, Roger D. 2019. Report Writing for Data Science in r.
British Columbia, Canada: Leanpub. https://leanpub.com/reportwriting.
Posit team. 2023. RStudio: Integrated Development Environment for
r. Boston, MA: Posit Software, PBC. http://www.posit.co/.
R Core Team. 2022. R: A Language and Environment for Statistical
Computing. Vienna, Austria: R Foundation for Statistical Computing.
https://www.R-project.org/.
Robin Lovelace, Jannes Muenchow, Jakub Nowosad. 2020. Geocomputation
with r. CRC Press. https://geocompr.robinlovelace.net/.
Wickham, Hadley. 2016. Ggplot2: Elegant Graphics for Data
Analysis. Springer-Verlag New York. https://ggplot2.tidyverse.org.
Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy
D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019.
“Welcome to the tidyverse.”
Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.
Wickham, Hadley, Romain François, Lionel Henry, Kirill Müller, and Davis
Vaughan. 2023. Dplyr: A Grammar of Data Manipulation. https://CRAN.R-project.org/package=dplyr.
Wickham, Hadley, and Garrett Grolemund. 2017. R for Data Science:
Import, Tidy, Transform, Visualize, and Model Data. O’Reilly Media.
http://r4ds.had.co.nz/.
Xie, Yihui, J. J. Allaire, and Garrett Grolemund. 2018. R Markdown:
The Definitive Guide. Boca Raton, Florida: Chapman; Hall/CRC. https://bookdown.org/yihui/rmarkdown.