Skip to content
@anthology-of-data-science

Anthology of Data Science

An anthology of open access data science materials

An anthology of open access data science materials

Machine learning is permeating many fields of work. As a new ‘system technology’1, its impact on organizations and society is expected to be of the same magnitude as that of the steam engine or electricity. As such, more and more professionals are seeking to acquire the necessary understanding and skills to apply machine learning in their day-to-day work. Hence more people without a background in either computer science or statistics - let alone both - have a need for high-quality, open access content to explore and learn data science by themselves.

Now there is a lot of machine learning learning materials out there, so why this anthology? Based on my experience in teaching professional education course on data & AI, I am continouly challenged to:

  • curate content for different professional learning paths, combining various existing open access materials that can be readily shared and thus contribute to the democratization of know-how in this field of work;
  • finding a balance between too technical vs. too vague, handwaving or even downright wrong;
  • take a hands-on, problem-based approach. Rather than, say, explaining the principles that underlie regularization, we choose to demonstrate these principles using the simplest algorithms. With a little math, everyone should be able to understand how LASSO performs regularisation for regression models. With this intuitive understanding, you can move on to more complex algorithms and applications, and reason where and how to use regularisation.

Copyright notice

All the work in this GitHub organization is licensed under CC BY-SA 4.0

Footnotes

  1. Sheikh et al. (2023), Mission AI: the New System Technology, https://doi.org/10.1007/978-3-031-21448-6.

Pinned Loading

  1. lecture-composable-data-stack lecture-composable-data-stack Public

    Slides for a 3-hour lecture on modern data engineering

    JavaScript 1

  2. pydata-book pydata-book Public

    Forked from wesm/pydata-book

    Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

    Jupyter Notebook

  3. anthology-of-data-science.github.io anthology-of-data-science.github.io Public

    Source code of https://anthology-of-data.science

    Jupyter Notebook

  4. visualization-curriculum visualization-curriculum Public

    Forked from uwdata/visualization-curriculum

    A data visualization curriculum of interactive notebooks.

    Jupyter Notebook

  5. udlbook udlbook Public

    Forked from udlbook/udlbook

    Understanding Deep Learning - Simon J.D. Prince

    Jupyter Notebook

Repositories

Showing 10 of 20 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…