Skip to content
@anthology-of-data-science

Anthology of Data Science

An anthology of open access data science materials

An anthology of open access data science materials

Machine learning is permeating many fields of work. As a new ‘system technology’1, its impact on organizations and society is expected to be of the same magnitude as that of the steam engine or electricity. As such, more and more professionals are seeking to acquire the necessary understanding and skills to apply machine learning in their day-to-day work. Hence more people without a background in either computer science or statistics - let alone both - have a need for high-quality, open access content to explore and learn data science by themselves.

Now there is a lot of machine learning learning materials out there, so why this anthology? Based on my experience in teaching professional education course on data & AI, I am continouly challenged to:

  • curate content for different professional learning paths, combining various existing open access materials that can be readily shared and thus contribute to the democratization of know-how in this field of work;
  • finding a balance between too technical vs. too vague, handwaving or even downright wrong;
  • take a hands-on, problem-based approach. Rather than, say, explaining the principles that underlie regularization, we choose to demonstrate these principles using the simplest algorithms. With a little math, everyone should be able to understand how LASSO performs regularisation for regression models. With this intuitive understanding, you can move on to more complex algorithms and applications, and reason where and how to use regularisation.

Copyright notice

All the work in this GitHub organization is licensed under CC BY-SA 4.0

Footnotes

  1. Sheikh et al. (2023), Mission AI: the New System Technology, https://doi.org/10.1007/978-3-031-21448-6.

Pinned Loading

  1. lecture-composable-data-stack lecture-composable-data-stack Public

    Slides for a 3-hour lecture on modern data engineering

    JavaScript 1

  2. pydata-book pydata-book Public

    Forked from wesm/pydata-book

    Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

    Jupyter Notebook

  3. interpretable-ml-book interpretable-ml-book Public

    Forked from christophM/interpretable-ml-book

    Book about interpretable machine learning

    Jupyter Notebook

  4. anthology-of-data-science.github.io anthology-of-data-science.github.io Public

    Source code of https://anthology-of-data.science

    Jupyter Notebook

  5. visualization-curriculum visualization-curriculum Public

    Forked from uwdata/visualization-curriculum

    A data visualization curriculum of interactive notebooks.

    Jupyter Notebook

  6. udlbook udlbook Public

    Forked from udlbook/udlbook

    Understanding Deep Learning - Simon J.D. Prince

    Jupyter Notebook

Repositories

Showing 10 of 16 repositories
  • anthology-of-data-science/anthology-of-data-science.github.io’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Dec 5, 2024
  • lecture-gam-ebm Public

    Lecture notes from 1-hour workshop on General Additive Models and Explainable Boosting Machines

    anthology-of-data-science/lecture-gam-ebm’s past year of commit activity
    SCSS 0 0 0 0 Updated Sep 10, 2024
  • template-quarto-reveal Public template
    anthology-of-data-science/template-quarto-reveal’s past year of commit activity
    SCSS 0 0 0 0 Updated Sep 9, 2024
  • .github Public
    anthology-of-data-science/.github’s past year of commit activity
    0 0 0 0 Updated Sep 8, 2024
  • lecture-composable-data-stack Public

    Slides for a 3-hour lecture on modern data engineering

    anthology-of-data-science/lecture-composable-data-stack’s past year of commit activity
    JavaScript 0 MIT 1 0 0 Updated Sep 8, 2024
  • ISLP_labs Public Forked from intro-stat-learning/ISLP_labs

    Up-to-date version of labs for ISLP

    anthology-of-data-science/ISLP_labs’s past year of commit activity
    Jupyter Notebook 0 BSD-2-Clause 460 0 0 Updated Jul 16, 2024
  • ibis-analytics Public Forked from ibis-project/ibis-analytics

    Ibis analytics, with Ibis (and more!)

    anthology-of-data-science/ibis-analytics’s past year of commit activity
    Python 0 MIT 8 0 0 Updated Jun 26, 2024
  • quarto-revealjs-clean Public Forked from grantmcdermott/quarto-revealjs-clean

    A minimalist and elegant presentation theme for Quarto Reveal.js

    anthology-of-data-science/quarto-revealjs-clean’s past year of commit activity
    HTML 0 MIT 46 0 0 Updated Jun 16, 2024
  • timeseries-forecasting Public

    Redirect to Forecasting: Principles and Practice (3rd edition)

    anthology-of-data-science/timeseries-forecasting’s past year of commit activity
    HTML 0 0 0 0 Updated Apr 11, 2024
  • pydata-book Public Forked from wesm/pydata-book

    Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

    anthology-of-data-science/pydata-book’s past year of commit activity
    Jupyter Notebook 0 15,332 0 0 Updated Apr 11, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…