{"id":26447747,"url":"https://github.com/diestok/basic-machine-learning-for-bioinformatics","last_synced_at":"2026-05-20T06:05:36.223Z","repository":{"id":195621046,"uuid":"417821915","full_name":"DieStok/Basic-Machine-Learning-for-Bioinformatics","owner":"DieStok","description":"ML course materials for bioinformatics students following the Basic Machine Learning for Bioinformatics course at Utrecht University. Course created and taught by Dieter Stoker.","archived":false,"fork":false,"pushed_at":"2023-02-27T09:16:31.000Z","size":91377,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"main","last_synced_at":"2023-09-19T00:43:32.884Z","etag":null,"topics":["course","jupyter-notebook","machine-learning","python","supervised-machine-learning","unsupervised-machine-learning"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/DieStok.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2021-10-16T12:30:14.000Z","updated_at":"2023-09-19T00:43:35.589Z","dependencies_parsed_at":"2023-09-19T00:53:38.388Z","dependency_job_id":null,"html_url":"https://github.com/DieStok/Basic-Machine-Learning-for-Bioinformatics","commit_stats":null,"previous_names":["diestok/basic-machine-learning-for-bioinformatics"],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DieStok%2FBasic-Machine-Learning-for-Bioinformatics","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DieStok%2FBasic-Machine-Learning-for-Bioinformatics/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DieStok%2FBasic-Machine-Learning-for-Bioinformatics/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DieStok%2FBasic-Machine-Learning-for-Bioinformatics/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/DieStok","download_url":"https://codeload.github.com/DieStok/Basic-Machine-Learning-for-Bioinformatics/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244236059,"owners_count":20420753,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["course","jupyter-notebook","machine-learning","python","supervised-machine-learning","unsupervised-machine-learning"],"created_at":"2025-03-18T13:58:16.370Z","updated_at":"2026-05-20T06:05:36.188Z","avatar_url":"https://github.com/DieStok.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Basic-Machine-Learning-for-Bioinformatics\nML course materials for bioinformatics students following the course Basic Machine Learning for Bioinformatics at UU. \u003cbr\u003e\n\n# Topics covered per day (lectures and practicals)\n* Day 1: Linear regression, gradient descent, introduction to linear algebra\n* Day 2: Logistic regression, regularisation, ROC curve, introduction to neural networks (NNs)\n* Day 3: NN Backpropagation algorithm, convolutional neural networks explained, guest speaker on deep learning in Oxford Nanopore sequencing (13:15-14:05)\n* Day 4: K-means clustering, hierarchical clustering, deep dive into phylogenetics\n* Day 5: Problems with high-dimensional data, Principal Component Analysis (PCA)\n* Day 6: Working with scikit-learn, introduction to Keras and TensorFlow, project introduction and start\n\n# Dependencies and running the practicals.\nThe material assumes a local installation of Anaconda, including the packages `numpy`, `scipy`, `pandas`, `sklearn`, `biopython`, `pandas-plink`, `tensorflow`, `notebook`, `matplotlib`, and `seaborn`.\n\n# More information\nFor more information and resources, read the [course reader](CourseReaderMLBasics2021_Final.pdf).\n\n# Words of thanks\nGreatly inspired by/based on [Andrew Ng's course on Coursera](https://www.coursera.org/learn/machine-learning/home/welcome). The PCA part is based on [Prof. Victor Lavrenko's excellent lecture series](https://www.youtube.com/watch?v=IbE0tbjy6JQ\u0026list=PLBv09BD7ez_5_yapAg86Od6JeeypkS4YM). Many thanks are owed to [Dr. Jeroen de Ridder](https://www.umcutrecht.nl/en/research/researchers/de-ridder-jeroen-j) for expert assistance. I thank [Dr. ir. Bas van Breukelen](https://www.uu.nl/staff/BvanBreukelen) for long-term assistance and [Prof. Dr. Berend Snel](https://tbb.bio.uu.nl/snel/group.html) for comments on the phylogenetics part. Any errors remain my own (and, with your help, will hopefully be noticed and rectified soon).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdiestok%2Fbasic-machine-learning-for-bioinformatics","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdiestok%2Fbasic-machine-learning-for-bioinformatics","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdiestok%2Fbasic-machine-learning-for-bioinformatics/lists"}