{"id":19554628,"url":"https://github.com/m4tx/masters-thesis","last_synced_at":"2025-07-19T19:34:16.030Z","repository":{"id":181041947,"uuid":"532071141","full_name":"m4tx/masters-thesis","owner":"m4tx","description":"Implementation of Context Binning and Model Clustering for Compression of Genetic Data","archived":false,"fork":false,"pushed_at":"2023-07-13T19:12:05.000Z","size":62,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-02-26T07:34:15.939Z","etag":null,"topics":["compression","genetic-data","latex","thesis"],"latest_commit_sha":null,"homepage":"","language":"TeX","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc-by-sa-4.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/m4tx.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2022-09-02T20:33:03.000Z","updated_at":"2023-07-13T20:12:19.000Z","dependencies_parsed_at":null,"dependency_job_id":"c1115cc5-b740-482a-8617-0a2f7e75c7f1","html_url":"https://github.com/m4tx/masters-thesis","commit_stats":null,"previous_names":["m4tx/masters-thesis"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/m4tx/masters-thesis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/m4tx%2Fmasters-thesis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/m4tx%2Fmasters-thesis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/m4tx%2Fmasters-thesis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/m4tx%2Fmasters-thesis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/m4tx","download_url":"https://codeload.github.com/m4tx/masters-thesis/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/m4tx%2Fmasters-thesis/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265998744,"owners_count":23862176,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["compression","genetic-data","latex","thesis"],"created_at":"2024-11-11T04:28:22.690Z","updated_at":"2025-07-19T19:34:16.009Z","avatar_url":"https://github.com/m4tx.png","language":"TeX","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Implementation of Context Binning and Model Clustering for Compression of Genetic Data\n\nMy master's thesis written as part of the computer science course at Jagiellonian University.\n\n## Abstract\n\nIn recent years, there happened a gigantic leap in the speed of DNA sequencing\nmethods, which allowed us to sequence DNAs of complex organisms, such as humans,\nquickly.  However, this leads to increasing demand for disk storage, as the\nsizes of the databases containing such data can easily reach dozens of\nterabytes. In his article \"Context binning, model clustering and adaptivity\nfor data compression of genetic data\", Jarek Duda proposes promising compression\ntechniques that should help build a compressor better than the current state of\nthe art. This thesis describes the compressor built to evaluate those\ntechniques, tests it with real-world data and compares it to other genetic data\ncompression tools.\n\n## Download\n\nThe PDF file can be downloaded from the\n[GitHub Releases page](https://github.com/m4tx/masters-thesis/releases/download/final/Implementation_of_Context_Binning_and_Model_Clustering_for_Compression_of_Genetic_Data.pdf).\n\n## Building\n\nMake sure you have Inkscape and a distribution of LaTeX installed in your\nsystem.\n\n```bash\nmake\n```\n\n## License\nThis work is licensed under a\n[Creative Commons Attribution-ShareAlike 4.0 International License](http://creativecommons.org/licenses/by-sa/4.0/).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fm4tx%2Fmasters-thesis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fm4tx%2Fmasters-thesis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fm4tx%2Fmasters-thesis/lists"}