{"id":20155837,"url":"https://github.com/smoothex/log-based-failure-localization","last_synced_at":"2026-06-01T01:31:05.386Z","repository":{"id":153134212,"uuid":"517777289","full_name":"Smoothex/log-based-failure-localization","owner":"Smoothex","description":"Log-based failure localization in distributed systems - a case study for Apache Cassandra","archived":false,"fork":false,"pushed_at":"2022-12-07T16:34:22.000Z","size":76,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-12-02T10:55:22.887Z","etag":null,"topics":["anomaly-detection","failure-detection","log-analysis","log-parsing","log-template","logs"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Smoothex.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-07-25T18:25:59.000Z","updated_at":"2022-12-03T10:21:01.000Z","dependencies_parsed_at":null,"dependency_job_id":"e8b07863-7114-4d83-9cd2-30881ef0de67","html_url":"https://github.com/Smoothex/log-based-failure-localization","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Smoothex/log-based-failure-localization","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Smoothex%2Flog-based-failure-localization","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Smoothex%2Flog-based-failure-localization/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Smoothex%2Flog-based-failure-localization/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Smoothex%2Flog-based-failure-localization/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Smoothex","download_url":"https://codeload.github.com/Smoothex/log-based-failure-localization/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Smoothex%2Flog-based-failure-localization/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33756575,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-05-31T02:00:06.040Z","response_time":95,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["anomaly-detection","failure-detection","log-analysis","log-parsing","log-template","logs"],"created_at":"2024-11-13T23:36:05.257Z","updated_at":"2026-06-01T01:31:05.366Z","avatar_url":"https://github.com/Smoothex.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Log-based failure localization in distributed systems - A Case Study for Apache Cassandra\n\nThis work proposes a workflow for processing log files, which helps identify errors and presents a clear difference between correct and incorrect execution.\n\n![image](https://user-images.githubusercontent.com/79105432/206227999-cfaea3c6-b9f8-4851-ac9b-9777facf5d82.png)\n\nThe proposed overview includes:\n* generation of the log files (using the Python reproduction scripts)\n* parsing the log files (using `templater.py`)\n* extracting the differences between the parsed log files (using `difference.py`)\n\n## Prerequisites\n* For CASSANDRA-14989 and CASSANDRA-11803: [Docker](https://docs.docker.com/get-docker/)\n* For the rest of the reproduction scripts: [Cassandra Cluster Manager (CCM)](https://github.com/riptano/ccm) (and its dependencies)\n* [DataStax Python Driver](https://docs.datastax.com/en/developer/python-driver/3.18/): `pip install cassandra-driver`\n* It is advised to execute the scripts on a Linux system, since CCM has some known bugs on Windows\n\n## Execution\nThe scripts are to be executed separately by simply running the specific file. The program should terminate with exit code `0` and the log files are generated in `/home/\u003cUSER\u003e/.ccm/\u003cCLUSTER_NAME\u003e/\u003cNODE_NAME\u003e/logs/`, where NODE_NAME is by default node1, node2, etc. For this work, the important file is `debug.log`.\n\nThe `debug.log` can then be copied to a desktop folder named `13346_failure` or `13346_normal` (or the numbers of the other bugs). The copy of `debug.log` is then ingested into `templater.py`.\n\nThe parsed log file then goes through `difference.py`, which generates an output file consisting of the different log entries.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsmoothex%2Flog-based-failure-localization","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsmoothex%2Flog-based-failure-localization","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsmoothex%2Flog-based-failure-localization/lists"}