{"id":22435423,"url":"https://github.com/subbyte/dect","last_synced_at":"2025-03-27T08:23:17.839Z","repository":{"id":36425429,"uuid":"40730423","full_name":"subbyte/DECT","owner":"subbyte","description":"Distributed Evolving Context Tree (DECT)","archived":false,"fork":false,"pushed_at":"2019-11-06T19:14:37.000Z","size":466,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-02-01T13:23:11.249Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/subbyte.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-08-14T18:34:04.000Z","updated_at":"2020-03-21T15:21:32.000Z","dependencies_parsed_at":"2022-09-03T10:51:03.589Z","dependency_job_id":null,"html_url":"https://github.com/subbyte/DECT","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/subbyte%2FDECT","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/subbyte%2FDECT/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/subbyte%2FDECT/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/subbyte%2FDECT/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/subbyte","download_url":"https://codeload.github.com/subbyte/DECT/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245806818,"owners_count":20675386,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-05T23:13:35.956Z","updated_at":"2025-03-27T08:23:17.819Z","avatar_url":"https://github.com/subbyte.png","language":"Scala","readme":"# Distributed Evolving Context Tree (DECT)\n\nDECT is a time-inhomogeneous variable-order Markov chain model introduced in our paper:\n```\n@inproceedings{shu:edbt:2016:dect,\n    author = {Xiaokui Shu and Nikolay Laptev and Danfeng Yao},\n    title = {{DECT}: Distributed Evolving Context Tree for Mining Web Behavior Evolution},\n    booktitle = {Proceedings of the 19th International Conference on Extending Database Technology (EDBT)},\n    month = {March},\n    year = {2016},\n    pages = {573--579},\n    publisher = {OpenProceedings.org},\n    address = {Konstanz, Germany},\n    location = {Bordeaux, France},\n}\n```\n\nProject goal: model Internet user behavior change through time.\n  - Markov model for recording Internet user behavior\n  - Higher-order Markov model for precise behavior presentation\n  - Variable-order Markov model is a succinct presentation of higher-order Markov model\n  - Time-inhomogeneous Markov model tracks user behavior change through time\n\nTechnical highlights:\n  - Flattened Context Tree\n  - Session Batching\n  - Parallel Window Operation\n  - Parallel Pruning\n\nTest Platform:\n  - Spark 1.3.1\n  - Scala 2.10.4\n  - sbt 0.13.8\n\nInput data format:\n  - text file, single or distributed\n  - each line is a user session\n  - line format: timestamp \\t siteID:siteID:siteID:siteID\n  - timestamp should be UNIX time (seconds)\n  - Line example: \"1433198382 site1:site3:site1:site1:site10:site25:site1:site99:site1\"\n\nCompile and run:\n  - $ sbt package\n  - modify the job-submission script template \"run.sh.template\" to run\n  - find arguments details in src/main/scala/Dect.scala\n\nProject dependencies:\n  - https://github.com/scallop/scallop \n      - Copyright (C) 2012 Platon Pronko and Chris Hodapp\n      - Licensed under under the MIT license\n  - https://github.com/typesafehub/config\n      - Copyright (C) 2011-2012 Typesafe Inc. \u003chttp://typesafe.com\u003e\n      - Licensed under the Apache 2.0 license\n\nCode licensed under the MIT license. See LICENSE file for terms.\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsubbyte%2Fdect","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsubbyte%2Fdect","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsubbyte%2Fdect/lists"}