{"id":30714256,"url":"https://github.com/citiususc/ludosym","last_synced_at":"2025-09-03T04:43:50.882Z","repository":{"id":312891614,"uuid":"788528411","full_name":"citiususc/ludosym","owner":"citiususc","description":null,"archived":false,"fork":false,"pushed_at":"2025-09-02T15:53:51.000Z","size":46591,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-09-02T17:39:26.874Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/citiususc.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2024-04-18T15:36:17.000Z","updated_at":"2025-09-02T15:53:54.000Z","dependencies_parsed_at":"2025-09-02T17:51:58.556Z","dependency_job_id":null,"html_url":"https://github.com/citiususc/ludosym","commit_stats":null,"previous_names":["citiususc/ludosym"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/citiususc/ludosym","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/citiususc%2Fludosym","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/citiususc%2Fludosym/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/citiususc%2Fludosym/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/citiususc%2Fludosym/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/citiususc","download_url":"https://codeload.github.com/citiususc/ludosym/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/citiususc%2Fludosym/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":273392279,"owners_count":25097258,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-03T02:00:09.631Z","response_time":76,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-09-03T04:43:48.026Z","updated_at":"2025-09-03T04:43:50.867Z","avatar_url":"https://github.com/citiususc.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🎲 Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior\n\nThis repository accompanies the paper:  \n**\"Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior\"**  \n📍 Accepted at *Findings of EMNLP 2025*.\n\n\n## 📂 Dataset\n\nThe main contribution of this work is a **Spanish sentence retrieval dataset** focused on symptoms associated with pathological gambling.\n\n- **Corpus:** `resources/dataset/corpus.jsonl`  \n- **Queries \u0026 Qrels:** also available in the same directory.  \n- All files follow the [BEIR](https://github.com/beir-cellar/beir) compatible format, enabling easy use with standard baselines (see Section 4 of the paper).  \n\nAdditionally, a **subfolder with pools** is provided, containing the material used by both human annotators and LLMs for dataset labeling.\n\n---\n\n## ⚙️ Code\n\nThe `src` folder is structured as follows:\n\n- **`train/`** → Training scripts for our domain-adapted **ludoBETO** model.  \n- **`labelling/`** → Statistics and analysis of human vs. automatic label generation.\n\n---\n\n## 🤖 Model\n\nWe introduce **[ludoBETO](https://huggingface.co/citiusLTL/ludoBETO)**, a BETO-based model adapted to the pathological gambling domain.  \nThis model is publicly available on HuggingFace for further research and fine-tuning.\n\n🔧 In our paper, we also implemented a **cross-encoder** using the [SimCSE](https://www.sbert.net/examples/sentence_transformer/unsupervised_learning/SimCSE/README.html) strategy with custom parameters over ludoBETO.\n\n---\n\n## 📖 Citation\n\nIf you use this resource, please cite:\n\n```bibtex\n@inproceedings{couto-etal-2025,\n    title = \"Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior\",\n    author = \"Couto-Pintos, Manuel and\n              Fernández-Pichel, Marcos and\n               Aragón, Mario Ezra and\n              Losada, David E.\",\n    booktitle = \"Findings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)\"\n}\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcitiususc%2Fludosym","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcitiususc%2Fludosym","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcitiususc%2Fludosym/lists"}