{"id":46675033,"url":"https://github.com/maykolmedrano/usecasen","last_synced_at":"2026-03-12T01:01:02.643Z","repository":{"id":342399440,"uuid":"1173843433","full_name":"MaykolMedrano/usecasen","owner":"MaykolMedrano","description":"Herramienta profesional unificada para descarga, recodificación y análisis de la Encuesta CASEN (Chile) en Python y Stata.","archived":false,"fork":false,"pushed_at":"2026-03-05T21:13:34.000Z","size":45,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2026-03-09T00:58:35.810Z","etag":null,"topics":["casen","chile","data-science","economics","microdata","public-policy","python","stata"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/MaykolMedrano.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-05T20:10:04.000Z","updated_at":"2026-03-05T21:13:38.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/MaykolMedrano/usecasen","commit_stats":null,"previous_names":["maykolmedrano/usecasen"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/MaykolMedrano/usecasen","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MaykolMedrano%2Fusecasen","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MaykolMedrano%2Fusecasen/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MaykolMedrano%2Fusecasen/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MaykolMedrano%2Fusecasen/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/MaykolMedrano","download_url":"https://codeload.github.com/MaykolMedrano/usecasen/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MaykolMedrano%2Fusecasen/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30315943,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-09T20:05:46.299Z","status":"ssl_error","status_checked_at":"2026-03-09T19:57:04.425Z","response_time":61,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["casen","chile","data-science","economics","microdata","public-policy","python","stata"],"created_at":"2026-03-08T22:38:03.373Z","updated_at":"2026-03-09T23:00:59.112Z","avatar_url":"https://github.com/MaykolMedrano.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cdiv align=\"center\"\u003e\n  \n# usecasen\n  \n  **La suite profesional definitiva para descarga, recodificación y análisis de la Encuesta CASEN (Chile) en Python y Stata.**\n\n  [![PyPI Version](https://img.shields.io/pypi/v/usecasen?style=flat-square\u0026color=blue)](https://pypi.org/project/usecasen/)\n  [![Language Support](https://img.shields.io/badge/Python_3.8%2B_%7C_Stata_14%2B-blue?style=flat-square)](https://www.python.org/)\n  [![Tests passing](https://img.shields.io/github/actions/workflow/status/MaykolMedrano/usecasen/python-app.yml?branch=main\u0026style=flat-square)](https://github.com/MaykolMedrano/usecasen/actions)\n  [![Downloads](https://img.shields.io/pypi/dm/usecasen?style=flat-square\u0026color=blue)](https://pypi.org/project/usecasen/)\n  [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg?style=flat-square)](https://opensource.org/licenses/MIT)\n  [Ver Documentación Python](python/README.md) • [Ver Documentación Stata](stata/README.md) • [Reportar Bug](https://github.com/MaykolMedrano/usecasen/issues)\n\u003c/div\u003e\n\n---\n\n## Sobre el Proyecto\n\n**`usecasen`** es una herramienta unificada para **Python** y **Stata** que automatiza la búsqueda, descarga y procesamiento de los microdatos de la **Encuesta de Caracterización Socioeconómica Nacional (CASEN)** de Chile.\n\nCuenta con un sistema de **scoring inteligente** integrado que detecta y extrae de forma automática los archivos oficiales correctos desde los servidores del Ministerio de Desarrollo Social y Familia (MDSF), resolviendo inconsistencias de encoding heredadas y operando de forma transparente sobre múltiples formatos de compresión (`.dta`, `.zip`, `.rar`).\n\n### Características Principales\n\n- **Smart Scoring \u0026 Fallbacks**: Motor de búsqueda que evalúa y aísla el archivo `.dta` principal entre cientos de documentos y anexos disponibles.\n- **Encoding Automático**: Traduce de forma nativa las bases anteriores a 2013 (latin1/ISO-8859) a UTF-8 para garantizar compatibilidad con versiones modernas de software.\n- **Soporte Multi-Compresión**: Trabaja con volúmenes masivos comprimidos en formatos como RAR y ZIP (ej. CASEN 2017) invocando automáticamente las herramientas del sistema (7z, winrar, powershell, unar, bsdtar).\n- **Procesamiento In-Memory (Python)**: Configurado para descargar y cargar los datos directamente en memoria RAM, eliminando cuellos de botella por operaciones de I/O en disco.\n- **Caché Inteligente**: Guarda localmente el dataset tras la primera consulta, reduciendo el tiempo de carga a milisegundos en peticiones subsecuentes.\n\n---\n\n## 1. Paquete Python (`python/casen`)\n\nLa librería de Python es ideal para interactuar con CASEN de manera exploratoria, rápida, y exportando DataFrames, logrando inyecciones directas en `pandas`.\n\n**Instalación Rápida:**\n\n```bash\npip install usecasen\n```\n\n**Uso Exploratorio Rápido:**\n\n```python\nimport casen\n\n# Descargar datos (con logs de progreso)\ndf_2022 = casen.download(2022)\n\n# Buscar en los metadatos (etiquetas de variable) en todo el archivo 2022, SIN descargar\nresultados = casen.search(\"educacion\")\n\n# Consultar el diccionario (codebook) nativo de la CASEN\ndiccionario_regiones = casen.get_labels(\"region\", 2022)\nprint(diccionario_regiones)\n# {1: 'Tarapacá', 2: 'Antofagasta', ...}\n```\n\n\u003e _Ver guía completa de Python y su integración a Stata (sfi) en:_ [`python/README.md`](python/README.md)\n\n---\n\n## 2. Wrapper para Stata (`stata/usecasen.ado`)\n\nComando robusto, compatible desde Stata 14 hasta 19, diseñado para procesar y consolidar masivamente datasets, logrando la limpieza nativa.\n\n**Instalación Rápida:**\n\n```stata\nnet install usecasen, from(\"https://raw.githubusercontent.com/MaykolMedrano/usecasen/master/stata\") replace\n```\n\n**Uso Clásico:**\n\n```stata\n* Descargar/cargar el último año disponible con limpieza en memoria\nusecasen, years(2022) clear\n\n* Descarga masiva para hacer paneles (descarga en /data y reemplaza)\nusecasen, years(2006 2017 2020 2022) path(\"data\") replace\n\n* Modo hardcore con logs y timeouts (para redes lentas o debug)\nusecasen, years(1990) retries(3) timeout(600) debugscore clear\n```\n\n\u003e _Ver guía completa, comandos y ayudas de Stata en:_ [`stata/README.md`](stata/README.md)\n\n---\n\n## Estructura del Repositorio\n\n```text\nusecasen\n |- python/             # API PyPI, Core in-memory, Metadata Scanner, Tests\n |- stata/              # Wrapper Stata (.ado/.sthlp/.pkg), Fallback extracts\n |- .github/workflows/  # CI/CD (Pytest Actions)\n `- README.md           # This file\n```\n\n---\n\n## Licencia \u0026 Citas\n\nEl código de este producto está licenciado nativamente bajo **MIT License**, mira [LICENSE](python/LICENSE) para detalles completos.\n\nSi este proyecto ha acelerado substancialmente tu investigación o tesis, puedes referenciar el repositorio:\n\n```bibtex\n@software{usecasen2026,\n  author = {Medrano, Maykol},\n  title = {usecasen: Herramientas Python y Stata para la Encuesta CASEN},\n  version = {1.0.0},\n  year = {2026},\n  publisher = {GitHub},\n  url = {https://github.com/MaykolMedrano/usecasen}\n}\n```\n\n\u003e **Aviso Legal de Datos**:\nLos microdatos que descarga esta herramienta son propiedad intelectual y pública del Ministerio de Desarrollo Social y Familia (MDSF) del Gobierno de Chile [Observatorio Social](https://observatorio.ministeriodesarrollosocial.gob.cl/).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmaykolmedrano%2Fusecasen","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmaykolmedrano%2Fusecasen","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmaykolmedrano%2Fusecasen/lists"}