{"id":28548686,"url":"https://github.com/zinal/yc-dataproc-snippets","last_synced_at":"2025-07-03T16:31:03.889Z","repository":{"id":63712841,"uuid":"558721444","full_name":"zinal/yc-dataproc-snippets","owner":"zinal","description":"YC Data Proc samples and snippets","archived":false,"fork":false,"pushed_at":"2024-09-27T07:53:07.000Z","size":39263,"stargazers_count":8,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-06-10T01:11:35.420Z","etag":null,"topics":["hadoop","spark","yandex-cloud"],"latest_commit_sha":null,"homepage":"https://cloud.yandex.ru/services/data-proc","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zinal.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-10-28T06:30:28.000Z","updated_at":"2024-09-27T07:53:11.000Z","dependencies_parsed_at":"2023-02-16T18:31:49.219Z","dependency_job_id":"c4e351dd-4225-4535-8af1-4d79d008d1e2","html_url":"https://github.com/zinal/yc-dataproc-snippets","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/zinal/yc-dataproc-snippets","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zinal%2Fyc-dataproc-snippets","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zinal%2Fyc-dataproc-snippets/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zinal%2Fyc-dataproc-snippets/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zinal%2Fyc-dataproc-snippets/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zinal","download_url":"https://codeload.github.com/zinal/yc-dataproc-snippets/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zinal%2Fyc-dataproc-snippets/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263360757,"owners_count":23454780,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["hadoop","spark","yandex-cloud"],"created_at":"2025-06-10T01:10:30.250Z","updated_at":"2025-07-03T16:31:03.874Z","avatar_url":"https://github.com/zinal.png","language":"Java","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Материалы по Yandex Data Proc\n\nВ этом репозитории собраны документы и различные примеры для работы с сервисом [Yandex Data Proc](https://cloud.yandex.ru/services/data-proc).\n\nСтруктура основных материалов:\n* [Инструкция по диагностике работы заданий Spark](dataproc-spark-diag/)\n* [Инструкция по настройке внешней базы данных Apache Hive Metastore](dataproc-hive/)\n* [Настройка S3A Committers для оптимизации записи в Yandex Object Storage](dataproc-s3a-committers/)\n* [Копирование дополнительных файлов на узлы Data Proc](dataproc-copy-files/)\n* [Настройка кластера Data Proc для работы с Apache Kafka](dataproc-kafka/)\n* [Использование автоскейлинга в заданиях Spark](dataproc-scaling/)\n* [Управление дополнительными компонентами Python](dataproc-python-repo/)\n* [Автоматизация настройки хранения ноутбуков Zeppelin в Object Storage](dataproc-zeppelin/)\n* [Использование Delta Lake](https://github.com/yandex-cloud/yc-delta)\n\nПримеры вспомогательных скриптов и программ:\n* [Раскраска вычислительных ресурсов Data Proc](dp-compute-colorizer/)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzinal%2Fyc-dataproc-snippets","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzinal%2Fyc-dataproc-snippets","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzinal%2Fyc-dataproc-snippets/lists"}