{"id":13653857,"url":"https://github.com/dbeley/youtube_extract","last_synced_at":"2025-08-10T00:03:17.569Z","repository":{"id":37663116,"uuid":"213781923","full_name":"dbeley/youtube_extract","owner":"dbeley","description":"Extract metadata for all videos of a youtube channel.","archived":false,"fork":false,"pushed_at":"2025-06-04T09:12:41.000Z","size":150,"stargazers_count":25,"open_issues_count":7,"forks_count":3,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-07-01T16:10:56.981Z","etag":null,"topics":["youtube","youtube-dl"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dbeley.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-10-09T00:15:46.000Z","updated_at":"2025-06-04T09:12:44.000Z","dependencies_parsed_at":"2024-11-10T04:31:30.294Z","dependency_job_id":"a4dea146-96bb-4023-9516-3b5a6f420e0a","html_url":"https://github.com/dbeley/youtube_extract","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/dbeley/youtube_extract","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbeley%2Fyoutube_extract","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbeley%2Fyoutube_extract/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbeley%2Fyoutube_extract/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbeley%2Fyoutube_extract/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dbeley","download_url":"https://codeload.github.com/dbeley/youtube_extract/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbeley%2Fyoutube_extract/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":269654968,"owners_count":24454349,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-09T02:00:10.424Z","response_time":111,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["youtube","youtube-dl"],"created_at":"2024-08-02T02:01:19.140Z","updated_at":"2025-08-10T00:03:17.441Z","avatar_url":"https://github.com/dbeley.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# youtube_extract\n\n[![Codacy Badge](https://api.codacy.com/project/badge/Grade/131858400ee84232a50c03f4b06c9344)](https://app.codacy.com/manual/dbeley/youtube_extract?utm_source=github.com\u0026utm_medium=referral\u0026utm_content=dbeley/youtube_extract\u0026utm_campaign=Badge_Grade_Dashboard)\n![Build Status](https://github.com/dbeley/youtube_extract/workflows/CI/badge.svg)\n[![codecov](https://codecov.io/gh/dbeley/youtube_extract/branch/master/graph/badge.svg)](https://codecov.io/gh/dbeley/youtube_extract)\n\nExtract metadata for all videos from a youtube channel and exports it into a csv or xlsx file.\n\nBe sure to read the csv file using the tab character `\\t` as field separator in your spreadsheet software of choice.\n\nAs of now it's quite slow and unpredictable, expect ~400 seconds for extracting all videos metadata from a channel containing 400 videos.\n\n## Fields extracted\n\n| Field          | Description                    |\n|----------------|--------------------------------|\n| author         | Channel Name                   |\n| channel_url    | Channel URL                    |\n| title          | Video Title                    |\n| webpage_url    | Video URL                      |\n| view_count     | View Count                     |\n| like_count     | Like Count                     |\n| duration       | Duration in seconds            |\n| upload_date    | Upload Date in YYYYMMDD Format |\n| tags           | Tags                           |\n| categories     | Categories                     |\n| description    | Description                    |\n| thumbnail      | Thumbnail URL                  |\n| best_format    | Highest Format Available       |\n| filesize_bytes | Filesize in bytes              |\n\n## Requirements\n\n- python \u003e=3.8\n- yt-dlp\n- pandas\n- openpyxl\n\n## Installation\n\n### Preferred install method\n\n```bash\npip install youtube_extract\n```\n\nIf you are an Archlinux user, you can install the AUR package [youtube_extract-git](https://aur.archlinux.org/packages/youtube_extract-git).\n\n### Run from source\n\n```bash\ngit clone https://github.com/dbeley/youtube_extract\ncd youtube_extract\npip install yt-dlp pandas openpyxl\npython setup.py install\nyoutube_extract -h\n```\n\n## Usage\n\nIf installed :\n\n```bash\nyoutube_extract CHANNEL_URL\n# or xlsx format\nyoutube_extract CHANNEL_URL -e xlsx\n```\n\nOtherwise, in the directory containing the source code :\n\n```bash\npython -m youtube_extract CHANNEL_URL\n# or xlsx format\npython -m youtube_extract CHANNEL_URL -e xlsx\n```\n\n### Using Cookies\n\nThe `--cookies` option allows you to provide a Netscape-formatted cookies file which can be used to access age-restricted content, private videos, or content that requires authentication.\nYou can obtain a cookies file using browser extensions like:\n\n- [cookies.txt](https://chromewebstore.google.com/detail/get-cookiestxt-locally/cclelndahbckbenkjhflpdbgdldlbecc?pli=1) for Chrome\n- [cookies.txt](https://addons.mozilla.org/en-US/firefox/addon/cookies-txt/) for Firefox\n\nThe cookies file should be in the standard Netscape format:\n\n#### Netscape HTTP Cookie File\n\n```\n.domain.com TRUE / FALSE 1234567890 name value\n```\n\n### Rate Limiting\n\nYouTube may rate-limit your requests if you extract data from channels with many videos. To avoid this, you can use the --sleep-requests option to add a delay between requests:\n\n```bash\nyoutube_extract CHANNEL_URL --sleep-requests 10\n```\nThis will pause for 10 seconds between requests, which can help avoid rate limiting at the cost of longer extraction time.\nSee: https://github.com/yt-dlp/yt-dlp/wiki/Extractors#this-content-isnt-available-try-again-later\n\n## Help\n\n```bash\nyoutube_extract -h\n```\n\n```\nusage: youtube_extract [-h] [--debug] [-e EXPORT_FORMAT] [--cookies COOKIE_FILE] [--sleep-requests SECONDS] [channel_url]\n\nExtract metadata for all videos from a youtube channel into a csv or xlsx\nfile.\n\npositional arguments:\n  channel_url           Youtube channel url.\n\noptional arguments:\n  -h, --help            show this help message and exit\n  --debug               Display debugging information.\n  -e EXPORT_FORMAT, --export_format EXPORT_FORMAT\n                        Export format (csv or xlsx). Default : csv.\n  --cookies COOKIE_FILE Path to cookies.txt file. \n                        Use for age-restricted content.\n  --sleep-requests SECONDS\n                        Number of seconds to sleep between requests during data extraction.\n```","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdbeley%2Fyoutube_extract","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdbeley%2Fyoutube_extract","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdbeley%2Fyoutube_extract/lists"}