{"id":30676015,"url":"https://github.com/web-dev-codi/transcript-productivity-parser","last_synced_at":"2025-09-01T09:13:44.825Z","repository":{"id":306077810,"uuid":"1024918669","full_name":"Web-Dev-Codi/transcript-productivity-parser","owner":"Web-Dev-Codi","description":"A Python application that automatically extracts actionable tasks from transcript files and converts them into organized daily todo lists in CSV format. Perfect for developers who record meetings, brainstorming sessions, or voice notes and want to quickly identify and track follow-up tasks.","archived":false,"fork":false,"pushed_at":"2025-07-23T14:05:49.000Z","size":14,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-07-23T15:15:20.095Z","etag":null,"topics":["csv","microsoft","parser","python","teams","todo","todolist","transcript"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Web-Dev-Codi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-07-23T12:51:52.000Z","updated_at":"2025-07-23T14:05:53.000Z","dependencies_parsed_at":"2025-07-23T15:15:52.600Z","dependency_job_id":"83af7586-858d-4c8a-98f3-2b44625c4d68","html_url":"https://github.com/Web-Dev-Codi/transcript-productivity-parser","commit_stats":null,"previous_names":["web-dev-codi/transcript-productivity-parser"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/Web-Dev-Codi/transcript-productivity-parser","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Web-Dev-Codi%2Ftranscript-productivity-parser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Web-Dev-Codi%2Ftranscript-productivity-parser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Web-Dev-Codi%2Ftranscript-productivity-parser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Web-Dev-Codi%2Ftranscript-productivity-parser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Web-Dev-Codi","download_url":"https://codeload.github.com/Web-Dev-Codi/transcript-productivity-parser/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Web-Dev-Codi%2Ftranscript-productivity-parser/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":273099317,"owners_count":25045697,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-01T02:00:09.058Z","response_time":120,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["csv","microsoft","parser","python","teams","todo","todolist","transcript"],"created_at":"2025-09-01T09:13:40.109Z","updated_at":"2025-09-01T09:13:44.819Z","avatar_url":"https://github.com/Web-Dev-Codi.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Transcript Todo Parser\n\nA format-agnostic Python tool that extracts todo items and action items from Microsoft Teams meeting transcripts and exports them to CSV format.\n\n## Features\n\n- **Format Agnostic**: Supports VTT, SRT, TXT, and JSON transcript formats\n- **Intelligent Extraction**: Identifies todos using multiple keyword patterns and context analysis  \n- **Speaker Detection**: Automatically extracts speaker names from transcripts\n- **Assignment Recognition**: Identifies who is assigned to each todo item\n- **CSV Export**: Outputs structured data with timestamp, speaker, todo text, and assignee\n- **CLI Interface**: Easy-to-use command line interface with preview mode\n\n## Installation\n\n1. Create and activate a virtual environment:\n```bash\npython -m venv venv\nsource venv/bin/activate  # On Windows: venv\\Scripts\\activate\n```\n\n2. Install required dependencies:\n```bash\npip install -r requirements.txt\n```\n\n## Usage\n\n### Basic Usage\n```bash\npython main.py transcript.vtt\npython main.py meeting_notes.txt -o todos.csv\n```\n\n### Preview Mode (without saving)\n```bash\npython main.py transcript.txt --preview\n```\n\n### Verbose Output\n```bash\npython main.py transcript.vtt --verbose -o detailed_todos.csv\n```\n\n## Supported Keywords\n\nThe parser automatically detects todos when speakers use these keywords:\n- \"todo\", \"action item\"\n- \"need to\", \"have to\", \"must do\"\n- \"will do\", \"should do\", \"going to\"\n- \"will handle\", \"will work on\", \"will take care of\"\n- \"responsible for\", \"assigned to\"\n\n## Output Format\n\nThe CSV output includes these columns:\n- **timestamp**: When the todo was mentioned\n- **speaker**: Who mentioned the todo\n- **keyword**: The trigger keyword found\n- **todo**: The extracted todo text\n- **assignee**: Who is assigned (if identified)\n- **context**: Full context of the statement\n\n## Examples\n\nRun with the provided sample files:\n```bash\npython main.py example_transcript.txt --preview\npython main.py example_teams.vtt -o sample_output.csv\n```\n\n## Supported Formats\n\n- **VTT/WebVTT**: Microsoft Teams and other video platform exports\n- **SRT**: Subtitle format files\n- **TXT**: Plain text transcripts with speaker identification\n- **JSON**: Structured transcript data","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fweb-dev-codi%2Ftranscript-productivity-parser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fweb-dev-codi%2Ftranscript-productivity-parser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fweb-dev-codi%2Ftranscript-productivity-parser/lists"}