https://github.com/cadensstudio/pptx-image-scraper
Extract images from a powerpoint slide deck.
https://github.com/cadensstudio/pptx-image-scraper
graphics image-extractor powerpoint
Last synced: about 1 year ago
JSON representation
Extract images from a powerpoint slide deck.
- Host: GitHub
- URL: https://github.com/cadensstudio/pptx-image-scraper
- Owner: cadensstudio
- License: mit
- Created: 2024-08-05T20:40:15.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-08-06T16:08:24.000Z (almost 2 years ago)
- Last Synced: 2025-01-30T21:32:49.767Z (over 1 year ago)
- Topics: graphics, image-extractor, powerpoint
- Language: PowerShell
- Homepage:
- Size: 3.91 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PowerPoint Image Scraper
This project provides a PowerShell script to extract images from a PowerPoint (.pptx) file. It leverages two main functions: one to unzip the PowerPoint file into its raw XML components (which includes a media folder), and another to extract the images from the media folder into a dedicated directory at the root of the project.
## Usage
1. **Clone the repository:**
```sh
git clone https://https://github.com/cadensstudio/pptx-image-scraper.git
cd pptx-image-scraper
```
2. **Create a `.env` file:**
Copy the `.env.example` file to `.env` and set the `FILEPATH` variable to the path of your PowerPoint file.
```sh
cp .env.example .env
```
Update the `FILEPATH` variable in the `.env` file:
```
FILEPATH=/path/to/your/powerpoint.pptx
```
3. **Run the script:**
Open a PowerShell terminal and execute the `extract.ps1` script:
```powershell
.\extract.ps1
```
4. **Check the `media` folder for images:**
After running the script, the images will be available in the `media` directory wherever the script was run from.
## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.