https://github.com/tiagocavalcante/extract-folder

Automates the extraction of compressed files (which may not have the correct extension) within a folder
https://github.com/tiagocavalcante/extract-folder

script unzip wayback-machine

Last synced: 5 months ago
JSON representation

Automates the extraction of compressed files (which may not have the correct extension) within a folder

Host: GitHub
URL: https://github.com/tiagocavalcante/extract-folder
Owner: TiagoCavalcante
License: mit
Created: 2024-02-06T22:55:12.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-04-19T22:02:00.000Z (almost 2 years ago)
Last Synced: 2025-03-24T06:16:58.429Z (11 months ago)
Topics: script, unzip, wayback-machine
Language: Python
Homepage:
Size: 5.86 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Compressed File Extractor

This project provides a Python script that automates the extraction of compressed files (which may not have the correct extension) within a given directory. It supports both gzip and zip formats, making it particularly useful for processing archives, such as those downloaded from the Internet Archive via the `wayback_machine_downloader`.

## Prerequisites

Before running the script, ensure you have Python 3 installed on your system. Additionally, the script uses the `magic` library for MIME type identification, so make sure to install this dependency:

```sh
python3 -m pip install python-magic
```

## Installation

No installation is needed. Just download the script file or clone this repository:
```sh
git clone https://github.com/TiagoCavalcante/extract-folder
```

## Usage

To use the script, navigate to the directory containing the script in your terminal and execute the following command:

```sh
python3 script.py
```

Replace with the path to the directory containing the compressed files you wish to extract.

## Additional Information

To download files from a site in the Internet Archive, you can use the wayback_machine_downloader with the following commands:

```sh
sudo apt install ruby-rubygems
sudo gem install wayback_machine_downloader
wayback_machine_downloader -a URL -p 1000 -s
```
Replace `URL` with the target website's URL.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tiagocavalcante/extract-folder

Awesome Lists containing this project

README