Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/lucksi/mycroft
Mycroft is a Offline File Analyzer and Metadata Scraper
https://github.com/lucksi/mycroft
file-analyzer hash hashing metadata metadata-extraction python zip-extraction zip-files
Last synced: about 2 months ago
JSON representation
Mycroft is a Offline File Analyzer and Metadata Scraper
- Host: GitHub
- URL: https://github.com/lucksi/mycroft
- Owner: Lucksi
- License: gpl-3.0
- Created: 2024-09-23T13:30:06.000Z (3 months ago)
- Default Branch: master
- Last Pushed: 2024-10-28T17:20:52.000Z (2 months ago)
- Last Synced: 2024-10-28T18:46:15.673Z (2 months ago)
- Topics: file-analyzer, hash, hashing, metadata, metadata-extraction, python, zip-extraction, zip-files
- Language: Python
- Homepage:
- Size: 235 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: Readme.md
- License: LICENSE
Awesome Lists containing this project
README
## Introduction
**Mycroft is a Offline File Analyzer and Metadata Scraper. Other than extracting metadata from a certain types of files it also a file analyzer which gives you extra information such as hashes,checksums and other file details.**
## Screenshot:
## Requirements:
```
Python3
```## Installation Linux:
```bash
sudo apt-get update
sudo apt-get install python3
sudo apt-get install git
git clone https://github.com/Lucksi/Mycroft
```
## Execution:
```bash
cd Mycroft
python3 main.py
```
## Usage Example:**After executing the program a simple query example is the following**
```bash
/home/test/Desktop/example.pdf
```
**In case that an input folder is not specified the input folder will be the folder Files inside Mycroft folder with only the filename the program will extract only the metadata of the inserted file**## Operating Sytstems:
| Platform | Tested |
| ------------- | ------------- |
| Linux | ✅ |
| Mac-Os | ❌ |
| Windows | ❌ |## Optional Commands:
| Option name | Description | Query Example |
| ------------- | ------------- | -------------|
| `--internal` | Scan files from 'Files' internal folder| test.docx --internal |
| `--output` | Insert path to a custom output folder | --output /home/test/Desktop |
| `--ext_d` | List Files of an external directory | list --ext_d /home/test/Desktop |
| `--folder_name` | Change the name of the output directory | --folder_name Test |
| `--advanced` | Extract advanced information from metadata | test.docx --advanced |
| `--text` | Extract file text-content from document | test.docx --text |
| `--search` | Search for a specific keyword in the document | test.docx --text --search Test |
| `--verbose` | Disable screen output | test.docx --verbose
| `--noOutput` | Disable file output | test.docx --noOutput
| `--extraction` | Extract media from documents or files from archives| test.docx --extraction## Features:
| Description | File-Format |
| ------------- | ------------- |
| Extract metadata | docx, pdf, xlsx, pptx, jpg, png, mp4, odt, odp, odc, zip, apk, jar, psd|
| Get file hashes | All formats |
| Get local informations | All formats |
| Extract and Download media | docx, pptx, odp, odt |
| Extract Text | docx, pptx, odt, odp|
| Search Keyword | docx, pptx, odt, odp|
| Extract files | zip, apk, jar |
| Get archives file information | zip, apk, jar |
| Get archive information | zip, apk, jar |
| Get Pdf File Permission | pdf |
| Recognize archive Compression Type | zip, apk, jar |
| Recognize Password Protection | zip, apk, jar|
| Recognize Shared File | docx, pptx, xlsx |
| Recognize Suspicious keywords | pdf |
| Recognize Encryption Algorithm | pdf |## File Hashing formats:
| Name | File-Format |
| ------------- | ------------- |
| `MD5` | all files except archive protected files |
| `SHA1` | all files except archive protected files |
| `SHA256` | all files except archive protected files |
| `SHA384` | all files except archive protected files |
| `SHA512` | all files except archive protected files |## File Checksum formats:
| Name | File-Format |
| ------------- | ------------- |
| `Crc32` | All Formats |
| `Crc32-Control-Code` | All Formats |
| `Adler32-Checksum` | All Formats |## Compression Algorithm:
| Name | Archive Format |
| ------------- | ------------- |
| `Storage` | zip, apk, jar |
| `Deflate` | zip, apk, jar |
| `Enhanced Deflate` | zip, apk, jar |
| `Bzip2` | zip, apk, jar |
| `LZMA` | zip, apk, jar |
| `Wavpack` | zip, apk, jar |
| `JPEG` | zip, apk, jar |
| `XZ` | zip, apk, jar |
| `PPMD` | zip, apk, jar |
| `AES` | zip, apk, jar |## STARGAZERS OVER TIME
[![Stargazers over time](https://starchart.cc/Lucksi/Mycroft.svg)](https://starchart.cc/Lucksi/Mycroft)
##
ORIGINAL CREATOR: LUCA GAROFALO (Lucksi)
##
LICENSE: GPL-3.0 License
COPYRIGHT: (C) 2024 Lucksi