Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dennybiasiolli/compare-datasets
Playground repo to compare csv files
https://github.com/dennybiasiolli/compare-datasets
csv csv-comparison python rust
Last synced: 22 days ago
JSON representation
Playground repo to compare csv files
- Host: GitHub
- URL: https://github.com/dennybiasiolli/compare-datasets
- Owner: dennybiasiolli
- Created: 2024-03-22T08:54:51.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-03-22T08:54:57.000Z (10 months ago)
- Last Synced: 2024-10-24T11:24:56.054Z (2 months ago)
- Topics: csv, csv-comparison, python, rust
- Language: Rust
- Homepage:
- Size: 7.81 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Compare Datasets
Documentation for the `pandas.DataFrame.compare` method:
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.compare.html## Setup
### Python version
Tested with Python 3.12
```bash
python3 -m venv venv
source venv/bin/activate
pip install --upgrade pippip install --upgrade pipenv
pipenv sync
# or without pipenv
pip install -r requirements.txt
```### Rust version
Tested with Rust 1.76
```bash
cargo build --release
```## Usage
### Python version
`pipenv run python compare_datasets.py [path/file1.csv] [path/file2.csv] [id_key_name]`
or without pipenv
`python compare_datasets.py [path/file1.csv] [path/file2.csv] [id_key_name]`
Example:
```bash
pipenv run python compare_datasets.py data/example1.csv data/example2.csv PassengerId
# or
python compare_datasets.py data/example1.csv data/example2.csv PassengerId
```### Rust version
`cargo run [path/file1.csv] [path/file2.csv]`
or with the compiled binary
`target/release/compare-datasets [path/file1.csv] [path/file2.csv]`
Example:
```bash
cargo run data/example1.csv data/example2.csv
# or
target/release/compare-datasets data/example1.csv data/example2.csv
```