Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/intake/intake
Intake is a lightweight package for finding, investigating, loading and disseminating data.
https://github.com/intake/intake
data-access data-catalog python
Last synced: 6 days ago
JSON representation
Intake is a lightweight package for finding, investigating, loading and disseminating data.
- Host: GitHub
- URL: https://github.com/intake/intake
- Owner: intake
- License: bsd-2-clause
- Created: 2017-08-14T20:44:22.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2024-09-24T17:54:01.000Z (about 1 month ago)
- Last Synced: 2024-10-30T03:36:24.259Z (9 days ago)
- Topics: data-access, data-catalog, python
- Language: Python
- Homepage: https://intake.readthedocs.io/
- Size: 14.6 MB
- Stars: 1,009
- Watchers: 42
- Forks: 141
- Open Issues: 110
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- best-of-python - GitHub - 28% open · ⏱️ 05.06.2024): (Data Loading & Extraction)
- awesome-mlops - Intake - A lightweight set of tools for loading and sharing data in data science projects. (Data Management)
- awesome-python-machine-learning-resources - GitHub - 27% open · ⏱️ 22.08.2022): (数据读写与提取)
- jimsghstars - intake/intake - Intake is a lightweight package for finding, investigating, loading and disseminating data. (Python)
README
# Intake: Take 2
**A general python package for describing, loading and processing data**
![Logo](https://github.com/intake/intake/raw/master/logo-small.png)
[![Build Status](https://github.com/intake/intake/workflows/CI/badge.svg)](https://github.com/intake/intake/actions)
[![Documentation Status](https://readthedocs.org/projects/intake/badge/?version=latest)](http://intake.readthedocs.io/en/latest/?badge=latest)*Taking the pain out of data access and distribution*
Intake is an open-source package to:
- describe your data declaratively
- gather data sets into catalogs
- search catalogs and services to find the right data you need
- load, transform and output data in many formats
- work with third party remote storage and compute platformsDocumentation is available at [Read the Docs](http://intake.readthedocs.io/en/latest).
Please report issues at https://github.com/intake/intake/issues
Install
-------Recommended method using conda:
```bash
conda install -c conda-forge intake
```You can also install using `pip`, in which case you have a choice as to how many of the optional
dependencies you install, with the simplest having least requirements```bash
pip install intake
```Note that you may well need specific drivers and other plugins, which usually have additional
dependencies of their own.Development
-----------
* Create development Python environment with the required dependencies, ideally with `conda`.
The requirements can be found in the yml files in the `scripts/ci/` directory of this repo.
* e.g. `conda env create -f scripts/ci/environment-py311.yml` and then `conda activate test_env`
* Install intake using `pip install -e .`
* Use `pytest` to run tests.
* Create a fork on github to be able to submit PRs.
* We respect, but do not enforce, pep8 standards; all new code should be covered by tests.