https://github.com/alteryx/featuretools_sql
Automated creation of EntitySets from relational data stored in SQL databases
https://github.com/alteryx/featuretools_sql
automated-feature-engineering automated-machine-learning automl data-science feature-engineering featuretools machine-learning mysql postgres postgresql sql
Last synced: 2 months ago
JSON representation
Automated creation of EntitySets from relational data stored in SQL databases
- Host: GitHub
- URL: https://github.com/alteryx/featuretools_sql
- Owner: alteryx
- License: bsd-3-clause
- Created: 2022-07-07T18:10:15.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-11-02T21:30:42.000Z (over 1 year ago)
- Last Synced: 2023-12-14T09:22:43.733Z (over 1 year ago)
- Topics: automated-feature-engineering, automated-machine-learning, automl, data-science, feature-engineering, featuretools, machine-learning, mysql, postgres, postgresql, sql
- Language: Python
- Homepage:
- Size: 139 KB
- Stars: 11
- Watchers: 16
- Forks: 0
- Open Issues: 13
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
![]()
Automated creation of EntitySets from relational data stored in SQL databases
The `featuretools_sql` library allows you to directly import your relational data into [Featuretools](https://github.com/alteryx/featuretools) to run automated feature engineering.
## Installation
Install with pip:
```shell
python -m pip install "featuretools[sql]"
```
or from the Conda-forge channel on [conda](https://anaconda.org/conda-forge/featuretools_sql):```
conda install -c conda-forge featuretools
```## Example
Simply pass in the database connection information:```python
from featuretools_sql.connector import DBConnectorsql_connector = DBConnector(
system_name = "mysql",
host = "127.0.0.1:3306"
user = "root",
password = "password",
database = "db"
)
entityset = sql_connector.get_entityset()
```The `entityset` object will have the `relationships` and `DataFrames` already populated, allowing you to call featuretools.DFS and run automated feature generation.
```python
import featuretools as ftfeature_defs, feature_matrix = ft.dfs(
entityset=entityset,
target_entity='target_table_name'
)
```We currently supports importing data from the following relational database systems:
- `MySQL`
- `PostgreSQL`
- `Snowflake`## Support
The Featuretools community is happy to provide support to users. Project support can be found in four places depending on the type of question:
1. For usage questions, use [Stack Overflow](https://stackoverflow.com/questions/tagged/featuretools) with the `featuretools` tag.
2. For bugs, issues, or feature requests start a [Github issue](https://github.com/alteryx/featuretools_sql/issues).
3. For discussion regarding development, use [Slack](https://join.slack.com/t/alteryx-oss/shared_invite/zt-182tyvuxv-NzIn6eiCEf8TBziuKp0bNA).
4. For everything else, the core developers can be reached by email at [email protected]## Built at Alteryx
`featuretools_sql` is an open source project maintained by [Alteryx](https://www.alteryx.com). To see the other open source projects we’re working on, visit [Alteryx Open Source](https://www.alteryx.com/open-source). If building impactful data science pipelines is important to you or your business, please get in touch.