https://github.com/apache/texera
Collaborative Machine-Learning-Centric Data Analytics Using Workflows
https://github.com/apache/texera
artificial-intelligence cloud-native data data-analytics data-science machine-learning texera workflow
Last synced: 20 days ago
JSON representation
Collaborative Machine-Learning-Centric Data Analytics Using Workflows
- Host: GitHub
- URL: https://github.com/apache/texera
- Owner: apache
- License: apache-2.0
- Created: 2016-03-15T20:38:46.000Z (about 10 years ago)
- Default Branch: main
- Last Pushed: 2026-04-28T06:42:57.000Z (24 days ago)
- Last Synced: 2026-04-28T08:17:27.432Z (24 days ago)
- Topics: artificial-intelligence, cloud-native, data, data-analytics, data-science, machine-learning, texera, workflow
- Language: Scala
- Homepage: https://texera.io/
- Size: 77.4 MB
- Stars: 231
- Watchers: 23
- Forks: 120
- Open Issues: 206
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Security: SECURITY.md
- Notice: NOTICE
Awesome Lists containing this project
README
Apache Texera - Human-AI Collaborative Data Science Using Visual Workflows
Apache Texera (Incubating) is an open-source platform for human-AI collaborative data science using visual workflows.
Official Site
|
Video
|
Publications
|
Blog
Apache Texera (Incubating) is an open-source platform for human-AI collaborative data science using visual workflows. It enables human analysts to construct, execute, and refine data analysis tasks through an intuitive GUI, assisted by AI agents that understand natural-language instructions. Texera is well suited for a wide range of applications, including “AI for Science,” by making advanced AI and data science capabilities accessible to a broader community. It can run on a laptop for local use or be deployed in the cloud to support scalable processing of large datasets.
The platform has the following key features:
* Natural-language data science through AI agents
* Intuitive GUI-based workflows for data science
* Real-time collaboration for workflow editing and execution
* Runtime debugging and interactive workflow execution
* Language-agnostic workflow runtime, native support for Python and Java
* Parallel backend engine for scalable big-data processing
* Separation of compute and storage for flexible cloud deployment

# Citation
Please cite Texera as
```
@article{DBLP:journals/pvldb/WangHNKALLDL24,
author = {Zuozhi Wang and
Yicong Huang and
Shengquan Ni and
Avinash Kumar and
Sadeem Alsudais and
Xiaozhen Liu and
Xinyuan Lin and
Yunyan Ding and
Chen Li},
title = {Texera: {A} System for Collaborative and Interactive Data Analytics
Using Workflows},
journal = {Proc. {VLDB} Endow.},
volume = {17},
number = {11},
pages = {3580--3588},
year = {2024},
url = {https://www.vldb.org/pvldb/vol17/p3580-wang.pdf},
timestamp = {Thu, 19 Sep 2024 13:09:37 +0200},
biburl = {https://dblp.org/rec/journals/pvldb/WangHNKALLDL24.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}
```