{"id":23029747,"url":"https://github.com/antononcube/raku-dsl-english-dataacquisitionworkflows","last_synced_at":"2025-04-02T20:25:51.099Z","repository":{"id":71383528,"uuid":"368874059","full_name":"antononcube/Raku-DSL-English-DataAcquisitionWorkflows","owner":"antononcube","description":"A Raku package for data acquisition workflows specification and translation. ","archived":false,"fork":false,"pushed_at":"2024-02-16T02:14:37.000Z","size":889,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-02-08T11:13:30.724Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Raku","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/antononcube.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2021-05-19T13:11:59.000Z","updated_at":"2023-03-30T17:29:48.000Z","dependencies_parsed_at":"2024-02-16T03:24:19.728Z","dependency_job_id":"3e837fd0-df9e-4915-ae17-2a362e45b54f","html_url":"https://github.com/antononcube/Raku-DSL-English-DataAcquisitionWorkflows","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/antononcube%2FRaku-DSL-English-DataAcquisitionWorkflows","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/antononcube%2FRaku-DSL-English-DataAcquisitionWorkflows/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/antononcube%2FRaku-DSL-English-DataAcquisitionWorkflows/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/antononcube%2FRaku-DSL-English-DataAcquisitionWorkflows/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/antononcube","download_url":"https://codeload.github.com/antononcube/Raku-DSL-English-DataAcquisitionWorkflows/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246886517,"owners_count":20849883,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-15T14:16:59.234Z","updated_at":"2025-04-02T20:25:51.078Z","avatar_url":"https://github.com/antononcube.png","language":"Raku","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Raku DSL::English::DataAcquisitionWorkflows\n    \n[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)\n\n## In brief\n\nThis Raku Perl 6 package has grammar classes and action classes for the parsing and\ninterpretation of natural language commands that specify Data Acquisition (DA) workflows.\n\nIt is envisioned that the interpreters (actions) are going to target different\nprogramming languages: R, Mathematica, Python, etc.\n\nThis mind-maps shows the conversational agent components this grammar addresses:\n\n![MindMap](./org/Data-acquisition-workflows-mind-map.png)\n\nThis \n[org-mode file](./org/DataAcquisitionWorkflows.org) \nis used to track project's progress.\n\n------\n\n## Installation\n\n```shell\nzef install https://github.com/antononcube/Raku-DSL-Shared.git\nzef install https://github.com/antononcube/Raku-DSL-Entity-English-Metadata.git\nzef install https://github.com/antononcube/Raku-DSL-English-DataAcquisitionWorkflows.git\n```\n\n------\n\n# Examples\n\nGeneral recommendation request:\n\n```perl6\nuse DSL::English::DataAcquisitionWorkflows;\n\nsay ToDataAcquisitionWorkflowCode(\n    \"what data can I get for time series investigations?;\n     why did you recommend those\",\n    \"WL-Ecosystem\");\n``` \n\nRecommendation request with subsequent filtering:\n\n```perl6\nsay ToDataAcquisitionWorkflowCode(\n    \"I want to investigate data that cross references good purchases with customer demographics\n     keep only datasets that can be transformed to star schema\",\n    \"WL-Ecosystem\");\n``` \n\nData quality verification specification:\n\n```perl6\nsay ToDataAcquisitionWorkflowCode(\n    \"verify the quality of the database dbGJ99;\n     what fraction of records have missing data;\n     what are the distributions of the numerical columns\",\n    \"WL-Ecosystem\");\n``` \n\nHere is a more complicated, statistics pipeline specification:\n\n```perl6\nsay ToDataAcquisitionWorkflowCode(\n    \"how many people used customer service data last month;\n     what is the breakdown of data sources over data types;\n     where textual data is utilized the most;\n     plot the results;\", \n    \"R-tidyverse\")\n```\n\nHere is a recommendation specification (by collaborative filtering):\n\n```perl6\nsay ToDataAcquisitionWorkflowCode(\n    \"what data people like me acquired last month;\n     which of those I can use for classfier investigations;\n     show me the data sizes and metadata;\", \n    \"WL-Ecosystem\")\n```\n\n------\n\n## Implementation notes\n\nThe general structure of this package and its grammar (and sub-grammars) design is analogous \nthe structure and grammars of the Raku package\n[DSL::English::FoodPreparationWorkflows](https://github.com/antononcube/Raku-DSL-English-FoodPreparationWorkflows),\n[AAr3].\n\nThe original versions of the grammars were generated using Mathematica.\nSee the notebook \n[\"Data-Acquisition-Workflows-grammar-generation.nb\"]()\n\n------\n\n## References\n\n### Repositories\n\n[AAr1] Anton Antonov,\n[DSL::Shared Raku package](https://github.com/antononcube/Raku-DSL-Shared),\n(2020),\n[GitHub/antononcube](https://github.com/antononcube).\n\n[AAr2] Anton Antonov,\n[DSL::Entity::Metadata Raku package](https://github.com/antononcube/Raku-DSL-Entity-Metadata),\n(2021),\n[GitHub/antononcube](https://github.com/antononcube).\n\n[AAr3] Anton Antonov,\n[DSL::English::FoodPreparationWorkflows Raku package](https://github.com/antononcube/Raku-DSL-English-FoodPreparationWorkflows),\n(2021),\n[GitHub/antononcube](https://github.com/antononcube).\n\n### Videos\n\n[AAv1] Anton Antonov,\n[\"Multi-language Data Acquisition Conversational Agent (extended version)\"](https://www.youtube.com/watch?v=KlEl2b8oxb8),\n(2021),\n[YouTube.com](https://www.youtube.com/channel/UC5qMPIsJeztfARXWdIw3Xzw).","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fantononcube%2Fraku-dsl-english-dataacquisitionworkflows","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fantononcube%2Fraku-dsl-english-dataacquisitionworkflows","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fantononcube%2Fraku-dsl-english-dataacquisitionworkflows/lists"}