{"id":18031525,"url":"https://github.com/sabasyed/synthetic-data-generation","last_synced_at":"2026-04-11T13:07:54.763Z","repository":{"id":258397051,"uuid":"850007180","full_name":"SabaSyed/Synthetic-Data-Generation","owner":"SabaSyed","description":"Synthetically generated latex expressions along with their python codes.","archived":false,"fork":false,"pushed_at":"2024-10-16T20:47:43.000Z","size":41,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-10T06:24:16.866Z","etag":null,"topics":["lambdify","latex","python","sympy","sympy-expressions","synthetic-data","synthetic-dataset-generation"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SabaSyed.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-30T17:27:46.000Z","updated_at":"2024-10-16T21:02:11.000Z","dependencies_parsed_at":"2024-10-18T20:14:20.556Z","dependency_job_id":"4275f96b-6963-4e44-9a9b-fc582f69dd29","html_url":"https://github.com/SabaSyed/Synthetic-Data-Generation","commit_stats":null,"previous_names":["sabasyed/synthetic-data-generation"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SabaSyed%2FSynthetic-Data-Generation","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SabaSyed%2FSynthetic-Data-Generation/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SabaSyed%2FSynthetic-Data-Generation/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SabaSyed%2FSynthetic-Data-Generation/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SabaSyed","download_url":"https://codeload.github.com/SabaSyed/Synthetic-Data-Generation/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247256072,"owners_count":20909240,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["lambdify","latex","python","sympy","sympy-expressions","synthetic-data","synthetic-dataset-generation"],"created_at":"2024-10-30T10:09:28.472Z","updated_at":"2025-12-30T23:08:24.444Z","avatar_url":"https://github.com/SabaSyed.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Synthetic Data Generation\n\nThis repository contains scripts and resources for generating synthetic datasets, focusing on mathematical expressions. The aim is to create diverse, randomized data that can be used for training models, particularly for tasks involving LaTeX-to-Python code conversion.\n\n## Overview\n\nTo run the scripts in this repository, you need to install the **SymPy** library in Python. SymPy is used to generate and manipulate mathematical expressions programmatically.\n\n```bash\npip install sympy\n```\n\n### Generated Data\n\nWe created 100 random expressions for each of the following categories:\n\n- **Multivariable equations**\n- **Trigonometric functions**\n- **Geometric expressions**\n- **Diophantine equations**\n- **Summation equations**\n\nThese expressions are generated using Python's SymPy library. The `lambdify` function is utilized to convert the symbolic SymPy expressions into executable Python code, which can be evaluated and tested.\n\n## Future Improvements\n\n- Expand the range of mathematical functions and expressions.\n- Optimize test case generation for more complex scenarios.\n\nFeel free to contribute by providing feedback, issues, or pull requests!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsabasyed%2Fsynthetic-data-generation","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsabasyed%2Fsynthetic-data-generation","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsabasyed%2Fsynthetic-data-generation/lists"}