{"id":19544650,"url":"https://github.com/viig99/step_dspy","last_synced_at":"2026-02-13T17:32:28.420Z","repository":{"id":247681943,"uuid":"822854844","full_name":"viig99/step_dspy","owner":"viig99","description":"STEP Web agent for WebArena using dspy.","archived":false,"fork":false,"pushed_at":"2024-07-21T08:47:47.000Z","size":30,"stargazers_count":3,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-05-13T15:57:13.620Z","etag":null,"topics":["agents","dspy","dspy-ai","step"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/viig99.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-07-02T01:01:34.000Z","updated_at":"2024-08-29T14:13:27.000Z","dependencies_parsed_at":"2025-05-13T15:58:13.637Z","dependency_job_id":null,"html_url":"https://github.com/viig99/step_dspy","commit_stats":null,"previous_names":["viig99/step_dspy"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/viig99/step_dspy","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viig99%2Fstep_dspy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viig99%2Fstep_dspy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viig99%2Fstep_dspy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viig99%2Fstep_dspy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/viig99","download_url":"https://codeload.github.com/viig99/step_dspy/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/viig99%2Fstep_dspy/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":262178241,"owners_count":23271020,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["agents","dspy","dspy-ai","step"],"created_at":"2024-11-11T03:30:20.344Z","updated_at":"2026-02-13T17:32:28.373Z","avatar_url":"https://github.com/viig99.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Agents on DSpy\nBuilding an agent using DSPy that can interact with the [webarena](https://webarena.dev/) environment. \n\nWe are looking to achieve SOTA performance on [webarena benchmark](https://docs.google.com/spreadsheets/d/1M801lEpBbKSNwP-vDBkC_pF7LdyGU1f_ufZb_NWNBZQ/edit?usp=sharing), by implementing various ideas like:\n* [SteP: Stacked LLM Policies for Web Actions](https://arxiv.org/pdf/2310.03720)\n* [Tree Search For Language Model Agents](https://jykoh.com/search-agents/paper.pdf)\n\n## Sample Agent\nThe sample agent is for solving the task of finding the walking distance between 2 locations on OpenStreetMap.\n\n[![Watch Video](https://img.youtube.com/vi/vXUkQjeIhbo/hqdefault.jpg)](https://www.youtube.com/watch?v=vXUkQjeIhbo)\n\n### Installation\n1. Install webarena depedencies\n```bash\ncd webarena\nuv venv -p 3.11 --seed\nsource .venv/bin/activate\npip install -r requirements.txt\nplaywright install\npip install -e .\n```\n\n2. Configure the environment\n```bash\n# export MAP=\"http://ec2-3-131-244-37.us-east-2.compute.amazonaws.com:3000\"\nexport MAP=\"https://www.openstreetmap.org\"\nexport SHOPPING=\"\u003cyour_shopping_site_domain\u003e:7770\"\nexport SHOPPING_ADMIN=\"\u003cyour_e_commerce_cms_domain\u003e:7780/admin\"\nexport REDDIT=\"\u003cyour_reddit_domain\u003e:9999\"\nexport GITLAB=\"\u003cyour_gitlab_domain\u003e:8023\"\n```\n\n3. Obtain the auto-login cookies for all websites\n```bash\nmkdir -p ./.auth\npython browser_env/auto_login.py\n```\n\n4. Copy the map configs to config_data folder\n```bash\npython scripts/generate_test_data.py\nFor Fedora:\n  grep -ol \"\\\"map\\\"\" config_files/*.json | xargs cp -t ../config_data/\nFor Mac:\n  grep -ol \"\\\"map\\\"\" config_files/*.json | xargs -I {} cp {} ../config_data/\nrm ../config_data/test*.json\n```\n\n### Running\n1. Setup the environment from the root directory\n```bash\nsource webarena/.venv/bin/activate\npython -m scripts.evaluate.debug_webarena\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fviig99%2Fstep_dspy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fviig99%2Fstep_dspy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fviig99%2Fstep_dspy/lists"}