{"id":29179512,"url":"https://github.com/genivia/ugrep-testing","last_synced_at":"2025-07-01T19:05:55.603Z","repository":{"id":301434411,"uuid":"1009231161","full_name":"Genivia/ugrep-testing","owner":"Genivia","description":"ugrep pattern match testing and validation","archived":false,"fork":false,"pushed_at":"2025-06-26T20:46:46.000Z","size":36771,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-06-26T22:02:46.029Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"https://ugrep.com","language":"Shell","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Genivia.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-06-26T19:41:21.000Z","updated_at":"2025-06-26T20:46:49.000Z","dependencies_parsed_at":"2025-06-26T22:12:50.392Z","dependency_job_id":null,"html_url":"https://github.com/Genivia/ugrep-testing","commit_stats":null,"previous_names":["genivia/ugrep-testing"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Genivia/ugrep-testing","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Genivia%2Fugrep-testing","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Genivia%2Fugrep-testing/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Genivia%2Fugrep-testing/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Genivia%2Fugrep-testing/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Genivia","download_url":"https://codeload.github.com/Genivia/ugrep-testing/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Genivia%2Fugrep-testing/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263021812,"owners_count":23401148,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-07-01T19:02:47.930Z","updated_at":"2025-07-01T19:05:55.594Z","avatar_url":"https://github.com/Genivia.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ugrep pattern match testing and validation\n\n## Setup\n\nCompile two small utilities `pick` and `trickle` to support the tests:\n```console\n$ ./setup.sh\n```\n\n## Run\n\nRun the extensive barrage of tests, which can take hours to complete:\n```console\n$ ./run.sh\n...\nOK\n```\nReports `OK` or halts when an error is detected, where `temp_words.txt` is the set of words seaarched as a pattern and `temp_results.txt` is the output with a problem.\n\nThe bulk of the tests are designed go through all possible pattern match methods and optimizations with randomized patterns.  This is independent of the ugrep command line options.  The SIMD optimizations with which ugrep was compiled are tested, when SIMD is enabled, which is one of SSE2, AVX2, AVX512BW, NEON, or AArch64.\n\n## Data\n\n- `words` extracted from `enwik8` with `ugrep -iwo '[a-z]+' enwik8 | sort -u \u003e words`\n- `enwik8` 100MB Wikipedia file\n\nNote: we pick words of a specific byte length or within a byte length range to test with.  Therefore, we test with ASCII words only to match byte lengths.  This has no impact on validation of the internal byte-based pattern match methods that don't care what bytes represent, i.e. ASCII or UTF-8 or raw binary.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgenivia%2Fugrep-testing","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgenivia%2Fugrep-testing","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgenivia%2Fugrep-testing/lists"}