{"id":20356394,"url":"https://github.com/v74c63t/zot-query","last_synced_at":"2026-04-15T14:03:15.692Z","repository":{"id":240026051,"uuid":"601412214","full_name":"v74c63t/zot-query","owner":"v74c63t","description":"This is a search engine that uses rank retrieval to retrieve urls that are most relevant to the query based on the index that was built.","archived":false,"fork":false,"pushed_at":"2024-05-17T10:22:32.000Z","size":17416,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-15T01:09:45.562Z","etag":null,"topics":["css","flask","html","python"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/v74c63t.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-02-14T02:15:24.000Z","updated_at":"2024-05-17T10:22:42.000Z","dependencies_parsed_at":"2024-05-16T10:51:03.312Z","dependency_job_id":null,"html_url":"https://github.com/v74c63t/zot-query","commit_stats":null,"previous_names":["v74c63t/search-engine"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/v74c63t%2Fzot-query","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/v74c63t%2Fzot-query/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/v74c63t%2Fzot-query/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/v74c63t%2Fzot-query/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/v74c63t","download_url":"https://codeload.github.com/v74c63t/zot-query/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241890103,"owners_count":20037637,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["css","flask","html","python"],"created_at":"2024-11-14T23:16:17.732Z","updated_at":"2026-04-15T14:03:10.660Z","avatar_url":"https://github.com/v74c63t.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ZOT Query \n\n## About\n\nThis is a search engine that uses rank retrieval to retrieve urls that are most relevant to the query based on the index that was built.\n\n## Configuration\n\n1. Installing libraries\n   - There are some required libraries that must be installed that are not part of the standard Python libraries. These must be installed prior to running anything and can be done by running the command below.\n\n     ```\n     pip install -r requirements.txt\n     ```\n\n3. Documents\n   - The documents that the index will be built from need to be placed in a folder named `DEV`\n   - All the documents within in this folder should be JSON files\n\n\n## Execution\n\n1. **Building the Index**\n   - To build the index run the command below\n     ```\n     python index.py\n     ```\n   - As long as the documents have been placed in the appropriate folder and all required packages are installed, a folder called `indexes` and three files will be created in total\n      - `doc_url_dev.json`\n        - This is the dictionary that contains the document id to document url key pairs\n      - `indexes/final_index.json`\n         - This is the built index\n      - `indexes/index_of_index.json`\n         - This is an index of the built index for optimization purposes so the entire index will not be needed to load into memory     \n   - *Note: This process takes a while to run, but there should be print statements that provides progress updates throughout the build. For reference, building the index for 55K+ documents took around an hour.*\n\n3. **Starting up the Search Interface**\n   - To start up the search interface, run the command below\n     ```\n     python run_server.py\n     ```\n   - This will start up the server and automatically open a new window to display the search interface via a web browser\n   - *Note: If a new window does not open with the app, go to `http://127.0.0.1:5000` on any web browser to access it.*\n   - Displayed below is an image of the search interface\n\n      ![](assets/search_engine.png)\n      \n5. **Performing a Query**\n   - To perform any query, enter any query in the input prompt and press either the enter key or the search button\n   - If a result is found within the index, the top 5 urls will be displayed if possible in a list format under the input prompt along with the amount of time in ms it took to retrieve that information for that particular query\n   - These urls are clickable and can be used to redirect to the actual web page\n   - Below are some examples of searched queries\n\n      ![](assets/query1.png)\n      \n      ![](assets/query2.png)\n      \n      ![](assets/query3.png)\n      \n      ![](assets/query4.png)\n      \n7. **Additional Notes**\n    - If an error pops up, such as accessed denied, you may need to wait a bit for everything to successfully render/load and it should work in a few minutes\n    - Make sure the url has `http://` instead of `https://`\n\n## Video Walkthrough\n\n\u003cimg src='assets/walkthrough.gif' alt='walkthrough'/\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fv74c63t%2Fzot-query","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fv74c63t%2Fzot-query","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fv74c63t%2Fzot-query/lists"}