{"id":22479022,"url":"https://github.com/janforman/aspseek","last_synced_at":"2025-03-27T18:22:53.182Z","repository":{"id":153593233,"uuid":"179712245","full_name":"janforman/aspseek","owner":"janforman","description":"ASPseek is a full-featured medium-to-large scale SQL-based Internet search engine. It consists of an indexing robot, search daemon and search frontend (CGI program). These programs are written in C++ using the STL library.","archived":false,"fork":false,"pushed_at":"2020-07-20T12:28:38.000Z","size":1051,"stargazers_count":3,"open_issues_count":1,"forks_count":1,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-02-01T21:27:44.844Z","etag":null,"topics":["full-text-search","indexing-engine","internet-archive","search-engine","searching","spider"],"latest_commit_sha":null,"homepage":"","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/janforman.png","metadata":{"files":{"readme":"README","changelog":"NEWS","contributing":null,"funding":null,"license":"COPYING","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":"AUTHORS","dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-04-05T15:54:50.000Z","updated_at":"2024-01-31T10:26:34.000Z","dependencies_parsed_at":"2023-05-19T22:30:39.885Z","dependency_job_id":null,"html_url":"https://github.com/janforman/aspseek","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/janforman%2Faspseek","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/janforman%2Faspseek/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/janforman%2Faspseek/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/janforman%2Faspseek/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/janforman","download_url":"https://codeload.github.com/janforman/aspseek/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245898706,"owners_count":20690540,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["full-text-search","indexing-engine","internet-archive","search-engine","searching","spider"],"created_at":"2024-12-06T15:12:25.316Z","updated_at":"2025-03-27T18:22:53.172Z","avatar_url":"https://github.com/janforman.png","language":"C++","funding_links":[],"categories":[],"sub_categories":[],"readme":"\tASPseek v.1.2\n\t    Advanced Internet search engine\n\tCopyright (C) 2000, 2001, 2002 by SWsoft\n\n\n\nASPseek is a full-featured medium-to-large scale Internet search engine.\nIt consists of an indexing robot, a search daemon and a search front-ends\n(CGI or Apache module). These programs are written in C++ using STL library.\nASPseek uses mix of SQL database and binary files for data storage.\n\n\n\tASPseek features\n\t----------------\n\nTo learn about ASPseek features, please read aspseek(7) man page.\nHere is just a brief list:\n\n* Ability to index and search through several millions of documents\n* HTTP, HTTP proxy, FTP (via proxy) protocols\n* HTTP basic authorization\n* HTTPS protocol\n* text/html and text/plain documents\n* Other document types support via external converters\n* Architecture optimized for multiple sites\n* Multithreaded\n* Async DNS resolver\n* Stopwords\n* Unicode support to deal with many character sets (including CJK) at once\n* Charset guesser (optional)\n* Language guesser\n* Robot exclusion standard (robots.txt) support\n* Settings to control network bandwidth usage and Web servers load\n* Real-time asynchronous indexing\n* Very good relevancy of results\n* Sorting results by relevance or by date\n* Smart results cache\n* Advanced search capabilities\n* Ispell support\n* Excerpts\n* Grouping results by site\n* Clones (mirrored documents) detection\n* Spaces and subsets\n* Query words highlighting in results\n* Cached compressed local copy of every indexed document\n* HTML templates for easy-to-customize search results\n\n\n\tHow to use it\n\t-------------\n\nPlease start with reading INSTALL file there you can find detailed instructions\nabout installation, run-time configuration and usage of ASPseek.\n\n\n\tDisclaimer (see COPYING for details)\n\t------------------------------------\n\nThis program is free software; you can redistribute it and/or modify\nit under the terms of the GNU General Public License as published by\nthe Free Software Foundation; either version 2 of the License, or\n(at your option) any later version.\n\nThis program is distributed in the hope that it will be useful,\nbut WITHOUT ANY WARRANTY; without even the implied warranty of\nMERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the\nGNU General Public License for more details.\n\nYou should have received a copy of the GNU General Public License\nalong with this program; if not, write to the Free Software\nFoundation, Inc., 59 Temple Place, Suite 330, Boston, MA  02111-1307  USA\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjanforman%2Faspseek","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjanforman%2Faspseek","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjanforman%2Faspseek/lists"}