{"id":32115279,"url":"https://github.com/paulhoule/infovore","last_synced_at":"2026-02-27T22:11:33.032Z","repository":{"id":5317442,"uuid":"6500041","full_name":"paulhoule/infovore","owner":"paulhoule","description":"RDF-Centric Map/Reduce Framework and Freebase data conversion tool","archived":false,"fork":false,"pushed_at":"2021-11-15T06:03:33.000Z","size":4385,"stargazers_count":148,"open_issues_count":49,"forks_count":21,"subscribers_count":21,"default_branch":"master","last_synced_at":"2025-10-20T15:52:35.742Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/paulhoule.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2012-11-02T01:56:00.000Z","updated_at":"2025-10-10T08:12:10.000Z","dependencies_parsed_at":"2022-07-05T13:31:45.271Z","dependency_job_id":null,"html_url":"https://github.com/paulhoule/infovore","commit_stats":null,"previous_names":[],"tags_count":55,"template":false,"template_full_name":null,"purl":"pkg:github/paulhoule/infovore","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paulhoule%2Finfovore","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paulhoule%2Finfovore/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paulhoule%2Finfovore/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paulhoule%2Finfovore/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/paulhoule","download_url":"https://codeload.github.com/paulhoule/infovore/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/paulhoule%2Finfovore/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":280118752,"owners_count":26275307,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-20T02:00:06.978Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-10-20T15:52:47.297Z","updated_at":"2025-10-20T15:52:51.924Z","avatar_url":"https://github.com/paulhoule.png","language":"Java","funding_links":[],"categories":["Graph Data Model","Machine Learning"],"sub_categories":["BBedit"],"readme":"Overview\n--------\n\nInfovore is an RDF processing system that uses Hadoop to process RDF data\nsets in the billion triple range and beyond.  Infovore was originally designed to process\nthe (old) proprietary Freebase dump into RDF,  but once Freebase came out with an official RDF\ndump,  Infovore gained the ability to clean and purify the dump,  making it not just possible\nbut easy to process Freebase data with triple stores such as Virtuoso 7.\n\nEvery week we run Infovore in Amazon Elastic/Map reduce in order to produce a product known as\n[:BaseKB](http://basekb.com/).\n\nInfovore depends on the [Centipede](https://github.com/paulhoule/centipede/wiki) framework for packaging\nand processing command-line arguments.  The [Telepath](https://github.com/paulhoule/telepath/wiki) project\nextends the Infovore project in order to process Wikipedia usage information to produce a product called\n[:SubjectiveEye3D](https://github.com/paulhoule/telepath/wiki/SubjectiveEye3D).\n\n\nSupporting\n----------\n\nIt costs several hundreds of dollars per month to process and store files in connection with this work.\nPlease join \u003ca href=\"https://www.gittip.com/\"\u003eGittip\u003c/a\u003e and make a \u003ca href=\"https://www.gittip.com/paulhoule/\"\u003esmall weekly donation\u003c/a\u003e to keep this data free.\n\n\nBuilding\n--------\n\nInfovore software requires JDK 7.\n\nmvn clean install\n\nInstalling\n----------\n\nThe following cantrip, run from the top level \"infovore\" directory, initializes the bash shell\nfor the use of the \"haruhi\" program,  which can be used to run Infovore applications\npackaged in the Bakemono Jar.\n\nsource haruhi/target/path.sh\n\nMore Information\n----------------\n\nSee \n\nhttps://github.com/paulhoule/infovore/wiki \n\nfor documentation and join the discussion group at\n\nhttps://groups.google.com/forum/#!forum/infovore-basekb\n\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpaulhoule%2Finfovore","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpaulhoule%2Finfovore","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpaulhoule%2Finfovore/lists"}