{"id":17318183,"url":"https://github.com/nathanmarz/elephantdb","last_synced_at":"2025-04-12T20:42:09.877Z","repository":{"id":62432590,"uuid":"1372117","full_name":"nathanmarz/elephantdb","owner":"nathanmarz","description":"Distributed database specialized in exporting key/value data from Hadoop","archived":false,"fork":false,"pushed_at":"2014-06-27T19:41:17.000Z","size":3037,"stargazers_count":558,"open_issues_count":4,"forks_count":51,"subscribers_count":40,"default_branch":"develop","last_synced_at":"2025-04-04T00:07:48.268Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/nathanmarz.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2011-02-16T01:54:18.000Z","updated_at":"2025-04-02T07:24:39.000Z","dependencies_parsed_at":"2022-11-01T21:00:47.950Z","dependency_job_id":null,"html_url":"https://github.com/nathanmarz/elephantdb","commit_stats":null,"previous_names":[],"tags_count":11,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nathanmarz%2Felephantdb","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nathanmarz%2Felephantdb/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nathanmarz%2Felephantdb/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nathanmarz%2Felephantdb/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/nathanmarz","download_url":"https://codeload.github.com/nathanmarz/elephantdb/tar.gz/refs/heads/develop","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248631668,"owners_count":21136554,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-15T13:19:07.868Z","updated_at":"2025-04-12T20:42:09.856Z","avatar_url":"https://github.com/nathanmarz.png","language":"Java","funding_links":[],"categories":["Key-value Data Model","II. Databases, search engines, big data and machine learning","数据库"],"sub_categories":["1. Databases and storages"],"readme":"[![Build Status](https://travis-ci.org/nathanmarz/elephantdb.png?branch=develop)](https://travis-ci.org/nathanmarz/elephantdb)\n\n# ElephantDB 0.5.1 (cascalog 2.x)\n\n## ElephantDB 0.4.5 (cascalog 1.x)\n\n# About\n\nElephantDB is a database that specializes in exporting key/value data\nfrom Hadoop. ElephantDB is composed of two components. The first is a\nlibrary that is used in MapReduce jobs for creating an indexed\nkey/value dataset that is stored on a distributed filesystem. The\nsecond component is a daemon that can download a subset of a dataset\nand serve it in a read-only, random-access fashion. A group of\nmachines working together to serve a full dataset is called a ring.\n\nSince ElephantDB server doesn't support random writes, it is almost\nlaughingly simple. Once the server loads up its subset of the data, it\ndoes very little. This leads to ElephantDB being rock-solid in\nproduction, since there's almost no moving parts.\n\nElephantDB server has a Thrift interface, so any language can make\nreads from it. The database itself is implemented in Clojure.\n\nAn ElephantDB datastore contains a fixed number of shards of a \"Local\nPersistence\". ElephantDB's local persistence engine is pluggable, and\nElephantDB comes bundled with local persistence implementations for\nBerkeley DB Java Edition and LevelDB. On the MapReduce side, each\nreducer creates or updates a single shard into the DFS, and on the\nserver side, each server serves a subset of the shards.\n\nElephantDB support hot-swapping so that a live server can be updated\nwith a new set of shards without downtime.\n\n# Questions\n\nGoogle group: [elephantdb-user](http://groups.google.com/group/elephantdb-user)\n\n# Introduction\n\n[Introduction to ElephantDB](https://speakerdeck.com/sorenmacbeth/introduction-to-elephantdb)\n\n# Tutorials\n\nTODO: Write an updated tutorial for ElephantDB 0.4.x\n\n# Using ElephantDB in MapReduce Jobs\n\nElephantDB is hosted at [Clojars](http://clojars.org/elephantdb).\nClojars is a maven repo that is trivially easy to use with maven or\nleiningen. You should use this dependency when using ElephantDB within\nyour MapReduce jobs to create ElephantDB datastores. ElephantDB\ncontains a module elephantdb-cascading which allows you to easily create\ndatastores from your Cascading workflows. elephantdb-cascalog is available\nfor use with [Cascalog](http://github.com/nathanmarz/cascalog) \u003e= 1.10.1.\n\n# Deploying ElephantDB server\n\nTODO: Documentation on how to deploy ElephantDB.\n\n# Running the EDB Jar\n\nTODO: Documentation on how to run ElephantDB\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnathanmarz%2Felephantdb","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnathanmarz%2Felephantdb","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnathanmarz%2Felephantdb/lists"}