{"id":18541878,"url":"https://github.com/cltk/cltk_docker","last_synced_at":"2025-07-02T01:32:57.734Z","repository":{"id":76838741,"uuid":"53440130","full_name":"cltk/cltk_docker","owner":"cltk","description":"Docker script for cltk","archived":false,"fork":false,"pushed_at":"2019-09-19T01:47:52.000Z","size":16,"stargazers_count":6,"open_issues_count":0,"forks_count":4,"subscribers_count":36,"default_branch":"master","last_synced_at":"2025-04-10T11:48:30.802Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cltk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-03-08T19:43:17.000Z","updated_at":"2019-11-21T21:15:59.000Z","dependencies_parsed_at":null,"dependency_job_id":"154d4c61-210f-413b-9079-b7c60ed9eb21","html_url":"https://github.com/cltk/cltk_docker","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/cltk/cltk_docker","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Fcltk_docker","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Fcltk_docker/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Fcltk_docker/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Fcltk_docker/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cltk","download_url":"https://codeload.github.com/cltk/cltk_docker/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cltk%2Fcltk_docker/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263060214,"owners_count":23407473,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-06T20:06:36.421Z","updated_at":"2025-07-02T01:32:57.707Z","avatar_url":"https://github.com/cltk.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"[![Build Status](https://travis-ci.org/cltk/cltk_docker.svg?branch=master)](https://travis-ci.org/cltk/cltk_docker)\n\n# Docker for CLTK core software\nThis repository contains a Docker container for the [CLTK](http://cltk.org).\n\n\n# Build\nFirst, clone this repository:\n``` bash\n$ git clone https://github.com/cltk/cltk_docker.git\n$ cd cltk_docker\n```\n\nBuild the image:\n```bash\n$ docker build -t cltk .\n```\n\n# Running\nTo run the image:\n```bash\n$ docker run -it cltk\n```\n\n```python\n\u003e\u003e\u003e from cltk.corpus.utils.importer import CorpusImporter\n\u003e\u003e\u003e c = CorpusImporter('latin')\n\u003e\u003e\u003e c.list_corpora\n['latin_text_perseus', 'latin_treebank_perseus', 'latin_text_lacus_curtius', 'latin_text_latin_library', 'phi5', 'phi7', 'latin_proper_names_cltk', 'latin_models_cltk', 'latin_pos_lemmata_cltk', 'latin_treebank_index_thomisticus', 'latin_lexica_perseus', 'latin_training_set_sentence_cltk', 'latin_word2vec_cltk', 'latin_text_antique_digiliblt', 'latin_text_corpus_grammaticorum_latinorum']\n```\n\n# Data Volumes\n\nThis `Dockerfile` uses three data volumes, which you can use to persist data across runs or map a directory from the Docker host:\n\n* `/cltk_data`\n* `/nltk_data`\n* `/data`\n\nSo if you use e.g. `docker volume create cltk_data`, you can then use `docker run -ti -v cltk_data:/cltk_data ctlk`, and any corpora installed will persist when you use the same volume. If your Docker host has already installed corpora locally, you could instead use e.g. `docker run -ti -v $HOME/cltk_data:/cltk_data cltk`.\n\n# Installing Corpora\n\nThis container also comes with a helper script, `install_corpora.py`, which can be used to install all corpora:\n\n    docker run -ti -v cltk_data:/cltk_data cltk install_corpora.py\n\nOr corpora for specific languages:\n\n    docker run -ti -v cltk_data:/cltk_data cltk install_corpora.py greek latin\n\n# Jupyter Notebook\n\nThe `Dockerfile.jupyter` file also defines a Jupyter Notebook container with CLTK installed. You can build it with `docker build -t cltk-jupyter -f Dockerfile.jupyter .`, and run it with e.g. (also using a mapped data volume as in the example above) `docker run -p 8888:8888 -v cltk_data:/cltk_data cltk-jupyter` (see the [Jupyter Docker Stacks Quick Start documentation](https://github.com/jupyter/docker-stacks#quick-start) for more examples)\n\n# License\nMIT. See LICENSE.txt.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcltk%2Fcltk_docker","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcltk%2Fcltk_docker","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcltk%2Fcltk_docker/lists"}