{"id":13910576,"url":"https://github.com/biolab/orange3-text","last_synced_at":"2025-08-14T17:38:57.812Z","repository":{"id":37431401,"uuid":"38108816","full_name":"biolab/orange3-text","owner":"biolab","description":"🍊 :page_facing_up: Text Mining add-on for Orange3","archived":false,"fork":false,"pushed_at":"2024-10-11T10:04:43.000Z","size":48696,"stargazers_count":127,"open_issues_count":56,"forks_count":84,"subscribers_count":20,"default_branch":"master","last_synced_at":"2024-10-29T20:54:53.113Z","etag":null,"topics":["bag-of-words","lemmatization","newspapers","nltk","orange","sentiment-analysis","stemming","stopwords","text","text-analysis","text-mining","twitter"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/biolab.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"biolab","patreon":null,"open_collective":null,"ko_fi":null,"tidelift":null,"community_bridge":null,"liberapay":null,"issuehunt":null,"otechie":null,"custom":null}},"created_at":"2015-06-26T12:02:06.000Z","updated_at":"2024-10-24T23:39:32.000Z","dependencies_parsed_at":"2023-02-18T06:45:48.817Z","dependency_job_id":"48900ba6-ef05-4f3a-81a7-c46f1028bbb5","html_url":"https://github.com/biolab/orange3-text","commit_stats":{"total_commits":1562,"total_committers":30,"mean_commits":52.06666666666667,"dds":0.7336747759282971,"last_synced_commit":"50ff18f17f6f0275a7b9ecf37345290d38933207"},"previous_names":[],"tags_count":59,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/biolab%2Forange3-text","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/biolab%2Forange3-text/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/biolab%2Forange3-text/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/biolab%2Forange3-text/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/biolab","download_url":"https://codeload.github.com/biolab/orange3-text/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":226388662,"owners_count":17617315,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bag-of-words","lemmatization","newspapers","nltk","orange","sentiment-analysis","stemming","stopwords","text","text-analysis","text-mining","twitter"],"created_at":"2024-08-07T00:01:35.229Z","updated_at":"2024-11-25T19:31:24.218Z","avatar_url":"https://github.com/biolab.png","language":"Python","funding_links":["https://github.com/sponsors/biolab"],"categories":["Python"],"sub_categories":[],"readme":"Orange3 Text \n============\n\n[![Discord Chat](https://img.shields.io/discord/633376992607076354)](https://discord.gg/FWrfeXV)\n[![Build Status](https://travis-ci.org/biolab/orange3-text.svg?branch=master)](https://travis-ci.org/biolab/orange3-text)\n[![codecov](https://codecov.io/gh/biolab/orange3-text/branch/master/graph/badge.svg)](https://codecov.io/gh/biolab/orange3-text)\n[![Documentation Status](https://readthedocs.org/projects/orange3-text/badge/?version=latest)](http://orange3-text.readthedocs.org/en/latest/?badge=latest)\n\nOrange3 Text extends [Orange3](http://orange.biolab.si), a data mining software\npackage, with common functionality for text mining. It provides access\nto publicly available data, like NY Times, Twitter, Wikipedia and PubMed. Furthermore,\nit provides tools for preprocessing, constructing vector spaces (like\nbag-of-words, topic modeling, and similarity hashing) and visualizations like word cloud\nend geo map. All features can be combined with powerful data mining techniques\nfrom the Orange data mining framework.\n\nPlease note that Text add-on won't work on 32-bit Windows systems. The add-on depends on conda-forge and they have [removed support for Windows 32](https://github.com/conda-forge/staged-recipes/issues/5640) in April 2018.\n\nAnaconda installation\n---------------------\n\nThe easiest way to install Orange3-Text is with Anaconda distribution. Download [Anaconda](https://www.continuum.io/downloads) \nfor your OS (Python version 3.5). In your Anaconda Prompt first add conda-forge to your channels:\n\n    conda config --add channels conda-forge\n\nThen install Orange3-Text\n\n    conda install orange3-text\n\nRun\n\n    python -m Orange.canvas\n\nto open Orange and check if everything is installed properly.\n\nInstallation from source\n------------------------\n\nTo install the add-on from source\n\n    # Clone the repository and move into it\n    git clone https://github.com/biolab/orange3-text.git\n    cd orange3-text\n\n    # Install the dependencies:\n    pip install -r requirements.txt\n\n    # Finally install Orange3-Text in editable/development mode.\n    pip install -e .\n\nTo register this add-on with Orange, but keep the code in the development directory (do not copy it to \nPython's site-packages directory), run\n\n    python setup.py develop\n\nWindows setup for biopython library\n-----------------------------------\n\nIf you're not using Anaconda distribution, you can manually install biopython library before installing the add-on.\nFirst, download the compiler [Visual Studio](http://landinghub.visualstudio.com/visual-cpp-build-tools) and run the setup with:\n\n    python setup.py build_ext --inplace --compiler=msvc install\n\nUsage\n-----\n\nAfter the installation, the widgets from this add-on are registered with Orange. To run Orange from the terminal,\nuse\n\n    python3 -m Orange.canvas\n\nor\n\n\torange-canvas\n\nThe new widgets are in the toolbox bar under Text Mining section.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbiolab%2Forange3-text","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbiolab%2Forange3-text","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbiolab%2Forange3-text/lists"}