{"id":15296146,"url":"https://github.com/mlampros/texttinyr","last_synced_at":"2025-04-13T19:31:01.228Z","repository":{"id":39706135,"uuid":"78033269","full_name":"mlampros/textTinyR","owner":"mlampros","description":"Text Processing for Small or Big Data Files in R","archived":false,"fork":false,"pushed_at":"2023-12-05T08:38:33.000Z","size":1623,"stargazers_count":39,"open_issues_count":0,"forks_count":8,"subscribers_count":10,"default_branch":"master","last_synced_at":"2025-03-27T10:13:30.247Z","etag":null,"topics":["bh","boost","cpp11","processing","r","rcpp","rcpparmadillo","text"],"latest_commit_sha":null,"homepage":"https://mlampros.github.io/textTinyR/","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mlampros.png","metadata":{"funding":{"github":["mlampros"],"patreon":null,"open_collective":null,"ko_fi":null,"tidelift":null,"community_bridge":null,"liberapay":null,"issuehunt":null,"otechie":null,"custom":null},"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-01-04T16:34:50.000Z","updated_at":"2025-03-19T10:18:16.000Z","dependencies_parsed_at":"2024-06-11T20:37:46.756Z","dependency_job_id":"450e8d10-1f44-4dd3-9eb2-afc4a42290d5","html_url":"https://github.com/mlampros/textTinyR","commit_stats":{"total_commits":64,"total_committers":2,"mean_commits":32.0,"dds":0.328125,"last_synced_commit":"6e38cff8d36641e00e0e422bcc47410150e2e023"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mlampros%2FtextTinyR","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mlampros%2FtextTinyR/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mlampros%2FtextTinyR/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mlampros%2FtextTinyR/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mlampros","download_url":"https://codeload.github.com/mlampros/textTinyR/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248767845,"owners_count":21158540,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bh","boost","cpp11","processing","r","rcpp","rcpparmadillo","text"],"created_at":"2024-09-30T18:09:34.534Z","updated_at":"2025-04-13T19:31:00.206Z","avatar_url":"https://github.com/mlampros.png","language":"R","funding_links":["https://github.com/sponsors/mlampros","https://www.buymeacoffee.com/VY0x8snyh"],"categories":[],"sub_categories":[],"readme":"\n[![tic](https://github.com/mlampros/textTinyR/workflows/tic/badge.svg?branch=master)](https://github.com/mlampros/textTinyR/actions)\n[![codecov.io](https://codecov.io/github/mlampros/textTinyR/coverage.svg?branch=master)](https://codecov.io/github/mlampros/textTinyR?branch=master)\n[![CRAN_Status_Badge](http://www.r-pkg.org/badges/version/textTinyR)](http://cran.r-project.org/package=textTinyR)\n[![Downloads](http://cranlogs.r-pkg.org/badges/grand-total/textTinyR?color=blue)](http://www.r-pkg.org/pkg/textTinyR)\n\u003ca href=\"https://www.buymeacoffee.com/VY0x8snyh\" target=\"_blank\"\u003e\u003cimg src=\"https://www.buymeacoffee.com/assets/img/custom_images/orange_img.png\" alt=\"Buy Me A Coffee\" height=\"21px\" \u003e\u003c/a\u003e\n[![Dependencies](https://tinyverse.netlify.com/badge/textTinyR)](https://cran.r-project.org/package=textTinyR)\n[![](https://img.shields.io/docker/automated/mlampros/texttinyr.svg)](https://hub.docker.com/r/mlampros/texttinyr)\n\n\n## textTinyR\n\u003cbr\u003e\n\nThe *textTinyR* package consists of text processing functions for small or big data files. More details on the functionality of textTinyR can be found in [blog-post1](http://mlampros.github.io/2017/01/05/textTinyR_package/) and [blog-post2](http://mlampros.github.io/2018/04/04/extending_textTinyR_package/). The R package can be installed, in the following Operating Systems: Linux, Mac and Windows. However, there is one limitation : *chinese*, *japanese*, *korean*, *thai* or *languages with ambiguous word boundaries* are not supported.\n\n\n\u003cbr\u003e\n\n\n**UPDATE 01-04-2018** : *boost-locale* is no longer a system requirement for the textTinyR package.\n\n\n\u003cbr\u003e\n\n\n### **Installation of the textTinyR package (CRAN, Github)**\n\n\u003cbr\u003e\n\nTo install the package from CRAN use, \n\n```R\n\ninstall.packages('textTinyR')\n\n\n```\n\u003cbr\u003e\n\nand to download the latest version from Github use the *install_github* function of the devtools package,\n\u003cbr\u003e\u003cbr\u003e\n\n```R\n\ndevtools::install_github(repo = 'mlampros/textTinyR')\n\n\n```\n\u003cbr\u003e\nUse the following link to report bugs/issues,\n\u003cbr\u003e\n\n[https://github.com/mlampros/textTinyR/issues](https://github.com/mlampros/textTinyR/issues)\n\n\u003cbr\u003e\u003cbr\u003e\n\n\n**UPDATE 06-02-2020**\n\n\u003cbr\u003e\n\n**Docker images** of the *textTinyR* package are available to download from my [dockerhub](https://hub.docker.com/r/mlampros/texttinyr) account. The images come with *Rstudio* and the *R-development* version (latest) installed. The whole process was tested on Ubuntu 18.04. To **pull** \u0026 **run** the image do the following,\n\n\u003cbr\u003e\n\n```R\n\ndocker pull mlampros/texttinyr:rstudiodev\n\ndocker run -d --name rstudio_dev -e USER=rstudio -e PASSWORD=give_here_your_password --rm -p 8787:8787 mlampros/texttinyr:rstudiodev\n\n```\n\n\u003cbr\u003e\n\nThe user can also **bind** a home directory / folder to the image to use its files by specifying the **-v** command,\n\n\u003cbr\u003e\n\n```R\n\ndocker run -d --name rstudio_dev -e USER=rstudio -e PASSWORD=give_here_your_password --rm -p 8787:8787 -v /home/YOUR_DIR:/home/rstudio/YOUR_DIR mlampros/texttinyr:rstudiodev\n\n\n```\n\n\u003cbr\u003e\n\nIn the latter case you might have first give permission privileges for write access to **YOUR_DIR** directory (not necessarily) using,\n\n\u003cbr\u003e\n\n```R\n\nchmod -R 777 /home/YOUR_DIR\n\n\n```\n\n\u003cbr\u003e\n\nThe **USER** defaults to *rstudio* but you have to give your **PASSWORD** of preference (see [https://rocker-project.org/](https://rocker-project.org/) for more information).\n\n\u003cbr\u003e\n\nOpen your web-browser and depending where the docker image was *build / run* give, \n\n\u003cbr\u003e\n\n**1st. Option** on your personal computer,\n\n\u003cbr\u003e\n\n```R\nhttp://0.0.0.0:8787 \n\n```\n\n\u003cbr\u003e\n\n**2nd. Option** on a cloud instance, \n\n\u003cbr\u003e\n\n```R\nhttp://Public DNS:8787\n\n```\n\n\u003cbr\u003e\n\nto access the Rstudio console in order to give your username and password.\n\n\u003cbr\u003e\n\n### **Citation:**\n\nIf you use the code of this repository in your paper or research please cite both **textTinyR** and the **original software** [https://CRAN.R-project.org/package=textTinyR/citation.html](https://CRAN.R-project.org/package=textTinyR/citation.html):\n\n\u003cbr\u003e\n\n```R\n@Manual{,\n  title = {{textTinyR}: Text Processing for Small or Big Data Files},\n  author = {Lampros Mouselimis},\n  year = {2021},\n  note = {R package version 1.1.8},\n  url = {https://CRAN.R-project.org/package=textTinyR},\n}\n```\n\n\u003cbr\u003e\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmlampros%2Ftexttinyr","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmlampros%2Ftexttinyr","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmlampros%2Ftexttinyr/lists"}