{"id":27119259,"url":"https://github.com/mrchypark/elbird","last_synced_at":"2025-04-07T08:52:13.758Z","repository":{"id":38038019,"uuid":"275399040","full_name":"mrchypark/elbird","owner":"mrchypark","description":"R binding package Kiwi(Korean Intelligent Word Identifier)","archived":false,"fork":false,"pushed_at":"2023-06-20T03:38:30.000Z","size":3435,"stargazers_count":32,"open_issues_count":20,"forks_count":3,"subscribers_count":4,"default_branch":"main","last_synced_at":"2024-04-14T07:11:03.551Z","etag":null,"topics":["analyzer","hacktoberfest","hacktoberfest2021","morphological","r","r-package","rstats"],"latest_commit_sha":null,"homepage":"https://mrchypark.github.io/elbird/index.html","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mrchypark.png","metadata":{"files":{"readme":"README.Rmd","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null},"funding":{"github":null,"patreon":null,"open_collective":null,"ko_fi":null,"tidelift":null,"community_bridge":null,"liberapay":null,"issuehunt":null,"otechie":null,"custom":["https://mrchypark.github.io/donateme/","https://paypal.me/mrchypark"]}},"created_at":"2020-06-27T15:22:41.000Z","updated_at":"2024-03-10T10:49:45.000Z","dependencies_parsed_at":"2022-08-08T22:46:18.599Z","dependency_job_id":"42577b22-6249-43c3-a252-d58bb188f48b","html_url":"https://github.com/mrchypark/elbird","commit_stats":{"total_commits":357,"total_committers":3,"mean_commits":119.0,"dds":0.07282913165266103,"last_synced_commit":"6f72ba8044f04cb5bd772a466de7ccbc95dc6307"},"previous_names":[],"tags_count":5,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mrchypark%2Felbird","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mrchypark%2Felbird/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mrchypark%2Felbird/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mrchypark%2Felbird/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mrchypark","download_url":"https://codeload.github.com/mrchypark/elbird/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246563368,"owners_count":20797444,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["analyzer","hacktoberfest","hacktoberfest2021","morphological","r","r-package","rstats"],"created_at":"2025-04-07T08:52:13.192Z","updated_at":"2025-04-07T08:52:13.750Z","avatar_url":"https://github.com/mrchypark.png","language":"R","funding_links":["https://mrchypark.github.io/donateme/","https://paypal.me/mrchypark"],"categories":[],"sub_categories":[],"readme":"---\noutput: github_document\n---\n\n\u003c!-- README.md is generated from README.Rmd. Please edit that file --\u003e\n\n```{r, include = FALSE}\nknitr::opts_chunk$set(\n  collapse = TRUE,\n  comment = \"#\u003e\",\n  fig.path = \"man/figures/README-\",\n  out.width = \"100%\"\n)\noptions(crayon.enabled = NULL)\n```\n\n# elbird [\u003cimg src=\"man/figures/logo.png\" align=\"right\" height=140/\u003e](https://mrchypark.github.io/elbird/index.html)\n\n\u003c!-- badges: start --\u003e\n[![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)\n[![R-CMD-check](https://github.com/mrchypark/elbird/workflows/R-CMD-check/badge.svg)](https://github.com/mrchypark/elbird/actions)\n[![CRAN status](https://www.r-pkg.org/badges/version/elbird)](https://cran.r-project.org/package=elbird)\n[![runiverse-name](https://mrchypark.r-universe.dev/badges/:name)](https://mrchypark.r-universe.dev/)\n[![runiverse-package](https://mrchypark.r-universe.dev/badges/elbird)](https://mrchypark.r-universe.dev/ui#packages)\n[![metacran downloads](https://cranlogs.r-pkg.org/badges/elbird)](https://cran.r-project.org/package=elbird)\n[![Downloads](https://cranlogs.r-pkg.org/badges/grand-total/elbird)](https://cran.r-project.org/package=elbird)\n[![Codecov test coverage](https://codecov.io/gh/mrchypark/elbird/branch/main/graph/badge.svg)](https://app.codecov.io/gh/mrchypark/elbird?branch=main)\n\u003c!-- badges: end --\u003e\n\n* [Korean version README](https://mrchypark.github.io/elbird/articles/README_kr.html)\n\nThe `elbird` package is a morpheme analyzer packed with [Kiwi](https://github.com/bab2min/Kiwi).\nIt is based on cpp package `Kiwi` and that has convenient functions such as faster performance compared to other tokenizers, easy user dictionary addition, unregistered noun extraction, etc.\n\n### logo\n\n\u003ca href=\"https://www.flaticon.com/free-icons/wings\" title=\"wings icons\"\u003eWings icons created by Good Ware - Flaticon\u003c/a\u003e    \n\u003ca href=\"https://www.flaticon.com/free-icons/africa\" title=\"africa icons\"\u003eAfrica icons created by Eucalyp - Flaticon\u003c/a\u003e\n\n## Installation\n\nYou can install the elbird with:\n\n```r\n# CRAN\ninstall.packages(\"elbird\")\n\n# Dev version\ninstall.packages('elbird', repos = c('https://mrchypark.r-universe.dev', 'https://cloud.r-project.org'))\n```\n\n## Example\n\nThe examples below introduce the behavior of `elbird`'s functions.\n\n### tokenize\n\nBasically, the `tokenize` function return list form and the `tokenize_tbl` organized in tibble data type, and grammar compatibility with tidytext are supported provides an `tokenize_tidy` function.\n\n```{r}\nlibrary(elbird)\ntokenize(\"안녕하세요 kiwi 형태소 분석기의 R wrapper인 elbird를 소개합니다.\")\ntokenize_tidy(\"안녕하세요 kiwi 형태소 분석기의 R wrapper인 elbird를 소개합니다.\")\n```\n\nMultiple sentences are input as `vector` or `list` and output as `list`.\n\n```{r}\ntokenize(c(\"새롭게 작성된 패키지 입니다.\", \"tidytext와의 호환을 염두하고 작성하였습니다.\"))\ntokenize_tidy(c(\"새롭게 작성된 패키지 입니다.\", \"tidytext와의 호환을 염두하고 작성하였습니다.\"))\n```\n\n### With tidytext\n\nThe `tokenize_tidy` function can also be used as `tokenize_tt` and `tokenize_tidytext`.\nBelow is an example of using it with the `tidytext` package.\nThe `tar` below is the target text for morpheme analysis.\n\n```{r}\nsuppressMessages(library(dplyr))\n# install.packages(\"komment\", repos = \"https://forkonlp.r-universe.dev/\")\nlibrary(stringr)\nlibrary(tidytext)\nlibrary(komment)\n\nspeech_list %\u003e% \n  filter(president == \"이명박\") %\u003e% \n  filter(str_detect(title, \"취임사\")) %\u003e% \n  pull(link) %\u003e% \n  get_speech(paragraph = T) %\u003e%\n  select(paragraph, content) -\u003e tar\ntar\n```\n\nThis is an example of using `tokenize_tidy` of `elbird` as a tokenizer with `tar` as `unnest_tokens` which is a function of `tidytext` package.\n\n```{r}\ntar %\u003e% \n  unnest_tokens(\n    input = content,\n    output = word,\n    token = tokenize_tidy\n    )\n```\n\n```{r}\nlibrary(ggplot2)\ntar %\u003e% \n  unnest_tokens(\n    input = content,\n    output = word,\n    token = tokenize_tidy\n    ) %\u003e%\n  count(word) %\u003e%\n  top_n(10) %\u003e%\n  ggplot(aes(n, word)) +\n  geom_col(show.legend = FALSE)\n```\n\n\n### analyze\n\nIn addition, an `analyze` function is provided that uses the output of multi-result with there score.\n\n```{r}\nlibrary(elbird)\nanalyze(\"안녕하세요 kiwi 형태소 분석기의 R wrapper인 elbird를 소개합니다.\")\nanalyze(c(\"안녕하세요. kiwi 형태소 분석기의 R wrapper인 elbird를 소개합니다.\"), top_n = 1)\n```\n\n## tag set\n\n[Tag list](https://github.com/bab2min/kiwipiepy#%ED%92%88%EC%82%AC-%ED%83%9C%EA%B7%B8) that used in [kiwipiepy](https://github.com/bab2min/kiwipiepy) package.\n\n```{r echo=FALSE, results='asis'}\ncat(paste0(\"* The table below is fetched at \", Sys.time(),\" \",Sys.timezone(),\".\"))\n```\n\n```{r echo=FALSE}\nhttr::GET(\"https://github.com/bab2min/kiwipiepy/blob/master/README.md\") %\u003e% \n  httr::content() %\u003e% \n  rvest::html_table() %\u003e% \n  knitr::kable(format = \"markdown\")\n```\n\n## Special Thanks to\n\n### kiwi package\n[bab2min](https://github.com/bab2min) with [kiwi package](https://github.com/bab2min/Kiwi) author.\n\n### logo\n[jhk0530](https://github.com/jhk0530) with [suggestion](https://github.com/mrchypark/elbird/issues/6).\n\n### cpp backend\n[kkweon](https://github.com/kkweon) with [kiwigo package](https://github.com/codingpot/kiwigo)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmrchypark%2Felbird","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmrchypark%2Felbird","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmrchypark%2Felbird/lists"}