https://github.com/toyama0919/embulk-filter-woothee
https://github.com/toyama0919/embulk-filter-woothee
agent embulk embulk-plugin java user-agent woothee
Last synced: 7 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/toyama0919/embulk-filter-woothee
- Owner: toyama0919
- License: mit
- Created: 2015-08-24T07:51:10.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2016-06-14T04:14:29.000Z (almost 10 years ago)
- Last Synced: 2025-02-02T08:31:58.525Z (over 1 year ago)
- Topics: agent, embulk, embulk-plugin, java, user-agent, woothee
- Language: Java
- Size: 81.1 KB
- Stars: 4
- Watchers: 4
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Woothee filter plugin for Embulk[](http://travis-ci.org/toyama0919/embulk-filter-woothee)
parse UserAgent strings and to filter/drop specified categories of user terminals.
see. [woothee/woothee-java](https://github.com/woothee/woothee-java)
## Overview
* **Plugin type**: filter
## Configuration
- **key_name**: target key name (string, required)
- **out_key_name**: out key name (string, default: agent_name)
- **out_key_category**: out key category (string, default: agent_category)
- **out_key_os**: out key os (string, default: agent_os)
- **out_key_version**: out key version (string, default: agent_version)
- **out_key_vendor**: out key vendor (string, default: agent_vendor)
- **filter_categories**: filter categories (array, default: null)
- **drop_categories**: drop categories (array, default: null)
- **merge_agent_info**: merge agent info (bool, default: false)
## Example1(add agent info)
```yaml
filters:
- type: woothee
key_name: user_agent
merge_agent_info: true
out:
type: stdout
```
## Example2(filter categories)
```yaml
filters:
- type: woothee
key_name: user_agent
merge_agent_info: true
filter_categories:
- pc
- smartphone
- mobilephone
- appliance
out:
type: stdout
```
## Example3(drop categories)
```yaml
filters:
- type: woothee
key_name: user_agent
merge_agent_info: true
drop_categories:
- crawler
- UNKNOWN
out:
type: stdout
```
## Build
```
$ ./gradlew gem
```