Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/toyama0919/embulk-filter-woothee
https://github.com/toyama0919/embulk-filter-woothee
agent embulk embulk-plugin java user-agent woothee
Last synced: 27 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/toyama0919/embulk-filter-woothee
- Owner: toyama0919
- License: mit
- Created: 2015-08-24T07:51:10.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2016-06-14T04:14:29.000Z (over 8 years ago)
- Last Synced: 2024-09-17T12:22:54.069Z (about 2 months ago)
- Topics: agent, embulk, embulk-plugin, java, user-agent, woothee
- Language: Java
- Size: 81.1 KB
- Stars: 4
- Watchers: 4
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# Woothee filter plugin for Embulk[![Build Status](https://secure.travis-ci.org/toyama0919/embulk-filter-woothee.png?branch=master)](http://travis-ci.org/toyama0919/embulk-filter-woothee)
parse UserAgent strings and to filter/drop specified categories of user terminals.
see. [woothee/woothee-java](https://github.com/woothee/woothee-java)
## Overview
* **Plugin type**: filter
## Configuration
- **key_name**: target key name (string, required)
- **out_key_name**: out key name (string, default: agent_name)
- **out_key_category**: out key category (string, default: agent_category)
- **out_key_os**: out key os (string, default: agent_os)
- **out_key_version**: out key version (string, default: agent_version)
- **out_key_vendor**: out key vendor (string, default: agent_vendor)
- **filter_categories**: filter categories (array, default: null)
- **drop_categories**: drop categories (array, default: null)
- **merge_agent_info**: merge agent info (bool, default: false)## Example1(add agent info)
```yaml
filters:
- type: woothee
key_name: user_agent
merge_agent_info: true
out:
type: stdout
```## Example2(filter categories)
```yaml
filters:
- type: woothee
key_name: user_agent
merge_agent_info: true
filter_categories:
- pc
- smartphone
- mobilephone
- appliance
out:
type: stdout
```## Example3(drop categories)
```yaml
filters:
- type: woothee
key_name: user_agent
merge_agent_info: true
drop_categories:
- crawler
- UNKNOWN
out:
type: stdout
```## Build
```
$ ./gradlew gem
```