https://github.com/hpprc/konoha-sentence-tokenizer-regex-compile
https://github.com/hpprc/konoha-sentence-tokenizer-regex-compile
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/hpprc/konoha-sentence-tokenizer-regex-compile
- Owner: hppRC
- Created: 2021-03-01T13:15:10.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-03-01T14:04:09.000Z (over 4 years ago)
- Last Synced: 2025-02-10T13:37:32.947Z (4 months ago)
- Language: Python
- Homepage:
- Size: 2.93 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Konoha 検証用リポジトリ
## 動作環境
```
MacBook Pro (16-inch, 2019)
2.4 GHz 8-Core Intel Core i9
32 GB 2667 MHz DDR4
Macintosh HD
``````
Python 3.8.2 (default, Apr 13 2020, 16:24:13)
[Clang 11.0.3 (clang-1103.0.32.29)] on darwin
```## 評価方法
for文で100,000回文の改行区切りを行う。
評価対象は以下の通り。- do_nothing: オーバーヘッドの確認のため、ループを回すだけのスクリプト
- compile_alwasy: SentenceTokenizer#tokenize()が呼ばれるたびにコンパイルを行う
- no_compile: コンパイルをせず、raw文字列を渡す
- pre_compile: 事前にコンパイル済みの正規表現オブジェクトを格納しておく実行コマンド
```bash
time python do_nothing.py
time python compile_always.py
time python no_compile.py
time python pre_compile.py
```## 実行時間
### do_nothing
| Executed in | 90.23 millis | fish | external |
| ----------- | -----------: | ------------: | -----------: |
| usr time | 45.31 millis | 105.00 micros | 45.20 millis |
| sys time | 38.16 millis | 479.00 micros | 37.68 millis |### compile_always
| Executed in | 879.63 millis | fish | external |
| ----------- | ------------: | ------------: | ------------: |
| usr time | 826.11 millis | 115.00 micros | 826.00 millis |
| sys time | 41.67 millis | 707.00 micros | 40.97 millis |### no_compile
| Executed in | 743.88 millis | fish | external |
| ----------- | ------------: | ------------: | ------------: |
| usr time | 642.70 millis | 136.00 micros | 642.57 millis |
| sys time | 52.06 millis | 550.00 micros | 51.51 millis |### pre_compile
| Executed in | 573.41 millis | fish | external |
| ----------- | ------------: | ------------: | ------------: |
| usr time | 525.01 millis | 158.00 micros | 524.85 millis |
| sys time | 41.56 millis | 755.00 micros | 40.81 millis |