An open API service indexing awesome lists of open source software.

https://github.com/tinbreaker/tangut-script-annotation-tool

This is an online tool for automatically generating Tangut linguistic annotations. It helps researchers and learners to quickly produce linguistic annotations for Tangut and provides multiple output formats and bilingual explanations. 这是一个用于自动生成西夏文语言学标注的在线工具。它可以帮助研究者和学习者快速生成西夏文的语言学标注,并提供多种输出格式和双语解释。
https://github.com/tinbreaker/tangut-script-annotation-tool

interlinear-gloss linguistics linguistics-dataset obsidian tangut typst

Last synced: 7 months ago
JSON representation

This is an online tool for automatically generating Tangut linguistic annotations. It helps researchers and learners to quickly produce linguistic annotations for Tangut and provides multiple output formats and bilingual explanations. 这是一个用于自动生成西夏文语言学标注的在线工具。它可以帮助研究者和学习者快速生成西夏文的语言学标注,并提供多种输出格式和双语解释。

Awesome Lists containing this project

README

          

# 西夏文自动标注工具 α
[中文](#西夏文自动标注工具-α) | [English](#tangut-annotation-tool-α)

这是一个用于西夏文标注的在线工具,可以帮助研究者和学习者快速生成西夏文的语言学标注。

## 功能特点

- 支持西夏文字符输入和查询
- 提供笔画检索功能
- 自动生成语言学标注
- 支持多种输出格式(Typst和Obsidian)
- 提供中英双语解释
- 支持两种读音系统(龚勋和龚煌城)

## 使用说明

### 基本操作

1. 在输入框中输入西夏文字符
2. 选择所需的配置:
- 语言选择: 中文/English
- 读音系统: 龚勋/龚煌城
- 输出格式: Typst/Obsidian
3. 点击"生成"按钮获取标注结果
4. 使用"复制到剪贴板"按钮复制结果

### 笔画检索

工具提供了笔画检索功能,支持以下操作:

- 使用笔画按钮输入检索序列
- 支持起始(^)和结束($)位置限定
- 支持通配符(*)
- 实时显示检索结果

可用的笔画代码:
- A: 一
- B: 丨
- C: 丿
- D: 丶
- E: 𠃍
- F: ㇈
- G: ㇇
- H: 𘠄
- I: 𘠅
- J: 𠄎
- K: ㇍(𘇥上部,不含丶)
- L: 𠄌
- M: 乚
- N: ㇊(𗕷左下角)
- O: 𘠈
- P: 𡿨
- Q: ㇏

## 技术特性

- 支持复杂的字符组合规则
- 智能词组识别
- 自动处理连字符和等号
- 实时检索结果更新
- 响应式界面设计

## 系统要求

- 现代网络浏览器(支持Unicode)
- Obsidian 需安装[Obsidian Interlinear Glosses](https://github.com/Mijyuoon/obsidian-ling-gloss),并参考使用文档
- Typst 需引用[leipzig-glossing](https://typst.app/universe/package/leipzig-glossing/),并参考使用文档

## 致谢

- 输入法基于 [TXGlook](https://github.com/adlpr/TXGlook) 开发。
- 语法标注基于:[Grammaire du tangoute. Phonologie et morphologie / Mathieu Beaudouin](https://theses.hal.science/tel-04711865)。
- 龚勋拟音:[GX202409 transcription of Tangut](https://homepage.univie.ac.at/xun.gong/tangut/phonology-202409.html)。
- 龚煌城拟音:[Module:txg/data/LFW2008](https://en.wiktionary.org/wiki/Module:txg/data/LFW2008)
- 词表来自:[Module:txg/data/wordlist](https://en.wiktionary.org/wiki/Module:txg/data/wordlist)
- 制作过程中使用了[古今文字集成](ccamc.org)部分数据,特此感谢。

## 版权信息

© 𗼇𘝞𗫸𗯿𘍞

## 注意事项

- 工具目前处于 α 测试阶段
- 字典/词典数据持续更新中
- 某些复杂组合可能需要手动调整

# Tangut Annotation Tool α
[中文](#西夏文自动标注工具-α) | [English](#tangut-annotation-tool-α)

This is an online tool for annotating Tangut text that helps researchers and learners quickly generate linguistic annotations for Tangut characters.

## Features

- Support for Tangut character input and lookup
- Stroke-based search functionality
- Automatic linguistic annotation generation
- Multiple output formats (Typst and Obsidian)
- Bilingual explanations (Chinese/English)
- Support for two phonological systems (Gong Xun and Gong Hwang-cherng)

## Usage Instructions

### Basic Operations

1. Enter Tangut characters in the input box
2. Select desired configuration:
- Language: Chinese/English
- Phonological system: Gong Xun/Gong Hwang-cherng
- Output format: Typst/Obsidian
3. Click "Generate" button to get annotations
4. Use "Copy to Clipboard" button to copy results

### Stroke Search

The tool provides stroke-based search with the following features:

- Use stroke buttons to input search sequences
- Support for start (^) and end ($) position markers
- Wildcard (*) support
- Real-time search results

Available stroke codes:
- A: Horizontal (一)
- B: Vertical (丨)
- C: Falling (丿)
- D: Dot (丶)
- E: Hook (𠃍)
- F: Rising hook (㇈)
- G: Curved hook (㇇)
- H: Double hook (𘠄)
- I: Triple hook (𘠅)
- J: Curved stroke (𠄎)
- K: Upper curve (㇍)
- L: Angled hook (𠄌)
- M: Curved tail (乚)
- N: Lower left hook (㇊)
- O: Complex hook (𘠈)
- P: Double curve (𡿨)
- Q: Diagonal (㇏)

## Technical Features

- Support for complex character combinations
- Intelligent phrase recognition
- Automatic handling of hyphens and equals signs
- Real-time search result updates
- Responsive interface design

## System Requirements

- Modern web browser (with Unicode support)
- For Obsidian: Install [Obsidian Interlinear Glosses](https://github.com/Mijyuoon/obsidian-ling-gloss) and refer to documentation
- For Typst: Import [leipzig-glossing](https://typst.app/universe/package/leipzig-glossing/) and refer to documentation

## Acknowledgements

- Input method based on [TXGlook](https://github.com/adlpr/TXGlook).
-Grammar annotations based on: [Grammaire du tangoute. Phonologie et morphologie / Mathieu Beaudouin](https://theses.hal.science/tel-04711865).
- Gong Xun transcription: [GX202409 transcription of Tangut](https://homepage.univie.ac.at/xun.gong/tangut/phonology-202409.html).
- Gong Hwang-cherng transcription: [Module:txg/data/LFW2008](https://en.wiktionary.org/wiki/Module:txg/data/LFW2008)
- Word list from: [Module:txg/data/wordlist](https://en.wiktionary.org/wiki/Module:txg/data/wordlist)
- Some data from [CCAMC](ccamc.org) was used in development.

## Copyright

© 𗼇𘝞𗫸𗯿𘍞

## Notes

- Tool is currently in α testing phase
- Dictionary/lexicon data is continuously updated
- Some complex combinations may require manual adjustment