https://github.com/scut-dlvclab/pavenet

[IEEE TPAMI 2025] Official repository of "Privacy-Preserving Biometric Verification With Handwritten Random Digit String".
https://github.com/scut-dlvclab/pavenet

Last synced: about 1 month ago
JSON representation

[IEEE TPAMI 2025] Official repository of "Privacy-Preserving Biometric Verification With Handwritten Random Digit String".

Host: GitHub
URL: https://github.com/scut-dlvclab/pavenet
Owner: SCUT-DLVCLab
License: gpl-3.0
Created: 2024-10-29T13:07:54.000Z (7 months ago)
Default Branch: main
Last Pushed: 2025-03-18T02:14:40.000Z (2 months ago)
Last Synced: 2025-03-29T19:02:53.768Z (about 2 months ago)
Language: Python
Homepage: https://ieeexplore.ieee.org/document/10840296
Size: 27.3 MB
Stars: 5
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

:shield:PAVENet

Privacy-Preserving Biometric Verification with
Handwritten Random Digit String

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

:star:Official code of the PAVENet model and the release of the HRDS4BV dataset.

:ocean:Introduction

This paper proposes using Random Digit String (RDS) for privacy-preserving handwriting verification. Users can perform identity authentication by writing a digit sequence of arbitrary content rather than signing signatures that contains personal information, effectively protecting privacy.

To this end, we first propose the **HRDS4BV** dataset, consisting of handwritten RDS acquiring from 402 writers. Second, we propose the Pattern Attentive VErification Network (**PAVENet**) to extract discriminative handwriting patterns, enhancing writing style representation.

![](./asset/framework.png)

The framework of PAVENet

:scroll:HRDS4BV Dataset

### Description

HRDS4BV dataset is a handwriting verification benchmark dataset that contains 16080 RDS samples from 402 users, with 20 genuine samples and 20 skilled forgeries per user. Each RDS is composed of random digits in a length of 7~11. The dataset is acquired in two separate sessions, in which 10 genuine samples and 10 skilled forgeries per user are collected in each session. More details are presented in the below table.

| Content | Length | Modality | Session | User | Genuine Sample | Skilled Forgery | Features |
| ------------------- | ------ | :------: | ------- | ---- | --------------------------- | --------------------------- | ----------- |
| Random Digit String | 7~11 | Online | 2 | 402 | $402\times(10 + 10) = 8040$ | $402\times(10 + 10) = 8040$ | $X,Y,P,T,U$ |

$X,Y,P,T,U$ respectively denote the $x$ coordinates, $y$ coordinates, pressure, timestamps, and the pen-up/pen-down information. The pen-down/pen-up information is represented by 0~3. 0 indicates that this is not a pen-up/pen-down point. 1 indicates that this is a pen-down point. 2 indicates that this is a pen-up point. 3 indicates that this point is both a pen-up and pen-down point, which is isolated.

### Dataset Accessibility

You can access the dataset following the instructions:

- The HRDS4BV dataset can only be used for non-commercial research purposes. For scholar or organization who wants to use the HRDS4BV dataset, please first fill in this [Application Form](./application-form/Application-Form-for-Using-MSDS.docx) and sign the [Legal Commitment](./application-form/Legal-Commitment.docx) and email them to us. When submitting the application form to us, please list or attached 1-2 of your publications in the recent 6 years to indicate that you (or your team) do research in the related research fields of handwriting verification, handwriting analysis and recognition, document image processing, and so on.
- We will give you the download link and the decompression password after your application has been received and approved.
- All users must follow all use conditions; otherwise, the authorization will be revoked.

### Data Format

The dataset is organized in the following directory format:

```bash
HRDS4BV
├─session1
│ ├─0
│ │ ├─f_0_0.txt
│ │ ├─f_0_1.txt
│ │ ├─...
│ │ ├─g_0_0.txt
│ │ ├─g_0_1.txt
│ │ └─...
│ ├─1
│ │ ├─f_0_0.txt
│ │ ├─f_0_1.txt
│ │ ├─...
│ │ ├─g_0_0.txt
│ │ ├─g_0_1.txt
│ │ └─...
│ ├─...
├─session2
│ ├─...
```

- Data of two sessions is stored in `session1` and `session2`.
- The users are arranged from `0` to `401`, with online dynamic time series and offline static images provided in `series` and `images`. The time series are saved as `.txt` files and the images are in `.png` format.
- The naming of each file follows the same format: `flag_user_index`.
- - `flag` is `f` or `g`. `f` indicates that this file is a skilled forgery, while `g` indicates that it is a genuine sample.
- - `user` indicates the number of user of this file.
- - `index` indicates the number of this file in the current folder.
- - For example, `f_0_0.txt` represents the first file (time series) of all skilled forgeries of the user `0`.

### Data License

HRDS4BV should be used and distributed under [Creative Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) License](https://creativecommons.org/licenses/by-nc-nd/4.0/) for non-commercial research purposes.

:earth_asia:Environment

```bash
git clone https://github.com/SCUT-DLVCLab/PAVENet.git
cd PAVENet
conda create -n pavenet python=3.8.16
conda activate pavenet
pip install -r requirements.txt
```

:hammer_and_pick:Data Preparation

Download the HRDS4BV dataset and unzip it using the following commands (`7z` is recommended for the unzipping; please ensure that `7z` is installed and available):

```bash
mkdir data
7z x HRDS4BV.zip -odata
```

You can enter the decompression password here.

Run `process.py` for data preprocessing and data splitting:

```bash
python process.py
```

Now the data should be all preprocessed and splitted. The final data directory should look like:

```bash
data
├── HRDS4BV
├── hrds4bv-across-test.pkl
└── hrds4bv-across-train.pkl
```

:rocket:Test

```
python test.py --weights weights/model.pth
```

:bookmark_tabs:Citation

```bibtex
@ARTICLE{pavenet2025zhang,
author={Zhang, Peirong and Liu, Yuliang and Lai, Songxuan and Li, Hongliang and Jin, Lianwen},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
title={Privacy-Preserving Biometric Verification With Handwritten Random Digit String},
year={2025},
volume={47},
number={4},
pages={3049-3066}
}
```

:phone:Cotact

Peirong Zhang: [email protected]

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/scut-dlvclab/pavenet

Awesome Lists containing this project

README