https://github.com/sshwy/yaoj-judger

Judger for the future yaoj
https://github.com/sshwy/yaoj-judger

c judger kafel online-judge seccomp

Last synced: 6 months ago
JSON representation

Judger for the future yaoj

Host: GitHub
URL: https://github.com/sshwy/yaoj-judger
Owner: sshwy
License: apache-2.0
Archived: true
Created: 2022-01-31T16:47:15.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2022-07-05T10:54:35.000Z (almost 4 years ago)
Last Synced: 2025-09-27T02:25:44.281Z (9 months ago)
Topics: c, judger, kafel, online-judge, seccomp
Language: C
Homepage:
Size: 940 KB
Stars: 4
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Judger of Yao OJ

It's worth known that this project is still under development, thus everything is varying :).

[![codecov](https://codecov.io/gh/sshwy/yaoj-judger/branch/master/graph/badge.svg?token=J2YFL24TOH)](https://codecov.io/gh/sshwy/yaoj-judger) [![CodeFactor](https://www.codefactor.io/repository/github/sshwy/yaoj-judger/badge/master)](https://www.codefactor.io/repository/github/sshwy/yaoj-judger/overview/master)

（大概）是新一代 OJ 评测模块！

基于 [kafel](https://github.com/google/kafel) 的一个沙箱模块，支持资源用度报告。主要用于 OI/ACM 的代码评测。

项目核心：在限制条件下执行某一程序（可能带参数）并得到相应的运行结果分析。

目前只支持在 x86_64 架构上运行。

## Features

- 使用 C 语言编写，直接调用系统 API
- 模块化架构，易于扩展，目前已支持：传统题、交互题、任意命令的执行
- 使用 [kafel](https://github.com/google/kafel) 配置更易读的 syscall 限制策略，并添加扩展语法以实现在运行时动态加载策略
- rlimit + rusage + pthread 全面限制/监控进程的资源使用
- 可以实现大家梦寐以求（？）的 CCF 数组开爆的功能（即数组超过空间限制直接 MLE）

## Getting Start

### Download

右转 Github Release 获取构建好的二进制文件和静态链接库。

### Build from source

在构建项目之前请确保您的 Linux 系统安装有

- make（GNU Make）
- flex（the fast lexical analyser generator）
- ausyscall（a program that allows mapping syscall names and numbers）
- clang 系列命令
- gengetopt

命令。或者（前提有 make）手动在项目根目录执行 `make check_buildenv` 看是否报错。

接下在项目根目录执行：

```bash
make all
```

## Usage

一个简单易懂的例子见 `tests/stress/01_OK`。

```bash
$ ./yaoj-judger --detailed-help
Usage: yaoj-judger [OPTION]... [arguments]...
judger for the future yaoj

A set of program runners with resource limitation, syscall limitation and final
status report.

-h, --help Print help and exit
--detailed-help Print help, including all details and hidden
options, and exit
-V, --version Print version and exit
-j, --judger=judgername specify which judger to use (possible
values="interactive", "general")
(required)
-r, --result=string predict judgement result (possible
values="OK", "RE", "MLE", "TLE",
"OLE", "SE", "DSC", "ECE")

Meanings of those shortname:
OK: all correct
RE: runtime error
MLE: memory limitation exceed
TLE: time limitation exceed
OLE: output limitation exceed
SE: system error, aka judger error
DSC: dangerous system call
ECE: exit code error

--log=filename specify judger result file (required)
--log-color whether display colorful log (default=off)
-p, --policy=filename specify policy name (required)

Note that if using builtin policy, add 'builtin:' prefix to policy's name.

-P, --policy-dir=filename specify policy search directory, depend on
'policy' option (default=`.')

If using builtin policy, this option is meaningless.

--json output judgement result to stdout in JSON
format (default=off)
--fork fork a new process to perform (default=off)

Resource Limitations:
note that 'timeout' and 'memory' option can be override by their
corresponding detailed options, such as realtime, stack-memory.
-t, --timeout=integer specify both time limits in milliseconds
--realtime=integer specify the runtime limit in milliseconds
--cputime=integer specify the cpu limit in milliseconds
-m, --memory=integer specify all three memory limits in MB
--virtual-memory=integer specify the virtual memory limit in MB
--real-memory=integer specify the actual memory limit in MB
--stack-memory=integer specify the stack memory limit in MB
-g, --output-size=integer specify the output limit in MB
-f, --fileno=integer specify the maximum number of opened fd
```

### Resource Limitation

`realtime` 指实际的运行时间，例如包括 sleep 占用的时间。而 `cputime` 只考虑 cpu 使用时间。事实上如今大多数 OJ 都是限制 cputime，realtime 一般做为一个超时阈值。毕竟考虑到多个进程同时评测，realtime 很多时候是不准的。`timeout` 会同时设置 `realtime` 和 `cputime` 的限制。

`virtual-memory` 表示对进程内存地址空间的限制，而 `real-memory` 表示对实际使用的内存的限制。例如大多数 OI 选手所知，数组开了很大一般是用不完的，而大多数 OJ 显示的是实际被使用的内存，也是在这上面做出的限制。而 CCF 评测会出现数组开爆的情况，大概是因为他们对前者也做出了相同的限制。`stack-memory` 则是栈空间限制。`memory` 会同时设置上述三个限制。

`output-size` 则限制进程创建文件的大小，即输出限制。

### Notes

另外如果想要安装为系统命令：

```bash
make all
sudo make install
```

删除安装的系统命令

```bash
sudo make uninstall
```

内建规则见 `src/builtin_policy` 目录。

对于更多使用方法，您可以去 [tests/](https://github.com/sshwy/yaoj-judger/tree/master/tests) 了解一下！

如果你在尝试链接库文件来自己写 main，那么在编译时需加上 `-Lpath/to/libyjudger -lpthread -lyjudger`，并且链接的顺序有时候会影响编译结果（[Why does the order in which libraries are linked sometimes cause errors in GCC?](https://stackoverflow.com/questions/45135/why-does-the-order-in-which-libraries-are-linked-sometimes-cause-errors-in-gcc)）。解决方法就是穷举……一般来说 `-lyjudger` 放最前面就可以。

## Development

首先你需要有 clang。

code coverage（需要 gcovr）：

```
make clean
make gcovr
```

可以在 local.cov 文件夹下查看代码覆盖情况。

文档生成（需要 doxygen）：

```
make docs
```

可以在 `docs/web` 里查看文档。

测试：

```
make test
```

会执行所有的测试。

## Design

早期思路来源于 [QingdaoU/Judger](https://github.com/QingdaoU/Judger)，在此鸣谢。在此基础上引入 kafel 以更友好的方式配置系统调用规则，并重新整理了执行逻辑。

首先分析一下 OJ 判题的流程。一个题目有若干个测试点，可能有子任务依赖关系，这些都是顶层逻辑。下一层就是在沙箱中执行程序，而执行程序是一个通用的模块，囊括编译、执行和检查等的步骤。执行单个程序的过程中不会有与外界的信息交换，但在执行两个程序之间可以进行信息交换（比如看看当前评测到哪个点了），这部分也可以交给顶层逻辑完成。

而执行一个程序要考虑三个要素：资源限制（时间、空间）、系统安全（系统调用的限制）、数据处理（执行程序获得输出）。

容易发现三个要素互不相关。其中资源限制是高度通用的模块，适用于几乎所有地方。系统调用的限制则需要考虑不同的执行过程以应用不同的 policy，也相对通用。而数据处理的部分则稍显复杂。不同程序的传参不同，并且 IO 的配置也有区别。

不过考虑到数据处理的情况是有限的，目前来看 OI 评测有以下几种代码：

- 标准输入输出型（std_io）: exec + input_file + output_file + error_file
- 文件 IO 型（file_io）: exec + input_file + output_file + error_file
- 交互型（interaction）: exec + interactor + input_file(both) + output_file(interactor) + error_file(both)
- 通用型（代码编译、执行一段命令、testlib）（general）: execve() + input_file + output_file + error_file

### Hooks

为了处理资源限制和系统安全限制，我们引入 hooks 框架（其实也可以理解为一个 event 模式），一定程度上规范了程序的评测过程。简单来说所有对当前进程施加的影响都可以封装为一个模块来调用。而把模块挂载（register）到它该执行的阶段就是 hook。

一个典型的 judger 应该具有以下过程：

```mermaid
graph TD
init(initialize) --> bfh(hook:before_fork)
bfh --> fork{fork} -->|parent| afh(hook:after_fork)
rd -.-> afh --> c_t{{child_terminate}} --> awh(hook:after_wait)
fork -->|child| cafh(child_prework) --> rd{{ready}} --> child_run -.-> c_t
```

除了 child 的部分需要一些逻辑实现，其他部分都可以用 hook 的形式挂载到框架上。原理在于，父进程不会与子进程有通讯，它只需要明确：

- 子进程的较为准确的开始（方便计时）
- 子进程的结束以及其状态码（结果判定）

对于第一个任务，我们采用无名管道的方式，在 child_prework 完成前一直阻塞父进程，直到 ready 信号发出。这样 after_fork 的第一个 hook 执行的时间就非常接近于 child_run 的执行时间。

对于第二个任务，使用 wait 和 getrusage 即可。前者需要逻辑方面的实现，而后者也可以封装为 hook。

### Policy

对于系统安全方面的限制，可以想到 seccomp，以及其 bpf 等。本项目采用更加人性化的方式：kafel 规则。它可读且易于扩展。

当然，有一些单纯的 kafel 做不了的需求，例如动态的规则内容。这时我们可以在 kafel 编译前外加一些字符串处理，把规则内容模板化。这样可以动态插入所需内容。

于是，我们将定制化的需求通过魔改 kafel-lang 实现。例如引入 `%[0-9]s` 占位符表示 runner 的参数列表字符串地址。这样可以更好地对 execve 等系统调用进行限制。

已知问题：只能在注册的入口 policy 中使用占位符（没有实现递归替换）。

## Reference

- [linux 安全模块 -- seccomp 详解](https://zhuanlan.zhihu.com/p/363174561)
- [Conventional Commits](https://www.conventionalcommits.org/en/v1.0.0/)
- [A list of signals and what they mean](https://www-uxsup.csx.cam.ac.uk/courses/moved.Building/signals.pdf)
- https://opensource.qduoj.com/#/judger/how_it_works
- https://notes.sshwy.name/Interactive-Prob-Judger/

## Thanks to

[QingdaoU/Judger](https://github.com/QingdaoU/Judger): [SATA LICENSE](https://github.com/QingdaoU/Judger/raw/newnew/LICENSE)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sshwy/yaoj-judger

Awesome Lists containing this project

README