Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jonathansalwan/triton

Triton is a dynamic binary analysis library. Build your own program analysis tools, automate your reverse engineering, perform software verification or just emulate code.
https://github.com/jonathansalwan/triton

binary-analysis binary-translation deobfuscation dynamic-analysis emulator instruction-semantics lifter program-analysis reverse-engineering symbolic-execution taint-analysis

Last synced: about 1 hour ago
JSON representation

Triton is a dynamic binary analysis library. Build your own program analysis tools, automate your reverse engineering, perform software verification or just emulate code.

Awesome Lists containing this project

README

        

**Triton** is a dynamic binary analysis library. It provides internal components that allow you to build your program analysis tools,
automate reverse engineering, perform software verification or just emulate code.

* Dynamic **symbolic** execution
* Dynamic **taint** analysis
* AST representation of the **x86**, **x86-64**, **ARM32**, **AArch64** and **RISC-V 32/64** ISA semantic
* Expressions **synthesis**
* SMT **simplification** passes
* **Lifting** to **LLVM** as well as **Z3** and back
* **SMT solver** interface to **Z3** and **Bitwuzla**
* **C++** and **Python** API




As **Triton** is a kind of a part-time project, please, **don't blame us** if it is not fully reliable. [Open issues](https://github.com/JonathanSalwan/Triton/issues) or
[pull requests](https://github.com/JonathanSalwan/Triton/pulls) are always better than trolling =). However, you can follow the development on twitter
[@qb_triton](https://twitter.com/qb_triton).





 



 



 

Codecov

 



 



 



# Quick start

* [Installation](#install)
* [Python API](https://triton-library.github.io/documentation/doxygen/py_triton_page.html)
* [C++ API](https://triton-library.github.io/documentation/doxygen/annotated.html)
* [Python Examples](https://github.com/JonathanSalwan/Triton/tree/master/src/examples/python)
* [They already used Triton](#they-already-used-triton)

## Getting started

```python
from triton import *

>>> # Create the Triton context with a defined architecture
>>> ctx = TritonContext(ARCH.X86_64)

>>> # Define concrete values (optional)
>>> ctx.setConcreteRegisterValue(ctx.registers.rip, 0x40000)

>>> # Symbolize data (optional)
>>> ctx.symbolizeRegister(ctx.registers.rax, 'my_rax')

>>> # Execute instructions
>>> ctx.processing(Instruction(b"\x48\x35\x34\x12\x00\x00")) # xor rax, 0x1234
>>> ctx.processing(Instruction(b"\x48\x89\xc1")) # mov rcx, rax

>>> # Get the symbolic expression
>>> rcx_expr = ctx.getSymbolicRegister(ctx.registers.rcx)
>>> print(rcx_expr)
(define-fun ref!8 () (_ BitVec 64) ref!1) ; MOV operation - 0x40006: mov rcx, rax

>>> # Solve constraint
>>> ctx.getModel(rcx_expr.getAst() == 0xdead)
{0: my_rax:64 = 0xcc99}

>>> # 0xcc99 XOR 0x1234 is indeed equal to 0xdead
>>> hex(0xcc99 ^ 0x1234)
'0xdead'
```

## Install

Triton relies on the following dependencies:

```
* libcapstone >= 5.0.x https://github.com/capstone-engine/capstone
* libboost (optional) >= 1.68
* libpython (optional) >= 3.6
* libz3 (optional) >= 4.6.0 https://github.com/Z3Prover/z3
* libbitwuzla (optional) >= 0.4.x https://github.com/bitwuzla/bitwuzla
* llvm (optional) >= 12
```

### Linux and MacOS

```console
$ git clone https://github.com/JonathanSalwan/Triton
$ cd Triton
$ mkdir build ; cd build
$ cmake ..
$ make -j3
$ sudo make install
```

By default, LLVM and Bitwuzla are not compiled. If you want to enjoy the full power of Triton, the cmake compile is:

```console
$ cmake -DLLVM_INTERFACE=ON -DCMAKE_PREFIX_PATH=$(llvm-config --prefix) -DBITWUZLA_INTERFACE=ON ..
```

#### MacOS M1 Note:

In case if you get compilation errors like:

```
Could NOT find PythonLibs (missing: PYTHON_LIBRARIES PYTHON_INCLUDE_DIRS)
```

Try to specify `PYTHON_EXECUTABLE`, `PYTHON_LIBRARIES` and `PYTHON_INCLUDE_DIRS` for your specific Python version:

```console
cmake -DCMAKE_INSTALL_PREFIX=/opt/homebrew/ \
-DPYTHON_EXECUTABLE=/opt/homebrew/bin/python3 \
-DPYTHON_LIBRARIES=/opt/homebrew/Cellar/[email protected]/3.10.8/Frameworks/Python.framework/Versions/3.10/lib/libpython3.10.dylib \
-DPYTHON_INCLUDE_DIRS=/opt/homebrew/opt/[email protected]/Frameworks/Python.framework/Versions/3.10/include/python3.10/ \
..
```

This information you can get out from this snippet:

```python
from sysconfig import get_paths
info = get_paths()
print(info)
```

### Windows

You can use cmake to generate the .sln file of libTriton.

```console
> git clone https://github.com/JonathanSalwan/Triton.git
> cd Triton
> mkdir build
> cd build
> cmake -G "Visual Studio 14 2015 Win64" \
-DBOOST_ROOT="C:/Users/jonathan/Works/Tools/boost_1_61_0" \
-DPYTHON_INCLUDE_DIRS="C:/Python36/include" \
-DPYTHON_LIBRARIES="C:/Python36/libs/python36.lib" \
-DZ3_INCLUDE_DIRS="C:/Users/jonathan/Works/Tools/z3-4.6.0-x64-win/include" \
-DZ3_LIBRARIES="C:/Users/jonathan/Works/Tools/z3-4.6.0-x64-win/bin/libz3.lib" \
-DCAPSTONE_INCLUDE_DIRS="C:/Users/jonathan/Works/Tools/capstone-5.0.1-win64/include" \
-DCAPSTONE_LIBRARIES="C:/Users/jonathan/Works/Tools/capstone-5.0.1-win64/capstone.lib" ..
```

However, if you prefer to directly download the precompiled library, check out our AppVeyor's [artefacts](https://ci.appveyor.com/project/JonathanSalwan/triton/history).
Note that if you use AppVeyor's artefacts, you probably have to install the [Visual C++ Redistributable](https://www.microsoft.com/en-US/download/details.aspx?id=30679)
packages for Visual Studio 2012.

### Installing from vcpkg

The Triton port in vcpkg is kept up to date by Microsoft team members and community contributors.
The url of vcpkg is: https://github.com/Microsoft/vcpkg. You can download and install Triton using
the vcpkg dependency manager:

```console
$ git clone https://github.com/Microsoft/vcpkg.git
$ cd vcpkg
$ ./bootstrap-vcpkg.sh # ./bootstrap-vcpkg.bat for Windows
$ ./vcpkg integrate install
$ ./vcpkg install triton
```

If the version is out of date, please [create an issue or pull request](https://github.com/Microsoft/vcpkg) on the vcpkg repository.

# Contributors

* [**Alberto Garcia Illera**](https://twitter.com/algillera) - Cruise Automation
* [**Alexey Vishnyakov**](https://vishnya.xyz/) - ISP RAS
* [**Black Binary**](https://github.com/black-binary) - n/a
* [**Christian Heitman**](https://github.com/cnheitman) - Quarkslab
* [**Daniil Kuts**](https://github.com/apach301) - ISP RAS
* [**Jessy Campos**](https://github.com/ek0) - n/a
* [**Matteo F.**](https://twitter.com/fvrmatteo) - n/a
* [**Pierrick Brunet**](https://github.com/pbrunet) - Quarkslab
* [**PixelRick**](https://github.com/PixelRick) - n/a
* [**Romain Thomas**](https://twitter.com/rh0main) - Quarkslab
* [**And many more**](https://github.com/JonathanSalwan/Triton/graphs/contributors)

## They already used Triton

### Tools

* [Exrop](https://github.com/d4em0n/exrop): Automatic ROPChain Generation.
* [Pimp](https://github.com/kamou/pimp): Triton based R2 plugin for concolic execution and total control.
* [Ponce](https://github.com/illera88/Ponce): IDA 2016 plugin contest winner! Symbolic Execution just one-click away!
* [QSynthesis](https://github.com/quarkslab/qsynthesis): Greybox Synthesizer geared for deobfuscation of assembly instructions.
* [TritonDSE](https://github.com/quarkslab/tritondse): Triton-based DSE library with loading and exploration capabilities.
* [Titan](https://github.com/archercreat/titan): Titan is a VMProtect devirtualizer using Triton.

### Papers and conference



  • Sydr-Fuzz: Continuous Hybrid Fuzzing and Dynamic Analysis for Security Development Lifecycle

    Talk at: Ivannikov ISP RAS Open Conference, Moscow, Russia, 2022. [paper] [slide]

    Authors: Vishnyakov A., Kuts D., Logunova V., Parygina D., Kobrin E., Savidov G., Fedotov A.

    Abstract: Nowadays automated dynamic analysis frameworks for
    continuous testing are in high demand to ensure software safety and satisfy the
    security development lifecycle (SDL) requirements. The security bug hunting
    efficiency of cutting-edge hybrid fuzzing techniques outperforms widely
    utilized coverage-guided fuzzing. We propose an enhanced dynamic analysis
    pipeline to leverage productivity of automated bug detection based on hybrid
    fuzzing. We implement the proposed pipeline in the continuous fuzzing toolset
    Sydr-Fuzz which is powered by hybrid fuzzing orchestrator, integrating our DSE
    tool Sydr with libFuzzer and AFL++. Sydr-Fuzz also incorporates security
    predicate checkers, crash triaging tool Casr, and utilities for corpus
    minimization and coverage gathering. The benchmarking of our hybrid fuzzer
    against alternative state-of-the-art solutions demonstrates its superiority
    over coverage-guided fuzzers while remaining on the same level with advanced
    hybrid fuzzers. Furthermore, we approve the relevance of our approach by
    discovering 85 new real-world software flaws within the OSS-Sydr-Fuzz project.
    Finally, we open Casr source code to the community to facilitate examination of
    the existing crashes.





  • Strong Optimistic Solving for Dynamic Symbolic Execution

    Talk at: Ivannikov Memorial Workshop, Kazan, Russia, 2022. [paper] [slide]

    Authors: Parygina D., Vishnyakov A., Fedotov A.

    Abstract: Dynamic symbolic execution (DSE) is an effective method
    for automated program testing and bug detection. It is increasing the code
    coverage by the complex branches exploration during hybrid fuzzing. DSE tools
    invert the branches along some execution path and help fuzzer examine
    previously unavailable program parts. DSE often faces over- and underconstraint
    problems. The first one leads to significant analysis complication while the
    second one causes inaccurate symbolic execution.
    We propose strong optimistic solving method that eliminates irrelevant path
    predicate constraints for target branch inversion. We eliminate such symbolic
    constraints that the target branch is not control dependent on. Moreover, we
    separately handle symbolic branches that have nested control transfer
    instructions that pass control beyond the parent branch scope, e.g. return,
    goto, break, etc. We implement the proposed method in our dynamic symbolic
    execution tool Sydr.
    We evaluate the strong optimistic strategy, the optimistic strategy that
    contains only the last constraint negation, and their combination. The results
    show that the strategies combination helps increase either the code coverage or
    the average number of correctly inverted branches per one minute. It is optimal
    to apply both strategies together in contrast with other configurations.





  • Greybox Program Synthesis: A New Approach to Attack Dataflow Obfuscation

    Talk at: Blackhat USA, Las Vegas, Nevada, 2021. [slide]

    Authors: Robin David

    Abstract: This talk presents the latest advances in program synthesis applied for deobfuscation. It aims at demystifying this analysis technique
    by showing how it can be put into action on obfuscation. Especially the implementation Qsynthesis released for this talk shows a complete end-to-end workflow
    to deobfuscate assembly instructions back in optimized (deobfuscated) instructions reassembled back in the binary.





  • From source code to crash test-case through software testing automation

    Talk at: C&ESAR, Rennes, France, 2021. [paper] [slide]

    Authors: Robin David, Jonathan Salwan, Justin Bourroux

    Abstract: This paper present an approach automating the software testing process from a source code to the dynamic testing of the compiled program. More specifically, from a static
    analysis report indicating alerts on source lines it enables testing to cover these lines dynamically and opportunistically checking whether whether or not they can trigger
    a crash. The result is a test corpus allowing to cover alerts and to trigger them if they happen to be true positives. This paper discuss the methodology employed to track
    alerts down in the compiled binary, the testing engines selection process and the results obtained on a TCP/IP stack implementation for embedded and IoT systems.





  • Symbolic Security Predicates: Hunt Program Weaknesses

    Talk at: Ivannikov ISP RAS Open Conference, Moscow, Russia, 2021. [paper] [slide]

    Authors: A.Vishnyakov, V.Logunova, E.Kobrin, D.Kuts, D.Parygina, A.Fedotov

    Abstract: Dynamic symbolic execution (DSE) is a powerful method for
    path exploration during hybrid fuzzing and automatic bug detection. We propose
    security predicates to effectively detect undefined behavior and memory access
    violation errors. Initially, we symbolically execute program on paths that
    don’t trigger any errors (hybrid fuzzing may explore these paths). Then we
    construct a symbolic security predicate to verify some error condition. Thus, we
    may change the program data flow to entail null pointer dereference, division
    by zero, out-of-bounds access, or integer overflow weaknesses. Unlike static
    analysis, dynamic symbolic execution does not only report errors but also
    generates new input data to reproduce them. Furthermore, we introduce function
    semantics modeling for common C/C++ standard library functions. We aim to model
    the control flow inside a function with a single symbolic formula. This assists
    bug detection, speeds up path exploration, and overcomes overconstraints in
    path predicate. We implement the proposed techniques in our dynamic symbolic
    execution tool Sydr. Thus, we utilize powerful methods from Sydr such as path
    predicate slicing that eliminates irrelevant constraints.
    We present Juliet Dynamic to measure dynamic bug detection tools accuracy. The
    testing system also verifies that generated inputs trigger sanitizers. We
    evaluate Sydr accuracy for 11 CWEs from Juliet test suite. Sydr shows 95.59%
    overall accuracy. We make Sydr evaluation artifacts publicly available to
    facilitate results reproducibility.





  • Towards Symbolic Pointers Reasoning in Dynamic Symbolic Execution

    Talk at: Ivannikov Memorial Workshop, Nizhny Novgorod, Russia, 2021. [paper] [slide]

    Authors: Daniil Kuts

    Abstract: Dynamic symbolic execution is a widely used technique for
    automated software testing, designed for execution paths exploration and
    program errors detection. A hybrid approach has recently become widespread,
    when the main goal of symbolic execution is helping fuzzer increase program
    coverage. The more branches symbolic executor can invert, the more useful it is
    for fuzzer. A program control flow often depends on memory values, which are
    obtained by computing address indexes from user input. However, most DSE tools
    don't support such dependencies, so they miss some desired program branches. We
    implement symbolic addresses reasoning on memory reads in our dynamic symbolic
    execution tool Sydr. Possible memory access regions are determined by either
    analyzing memory address symbolic expressions, or binary searching with
    SMT-solver. We propose an enhanced linearization technique to model memory
    accesses. Different memory modeling methods are compared on the set of
    programs. Our evaluation shows that symbolic addresses handling allows to
    discover new symbolic branches and increase the program coverage.





  • QSynth: A Program Synthesis based Approach for Binary Code Deobfuscation

    Talk at: BAR, San Diego, California, 2020. [paper]

    Authors: Robin David, Luigi Coniglio, Mariano Ceccato

    Abstract: We present a generic approach leveraging both DSE and program synthesis to successfully synthesize programs obfuscated with Mixed-Boolean-Arithmetic, Data-Encoding
    or Virtualization. The synthesis algorithm proposed is an offline enumerate synthesis primitive guided by top-down breath-first search. We shows its effectiveness
    against a state-of-the-art obfuscator and its scalability as it supersedes other similar approaches based on synthesis. We also show its effectiveness in presence of
    composite obfuscation (combination of various techniques). This ongoing work enlightens the effectiveness of synthesis to target certain kinds of obfuscations and
    opens the way to more robust algorithms and simplification strategies.





  • Sydr: Cutting Edge Dynamic Symbolic Execution

    Talk at: Ivannikov ISP RAS Open Conference, Moscow, Russia, 2020. [paper] [slide] [video]

    Authors: A.Vishnyakov, A.Fedotov, D.Kuts, A.Novikov, D.Parygina, E.Kobrin, V.Logunova, P.Belecky, S.Kurmangaleev

    Abstract: Dynamic symbolic execution (DSE) has enormous amount of applications in computer security (fuzzing, vulnerability discovery, reverse-engineering, etc.). We propose
    several performance and accuracy improvements for dynamic symbolic execution. Skipping non-symbolic instructions allows to build a path predicate 1.2--3.5 times faster.
    Symbolic engine simplifies formulas during symbolic execution. Path predicate slicing eliminates irrelevant conjuncts from solver queries. We handle each jump table
    (switch statement) as multiple branches and describe the method for symbolic execution of multi-threaded programs. The proposed solutions were implemented in Sydr tool.
    Sydr performs inversion of branches in path predicate. Sydr combines DynamoRIO dynamic binary instrumentation tool with Triton symbolic engine.





  • Symbolic Deobfuscation: From Virtualized Code Back to the Original

    Talk at: DIMVA, Paris-Saclay, France, 2018. [paper] [slide]

    Authors: Jonathan Salwan, Sébastien Bardin, Marie-Laure Potet

    Abstract: Software protection has taken an important place during the last decade in order to protect legit software against reverse engineering or tampering.
    Virtualization is considered as one of the very best defenses against such attacks. We present a generic approach based on symbolic path exploration, taint and
    recompilation allowing to recover, from a virtualized code, a devirtualized code semantically identical to the original one and close in size. We define criteria
    and metrics to evaluate the relevance of the deobfuscated results in terms of correctness and precision. Finally we propose an open-source setup allowing to evaluate
    the proposed approach against several forms of virtualization.





  • Deobfuscation of VM based software protection

    Talk at: SSTIC, Rennes, France, 2017. [french paper] [english slide] [french video]

    Authors: Jonathan Salwan, Sébastien Bardin, Marie-Laure Potet

    Abstract: In this presentation we describe an approach which consists to automatically analyze virtual machine based software protections and which recompiles a new
    version of the binary without such protections. This automated approach relies on a symbolic execution guide by a taint analysis and some concretization policies, then
    on a binary rewriting using LLVM transition.





  • How Triton can help to reverse virtual machine based software protections

    Talk at: CSAW SOS, NYC, New York, 2016. [slide]

    Authors: Jonathan Salwan, Romain Thomas

    Abstract: The first part of the talk is going to be an introduction to the Triton framework to expose its components and to explain how they work together.
    Then, the second part will include demonstrations on how it's possible to reverse virtual machine based protections using taint analysis, symbolic execution, SMT
    simplifications and LLVM-IR optimizations.





  • Dynamic Binary Analysis and Obfuscated Codes

    Talk at: St'Hack, Bordeaux, France, 2016. [slide]

    Authors: Jonathan Salwan, Romain Thomas

    Abstract: At this presentation we will talk about how a DBA (Dynamic Binary Analysis) may help a reverse engineer to reverse obfuscated code. We will first
    introduce some basic obfuscation techniques and then expose how it's possible to break some stuffs (using our open-source DBA framework - Triton) like detect opaque
    predicates, reconstruct CFG, find the original algorithm, isolate sensible data and many more... Then, we will conclude with a demo and few words about our future work.





  • How Triton may help to analyse obfuscated binaries

    Publication at: MISC magazine 82, 2015. [french article]

    Authors: Jonathan Salwan, Romain Thomas

    Abstract: Binary obfuscation is used to protect software's intellectual property. There exist different kinds of obfucation but roughly, it transforms a binary
    structure into another binary structure by preserving the same semantic. The aim of obfuscation is to ensure that the original information is "drown" in useless information
    that will make reverse engineering harder. In this article we will show how we can analyse an ofbuscated program and break some obfuscations using the Triton framework.





  • Triton: A Concolic Execution Framework

    Talk at: SSTIC, Rennes, France, 2015. [french paper] [detailed english slide]

    Authors: Jonathan Salwan, Florent Saudel

    Abstract: This talk is about the release of Triton, a concolic execution framework based on Pin. It provides components like a taint engine, a dynamic symbolic execution
    engine, a snapshot engine, translation of x64 instruction to SMT2, a Z3 interface to solve constraints and Python bindings. Based on these components, Triton offers the possibility
    to build tools for vulnerabilities research or reverse-engineering assistance.





  • Dynamic Behavior Analysis Using Binary Instrumentation

    Talk at: St'Hack, Bordeaux, France, 2015. [slide]

    Authors: Jonathan Salwan

    Abstract: This talk can be considered like the part 2 of our talk at SecurityDay. In the previous part, we talked about how it was possible to cover a targeted function
    in memory using the DSE (Dynamic Symbolic Execution) approach. Cover a function (or its states) doesn't mean find all vulnerabilities, some vulnerability doesn't crashes the program.
    That's why we must implement specific analysis to find specific bugs. These analysis are based on the binary instrumentation and the runtime behavior analysis of the program. In this
    talk, we will see how it's possible to find these following kind of bugs : off-by-one, stack / heap overflow, use-after-free, format string and {write, read}-what-where.





  • Covering a function using a Dynamic Symbolic Execution approach

    Talk at: Security Day, Lille, France, 2015. [slide]

    Authors: Jonathan Salwan

    Abstract: This talk is about binary analysis and instrumentation. We will see how it's possible to target a specific function, snapshot the context memory/registers before the
    function, translate the instrumentation into an intermediate representation,apply a taint analysis based on this IR, build/keep formulas for a Dynamic Symbolic Execution (DSE), generate
    a concrete value to go through a specific path, restore the context memory/register and generate another concrete value to go through another path then repeat this operation until the
    target function is covered.


## Cite Triton

```latex
@inproceedings{SSTIC2015-Saudel-Salwan,
author = {Saudel, Florent and Salwan, Jonathan},
title = {Triton: A Dynamic Symbolic Execution Framework},
booktitle = {Symposium sur la s{\'{e}}curit{\'{e}} des technologies de l'information
et des communications},
series = {SSTIC},
pages = {31--54},
address = {Rennes, France},
month = jun,
year = {2015},
}
```