Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/ocharles/weeder

A re-implementation of weeder using HIE files
https://github.com/ocharles/weeder

Last synced: 2 months ago
JSON representation

A re-implementation of weeder using HIE files

Host: GitHub
URL: https://github.com/ocharles/weeder
Owner: ocharles
License: bsd-3-clause
Created: 2019-09-10T09:23:38.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2024-02-22T15:02:08.000Z (3 months ago)
Last Synced: 2024-02-26T22:13:21.414Z (3 months ago)
Language: Haskell
Size: 213 KB
Stars: 157
Watchers: 8
Forks: 22
Open Issues: 30
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE

Lists

static-analysis - Weeder
awesome-cabal - `weeder`

README

        # Weeder

Weeder is an application to perform whole-program dead-code analysis. Dead code

is code that is written, but never reachable from any other code. Over the

lifetime of a project, this happens as code is added and removed, and leftover

code is never cleaned up. While GHC has warnings to detect dead code is a single

module, these warnings don't extend across module boundaries - this is where

Weeder comes in.

Weeder uses HIE files produced by GHC - these files can be thought of as source

code that has been enhanced by GHC, adding full symbol resolution and type

information. Weeder builds a dependency graph from these files to understand how

code interacts. Once all analysis is done, Weeder performs a traversal of this

graph from a set of roots (e.g., your `main` function), and determines which

code is reachable and which code is dead.

# Using Weeder

## Preparing Your Code for Weeder

To use Weeder, you will need to generate `.hie` files from your source code.

### Cabal

If you use Cabal, this is easily done by adding one line to your

`cabal.project.local` file:

``` cabal

package *

  ghc-options: -fwrite-ide-info

```

Once this has been added, perform a full rebuild of your project:

``` shell

cabal clean

cabal build all

```

### Stack

If you use `stack`, add the following to your `stack.yaml`:

``` yaml

ghc-options:

  "$locals": -fwrite-ide-info

```

and rebuild:

``` shell

stack clean

stack build

```

## Calling Weeder

To call Weeder, you first need to provide a configuration file, `weeder.toml`. Weeder uses

[TOML](https://toml.io/en/) as its configuration format.

`roots` is a list of regular expressions of symbols that are considered as

alive. If you're building an executable, the pattern `^Main.main$` is a

good starting point - specifying that `main` is a root. Weeder currently doesn't

add all exported functions as roots automatically but in many cases `main` from a

test suite could be a good workaround for that

`type-class-roots` configures whether or not Weeder should consider all instances

of type classes as roots. Defaults to `false`.

``` toml

roots = [ "^Main.main$" ]

type-class-roots = true

```

Now invoke the `weeder` executable, and - if your project has weeds - you will

see something like the following:

``` shell

$ weeder

src/Dhall/TH.hs:187: toDeclaration

src/Dhall/TH.hs:196: toNestedHaskellType

```

… which indicates the location of two unused symbols.

(Please note these warnings are just for demonstration and not necessarily weeds

in the Dhall project).

## Configuration options

| Name             | Default value                        | Description |

| ---------------- | ------------------------------------ | --- |

| roots            | `[ "Main.main", "^Paths_weeder.*" ]` | Any declarations matching these regular expressions will be considered as alive. |

| type-class-roots | `false`                              | Consider all instances of type classes as roots. Overrides `root-instances`. |

| root-instances   | `[ {class = '\.IsString$'}, {class = '\.IsList$'} ]` | Type class instances that match on all specified fields will be considered as roots. Accepts the fields `instance` matching on the pretty-printed type of the instance (visible in the output), `class` matching on its parent class declaration, and `module` matching on the module the instance is in. |

| unused-types     | `false`                              | Enable analysis of unused types. |

`root-instances` can also accept string literals as a shorthand for writing a table

containing only the `instance` field. See the following example from the test suite:

``` toml

root-instances = [ { module = "Spec.ConfigInstanceModules.Module1", instance = "Bounded T" }

                 , "Read T" 

                 , { module = "Spec.ConfigInstanceModules.Module3" }

                 , { class = '\.Enum$' }

                 , { module = "Spec.ConfigInstanceModules.Module2", class = '\.Show$' }

                 ]

```

## Exit codes

Weeder emits the following exit codes:

| Exit code | Cause |

| --- | --- |

|  0  | No weeds were found |

| 228 | One or more weeds found |

|  1  | Generic failing exit code |

|  2  | Failure to read HIE file due to GHC version mismatch |

|  3  | Failure to parse config file |

|  4  | No HIE files found |

# Tips

- You may want to add `^Paths_.*` to the roots in `weeder.toml` to ignore the

  `Paths_packageName` module automatically generated by Cabal.

- You can automatically write and use a default configuration file by calling 

  Weeder with the `--write-default-config` flag, if no configuration file is

  found.

- You can mandate explicitly specifying every option in the configuration by 

  calling Weeder with the `--no-default-fields` flag. This can prevent being

  caught off guard by new configuration options or changes to default values.

- To mark all instances in a module `M` as roots, add `{ module = "^M$" }`

  to `root-instances`.

# Limitations

Weeder currently has a few limitations:

## Overloaded syntax

On some versions of GHC, Weeder might report various type classes that are used

for syntax extensions as weeds. For example, `Num` and `IsString` classes might be

flagged as weeds if they are only used for overloaded literal syntax (that is,

the `fromInteger` and `fromString` methods).

You can add instances of specific type classes as roots with the `root-instances` 

field, or toggle whether Weeder considers all type class instances as roots with 

the `type-class-roots` configuration option.

## Type families

Weeder cannot yet analyse uses of type family instances. For this reason type

family instances will be marked as implicit roots if analysis of types is

enabled via `unused-types`.

## Template Haskell

Weeder is currently unable to parse the result of a Template Haskell splice. If

some Template Haskell code refers to other source code, this dependency won't be

tracked by Weeder, and thus Weeder might end up with false positives.