Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/weppos/publicsuffix-go
Domain name parser for Go based on the Public Suffix List.
https://github.com/weppos/publicsuffix-go
go golang psl publicsuffix
Last synced: 3 days ago
JSON representation
Domain name parser for Go based on the Public Suffix List.
- Host: GitHub
- URL: https://github.com/weppos/publicsuffix-go
- Owner: weppos
- License: mit
- Created: 2016-03-10T21:36:33.000Z (almost 9 years ago)
- Default Branch: main
- Last Pushed: 2024-12-12T08:32:50.000Z (10 days ago)
- Last Synced: 2024-12-12T22:03:43.178Z (10 days ago)
- Topics: go, golang, psl, publicsuffix
- Language: Go
- Homepage:
- Size: 1.95 MB
- Stars: 192
- Watchers: 7
- Forks: 34
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Funding: .github/FUNDING.yml
- License: LICENSE.txt
- Security: SECURITY.md
Awesome Lists containing this project
README
# Public Suffix for Go
The package publicsuffix provides a Go domain name parser based on the [Public Suffix List](http://publicsuffix.org/).
[![Tests](https://github.com/weppos/publicsuffix-go/workflows/Tests/badge.svg)](https://github.com/weppos/publicsuffix-go/actions?query=workflow%3ATests)
[![GoDoc](https://godoc.org/github.com/weppos/publicsuffix-go/publicsuffix?status.svg)](https://pkg.go.dev/github.com/weppos/publicsuffix-go/publicsuffix)Currently, **publicsuffix-go requires Go version 1.21 or greater**. We do our best not to break older versions of Go if we don't have to, but due to tooling constraints, we don't always test older versions.
## Getting started
Clone the repository [in your workspace](https://golang.org/doc/code.html#Organization) and move into it:
```shell
mkdir -p $GOPATH/src/github.com/weppos && cd $_
git clone [email protected]:weppos/publicsuffix-go.git
cd publicsuffix-go
```Fetch the dependencies:
```shell
go get ./...
```Run the test suite.
```shell
go test ./...
```## Testing
The following command runs the entire test suite.
```shell
go test ./...
```There are 3 different test suites built into this library:
- Acceptance: the acceptance test suite contains some high level tests to ensure the library behaves as expected
- PSL: the PSL test suite runs the library against the [official Public Suffix test cases](https://github.com/publicsuffix/list/blob/master/tests/tests.txt)
- Unit: the unit test suite stresses the various single components of this package## Installation
```shell
go get github.com/weppos/publicsuffix-go
```## Usage
This is a simple example that demonstrates how to use the package with the default options and the default Public Suffix list packaged with the library.
```go
package mainimport (
"fmt""github.com/weppos/publicsuffix-go/publicsuffix"
)func main() {
// Extract the domain from a string
// using the default list
fmt.Println(publicsuffix.Domain("example.com")) // example.com
fmt.Println(publicsuffix.Domain("www.example.com")) // example.com
fmt.Println(publicsuffix.Domain("example.co.uk")) // example.co.uk
fmt.Println(publicsuffix.Domain("www.example.co.uk")) // example.co.uk// Parse the domain from a string
// using the default list
fmt.Println(publicsuffix.Parse("example.com")) // &DomainName{"com", "example", ""}
fmt.Println(publicsuffix.Parse("www.example.com")) // &DomainName{"com", "example", "www"}
fmt.Println(publicsuffix.Parse("example.co.uk")) // &DomainName{"co.uk", "example", ""}
fmt.Println(publicsuffix.Parse("www.example.co.uk")) // &DomainName{"co.uk", "example", "www"}
}
```#### Ignoring Private Domains
The PSL is composed by two list of suffixes: IANA suffixes, and Private Domains.
Private domains are submitted by private organizations. By default, private domains are not ignored.
Sometimes, you want to ignore these domains and only query against the IANA suffixes. You have two options:1. Ignore the domains at runtime
2. Create a custom list without the private domainsIn the first case, the private domains are ignored at runtime: they will still be included in the lists but the lookup will skip them when found.
```go
publicsuffix.DomainFromListWithOptions(publicsuffix.DefaultList(), "google.blogspot.com", nil)
// google.blogspot.compublicsuffix.DomainFromListWithOptions(publicsuffix.DefaultList(), "google.blogspot.com", &publicsuffix.FindOptions{IgnorePrivate: true})
// blogspot.com// Note that the DefaultFindOptions includes the private domains by default
publicsuffix.DomainFromListWithOptions(publicsuffix.DefaultList(), "google.blogspot.com", publicsuffix.DefaultFindOptions)
// google.blogspot.com
```This solution is easy, but slower. If you find yourself ignoring the private domains in all cases (or in most cases), you may want to create a custom list without the private domains.
```go
list := NewListFromFile("path/to/list.txt", &publicsuffix.ParserOption{PrivateDomains: false})
publicsuffix.DomainFromListWithOptions(list, "google.blogspot.com", nil)
// blogspot.com
```## IDN domains, A-labels and U-labels
[A-label and U-label](https://tools.ietf.org/html/rfc5890#section-2.3.2.1) are two different ways to represent IDN domain names. These two encodings are also known as ASCII (A-label) or Pynucode vs Unicode (U-label). Conversions between U-labels and A-labels are performed according to the ["Punycode" specification](https://tools.ietf.org/html/rfc3492), adding or removing the ACE prefix as needed.
IDNA-aware applications generally use the A-label form for storing and manipulating data, whereas the U-labels can appear in presentation and user interface forms.
Although the PSL list has been traditionally U-label encoded, this library follows the common industry standards and stores the rules in their A-label form. Therefore, unless explicitly mentioned, any method call, comparison or internal representation is expected to be ASCII-compatible encoded (ACE).
Passing Unicode names to the library may either result in error or unexpected behaviors.
If you are interested in the details of this decision, you can read the full discussion [here](https://github.com/weppos/publicsuffix-go/issues/31).
## Differences with `golang.org/x/net/publicsuffix`
The [`golang.org/x/net/publicsuffix`](https://godoc.org/golang.org/x/net/publicsuffix) is a package part of the Golang `x/net` package, that provides a public suffix list implementation.
The main difference is that the `x/net` package is optimized for speed, but it's less flexible. The list is compiled and embedded into the package itself. However, this is also the main downside.
The [list is not frequently refreshed](https://github.com/letsencrypt/boulder/issues/1374#issuecomment-182429297), hence the results may be inaccurate, in particular if you heavily rely on the private domain section of the list. Changes in the IANA section are less frequent, whereas changes in the Private Domains section happens weekly.This package provides the following extra features:
- Ability to load an arbitrary list at runtime (e.g. you can feed your own list, or create multiple lists)
- Ability to create multiple lists
- Ability to parse a domain using a previously defined list
- Ability to add custom rules to an existing list, or merge/load rules from other lists (provided as file or string)
- Advanced access to the list rules
- Ability to ignore private domains at runtime, or when the list is parsedThis package also aims for 100% compatibility with the `x/net` package. A special adapter is provided as a drop-in replacement. Simply change the include statement from
```go
import (
"golang.org/x/net/publicsuffix"
)
```to
```go
import (
"github.com/weppos/publicsuffix-go/net/publicsuffix"
)
```The `github.com/weppos/publicsuffix-go/net/publicsuffix` package defines the same methods defined in `golang.org/x/net/publicsuffix`, but these methods are implemented using the `github.com/weppos/publicsuffix-go/publicsuffix` package.
Note that the adapter doesn't offer the flexibility of `github.com/weppos/publicsuffix-go/publicsuffix`, such as the ability to use multiple lists or disable private domains at runtime.
## `cookiejar.PublicSuffixList` interface
This package implements the [`cookiejar.PublicSuffixList` interface](https://godoc.org/net/http/cookiejar#PublicSuffixList). It means it can be used as a value for the `PublicSuffixList` option when creating a `net/http/cookiejar`.
```go
import (
"net/http/cookiejar"
"github.com/weppos/publicsuffix-go/publicsuffix"
)deliciousJar := cookiejar.New(&cookiejar.Options{PublicSuffixList: publicsuffix.CookieJarList})
```## License
Copyright (c) 2016-2024 Simone Carletti. This is Free Software distributed under the MIT license.