Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/scrapy/w3lib

Python library of web-related functions
https://github.com/scrapy/w3lib

hacktoberfest python

Last synced: 3 days ago
JSON representation

Python library of web-related functions

Awesome Lists containing this project

README

        

=====
w3lib
=====

.. image:: https://github.com/scrapy/w3lib/actions/workflows/tests.yml/badge.svg
:target: https://github.com/scrapy/w3lib/actions

.. image:: https://img.shields.io/codecov/c/github/scrapy/w3lib/master.svg
:target: http://codecov.io/github/scrapy/w3lib?branch=master
:alt: Coverage report

Overview
========

This is a Python library of web-related functions, such as:

* remove comments, or tags from HTML snippets
* extract base url from HTML snippets
* translate entites on HTML strings
* convert raw HTTP headers to dicts and vice-versa
* construct HTTP auth header
* converting HTML pages to unicode
* sanitize urls (like browsers do)
* extract arguments from urls

Requirements
============

Python 3.9+

Install
=======

``pip install w3lib``

Documentation
=============

See http://w3lib.readthedocs.org/

License
=======

The w3lib library is licensed under the BSD license.