An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with html-parser

A curated list of projects in awesome lists tagged with html-parser .

https://github.com/fb55/htmlparser2

The fast & forgiving HTML and XML parser

dom html html-parser htmlparser2 javascript parser xml

Last synced: 13 May 2025

https://github.com/oblac/jodd

Jodd! Lightweight. Java. Zero dependencies. Use what you like.

aop database html-parser http-client ioc java java8 jodd jquery json-parser mail micro-framework utility-library

Last synced: 12 May 2025

https://github.com/posthtml/posthtml

PostHTML is a tool to transform HTML/XML with JS plugins

html html-parser parser posthtml transformer xml xml-parser

Last synced: 13 May 2025

https://github.com/zzzprojects/html-agility-pack

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

hap html-parser htmlagilitypack parse xpath

Last synced: 14 May 2025

https://github.com/jsdf/react-native-htmlview

A React Native component which renders HTML content as native views

html html-parser html-renderer react react-component react-native

Last synced: 13 May 2025

https://github.com/tid-kijyun/kanna

Kanna(鉋) is an XML/HTML parser for Swift.

html-parser swift xml-parser

Last synced: 17 Dec 2025

https://github.com/tid-kijyun/Kanna

Kanna(鉋) is an XML/HTML parser for Swift.

html-parser swift xml-parser

Last synced: 06 Aug 2025

https://github.com/imangazaliev/didom

Simple and fast HTML and XML parser

dom html html-parser parser xml xml-parser xpath

Last synced: 14 May 2025

https://github.com/Imangazaliev/DiDOM

Simple and fast HTML and XML parser

dom html html-parser parser xml xml-parser xpath

Last synced: 15 Apr 2025

https://github.com/philss/floki

Floki is a simple HTML parser that enables search for nodes using CSS selectors.

css-selector css-selectors elixir erlang fast-html floki html-parser html5ever

Last synced: 14 May 2025

https://github.com/sub6resources/flutter_html

A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)

flutter flutter-html flutter-package flutter-widget flutter-widgets html-css html-parser html-tags

Last synced: 14 May 2025

https://github.com/Sub6Resources/flutter_html

A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)

flutter flutter-html flutter-package flutter-widget flutter-widgets html-css html-parser html-tags

Last synced: 02 Apr 2025

https://github.com/lexborisov/myhtml

Fast C/C++ HTML 5 Parser. Using threads.

c html html-parser pure-c

Last synced: 14 May 2025

https://github.com/psharanda/atributika

Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

attributedstring hashtag html html-parser ios nsattributedstring swift tags textkit uilabel uitextview

Last synced: 15 May 2025

https://github.com/psharanda/Atributika

Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

attributedstring hashtag html html-parser ios nsattributedstring swift tags textkit uilabel uitextview

Last synced: 06 Aug 2025

https://github.com/yorickpeterse/oga

Oga is an XML/HTML parser written in Ruby.

html html-parser parser ruby xml xml-parser

Last synced: 15 May 2025

https://github.com/YorickPeterse/oga

Oga is an XML/HTML parser written in Ruby.

html html-parser parser ruby xml xml-parser

Last synced: 14 Jul 2025

https://github.com/cezheng/fuzi

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

css html html-parser html-parsing ios parser parsing swift xml xml-parser xml-parsing xpath

Last synced: 05 Sep 2025

https://github.com/cezheng/Fuzi

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

css html html-parser html-parsing ios parser parsing swift xml xml-parser xml-parsing xpath

Last synced: 06 Aug 2025

https://github.com/skrapeit/skrape.it

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

crawler dom hacktoberfest html-parser integration-testing jsoup kotlin kotlin-dsl parse scraper skrape system-testing test-automation testing

Last synced: 09 Apr 2025

https://github.com/lexborisov/modest

Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.

c css css-parser css-selector html html-parser html-renderer pure-c

Last synced: 04 Apr 2025

https://github.com/lexborisov/Modest

Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.

c css css-parser css-selector html html-parser html-renderer pure-c

Last synced: 20 Mar 2025

https://github.com/miso-belica/jusText

Heuristic based boilerplate removal tool

html-parser html-parsing python text-extraction

Last synced: 14 Mar 2025

https://github.com/miso-belica/justext

Heuristic based boilerplate removal tool

html-parser html-parsing python text-extraction

Last synced: 13 Apr 2025

https://github.com/antchfx/htmlquery

htmlquery is golang XPath package for HTML query.

go golang html html-parser xpath xpath-selector xpath2

Last synced: 14 Mar 2025

https://github.com/clj-commons/hickory

HTML as data

clojure html-parser

Last synced: 18 Jan 2026

https://github.com/rajatomar788/pywebcopy

Locally saves webpages to your hard disk with images, css, js & links as is.

archive-tool crawler html html-parser mirror python web webpage

Last synced: 08 Jul 2025

https://github.com/bupt1987/html-parser

php html parser,类似与PHP Simple HTML DOM Parser,但是比它快好几倍

html html-parser parser

Last synced: 25 Mar 2025

https://github.com/zhegexiaohuozi/jsoupxpath

纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.

antlr4 html-parser jsoupxpath xpath

Last synced: 15 May 2025

https://github.com/b-fuze/deno-dom

Browser DOM & HTML parser in Deno

browser-dom deno dom html-parser rust typescript wasm

Last synced: 15 May 2025

https://github.com/mohamedrejeb/ksoup

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing

Last synced: 10 Jul 2025

https://github.com/duzun/hquery.php

An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.

broken-html crawler css-selectors domcrawler fast hquery html html-parser invalid-html jquery-like jquery-selectors parser php psr-0 psr-4 scraper selectors xml xml-parser

Last synced: 14 May 2025

https://github.com/ZhgChgLi/ZMarkupParser

ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.

cocoapods html html-converter html-parser html-renderer ios nsattributedstring swift swift-package textfield uikit uilabel uitextview

Last synced: 02 Aug 2025

https://github.com/MohamedRejeb/ksoup

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing

Last synced: 24 Apr 2025

https://github.com/MohamedRejeb/Ksoup

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing

Last synced: 12 Apr 2025

https://github.com/prettyhtml/prettyhtml

💅 The formatter for the modern web https://prettyhtml.netlify.com/

angular formatter html html-parser prettier rehype svelte vue web-components

Last synced: 03 Sep 2025

https://github.com/Prettyhtml/prettyhtml

💅 The formatter for the modern web https://prettyhtml.netlify.com/

angular formatter html html-parser prettier rehype svelte vue web-components

Last synced: 30 Apr 2025

https://github.com/zhgchgli/zmarkupparser

ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.

cocoapods html html-converter html-parser html-renderer ios nsattributedstring swift swift-package textfield uikit uilabel uitextview

Last synced: 06 Apr 2025

https://github.com/ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

doc document-analysis document-content-extraction documents docx docx-parser excel html html-parser logical-structure-extraction ocr odt pdf pdf-parser scanned-documents table-of-contents table-recognition txt

Last synced: 15 May 2025

https://github.com/hexilee/unhtml.rs

A magic html parser

html-parser rust rust-crate scraper

Last synced: 05 Apr 2025

https://github.com/Hexilee/unhtml.rs

A magic html parser

html-parser rust rust-crate scraper

Last synced: 16 May 2025

https://github.com/justinwilaby/sax-wasm

The first streamable, fixed memory XML, HTML, and JSX parser for WebAssembly.

html-parser jsx jsx-parser node parser rust sax-js webassembly xml xml-parser

Last synced: 15 May 2025

https://github.com/acrazing/html5parser

A super tiny and fast html5 AST parser.

ast dom html html-parser html5 parser

Last synced: 05 Apr 2025

https://github.com/AutoCSer/AutoCSer

AutoCSer is a high-performance RPC framework. AutoCSer 是一个以高效率为目标向导的整体开发框架。主要包括 TCP 接口服务框架、TCP 函数服务框架、远程表达式链组件、前后端一体 WEB 视图框架、ORM 内存索引缓存框架、日志流内存数据库缓存组件、消息队列组件、二进制 / JSON / XML 数据序列化 等一系列无缝集成的高性能组件。

cache-server code-generator gif html-parser html-title-crawler http-server json message-queue orm-cache raw-socket rpc serialization webview xml

Last synced: 04 May 2025

https://github.com/gerev/nsoup

NSoup is a .NET port of the jsoup (http://jsoup.org) HTML parser and sanitizer originally written in Java

c-sharp dot-net html-parser jsoup

Last synced: 10 Apr 2025

https://github.com/mykolaharmash/hyntax

Straightforward HTML parser for JavaScript

dom html html-parser javascript

Last synced: 05 Apr 2025

https://github.com/sihaelov/harser

Easy way for HTML parsing and building XPath

html html-parser parser python xpath

Last synced: 15 May 2025

https://github.com/craigbarnes/lua-gumbo

Moved to https://gitlab.com/craigbarnes/lua-gumbo

dom html html-parser html5 lua parser sanitize-html

Last synced: 27 Jul 2025

https://github.com/kata198/advancedhtmlparser

Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modification, and formatting. Also XPath.

attributes create dom dom-tree filter formatter getelementbyid getelementsbyclassname getelementsbyname getelementsbytagname html html-parser parser python tags tree

Last synced: 25 Oct 2025

https://github.com/vectorized/aris

Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...

angular babel css css-in-js css-prefixer css-preprocessor css3 frontend html-css-javascript html-parser html-template html5 jquery js js-framework jsx lodash react svg syntax-highlighting

Last synced: 14 Apr 2025

https://github.com/Vectorized/Aris

Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...

angular babel css css-in-js css-prefixer css-preprocessor css3 frontend html-css-javascript html-parser html-template html5 jquery js js-framework jsx lodash react svg syntax-highlighting

Last synced: 21 Mar 2025

https://github.com/rusterlium/html5ever_elixir

NIF wrapper of html5ever using Rustler

binding elixir erlang html-parser html5ever nif rustler

Last synced: 15 May 2025

https://github.com/jstedfast/htmlkit

A cross-platform .NET framework for parsing HTML

c-sharp html html-parser html5 parser

Last synced: 16 May 2025

https://github.com/huozhi/html2any

🌀 parse and convert html string to anything

html-parser rich-text xml-parser

Last synced: 30 Apr 2025

https://github.com/burnoo/kspoon

Annotation based HTML to Kotlin class parser with KMP support, kotlinx (de)serializtion format, jspoon successor

html html-parser kotlin kotlin-multiplatform kotlinx-serialization ktor retrofit2

Last synced: 12 Jan 2026

https://github.com/ying32/htmlparser

delphi html parser(代码是改自原wr960204的HtmlParser)

html-parser

Last synced: 01 Dec 2025

https://github.com/leafo/web_sanitize

Lua library for sanitizing, parsing, and editing untrusted HTML

css-sanitization html html-parser html-sanitization lua moonscript security

Last synced: 06 Oct 2025

https://github.com/softcircuits/htmlmonkey

Lightweight HTML/XML parser written in C#.

csharp dotnet html html-parser parser

Last synced: 04 Oct 2025

https://github.com/orottier/webpage-rs

Small Rust library to fetch info about a web page: title, description, language, HTTP info, RSS feeds, Opengraph, Schema.org, and more

html html-parser json-ld opengraph rust

Last synced: 07 Apr 2025

https://github.com/hexilee/unhtml

HTML unmarshaler for golang

go golang html-parser unmarshaller

Last synced: 15 Apr 2025

https://github.com/app-generator/html-parser

Html Parser - Html to Pug, Jinja2, Blade Converter | AppSeed

appseed developer-tools html-parser open-source pug

Last synced: 19 Mar 2025

https://github.com/coolspring8/go-lolhtml

An idiomatic Go wrapper for Rust crate `lol-html` (Low Output Latency streaming HTML parser/rewriter)

bindings cgo css-selector html html-parser rewriting

Last synced: 14 Jan 2026

https://github.com/naqvis/crystal-html5

Crystal implementation of HTML5-Compliant Tokenizer and Parser with XPath & CSS Selector support

crystal crystal-lang crystal-language crystal-shard css-selectors html-parser html-tokenizer html5 xpath2

Last synced: 07 May 2025

https://github.com/f34nk/modest_ex

Elixir library to do pipeable transformations on html strings (with CSS selectors)

css-selector elixir html-parser html-renderer

Last synced: 22 Apr 2025

https://github.com/ange007/htmlp

Delphi Dom HTML Parser and Converter. Fork (not from the original author): https://sourceforge.net/projects/htmlp/

delphi dom dom-parser formatter html html-formatter html-parser html-parsing parser xpath

Last synced: 28 Jan 2026

https://github.com/andreaspitzer/hypertag

🏎 The fastest HTML tag and attributes parser for JavaScript

html-parser javascript nodejs tag-parsing

Last synced: 25 Jul 2025

https://github.com/hexydec/htmldoc

A token based HTML Document parser and minifier written in PHP. Extract attribute values and text using CSS selectors.

html html-dom-parser html-parser html5 minification minify minify-html php simplehtmldom svg tokenize tokenizer

Last synced: 17 Oct 2025

https://github.com/erdomke/bracketpipe

BracketPipe is a .NET library for building parsing and processing piplines for web languages like HTML, CSS, Javscript, SVG, and MathML

html-markdown html-minifier html-parser html-sanitization

Last synced: 21 Mar 2025

https://github.com/snapframework/xmlhtml

XML parser and renderer with HTML 5 quirks mode

haskell html-parser xml-parser

Last synced: 11 Apr 2025

https://github.com/oblac/jodd-lagarto

Java HTML parsers suite.

html html-parser java jquery parser

Last synced: 27 Apr 2025

https://github.com/serpapi/google-local-results-ai-parser

A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing, classification, and information extraction from English HTML content.

data-parsing html-parser information-extraction natural-language-processing opensource rubygem structured-data-extraction text-classification webscraping

Last synced: 01 Jul 2025

https://github.com/yannickperrenet/bookmarkdown

✅ Parse your browser's exported HTML bookmark file to Markdown.

brave brave-browser html-parser html-to-markdown html-to-md markdown

Last synced: 30 Dec 2025

https://github.com/inversoft/prime-transformer

Fast Java8 BBCode & HTML parser and transformation library.

bbcode bbcode-parser html-parser java8

Last synced: 13 Apr 2025

https://github.com/zadean/htmerl

HTML Parser in Erlang

erlang html-parser html5

Last synced: 30 Oct 2025

https://github.com/vincentlaucsb/pgreaper

A Python library for loading data from various formats into PostgreSQL databases.

convert-data csv-converter html-parser postgresql python sql sql-database sql-table sqlite3-database

Last synced: 09 Sep 2025

https://github.com/viur-framework/html5

Pure Python HTML abstraction layer, parser and interpreter

framework html html-parser html5 library pyjs pyodide python viur

Last synced: 22 Jun 2025

https://github.com/fefit/rphtml

A html parser written in RUST, parse html into node trees.

html-minify html-parser html-parsing

Last synced: 27 Dec 2025

https://github.com/yeonjuan/es-html-parser

HTML parser for static analysis

html html-parser

Last synced: 28 Oct 2025

https://github.com/danny1113/html-parser-builder

A result builder that build HTML parser and transform HTML elements to strongly-typed result, inspired by RegexBuilder.

dsl html-parser swift

Last synced: 21 Oct 2025

https://github.com/duncan3dc/domparser

Wrappers for the PHP DomDocument class to provide extra functionality for html/xml parsing

html html-parser php xml xml-parsing

Last synced: 05 May 2025

https://github.com/ariary/jsextractor

Fastly gather all JavaScript from url (CLi+TUI)

bug-bounty cli extract extractor html-parser javascript js parser pentest recon tui web-pentest xss

Last synced: 14 Jul 2025

https://github.com/imelgrat/feed-finder

A PHP class for extracting the URLs of RSS (1.0 and 2.0) and ATOM feeds associated to a page, as well as OPML outline documents.

atom atom-feed composer-package html-parser html-scraper opml opml-outline php regex regular-expression rss rss-feed rss-feed-scraper

Last synced: 16 May 2025

https://github.com/sunshineplan/node

HTML parsing library, the alternative to BeautifulSoup in Golang.

beautifulsoup css-selectors generic go golang html-parser xpath xpath-query

Last synced: 16 Jun 2025

https://github.com/creeperyang/html-parser-lite

A light weight html parser and more.

html-parser html-parser-lite parser

Last synced: 23 Mar 2025

https://github.com/rohitawate/domengine

DOM manipulation engine written in C++.

cpp11 dom-manipulation dom-tree html-parser interactive-shell interpreter

Last synced: 10 Apr 2025

https://github.com/willianantunes/pyfriends

Let's research over all the seasons of Friends sitcom and try to get some insights from it 🕵

html-parser jupyter-notebook pandas parquet postgresql python

Last synced: 15 Apr 2025

https://github.com/algunion/htmlforge.jl

Flexible HTML parsing and manipulation in Julia Programming Language

html html-manipulation html-parser html5 htmx julia

Last synced: 15 Apr 2025