Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with html-parser

A curated list of projects in awesome lists tagged with html-parser .

https://github.com/fb55/htmlparser2

The fast & forgiving HTML and XML parser

dom html html-parser htmlparser2 javascript parser xml

Last synced: 16 Dec 2024

https://github.com/oblac/jodd

Jodd! Lightweight. Java. Zero dependencies. Use what you like.

aop database html-parser http-client ioc java java8 jodd jquery json-parser mail micro-framework utility-library

Last synced: 16 Dec 2024

https://github.com/posthtml/posthtml

PostHTML is a tool to transform HTML/XML with JS plugins

html html-parser parser posthtml transformer xml xml-parser

Last synced: 16 Dec 2024

https://github.com/jsdf/react-native-htmlview

A React Native component which renders HTML content as native views

html html-parser html-renderer react react-component react-native

Last synced: 16 Dec 2024

https://github.com/zzzprojects/html-agility-pack

Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.

hap html-parser htmlagilitypack parse xpath

Last synced: 19 Dec 2024

https://github.com/tid-kijyun/Kanna

Kanna(鉋) is an XML/HTML parser for Swift.

html-parser swift xml-parser

Last synced: 09 Dec 2024

https://github.com/imangazaliev/didom

Simple and fast HTML and XML parser

dom html html-parser parser xml xml-parser xpath

Last synced: 18 Dec 2024

https://github.com/Imangazaliev/DiDOM

Simple and fast HTML and XML parser

dom html html-parser parser xml xml-parser xpath

Last synced: 08 Nov 2024

https://github.com/philss/floki

Floki is a simple HTML parser that enables search for nodes using CSS selectors.

css-selector css-selectors elixir erlang fast-html floki hacktoberfest html-parser html5ever myhtml

Last synced: 16 Dec 2024

https://github.com/sub6resources/flutter_html

A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)

flutter flutter-html flutter-package flutter-widget flutter-widgets html-css html-parser html-tags

Last synced: 16 Dec 2024

https://github.com/Sub6Resources/flutter_html

A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)

flutter flutter-html flutter-package flutter-widget flutter-widgets html-css html-parser html-tags

Last synced: 03 Nov 2024

https://github.com/lexborisov/myhtml

Fast C/C++ HTML 5 Parser. Using threads.

c html html-parser pure-c

Last synced: 15 Dec 2024

https://github.com/psharanda/atributika

Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

attributedstring hashtag html html-parser ios nsattributedstring swift tags textkit uilabel uitextview

Last synced: 19 Dec 2024

https://github.com/psharanda/Atributika

Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.

attributedstring hashtag html html-parser ios nsattributedstring swift tags textkit uilabel uitextview

Last synced: 09 Dec 2024

https://github.com/yorickpeterse/oga

Oga is an XML/HTML parser written in Ruby.

html html-parser parser ruby xml xml-parser

Last synced: 21 Dec 2024

https://github.com/YorickPeterse/oga

Oga is an XML/HTML parser written in Ruby.

html html-parser parser ruby xml xml-parser

Last synced: 22 Nov 2024

https://github.com/cezheng/fuzi

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

css html html-parser html-parsing ios parser parsing swift xml xml-parser xml-parsing xpath

Last synced: 19 Dec 2024

https://github.com/cezheng/Fuzi

A fast & lightweight XML & HTML parser in Swift with XPath & CSS support

css html html-parser html-parsing ios parser parsing swift xml xml-parser xml-parsing xpath

Last synced: 09 Dec 2024

https://github.com/lexborisov/modest

Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.

c css css-parser css-selector html html-parser html-renderer pure-c

Last synced: 20 Dec 2024

https://github.com/lexborisov/Modest

Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.

c css css-parser css-selector html html-parser html-renderer pure-c

Last synced: 27 Oct 2024

https://github.com/miso-belica/jusText

Heuristic based boilerplate removal tool

html-parser html-parsing python text-extraction

Last synced: 25 Oct 2024

https://github.com/miso-belica/justext

Heuristic based boilerplate removal tool

html-parser html-parsing python text-extraction

Last synced: 20 Dec 2024

https://github.com/antchfx/htmlquery

htmlquery is golang XPath package for HTML query.

go golang html html-parser xpath xpath-selector xpath2

Last synced: 26 Oct 2024

https://github.com/clj-commons/hickory

HTML as data

clojure html-parser

Last synced: 20 Dec 2024

https://github.com/rajatomar788/pywebcopy

Locally saves webpages to your hard disk with images, css, js & links as is.

archive-tool crawler html html-parser mirror python web webpage

Last synced: 20 Nov 2024

https://github.com/bupt1987/html-parser

php html parser,类似与PHP Simple HTML DOM Parser,但是比它快好几倍

html html-parser parser

Last synced: 29 Oct 2024

https://github.com/zhegexiaohuozi/jsoupxpath

纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.

antlr4 html-parser jsoupxpath xpath

Last synced: 30 Nov 2024

https://github.com/b-fuze/deno-dom

Browser DOM & HTML parser in Deno

browser-dom deno dom html-parser rust typescript wasm

Last synced: 20 Dec 2024

https://github.com/duzun/hquery.php

An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.

broken-html crawler css-selectors domcrawler fast hquery html html-parser invalid-html jquery-like jquery-selectors parser php psr-0 psr-4 scraper selectors xml xml-parser

Last synced: 21 Dec 2024

https://github.com/MohamedRejeb/ksoup

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing

Last synced: 10 Nov 2024

https://github.com/MohamedRejeb/Ksoup

Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.

android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing

Last synced: 07 Nov 2024

https://github.com/prettyhtml/prettyhtml

💅 The formatter for the modern web https://prettyhtml.netlify.com/

angular formatter html html-parser prettier rehype svelte vue web-components

Last synced: 17 Dec 2024

https://github.com/Prettyhtml/prettyhtml

💅 The formatter for the modern web https://prettyhtml.netlify.com/

angular formatter html html-parser prettier rehype svelte vue web-components

Last synced: 12 Nov 2024

https://github.com/ZhgChgLi/ZMarkupParser

ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.

cocoapods html html-converter html-parser html-renderer ios nsattributedstring swift swift-package textfield uikit uilabel uitextview

Last synced: 06 Dec 2024

https://github.com/ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

doc document-analysis document-content-extraction documents docx docx-parser excel html html-parser logical-structure-extraction ocr odt pdf pdf-parser scanned-documents table-of-contents table-recognition txt

Last synced: 15 Dec 2024

https://github.com/Hexilee/unhtml.rs

A magic html parser

html-parser rust rust-crate scraper

Last synced: 19 Nov 2024

https://github.com/hexilee/unhtml.rs

A magic html parser

html-parser rust rust-crate scraper

Last synced: 21 Dec 2024

https://github.com/acrazing/html5parser

A super tiny and fast html5 AST parser.

ast dom html html-parser html5 parser

Last synced: 21 Dec 2024

https://github.com/justinwilaby/sax-wasm

The first streamable, fixed memory XML, HTML, and JSX parser for WebAssembly.

html-parser jsx jsx-parser node parser rust sax-js webassembly xml xml-parser

Last synced: 21 Dec 2024

https://github.com/AutoCSer/AutoCSer

AutoCSer is a high-performance RPC framework. AutoCSer 是一个以高效率为目标向导的整体开发框架。主要包括 TCP 接口服务框架、TCP 函数服务框架、远程表达式链组件、前后端一体 WEB 视图框架、ORM 内存索引缓存框架、日志流内存数据库缓存组件、消息队列组件、二进制 / JSON / XML 数据序列化 等一系列无缝集成的高性能组件。

cache-server code-generator gif html-parser html-title-crawler http-server json message-queue orm-cache raw-socket rpc serialization webview xml

Last synced: 13 Nov 2024

https://github.com/gerev/nsoup

NSoup is a .NET port of the jsoup (http://jsoup.org) HTML parser and sanitizer originally written in Java

c-sharp dot-net html-parser jsoup

Last synced: 12 Oct 2024

https://github.com/mykolaharmash/hyntax

Straightforward HTML parser for JavaScript

dom html html-parser javascript

Last synced: 22 Dec 2024

https://github.com/sihaelov/harser

Easy way for HTML parsing and building XPath

html html-parser parser python xpath

Last synced: 19 Nov 2024

https://github.com/craigbarnes/lua-gumbo

Moved to https://gitlab.com/craigbarnes/lua-gumbo

dom html html-parser html5 lua parser sanitize-html

Last synced: 22 Nov 2024

https://github.com/Vectorized/Aris

Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...

angular babel css css-in-js css-prefixer css-preprocessor css3 frontend html-css-javascript html-parser html-template html5 jquery js js-framework jsx lodash react svg syntax-highlighting

Last synced: 28 Oct 2024

https://github.com/vectorized/aris

Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...

angular babel css css-in-js css-prefixer css-preprocessor css3 frontend html-css-javascript html-parser html-template html5 jquery js js-framework jsx lodash react svg syntax-highlighting

Last synced: 01 Nov 2024

https://github.com/rusterlium/html5ever_elixir

NIF wrapper of html5ever using Rustler

binding elixir erlang html-parser html5ever nif rustler

Last synced: 22 Dec 2024

https://github.com/jstedfast/htmlkit

A cross-platform .NET framework for parsing HTML

c-sharp html html-parser html5 parser

Last synced: 15 Dec 2024

https://github.com/huozhi/html2any

🌀 parse and convert html string to anything

html-parser rich-text xml-parser

Last synced: 10 Dec 2024

https://github.com/leafo/web_sanitize

Lua library for sanitizing, parsing, and editing untrusted HTML

css-sanitization html html-parser html-sanitization lua moonscript security

Last synced: 09 Nov 2024

https://github.com/softcircuits/htmlmonkey

Lightweight HTML/XML parser written in C#.

csharp dotnet html html-parser parser

Last synced: 11 Nov 2024

https://github.com/hexilee/unhtml

HTML unmarshaler for golang

go golang html-parser unmarshaller

Last synced: 08 Nov 2024

https://github.com/orottier/webpage-rs

Small Rust library to fetch info about a web page: title, description, language, HTTP info, RSS feeds, Opengraph, Schema.org, and more

html html-parser json-ld opengraph rust

Last synced: 20 Dec 2024

https://github.com/app-generator/html-parser

Html Parser - Html to Pug, Jinja2, Blade Converter | AppSeed

appseed developer-tools html-parser open-source pug

Last synced: 27 Oct 2024

https://github.com/naqvis/crystal-html5

Crystal implementation of HTML5-Compliant Tokenizer and Parser with XPath & CSS Selector support

crystal crystal-lang crystal-language crystal-shard css-selectors html-parser html-tokenizer html5 xpath2

Last synced: 02 Nov 2024

https://github.com/f34nk/modest_ex

Elixir library to do pipeable transformations on html strings (with CSS selectors)

css-selector elixir html-parser html-renderer

Last synced: 01 Nov 2024

https://github.com/andreaspizsa/hypertag

🏎 The fastest HTML tag and attributes parser for JavaScript

html-parser javascript nodejs tag-parsing

Last synced: 01 Dec 2024

https://github.com/erdomke/bracketpipe

BracketPipe is a .NET library for building parsing and processing piplines for web languages like HTML, CSS, Javscript, SVG, and MathML

html-markdown html-minifier html-parser html-sanitization

Last synced: 28 Oct 2024

https://github.com/snapframework/xmlhtml

XML parser and renderer with HTML 5 quirks mode

haskell html-parser xml-parser

Last synced: 21 Dec 2024

https://github.com/oblac/jodd-lagarto

Java HTML parsers suite.

html html-parser java jquery parser

Last synced: 11 Nov 2024

https://github.com/yannickperrenet/bookmarkdown

✅ Parse your browser's exported HTML bookmark file to Markdown.

brave brave-browser html-parser html-to-markdown html-to-md markdown

Last synced: 25 Nov 2024

https://github.com/zadean/htmerl

HTML Parser in Erlang

erlang html-parser html5

Last synced: 28 Oct 2024

https://github.com/inversoft/prime-transformer

Fast Java8 BBCode & HTML parser and transformation library.

bbcode bbcode-parser html-parser java8

Last synced: 07 Nov 2024

https://github.com/serpapi/google-local-results-ai-parser

A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing, classification, and information extraction from English HTML content.

data-parsing html-parser information-extraction natural-language-processing opensource rubygem structured-data-extraction text-classification webscraping

Last synced: 20 Nov 2024

https://github.com/viur-framework/html5

Pure Python HTML abstraction layer, parser and interpreter

framework html html-parser html5 library pyjs pyodide python viur

Last synced: 07 Nov 2024

https://github.com/vincentlaucsb/pgreaper

A Python library for loading data from various formats into PostgreSQL databases.

convert-data csv-converter html-parser postgresql python sql sql-database sql-table sqlite3-database

Last synced: 06 Nov 2024

https://github.com/yeonjuan/es-html-parser

HTML parser for static analysis

html html-parser

Last synced: 11 Oct 2024

https://github.com/danny1113/html-parser-builder

A result builder that build HTML parser and transform HTML elements to strongly-typed result, inspired by RegexBuilder.

dsl html-parser swift

Last synced: 02 Nov 2024

https://github.com/imelgrat/feed-finder

A PHP class for extracting the URLs of RSS (1.0 and 2.0) and ATOM feeds associated to a page, as well as OPML outline documents.

atom atom-feed composer-package html-parser html-scraper opml opml-outline php regex regular-expression rss rss-feed rss-feed-scraper

Last synced: 19 Nov 2024

https://github.com/duncan3dc/domparser

Wrappers for the PHP DomDocument class to provide extra functionality for html/xml parsing

html html-parser php xml xml-parsing

Last synced: 09 Nov 2024

https://github.com/creeperyang/html-parser-lite

A light weight html parser and more.

html-parser html-parser-lite parser

Last synced: 28 Oct 2024

https://github.com/algunion/htmlforge.jl

Flexible HTML parsing and manipulation in Julia Programming Language

html html-manipulation html-parser html5 htmx julia

Last synced: 11 Dec 2024

https://github.com/lorien/lxmlbench

Simple benchmark to test CPU speed by parsing HTML with lxml

benchmark cpu-benchmark cpu-speed html html-parser lxml python python3 speedtest

Last synced: 13 Nov 2024

https://github.com/hean01/domx

HTML parser and DOM tree builder for rust

dom html-parser rust

Last synced: 16 Nov 2024

https://github.com/victornpb/benchmark-html-parser-libraries

A Benchmark of javascript libraries for parsing HTML (CPU/RAM)

benchmark html html-parser htmlparser2 jsdom nodejs parse5 parser

Last synced: 21 Oct 2024

https://github.com/ariary/jsextractor

Fastly gather all JavaScript from url (CLi+TUI)

bug-bounty cli extract extractor html-parser javascript js parser pentest recon tui web-pentest xss

Last synced: 11 Nov 2024

https://github.com/rohitawate/domengine

DOM manipulation engine written in C++.

cpp11 dom-manipulation dom-tree html-parser interactive-shell interpreter

Last synced: 12 Oct 2024

https://github.com/lrswss/akal2ical

Perl-Skript um Abfuhrtermine des AfA Karlsruhe für den angegebenen Straßenzug abzurufen und als iCal-Datei zu speichern

converter garbageday german html-parser ical karlsruhe perl-script

Last synced: 28 Sep 2024

https://github.com/karambir/ugc-colleges

Python Script to extract college names from UGC, India website.

college crawler extract html-parser python python-script ugc

Last synced: 12 Dec 2024

https://github.com/tbjgolden/deno-htmlparser2

Deno port of `htmlparser2`

deno html html-parser htmlparser2 parser

Last synced: 17 Dec 2024

https://github.com/romagny13/html-parser

TypeScript/JavaScript HTML Parser

html-parser typescript

Last synced: 30 Nov 2024

https://github.com/vitkarpov/fast-xml-parser

🚀 Is a fast XML parser in TypeScript with zero dependencies

dom html-parser html-parser-library javascript typescript

Last synced: 27 Oct 2024

https://github.com/sunshineplan/node

HTML parsing library, the alternative to BeautifulSoup in Golang.

beautifulsoup css-selectors generic go golang html-parser xpath xpath-query

Last synced: 06 Nov 2024

https://github.com/Anikeshpatel/dompy

JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python

dompy html-parser htmlparser javascript-dom-api python3 pythondomapi webscraping

Last synced: 11 Oct 2024

https://github.com/iamareebjamal/get_results

Python Script to download results of whole class/branch by providing attendance Excel file.

attendance html-parser python-script result-analysis student-information

Last synced: 29 Nov 2024

https://github.com/coderosh/docpa

A simple library that I use for web scraping. Uses htmlparser2 to parse dom.

docpa document-parser dom html-parser

Last synced: 21 Nov 2024

https://github.com/nix1707/webscrapper-browserextension

Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.

chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping

Last synced: 12 Dec 2024

https://github.com/timolinn/nginb

[WIP] This is a hobbyist browser engine written in Go

browser-engine html-parser parsers

Last synced: 28 Nov 2024