Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with html-parser
A curated list of projects in awesome lists tagged with html-parser .
https://github.com/fb55/htmlparser2
The fast & forgiving HTML and XML parser
dom html html-parser htmlparser2 javascript parser xml
Last synced: 16 Dec 2024
https://github.com/oblac/jodd
Jodd! Lightweight. Java. Zero dependencies. Use what you like.
aop database html-parser http-client ioc java java8 jodd jquery json-parser mail micro-framework utility-library
Last synced: 16 Dec 2024
https://github.com/posthtml/posthtml
PostHTML is a tool to transform HTML/XML with JS plugins
html html-parser parser posthtml transformer xml xml-parser
Last synced: 16 Dec 2024
https://github.com/jsdf/react-native-htmlview
A React Native component which renders HTML content as native views
html html-parser html-renderer react react-component react-native
Last synced: 16 Dec 2024
https://github.com/zzzprojects/html-agility-pack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
hap html-parser htmlagilitypack parse xpath
Last synced: 19 Dec 2024
https://github.com/tid-kijyun/Kanna
Kanna(鉋) is an XML/HTML parser for Swift.
Last synced: 09 Dec 2024
https://github.com/imangazaliev/didom
Simple and fast HTML and XML parser
dom html html-parser parser xml xml-parser xpath
Last synced: 18 Dec 2024
https://github.com/Imangazaliev/DiDOM
Simple and fast HTML and XML parser
dom html html-parser parser xml xml-parser xpath
Last synced: 08 Nov 2024
https://github.com/philss/floki
Floki is a simple HTML parser that enables search for nodes using CSS selectors.
css-selector css-selectors elixir erlang fast-html floki hacktoberfest html-parser html5ever myhtml
Last synced: 16 Dec 2024
https://github.com/sub6resources/flutter_html
A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)
flutter flutter-html flutter-package flutter-widget flutter-widgets html-css html-parser html-tags
Last synced: 16 Dec 2024
https://github.com/Sub6Resources/flutter_html
A Flutter widget for rendering static html as Flutter widgets (Will render over 80 different html tags!)
flutter flutter-html flutter-package flutter-widget flutter-widgets html-css html-parser html-tags
Last synced: 03 Nov 2024
https://github.com/lexborisov/myhtml
Fast C/C++ HTML 5 Parser. Using threads.
Last synced: 15 Dec 2024
https://github.com/psharanda/atributika
Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.
attributedstring hashtag html html-parser ios nsattributedstring swift tags textkit uilabel uitextview
Last synced: 19 Dec 2024
https://github.com/psharanda/Atributika
Convert text with HTML tags, links, hashtags, mentions into NSAttributedString. Make them clickable with UILabel drop-in replacement.
attributedstring hashtag html html-parser ios nsattributedstring swift tags textkit uilabel uitextview
Last synced: 09 Dec 2024
https://github.com/yorickpeterse/oga
Oga is an XML/HTML parser written in Ruby.
html html-parser parser ruby xml xml-parser
Last synced: 21 Dec 2024
https://github.com/YorickPeterse/oga
Oga is an XML/HTML parser written in Ruby.
html html-parser parser ruby xml xml-parser
Last synced: 22 Nov 2024
https://github.com/cezheng/fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
css html html-parser html-parsing ios parser parsing swift xml xml-parser xml-parsing xpath
Last synced: 19 Dec 2024
https://github.com/cezheng/Fuzi
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
css html html-parser html-parsing ios parser parsing swift xml xml-parser xml-parsing xpath
Last synced: 09 Dec 2024
https://github.com/lexborisov/modest
Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.
c css css-parser css-selector html html-parser html-renderer pure-c
Last synced: 20 Dec 2024
https://github.com/lexborisov/Modest
Modest is a fast HTML renderer implemented as a pure C99 library with no outside dependencies.
c css css-parser css-selector html html-parser html-renderer pure-c
Last synced: 27 Oct 2024
https://github.com/miso-belica/jusText
Heuristic based boilerplate removal tool
html-parser html-parsing python text-extraction
Last synced: 25 Oct 2024
https://github.com/miso-belica/justext
Heuristic based boilerplate removal tool
html-parser html-parsing python text-extraction
Last synced: 20 Dec 2024
https://github.com/antchfx/htmlquery
htmlquery is golang XPath package for HTML query.
go golang html html-parser xpath xpath-selector xpath2
Last synced: 26 Oct 2024
https://github.com/rajatomar788/pywebcopy
Locally saves webpages to your hard disk with images, css, js & links as is.
archive-tool crawler html html-parser mirror python web webpage
Last synced: 20 Nov 2024
https://github.com/bupt1987/html-parser
php html parser,类似与PHP Simple HTML DOM Parser,但是比它快好几倍
Last synced: 29 Oct 2024
https://github.com/zhegexiaohuozi/jsoupxpath
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.
antlr4 html-parser jsoupxpath xpath
Last synced: 30 Nov 2024
https://github.com/b-fuze/deno-dom
Browser DOM & HTML parser in Deno
browser-dom deno dom html-parser rust typescript wasm
Last synced: 20 Dec 2024
https://github.com/duzun/hquery.php
An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.
broken-html crawler css-selectors domcrawler fast hquery html html-parser invalid-html jquery-like jquery-selectors parser php psr-0 psr-4 scraper selectors xml xml-parser
Last synced: 21 Dec 2024
https://github.com/MohamedRejeb/ksoup
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.
android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing
Last synced: 10 Nov 2024
https://github.com/MohamedRejeb/Ksoup
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.
android html-parser kotlin kotlin-android kotlin-js kotlin-jvm kotlin-library kotlin-multiplatform kotlin-native parser parser-library parsing
Last synced: 07 Nov 2024
https://github.com/prettyhtml/prettyhtml
💅 The formatter for the modern web https://prettyhtml.netlify.com/
angular formatter html html-parser prettier rehype svelte vue web-components
Last synced: 17 Dec 2024
https://github.com/Prettyhtml/prettyhtml
💅 The formatter for the modern web https://prettyhtml.netlify.com/
angular formatter html html-parser prettier rehype svelte vue web-components
Last synced: 12 Nov 2024
https://github.com/ZhgChgLi/ZMarkupParser
ZMarkupParser is a pure-Swift library that helps you convert HTML strings into NSAttributedString with customized styles and tags.
cocoapods html html-converter html-parser html-renderer ios nsattributedstring swift swift-package textfield uikit uilabel uitextview
Last synced: 06 Dec 2024
https://github.com/ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
doc document-analysis document-content-extraction documents docx docx-parser excel html html-parser logical-structure-extraction ocr odt pdf pdf-parser scanned-documents table-of-contents table-recognition txt
Last synced: 15 Dec 2024
https://github.com/Hexilee/unhtml.rs
A magic html parser
html-parser rust rust-crate scraper
Last synced: 19 Nov 2024
https://github.com/hexilee/unhtml.rs
A magic html parser
html-parser rust rust-crate scraper
Last synced: 21 Dec 2024
https://github.com/acrazing/html5parser
A super tiny and fast html5 AST parser.
ast dom html html-parser html5 parser
Last synced: 21 Dec 2024
https://github.com/justinwilaby/sax-wasm
The first streamable, fixed memory XML, HTML, and JSX parser for WebAssembly.
html-parser jsx jsx-parser node parser rust sax-js webassembly xml xml-parser
Last synced: 21 Dec 2024
https://github.com/Swaagie/minimize
Minimize HTML
dom html-minifier html-parser javascript
Last synced: 28 Oct 2024
https://github.com/swaagie/minimize
Minimize HTML
dom html-minifier html-parser javascript
Last synced: 26 Oct 2024
https://github.com/AutoCSer/AutoCSer
AutoCSer is a high-performance RPC framework. AutoCSer 是一个以高效率为目标向导的整体开发框架。主要包括 TCP 接口服务框架、TCP 函数服务框架、远程表达式链组件、前后端一体 WEB 视图框架、ORM 内存索引缓存框架、日志流内存数据库缓存组件、消息队列组件、二进制 / JSON / XML 数据序列化 等一系列无缝集成的高性能组件。
cache-server code-generator gif html-parser html-title-crawler http-server json message-queue orm-cache raw-socket rpc serialization webview xml
Last synced: 13 Nov 2024
https://github.com/gerev/nsoup
NSoup is a .NET port of the jsoup (http://jsoup.org) HTML parser and sanitizer originally written in Java
c-sharp dot-net html-parser jsoup
Last synced: 12 Oct 2024
https://github.com/mykolaharmash/hyntax
Straightforward HTML parser for JavaScript
dom html html-parser javascript
Last synced: 22 Dec 2024
https://github.com/sihaelov/harser
Easy way for HTML parsing and building XPath
html html-parser parser python xpath
Last synced: 19 Nov 2024
https://github.com/jonascz/save-for-offline
Android app for saving webpages for offline reading.
android android-application html-files html-parser java offline offline-storage parser viewer web-scraping
Last synced: 07 Nov 2024
https://github.com/craigbarnes/lua-gumbo
Moved to https://gitlab.com/craigbarnes/lua-gumbo
dom html html-parser html5 lua parser sanitize-html
Last synced: 22 Nov 2024
https://github.com/Vectorized/Aris
Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...
angular babel css css-in-js css-prefixer css-preprocessor css3 frontend html-css-javascript html-parser html-template html5 jquery js js-framework jsx lodash react svg syntax-highlighting
Last synced: 28 Oct 2024
https://github.com/vectorized/aris
Aris - A fast and powerful tool to write HTML in JS easily. Includes syntax highlighting, templates, SVG, CSS autofixing, debugger support and more...
angular babel css css-in-js css-prefixer css-preprocessor css3 frontend html-css-javascript html-parser html-template html5 jquery js js-framework jsx lodash react svg syntax-highlighting
Last synced: 01 Nov 2024
https://github.com/rusterlium/html5ever_elixir
NIF wrapper of html5ever using Rustler
binding elixir erlang html-parser html5ever nif rustler
Last synced: 22 Dec 2024
https://github.com/jstedfast/htmlkit
A cross-platform .NET framework for parsing HTML
c-sharp html html-parser html5 parser
Last synced: 15 Dec 2024
https://github.com/huozhi/html2any
🌀 parse and convert html string to anything
html-parser rich-text xml-parser
Last synced: 10 Dec 2024
https://github.com/leafo/web_sanitize
Lua library for sanitizing, parsing, and editing untrusted HTML
css-sanitization html html-parser html-sanitization lua moonscript security
Last synced: 09 Nov 2024
https://github.com/softcircuits/htmlmonkey
Lightweight HTML/XML parser written in C#.
csharp dotnet html html-parser parser
Last synced: 11 Nov 2024
https://github.com/hexilee/unhtml
HTML unmarshaler for golang
go golang html-parser unmarshaller
Last synced: 08 Nov 2024
https://github.com/orottier/webpage-rs
Small Rust library to fetch info about a web page: title, description, language, HTTP info, RSS feeds, Opengraph, Schema.org, and more
html html-parser json-ld opengraph rust
Last synced: 20 Dec 2024
https://github.com/app-generator/html-parser
Html Parser - Html to Pug, Jinja2, Blade Converter | AppSeed
appseed developer-tools html-parser open-source pug
Last synced: 27 Oct 2024
https://github.com/naqvis/crystal-html5
Crystal implementation of HTML5-Compliant Tokenizer and Parser with XPath & CSS Selector support
crystal crystal-lang crystal-language crystal-shard css-selectors html-parser html-tokenizer html5 xpath2
Last synced: 02 Nov 2024
https://github.com/RobustaStudio/bkit
build a messenger bot using HTML
bot bot-framework facebook golang html-parser messenger messenger-bot robusta
Last synced: 29 Oct 2024
https://github.com/stevenvachon/handlebars-html-parser
Parse Handlebars and HTML.
handlebars html html-parser html5 nodejs whatwg
Last synced: 07 Nov 2024
https://github.com/f34nk/modest_ex
Elixir library to do pipeable transformations on html strings (with CSS selectors)
css-selector elixir html-parser html-renderer
Last synced: 01 Nov 2024
https://github.com/kshru9/web-crawler
A multithreaded web crawler using two mechanism - single lock and thread safe data structures
concurrency concurrent-data-structure cpp crawler data-structures html-parser lock multithreading openssl pagerank pthread reader-writer-lock search-engine socket threading threadsafe webcrawler website-downloader
Last synced: 28 Oct 2024
https://github.com/andreaspizsa/hypertag
🏎 The fastest HTML tag and attributes parser for JavaScript
html-parser javascript nodejs tag-parsing
Last synced: 01 Dec 2024
https://github.com/erdomke/bracketpipe
BracketPipe is a .NET library for building parsing and processing piplines for web languages like HTML, CSS, Javscript, SVG, and MathML
html-markdown html-minifier html-parser html-sanitization
Last synced: 28 Oct 2024
https://github.com/snapframework/xmlhtml
XML parser and renderer with HTML 5 quirks mode
haskell html-parser xml-parser
Last synced: 21 Dec 2024
https://github.com/hamedmasafi/tooska
css css-parser html html-parser json json-parser serializa
Last synced: 20 Nov 2024
https://github.com/oblac/jodd-lagarto
Java HTML parsers suite.
html html-parser java jquery parser
Last synced: 11 Nov 2024
https://github.com/mauriceconrad/xml-parser
A Node.js XML DOM, Parser & Stringifier.
crawler crawling dom html html-parser html-parsing xml xml-parser xml-parsing xml-schema
Last synced: 28 Oct 2024
https://github.com/yannickperrenet/bookmarkdown
✅ Parse your browser's exported HTML bookmark file to Markdown.
brave brave-browser html-parser html-to-markdown html-to-md markdown
Last synced: 25 Nov 2024
https://github.com/nuzulul/telegram-scraper
A simple Telegram channel scraper
bot html-parser scraper telegram telegram-bot telegram-channel-scraper telegram-channel-to-json telegram-scraper telegram-to-json web-scraper web-scraping
Last synced: 12 Nov 2024
https://github.com/inversoft/prime-transformer
Fast Java8 BBCode & HTML parser and transformation library.
bbcode bbcode-parser html-parser java8
Last synced: 07 Nov 2024
https://github.com/serpapi/google-local-results-ai-parser
A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing, classification, and information extraction from English HTML content.
data-parsing html-parser information-extraction natural-language-processing opensource rubygem structured-data-extraction text-classification webscraping
Last synced: 20 Nov 2024
https://github.com/viur-framework/html5
Pure Python HTML abstraction layer, parser and interpreter
framework html html-parser html5 library pyjs pyodide python viur
Last synced: 07 Nov 2024
https://github.com/vincentlaucsb/pgreaper
A Python library for loading data from various formats into PostgreSQL databases.
convert-data csv-converter html-parser postgresql python sql sql-database sql-table sqlite3-database
Last synced: 06 Nov 2024
https://github.com/danny1113/html-parser-builder
A result builder that build HTML parser and transform HTML elements to strongly-typed result, inspired by RegexBuilder.
Last synced: 02 Nov 2024
https://github.com/imelgrat/feed-finder
A PHP class for extracting the URLs of RSS (1.0 and 2.0) and ATOM feeds associated to a page, as well as OPML outline documents.
atom atom-feed composer-package html-parser html-scraper opml opml-outline php regex regular-expression rss rss-feed rss-feed-scraper
Last synced: 19 Nov 2024
https://github.com/duncan3dc/domparser
Wrappers for the PHP DomDocument class to provide extra functionality for html/xml parsing
html html-parser php xml xml-parsing
Last synced: 09 Nov 2024
https://github.com/creeperyang/html-parser-lite
A light weight html parser and more.
html-parser html-parser-lite parser
Last synced: 28 Oct 2024
https://github.com/algunion/htmlforge.jl
Flexible HTML parsing and manipulation in Julia Programming Language
html html-manipulation html-parser html5 htmx julia
Last synced: 11 Dec 2024
https://github.com/lorien/lxmlbench
Simple benchmark to test CPU speed by parsing HTML with lxml
benchmark cpu-benchmark cpu-speed html html-parser lxml python python3 speedtest
Last synced: 13 Nov 2024
https://github.com/victornpb/benchmark-html-parser-libraries
A Benchmark of javascript libraries for parsing HTML (CPU/RAM)
benchmark html html-parser htmlparser2 jsdom nodejs parse5 parser
Last synced: 21 Oct 2024
https://github.com/imingyu/forgiving-xml-parser
An XML/HTML parser and serializer for JavaScript.
forgiving-xml-parser html html-parser html-parsing html2js html2json javascript json parser serializer transformation typescript xml xml-parser xml-parsing xml2js xml2json
Last synced: 15 Nov 2024
https://github.com/ariary/jsextractor
Fastly gather all JavaScript from url (CLi+TUI)
bug-bounty cli extract extractor html-parser javascript js parser pentest recon tui web-pentest xss
Last synced: 11 Nov 2024
https://github.com/ocramz/twelve
Like @11ty , but this goes up to 12
html-parser html-template-engine html-templating static-site-generator static-website-generation template-engine
Last synced: 15 Oct 2024
https://github.com/kevinhermawan/markup2json
A library for converting HTML and XML into JSON
html-parser html-to-json html2json javascript typescript xml-parser xml-to-json xml2json
Last synced: 06 Dec 2024
https://github.com/rohitawate/domengine
DOM manipulation engine written in C++.
cpp11 dom-manipulation dom-tree html-parser interactive-shell interpreter
Last synced: 12 Oct 2024
https://github.com/lrswss/akal2ical
Perl-Skript um Abfuhrtermine des AfA Karlsruhe für den angegebenen Straßenzug abzurufen und als iCal-Datei zu speichern
converter garbageday german html-parser ical karlsruhe perl-script
Last synced: 28 Sep 2024
https://github.com/karambir/ugc-colleges
Python Script to extract college names from UGC, India website.
college crawler extract html-parser python python-script ugc
Last synced: 12 Dec 2024
https://github.com/tbjgolden/deno-htmlparser2
Deno port of `htmlparser2`
deno html html-parser htmlparser2 parser
Last synced: 17 Dec 2024
https://github.com/vitkarpov/fast-xml-parser
🚀 Is a fast XML parser in TypeScript with zero dependencies
dom html-parser html-parser-library javascript typescript
Last synced: 27 Oct 2024
https://github.com/sunshineplan/node
HTML parsing library, the alternative to BeautifulSoup in Golang.
beautifulsoup css-selectors generic go golang html-parser xpath xpath-query
Last synced: 06 Nov 2024
https://github.com/Anikeshpatel/dompy
JavaScript Dom Api for Python, Html Parser and a Web scraping tool in python
dompy html-parser htmlparser javascript-dom-api python3 pythondomapi webscraping
Last synced: 11 Oct 2024
https://github.com/iamareebjamal/get_results
Python Script to download results of whole class/branch by providing attendance Excel file.
attendance html-parser python-script result-analysis student-information
Last synced: 29 Nov 2024
https://github.com/coderosh/docpa
A simple library that I use for web scraping. Uses htmlparser2 to parse dom.
docpa document-parser dom html-parser
Last synced: 21 Nov 2024
https://github.com/nix1707/webscrapper-browserextension
Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.
chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping
Last synced: 12 Dec 2024
https://github.com/timolinn/nginb
[WIP] This is a hobbyist browser engine written in Go
browser-engine html-parser parsers
Last synced: 28 Nov 2024