Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Parsing

A grammar describes the syntax of a programming language, and might be defined in Backus-Naur form (BNF). A lexer performs lexical analysis, turning text into tokens. A parser takes tokens and builds a data structure like an abstract syntax tree (AST). The parser is concerned with context: does the sequence of tokens fit the grammar? A compiler is a combined lexer and parser, built for a specific grammar.

https://github.com/kivanc57/nlp_data_visualization

This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.

data-science matplotlib nlp parsing plotting python spacy visualization

Last synced: 09 Oct 2024

https://github.com/seykes/parsing-freelance.ru

Сбор всех IT'шников с сайта.

bs4 csv csv-parser parsing python

Last synced: 09 Nov 2024

https://github.com/mnemnion/jlpeg.jl

A bytecode Parsing Expression Grammer VM closely inspired by LPeg

parsing parsing-expression-grammars peg

Last synced: 12 Nov 2024

https://github.com/hansalozu/bincol-tgbot

Telegram bot that makes it easier to get information from the bincol.ru website. It parses the data of the electronic schedule and notifies about changes in it

nodejs parsing scraping-websites telegraf telegram telegram-bot

Last synced: 09 Nov 2024

https://github.com/rani-abha/mini-search-engine

This mini search engine should be programmed to perform parsing, crawling, indexing, and query-serving functions and return the results on a result page.

content-pages crawling indexing lucene parsing query-service rabittmq search-algorithm search-engine server spring-boot

Last synced: 15 Nov 2024

https://github.com/mcdostone/nom-config-in

A config.in parser written in Rust.

config-in linux-kernel nom parsing rust

Last synced: 17 Nov 2024

https://github.com/rryi/tokens.jl

parse text into tokens, build memory-efficient token lists and trees,

lexer mutable-strings parser parsing string string-manipulation string-tree string-vector token tokenize

Last synced: 10 Nov 2024

https://github.com/davidvujic/clojure-email-experiment

Experimenting with parsing email files using Clojure

clojure email parsing

Last synced: 15 Oct 2024

https://github.com/dark-lon3r/aiomediagrabber

AioMediaGrabber — бота для Telegram на базе библиотеки aiogram, который позволяет скачивать видео из TikTok 🎵 и треки из SoundCloud 🎧! 🚀

aiogram aiogram-bot bot download downloader easy music os parser parsing python scraper telegram telegram-bot telegram-bot-api ukraine ukrainian video

Last synced: 12 Nov 2024

https://github.com/chewbaccayeti/octo_games_metrics

This repo dedicated to data scrapping/parsing them into fronted side with Highchart.js statistics (SteamDB, RedditAPI)

ajax axios cheerio cors dotenv express hot-reload javascript momentjs nodejs nodemon parsing puppeteer react reddit scraping steam typescript webpack

Last synced: 10 Oct 2024

https://github.com/marinagorbacheva/information-retrieval

Website scraping, parsing, indexing and search.

indexing parsing scraping search

Last synced: 16 Nov 2024

https://github.com/nikolai2038/politech-get-schedule-for-google-calendar

Bash script to get shedule for Google Calendar from https://my.spbstu.ru/

bash google-calendar ics parsing

Last synced: 14 Nov 2024

https://github.com/nikolai2038/moodle-downloader

Bash-script to download videos from moodle page

bash curl downloader moodle parsing

Last synced: 14 Nov 2024

https://github.com/nikolai2038/webtoons-parser

Bash script to download images from www.webtoons.com

bash parsing webtoon webtoons

Last synced: 14 Nov 2024

https://github.com/607011/getopt-cpp

A sleek and slender C++'ish alternative to getopt and getopt_long

command-line cplusplus cplusplus-11 getopt getopt-long parsing

Last synced: 12 Oct 2024

https://github.com/jgarber623/microformats-ruby

A Ruby gem for parsing HTML documents containing microformats.

html indieweb microformats microformats2 parsing ruby rubygems

Last synced: 23 Oct 2024

https://github.com/sigmapitech/chaos

A tool to automatically apply bits of the Epitech norm

epitech epitech-norme from-scratch lexer linter normalization norminette parsing python

Last synced: 11 Nov 2024

https://github.com/felixschl/purescript-neodoc-parsing

Parsing library for purescript, originally written as efficient parser implementation for http://github.com/felixSchl/neodoc

parsing purescript

Last synced: 07 Nov 2024

https://github.com/thetimick/tivkparser

Приложение, для получения различных данных из соц. сети VK

parser parsing vk vkontakte

Last synced: 26 Sep 2024

https://github.com/hansalemaos/wmicprocsdict

Converts the output of a powershell (Get-...)/wmic command to a dict

dict parsing powershell python wmic

Last synced: 12 Nov 2024

https://github.com/42yerevanprojects/minishell

As beautiful as a shell. Implementation of the 42 project minishell

42born2code 42school bash c execution parsing processes shell signal-handling

Last synced: 11 Nov 2024

https://github.com/adiepenbrock/rjvm

Parse JVM class files with Rust

jvm parsing rust

Last synced: 13 Oct 2024

https://github.com/drdaeman/destruct

Tiny library to parse binary structures into Python objects.

binary-data parsing python struct

Last synced: 22 Oct 2024

https://github.com/kubinyete/edi-php

A standard library for declaring EDI parsers

edi library parsing php php8

Last synced: 12 Nov 2024

https://github.com/kubinyete/getnet-edi-php

A standard library for parsing EDI files from Getnet.

acquirer conciliation edi getnet getnet-brasil library parser parsing

Last synced: 12 Nov 2024

https://github.com/kubinyete/pagseguro-edi-php

A straightfoward library for sequential loading of EDI entries from PagSeguro API

edi library pagbank pagseguro pagseguro-api pagseguro-php parsing php php8

Last synced: 12 Nov 2024

https://github.com/stereobooster/instaparsejs

What if context-free grammars were as easy to use as regular expressions?. Clojure instaparse compiled to ESM

gll gll-parser instaparse parser parsing peg pegll

Last synced: 31 Oct 2024

https://github.com/pustovitdmytro/calendar-maker

module that can transform .txt files to .csv or .ics(ical) calendars

calendar csv ical parsing python3

Last synced: 11 Nov 2024

https://github.com/whothefluff/abap-jsonparser

JSON parser

parsing

Last synced: 14 Nov 2024

https://github.com/andreie91/java-lambda-expressions-exercises

A collection of problems which are solved with the use of lambda expressions, such as parsing a file and only saving objects which meet certain criteria.

java lambda-expressions parsing

Last synced: 14 Nov 2024

https://github.com/turbaszek/knotes

Minimalist kindle notes reader

api-service elm haskell parsing web-application

Last synced: 16 Oct 2024

https://github.com/julianalonso/jasoon

Swift JSON parsing library

json json-parser parsing parsing-library swift swift3

Last synced: 30 Oct 2024

https://github.com/matej-ch/html-helpers

Two simple classes for removing html tags and extracting file patchs from tags

dom-manipulation domdocument html html-tags parsing php

Last synced: 14 Nov 2024

https://github.com/Kubinyete/edi-php

A standard library for declaring EDI parsers

edi library parsing php php8

Last synced: 23 Oct 2024

https://github.com/mahdrentys/chemin-rs

An enum-based router generator for rust, supporting query strings and i18n.

agnostic i18n parsing query-string router rust rust-crate rust-library

Last synced: 12 Oct 2024

https://github.com/yvann-ba/ft_printf

Custom `printf` implementation in C. Supports various format specifiers and variadic arguments

42projects parsing printf stringmanipulation

Last synced: 03 Nov 2024

https://github.com/yves-biener/zparser-html

HTML Spec Complient parser written in zig.

html parsing tokenizer tree-structure zig

Last synced: 24 Oct 2024

https://github.com/viktor-akusoff/gallerythief

Simple python package for scraping images from different search engines by prompt.

parsing searching web-scraping

Last synced: 13 Nov 2024

https://github.com/vadimkantorov/fasttsv

TSV parser for Python in pure vectorized NumPy code

numpy parsing simd tsv vectorized-code

Last synced: 13 Nov 2024

https://github.com/alextkdev/resume_parsing

Solution on Python that allows parsing and sorting of resumes from popular job websites.

beautifulsoup4 ci-cd flake8 parser parsing python3 selenium

Last synced: 11 Oct 2024

https://github.com/robertsaupe/php-json

php library for json and jsonc parsing

json jsonc library parsing php php8

Last synced: 14 Nov 2024

https://github.com/dylanlacey/gown

Gown helps you eliminate unwanted things from your Sauce Connect logs

parsing sauce-connect sauce-labs

Last synced: 16 Nov 2024

https://github.com/aggstam/btc-classifier

Python scripts suite for generating Bitcoin transactions graphs and analyzing them using machine learning.

bitcoin classification cryptocurrency graph machine-learning parsing python

Last synced: 13 Nov 2024

https://github.com/sintef/rusty-chunkenc

A Rust implementation of Prometheus' chunkenc library

chunkenc gorilla parsing prometheus rust-crate serialization time-series

Last synced: 13 Nov 2024

https://github.com/hyperimpose/minutia

Summarizing the content of internet services

html http hyperimpose hyperlink minutia parsing python url

Last synced: 13 Nov 2024

https://github.com/elifoster/headercount

A MediaWiki extension that adds a parser function to count the number of headers in a given page.

mediawiki-extension parsing

Last synced: 31 Oct 2024

https://github.com/abombalemba/ozon

Parsing Ozon site

ozon parsing python

Last synced: 08 Nov 2024

https://github.com/eel-moun/bashclone

this project is about Building a minimalistic shell from scratch, emulating the behavior of Bash.

abstract-syntax-tree c collaboration git lexing parsing process shell system-design tokenizer

Last synced: 16 Nov 2024

https://github.com/JulianAlonso/JASOON

Swift JSON parsing library

json json-parser parsing parsing-library swift swift3

Last synced: 15 Nov 2024

https://github.com/lupcode/java-json

Robust and efficient JSON parser and generator that includes line and column tracker for error handling. Can handle strings as well as streams for parsing and generating.

api column-tracker efficient efficient-json-parser java java-8 java-json json json-api json-parser library parsing robust streams utf8 utf8-characters utf8-string

Last synced: 14 Nov 2024

https://github.com/fdero/minigrammar

A parser-generation library that makes use of python metaprogramming to inject the parsing-logic into user defined AST-classes

grammar grammar-parser metaprogramming parsing parsing-library python3 reflection

Last synced: 14 Nov 2024

https://github.com/mezoni/parser_combinator

Parser combinator is a collection of parsers that can be used to combine basic parsers to create parsers for more complex rules.

dart dart3 parser parser-combinators parser-library parsers parsing

Last synced: 13 Nov 2024

https://github.com/n-c0de-r/jobinterviewcode1

Javascript parsing test for a student job

application-tests jobinterview parsing

Last synced: 15 Nov 2024

https://github.com/natsionalnoedostoyanie/metro_parser

Parser for the METRO site

aiohttp parsing poetry

Last synced: 15 Nov 2024

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 15 Nov 2024

https://github.com/seantiz/dryfold-cli

A tool to help me plan C++ codebase migration ahead of time. Dryfold breaks the work down into kanban-board tasks.

cpp kanban kanban-boards parsing report-generator rust static-analysis system-design tree-sitter typescript

Last synced: 15 Nov 2024

https://github.com/touhi99/parsing

Parsing Course Project - A CYK implementation with CNF

cky cnf cyk-algorithm nltk parsing

Last synced: 17 Nov 2024

https://github.com/aivanf/aidantic

Data parsing and validation with OneOf using Python type hints

gamedev oneof parsing python python3 validation

Last synced: 08 Nov 2024

https://github.com/oxylabs/how-to-automate-recurring-scraping-jobs-with-scheduler

Automation is key when it comes to web data acquisition. If you have identical scraping jobs that you execute on a regular basis, let Scheduler do the work for you

github-python job-scheduler parsing python scheduler-service scraper web-scheduler

Last synced: 17 Nov 2024

https://github.com/s0fft/learning-lab

Code Notes & Test-Learn // Micro Pet-Projects: Python / Asynchrony / FastAPI / Django-Tastypie / Parsing / SQL / Docker / DS / ML / etc.

asynchrony data-science django docker fastapi jupyter-lab jupyter-notebook mashine-learning matplotlib notes numpy pandas parsing python3 scikit-learn seaborn sql sqlalchemy tastypie telegram-bot

Last synced: 15 Nov 2024

https://github.com/bilalhameed248/pdf-document-extraction

Python PDF-to-HTML Converter: Transforming PDF Documents into Structured HTML Tags. - Feb 2022 - Jun 2023

document extraction fitz parser parsing pdf pymupdf pymupdf-fitz python python3

Last synced: 15 Nov 2024

https://github.com/bilalhameed248/news-website-nwsfd

CodeIgniter Based News Website - Sep 2020 - Aug 2021

codeigniter css gmail htlm5 news parsing scraping

Last synced: 15 Nov 2024

https://github.com/govcraft/mti

A Rust library that implements type-safe, prefix-enhanced identifiers based on the TypeID Specification

crate database distributed-systems identifier parsing rust rust-lang rustlang string-extensions typeid typeid-implementation unique-identifier web-development web-identity

Last synced: 12 Oct 2024

https://github.com/styczynski/jnp1-task-1

Task we have to do as a part of subject on University of Warsaw

c jnp parsing regex task warsaw-university

Last synced: 15 Nov 2024

https://github.com/sbstjn/link

Go package to parse links from strings

go golang parsing regexp

Last synced: 09 Nov 2024

https://github.com/iammosespaulr/nl2sympy

This is under construction, I'll push code when I make progress.

nlp nmt parsing sympy sympy-gamma

Last synced: 15 Nov 2024

https://github.com/flyer-dm/webdataparser

Приложение для парсинга сайтов и выгрузки данных в разные форматы

daq data-collection-system data-export desktop-application flet parsing playwright

Last synced: 17 Nov 2024

https://github.com/akanoa/nom-stream-parser

An Rust lib which allow to downstream result data from an upstream

nom parsing performance rust stream

Last synced: 10 Nov 2024

https://github.com/arunanshub/bwscanner

Scan for technologies that match a given regex.

builtwith lxml parsing webscraping

Last synced: 18 Oct 2024

Parsing Awesome Lists