Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by GateNLP
A curated list of projects in awesome lists by GateNLP .
https://github.com/gatenlp/ultimate-sitemap-parser
Ultimate Website Sitemap Parser
python python-3 python3 robots-txt sitemap sitemap-xml xml-sitemap xml-sitemap-parser
Last synced: 15 Jan 2025
https://github.com/gatenlp/gate-core
The GATE Embedded core API and GATE Developer application
Last synced: 18 Jan 2025
https://github.com/gatenlp/python-gatenlp
Python text processing, pattern matching, and NLP framework
annotations gatenlp language-engineering natural-language-processing nlp pattern-matching python python-gatenlp python3 text-processing
Last synced: 19 Dec 2024
https://github.com/gatenlp/broad_twitter_corpus
The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)
Last synced: 13 Nov 2024
https://github.com/gatenlp/gateplugin-learningframework
A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, NaiveBayes, CRF and others), LibSVM, Scikit-Learn, Weka, and DNNs through Pytorch and Keras.
classification crf machine-learning nlp sequence-tagging
Last synced: 13 Nov 2024
https://github.com/gatenlp/semeval2019-hyperpartisan-bertha-von-suttner
SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution
Last synced: 13 Nov 2024
https://github.com/gatenlp/gateplugin-python
Python integration for the GATE framework
Last synced: 13 Nov 2024
https://github.com/gatenlp/bio-yodie
Bio-YODIE is GATE's biomedical named entity linking pipeline.
Last synced: 13 Nov 2024
https://github.com/gatenlp/mimir
Multi-paradigm Information Management Index and Repository
Last synced: 13 Nov 2024
https://github.com/gatenlp/cluster-embeddings
Simple script to create clusters from embeddings in word2vec format
Last synced: 13 Nov 2024
https://github.com/gatenlp/jaspell
Fork of http://jaspell.sourceforge.net to allow control over the character encoding used for the dictionary files.
Last synced: 13 Nov 2024
https://github.com/gatenlp/stanceclassifier
Stance Classifier for the WeVerify project
Last synced: 13 Nov 2024
https://github.com/gatenlp/gateplugin-stanford_corenlp
GATE wrappers for the Stanford CoreNLP tool set
Last synced: 13 Nov 2024
https://github.com/gatenlp/gate-teamware
A web application for collaborative document annotation.
Last synced: 13 Nov 2024
https://github.com/gatenlp/gate-lf-python-data
Python library for handling (dense) training/application data produced by the Learning Framework
Last synced: 13 Nov 2024
https://github.com/gatenlp/wpextract
Create datasets from WordPress sites for research or archiving
corpus crawler nlp text-extraction text-mining web-scraping wordpress
Last synced: 13 Nov 2024
https://github.com/gatenlp/gateplugin-dict-lemmatizer
A plugin for the GATE language technology framework for finding lemmata of words.
Last synced: 13 Nov 2024
https://github.com/gatenlp/gate-cloud-python-example
example of using the GATE Cloud on-line API
Last synced: 13 Jan 2025
https://github.com/gatenlp/emina
Emergent Informativeness and Actionability
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-tagger_googlenlp
GATE NLP plugin for the Google NLP
Last synced: 13 Jan 2025
https://github.com/gatenlp/tweet-network-gexf-generator
Tweet Network GEXF Generator
Last synced: 13 Jan 2025
https://github.com/gatenlp/clef2024_incrediblae_manual_evaluation_dataset
Manual evaluation dataset of CheckThat! Lab at CLEF 2024 Task 6: Robustness of Credibility Assessment with Adversarial Examples (InCrediblAE)
Last synced: 13 Jan 2025
https://github.com/gatenlp/tweet-rehydrater
Tool to take standoff annotations against a list of Tweets and merge them with the original text from Twitter
Last synced: 13 Jan 2025
https://github.com/gatenlp/surveykeywordsextraction
Keywords extraction from survey questions
Last synced: 13 Jan 2025
https://github.com/gatenlp/corpusconversion-bnc
Tool to convert the British National Corpus to GATE format
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-jape_plus
An alternative, usually more efficient and faster, JAPE implementation
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-tagger_syntaxnet
A GATE plugin for using a Google Tensorflow Serving SyntaxNet server
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-lang_chinese
Support for processing Chinese documents
Last synced: 13 Jan 2025
https://github.com/gatenlp/bio-yodie-resource-prep
Scripts to prepare the informational resources required by GATE Bio-YODIE.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-modularpipelines
A plugin for the GATE language technology framework that helps creating modular pipelines and parametrizing them
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-jdbclookup
A plugin for the GATE language technology framework for adding and updating annotations from a JDBC table.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-metamaplite
A GATE plugin wrapping MetaMapLite.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-gazetteer_ontology_based
An ontology based gazetteer for GATE
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-lang_german
German language support for GATE
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-sentiment
Provides resources for Sentiment Analysis in GATE
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-cistem
A GATE wrapper around the CISTEM German Stemmer (see https://github.com/LeonieWeissweiler/CISTEM)
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-lang_danish
Support for processing Danish documents
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-unga
Information extraction for United Nations General Assembly Resolutions
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-lf-keras-sparse
A lightweight wrapper around keras mainly for use with the GATE LearningFramework plugin
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-groovy
Adds support for the Groovy scripting language to GATE as well as making GATE easier to use from Groovy scripts
Last synced: 13 Jan 2025
https://github.com/gatenlp/unga-search
Exploration webapp for the UN GA Mímir index.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-dsl
Write GATE applications in a Groovy DSL.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-lf-pytorch-json
PyTorch wrapper for the LearningFramework GATE plugin
Last synced: 13 Jan 2025
https://github.com/gatenlp/corpusconversion-conll2003
Tool/scripts to help converting the CoNLL 2003 corpora to GATE format
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-lf-keras-json
Keras wrapper for the LearningFramework GATE plugin
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-java
A plugin for the GATE language technology framework that allows on-the fly use of Java programs as Processing Resources
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-liwc
A gate plugin to extract LIWC features
Last synced: 13 Jan 2025
https://github.com/gatenlp/cloud-client
Client library for the GATE Cloud REST APIs
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-crowd_sourcing
GATE plugin to interface with the CrowdFlower crowd sourcing platform
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-format_datasift
Document Format plugin to support reading DataSift JSON files
Last synced: 13 Jan 2025
https://github.com/gatenlp/sklearn-wrapper
A lightweight wrapper around scikit-learn for the GATE LearningFramework plugin
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-tagger_tagme
GATE NLP plugin for the TagMe service
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-stringannotation
A plugin for the GATE language technology framework that provides gazetteer and regular expression annotator PRs for string annotation
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-format_twitter
Document Format plugin to support reading and writing Twitter style JSON files
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-twitter
A suite of tools designed for processing Tweets
Last synced: 13 Jan 2025
https://github.com/gatenlp/weka-wrapper
A very lightweight wrapper around Weka
Last synced: 13 Jan 2025
https://github.com/gatenlp/topicllm_granularity_hallucination
Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling
Last synced: 13 Jan 2025
https://github.com/gatenlp/cluster-brown4wikipedia
Tools to simplify creating brown clusters from Wikipedia dump files
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-documentnormalizer
Tools for normalizing documents before processing
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-tagger_datenormalizer
Tools for detecting and normalising date expressions
Last synced: 13 Jan 2025
https://github.com/gatenlp/corpusconversion-universal-dependencies
Tool to convert the Universal Dependencies Treebanks to GATE format
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-termraider
TermRaider is a set of term extraction and scoring tools.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-format_alto
GATE document format for reading ALTO XML documents
Last synced: 13 Jan 2025
https://github.com/gatenlp/mimir-python
A client library for working with Mimir search in Python
Last synced: 13 Jan 2025
https://github.com/gatenlp/python-gatenlp-ml-tner
Token classification training and application using transformers via the tner package
deep-learning gatenlp machine-learning nlp python-gatenlp pytorch
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-lf-tests
Separate repository to contain the pre-configured ML tasks for testing
Last synced: 13 Jan 2025
https://github.com/gatenlp/gatelib-interaction
Various bits and pieces for interacting between GATE plugins and other software via command line or microservice
Last synced: 13 Jan 2025
https://github.com/gatenlp/food-sustainability
Alpro Project related files (and the dashboard)
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-annotatorframework
A generic framework for making use of external annotators in GATE
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-lf-tutorial-pos1
GATE LearningFramework tutorial on POS tagging
Last synced: 13 Jan 2025
https://github.com/gatenlp/usp-test-cassettes
VCR.py cassette files for Ultimate Sitemap Parser integration tests
Last synced: 10 Dec 2024
https://github.com/gatenlp/wv-covid19-annotation-exercise
WV Covid19 Annotation Exercise
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-format_fastinfoset
FastInfoset document format parser
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-format_bdoc
Store and load GATE documents represented as BasicDocument instances in JSON, YAML and MessagePack formats
gate-document json messagepack serialization yaml
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-corpusstats
A plugin for the GATE language technology framework for creating and storing corpus statistics like tf, df.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-context
GATE plugin implementing the ConText algorithm (see http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2757457/)
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-evaluation
A plugin for the GATE language technology framework that provides API methods and tools for evaluating and analyzing of documents and corpora.
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateapplication-hyperpartisanclassification
GATE's Hyper-partisan scorer, based on SemEval 2019 Task 4
Last synced: 13 Jan 2025
https://github.com/gatenlp/gate-lf-tutorial-textclassification1
GATE LearningFramework tutorial on text classification
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-schema_tools
A set of tools for using XML schemas in GATE
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-stemmer_snowball
GATE wrapper for the Snowball stemmer (see http://snowballstem.org/)
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-tagger_corenlp
A GATE plugin for annotating documents using a CoreNLP server
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-inter_annotator_agreement
A PR for calculating numerous Inter-Annotator Agreement measures
Last synced: 13 Jan 2025
https://github.com/gatenlp/gateplugin-oscar4
GATE PR that uses OSCAR4 to annotate chemical named entities.
Last synced: 13 Jan 2025