Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-stars
A curated list of my GitHub stars!
https://github.com/kohei-shinden/awesome-stars
Last synced: 4 days ago
JSON representation
-
Python
- UKPLab/sentence-transformers - State-of-the-Art Text Embeddings
- allegro/allRank - allRank is a framework for training learning-to-rank neural models based on PyTorch.
- arezooSarvi/outlierbias - This repository contains the code used for the experiments in "On the Impact of Outlier Bias on User Clicks", which will be published at SIGIR 2023.
- YadaYuki/news-recommendation-llm - Pre-trained Large Language Model (BERT) Based News Recommendation using Python / PyTorch 🌎
- hakuhodo-technologies/scope-rl - SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
- orcax/PGPR - Reinforcement Knowledge Graph Reasoning for Explainable Recommendation
- oreilly-japan/building-search-app-w-ml - 『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ
- NVIDIA/Megatron-LM - Ongoing research training transformer models at scale
- thongnt99/learned-sparse-retrieval - Unified Learned Sparse Retrieval Framework
- tristandeleu/pytorch-maml-rl - Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
- MichSchli/RelationPrediction - Implementation of R-GCNs for Relational Link Prediction
- yxuansu/OpenAlpaca - OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
- bitsandbytes-foundation/bitsandbytes - Accessible large language models via k-bit quantization for PyTorch.
- huggingface/peft - 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
- microsoft/LoRA - Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
- meta-llama/llama - Inference code for Llama models
- tatsu-lab/stanford_alpaca - Code and documentation to train Stanford's Alpaca models, and generate the data.
- THUIR/T2Ranking - T2Ranking: A large-scale Chinese benchmark for passage ranking.
- openai/chatgpt-retrieval-plugin - The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
- databrickslabs/dolly - Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
- BlinkDL/RWKV-LM - RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa
- HewlettPackard/swarm-learning - A simplified library for decentralized, privacy preserving machine learning
- earth-genome/ChatGeoPT - A very basic, very brittle proof of concept for an AI assistant for geospatial search
- m3dev/kannon - Kannon is a wrapper for the gokart library that allows gokart tasks to be easily executed in a distributed and parallel manner on multiple kubernetes jobs.
- Anaig/OpenAI-and-Cognitive-Search - Azure OpenAI integration as a custom skillset in Azure Cognitive Search
- hurutoriya/doraemon-himitsu-dogu-search - Doraemon Himitsu Dogu Japanese hybrid search based on Elascticsearch ANN x multi match
- namuan/dr-doc-search - Converse with book - Built with GPT-3
- karapostK/ProtoMF - This repository hosts the code and the additional materials for the paper "ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations" by Alessandro B. Melchiorre, Nav
- hcai-mms/ProtoMF - This repository hosts the code and the additional materials for the paper "ProtoMF: Prototype-based Matrix Factorization for Effective and Explainable Recommendations" by Alessandro B. Melchiorre, Nav
- shunk031/huggingface-datasets_JGLUE - JGLUE: Japanese General Language Understanding Evaluation for huggingface datasets
- pytorch/torchrec - Pytorch domain library for recommendation systems
- facebookresearch/dpr-scale - Scalable training for dense retrieval models.
- khanhnamle1994/MetaRec - PyTorch Implementations For A Series Of Deep Learning-Based Recommendation Models
- yao8839836/kg-bert - KG-BERT: BERT for Knowledge Graph Completion
- acheong08/ChatGPT-Proxy - Forward requests and inject cloudflare cookies
- acheong08/EdgeGPT - Reverse engineered API of Microsoft's Bing Chat AI
- audreyfeldroy/cookiecutter-pypackage - Cookiecutter template for a Python package.
- rixwew/pytorch-fm - Factorization Machine models in PyTorch
- yaoxingcheng/TLM - ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
- cbfinn/maml_rl - Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
- run-llama/llama_index - LlamaIndex is a data framework for your LLM applications
- ielab/Counterfactual-DR - The official repository for 'Implicit Feedback for Dense Passage Retrieval: A Counterfactual Approach', Shengyao Zhuang, Hang Li and Guido Zuccon, SIGIR2022
- twjiang/graphSAGE-pytorch - A PyTorch implementation of GraphSAGE. This package contains a PyTorch implementation of GraphSAGE.
- kavehhassani/mvgrl
- NVIDIA/NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
- hppRC/bert-classification-tutorial - 【2023年版】BERTによるテキスト分類
- maguowei/starred - creating your own Awesome List by GitHub stars!
- sisinflab/elliot - Comprehensive and Rigorous Framework for Reproducible Recommender Systems Evaluation
- linkedin/greykite - A flexible, intuitive and fast forecasting library
- INK-USC/RE-Net - Recurrent Event Network: Autoregressive Structure Inference over Temporal Knowledge Graphs (EMNLP 2020)
- hungpthanh/GRU4REC-pytorch - An other implementation of GRU4REC using PyTorch
- openai/tiktoken - tiktoken is a fast BPE tokeniser for use with OpenAI's models.
- castorini/mr.tydi - Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.
- facebookresearch/contriever - Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
- rejasupotaro/amazon-product-search
- amazon-science/esci-data - Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
- pakio/EsBM25SemanticHybridComparison - (Demo) Elasticsearch with ML node and ingest pipeline for hybrid search (Lexical + Semantic)
- fairlearn/fairlearn - A Python package to assess and improve fairness of machine learning models.
- marqo-ai/marqo - Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
- Zhang-xiaokun/CoHHN - Session-based Recommendation, CoHHN, price preferences, interest preferences, Heterogeneous Hypergraph, Co-guided Learning, SIGIR2022
- elastic/rally - Macrobenchmarking framework for Elasticsearch
- RUCAIBox/DPGNN - Codes of Modeling Two-Way Selection Preference for Person-Job Fit
- cookiecutter/cookiecutter - A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
- XinyiYan/duelingBandits
- elastic/eland - Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
- xiaxin1998/OD-Rec - Codes for SIGIR'22 Paper 'On-Device Next-Item Recommendation with Self-Supervised Knowledge Distillation'
- facebookresearch/LAMA - LAnguage Model Analysis
- shenweichen/GraphEmbedding - Implementation and experiments of graph embedding algorithms.
- ubie-oss/esqa - Testing tool to verify the search qualities of the Elasticsearch indices
- po3rin/kuro2sudachi - kuro2sudachi lets you to convert kuromoji user dict to sudachi user dict.
- iesl/CSFCube - A Test Collection of Computer Science Papers for Faceted Query by Example
- allenai/scidocs - Dataset accompanying the SPECTER model
- allenai/aspire - Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.
- wildltr/ptranking - Learning to Rank in PyTorch
- sobamchan/xscitldr - X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)
- getalp/wikIR - A python tool for building large scale Wikipedia-based Information Retrieval datasets
- microsoft/HittER - Hierarchical Transformers for Knowledge Graph Embeddings (EMNLP 2021)
- shivsahni/FireBaseScanner - The scripts helps security analsts to identify misconfigured firebase instances.
- webis-de/small-text - Active Learning for Text Classification in Python
- openwpm/OpenWPM - A web privacy measurement framework
- RUCAIBox/RecBole - A unified, comprehensive and efficient recommendation library
- RUCAIBox/RecSysDatasets - This is a repository of public data sources for Recommender Systems (RS).
- hand10ryo/PyTorchCML - PyTorchCML is a library of PyTorch implementations of matrix factorization (MF) and collaborative metric learning (CML), algorithms used in recommendation systems and data mining.
- yahoojapan/JGLUE - JGLUE: Japanese General Language Understanding Evaluation
- recommenders-team/recommenders - Best Practices on Recommendation Systems
- irecsys/DeepCARSKit - A Deep Learning Based Context-Aware Recommendation Library
- hppRC/template-pytorch-lightning-hydra-mlflow-poetry - An easy to use ML template project using Pytorch-Lightning, Hydra, Mlflow, and Poetry.
- texttron/tevatron - Tevatron - A flexible toolkit for neural retrieval research and development.
- allenai/dont-stop-pretraining - Code associated with the Don't Stop Pretraining ACL 2020 paper
- joisino/wordtour - Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)
- cbfinn/maml - Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
- wuliwei9278/SSE-PT - Codes and Datasets for paper RecSys'20 "SSE-PT: Sequential Recommendation Via Personalized Transformer" and NurIPS'19 "Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers"
- encode/uvicorn - An ASGI web server, for Python. 🦄
- LibRerank-Community/LibRerank - LibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EGRerank, Seq2Slate.
- rish-16/grafog - Graph Data Augmentation Library for PyTorch Geometric
- PaddlePaddle/PaddleNLP - 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search,
- PaddlePaddle/ERNIE - Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
- naver/splade - SPLADE: sparse neural search (SIGIR21, SIGIR22)
- easezyc/WSDM2022-PTUPCDR - This is the official implementation of our paper Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR), which has been accepted by WSDM2022.
- DyGRec/TGSRec
- mquad/sars_tutorial - Repository for the tutorial on Sequence-Aware Recommender Systems held at TheWebConf 2019 and ACM RecSys 2018
- hidasib/GRU4Rec - GRU4Rec is the original Theano implementation of the algorithm in "Session-based Recommendations with Recurrent Neural Networks" paper, published at ICLR 2016 and its follow-up "Recurrent Neural Netwo
- cheungdaven/DeepRec - An Open-source Toolkit for Deep Learning based Recommendation with Tensorflow.
- mlco2/codecarbon - Track emissions from Compute and recommend ways to reduce their impact on the environment.
- horovod/horovod - Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
- benfred/implicit - Fast Python Collaborative Filtering for Implicit Feedback Datasets
- joisino/private-recsys - Code for "Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data?" (SDM 2022)
- Coder-Yu/QRec - QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)
- S-aiueo32/lpips-pytorch - A simple and useful implementation of LPIPS.
- castorini/bertserini - BERTserini
- microsoft/MSMARCO-Passage-Ranking-Submissions - Submission archive for the MS MARCO passage ranking leaderboard
- drivendataorg/cookiecutter-data-science - A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
- awslabs/sockeye - Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
- hltcoe/HC4 - HLTCOE CLIR Common-Crawl Collection
- hltcoe/patapsco - Cross language information retrieval pipeline
- JunMa11/SegLossOdyssey - A collection of loss functions for medical image segmentation
- unicamp-dl/mMARCO - A multilingual version of MS MARCO passage ranking dataset
- lpworld/PURS
- mindsdb/mindsdb - The platform for building AI from enterprise data
- snap-stanford/GreaseLM - [ICLR 2022 spotlight]GreaseLM: Graph REASoning Enhanced Language Models for Question Answering
- vespa-engine/pyvespa - Python API for https://vespa.ai, the open big data serving engine
- jina-ai/finetuner - :dart: Task-oriented embedding tuning for BERT, CLIP, etc.
- jtibshirani/text-embeddings
- GemsLab/REGAL - Representation learning-based graph alignment based on implicit matrix factorization and structural embeddings
- benedekrozemberczki/role2vec - A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
- maxuewei2/RiWalk - Code for the ICDM 2019 Paper "RiWalk: Fast Structural Node Embedding via Role Identification".
- GemsLab/MultiLENS - Framework for latent network summarization: bridging network embedding and summarization
- tadpole/DRNE - The Implementation of "Deep Recursive Network Embedding with Regular Equivalence"(KDD 2018)
- bmitra-msft/TREC-Deep-Learning-Quick-Start
- argonism/piyo - 🐣 esa API v1 client library, written in python
- Albert-Ma/PROP - WSDM'2021, PROP and SIGIR'2021,B-PROP
- caojiangxia/BiGI - [WSDM 2021]Bipartite Graph Embedding via Mutual Information Maximization
- GemsLab/StrucEmbedding-GraphLibrary - The SEMB library is an easy-to-use tool for getting and evaluating structural node embeddings in graphs.
- PaddlePaddle/PaddleSeg - Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Mat
- jina-ai/jina - ☁️ Build multimodal AI applications with cloud-native stack
- AkariAsai/CORA - This is the official implementation of NeurIPS 2021 "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".
- facebookresearch/DPR - Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
- cl-tohoku/AIO2_DPR_baseline - https://www.nlp.ecei.tohoku.ac.jp/projects/aio/
- hical/HiCAL - HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.
- PengjieRen/CaSE_WISE - This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search Engines"
- PrefectHQ/prefect - Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
- Sujit-O/pykg2vec - Python library for knowledge graph embedding and representation learning.
- baharefatemi/SimplE - Implementation of SimplE Embedding for Link Prediction in Knowledge Graphs in PyTorch
- Mehran-k/SimplE - SimplE Embedding for Link Prediction in Knowledge Graphs
- mnick/holographic-embeddings - Code for experiments in the AAAI 2016 paper "Holographic Embeddings of Knowledge Graphs"
- leoribeiro/struc2vec - This repository provides a reference implementation of struc2vec.
- susheels/gnns-and-local-assortativity - This repo contains a reference implementation for the paper "Breaking the Limit of Graph Neural Networks by Improving the Assortativity of Graphs with Local Mixing Patterns"
- DLR-RM/stable-baselines3 - PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
- krishnanlab/node2vecplus_benchmarks
- shenweichen/DeepMatch - A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.
- megagonlabs/ginza - A Japanese NLP Library using spaCy as framework based on Universal Dependencies
- kanjirz50/mnist-pipeline-sample
- m3dev/thunderbolt - gokart file manager
- DaoD/COCA - CIKM 2021: Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking
- fajieyuan/SIGIR2021_Conure - Pre-training and Lifelong learning for User Embedding and Recommender System
- EthanRosenthal/torchmf - matrix factorization in PyTorch
- m3dev/cookiecutter-gokart - cookiecutter for gokart
- krishnanlab/PecanPy - A fast, parallelized, memory efficient, and cache-optimized Python implementation of node2vec
- spotify/luigi - Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
- m3dev/gokart - Gokart solves reproducibility, task dependencies, constraints of good code, and ease of use for Machine Learning Pipeline.
- vaexio/vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
- dask/dask - Parallel computing with task scheduling
- annamalai-nr/subgraph2vec_gensim - Contains the code (and working vm setup) for our KDD MLG 2016 paper titled: "subgraph2vec: Learning Distributed Representations of Rooted Sub-graphs from Large Graphs"
- ultralytics/yolov5 - YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
- sansan-inc/OneNER
- st-tech/zr-obp - Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
- nttcslab/japanese-dialog-transformers - Code for evaluating Japanese pretrained models provided by NTT Ltd.
- khoadoan/GraphOTSim
- NicolasHug/Surprise - A Python scikit for building and analyzing recommender systems
- polm/ipadic-py - IPAdic packaged for easy use from Python.
- luyug/COIL - NAACL2021 - COIL Contextualized Lexical Retriever
- Textualize/rich - Rich is a Python library for rich text and beautiful formatting in the terminal.
- megagonlabs/t5-japanese - Codes to pre-train Japanese T5 models
- yagays/wikipedia_graph_embedding
- plasticityai/magnitude - A fast, efficient universal vector embedding utility package.
- xiangwang1223/knowledge_graph_attention_network - KGAT: Knowledge Graph Attention Network for Recommendation, KDD2019
- megagonlabs/jrte-corpus - Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
- facebookresearch/hydra - Hydra is a framework for elegantly configuring complex applications
- nikhilbarhate99/PPO-PyTorch - Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
- Linear95/BinarySentEmb - Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.
- OpenMined/PyDP - The Python Differential Privacy Library. Built on top of: https://github.com/google/differential-privacy
- IBM/differential-privacy-library - Diffprivlib: The IBM Differential Privacy Library
- hanjialiang/DeepRec - Code for the WWW 2021 paper - DeepRec: On-device Deep Learning for Privacy-Preserving Sequential Recommendation in Mobile Commerce
- tmu-nlp/JapaneseWordSimilarityDataset - Japanese Word Similarity Dataset
- plotly/plotly.py - The interactive graphing library for Python :sparkles: This project now includes Plotly Express!
- informagi/REL - REL: Radboud Entity Linker
- robustness-gym/robustness-gym - Robustness Gym is an evaluation toolkit for machine learning.
- microsoft/nni - An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
- Layout-Parser/layout-model-training - The scripts for training Detectron2-based Layout Models on popular layout analysis datasets
- allenai/ir_datasets - Provides a common interface to many IR ranking datasets.
- ibm-research-tokyo/diffsnn - An implementation of a differentiable point process and a differentiable spiking neural network.
- coveooss/SIGIR-ecom-data-challenge
- WorksApplications/chikkarpy - Japanese synonym library
- microsoft/tf-gnn-samples - TensorFlow implementations of Graph Neural Networks
- tensorflow/ranking - Learning to Rank in TensorFlow
- medipixel/rl_algorithms - Structural implementation of RL key algorithms
- dongminlee94/deep_rl - PyTorch implementation of deep reinforcement learning algorithms
- liyinxiao/UnifiedEmbeddingModel - Implementation of unified embedding model from Embedding-based Retrieval in Facebook Search.
- thomas0809/GraphIE - A Graph-Based Framework for Information Extraction
- Layout-Parser/layout-parser - A Unified Toolkit for Deep Learning Based Document Image Analysis
- AdeDZY/DeepCT - DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.
- Khrylx/PyTorch-RL - PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
- ashkamath/mdetr
- lucidrains/perceiver-pytorch - Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
- beir-cellar/beir - A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
- liuyukid/transformers-ner - Pytorch-Named-Entity-Recognition-with-transformers
- stanford-futuredata/ColBERT - ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
- capreolus-ir/capreolus - A toolkit for end-to-end neural ad hoc retrieval
- facebookresearch/KILT - Library for Knowledge Intensive Language Tasks
- megagonlabs/bunkai - Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
- hoya012/deep_learning_object_detection - A paper list of object detection using deep learning.
- IntelLabs/academic-budget-bert - Repository containing code for "How to Train BERT with an Academic Budget" paper
- aladdinpersson/Machine-Learning-Collection - A resource for learning about Machine learning & Deep Learning
- snap-stanford/ogb - Benchmark datasets, data loaders, and evaluators for graph machine learning
- Zhiyu-Chen/SIGIR2020-BERT-Table-Search
- sebastian-hofstaetter/teaching - Open-Source Information Retrieval Courses @ TU Wien
- thunlp/OpenMatch - An Open-Source Package for Information Retrieval.
- rinnakk/japanese-pretrained-models - Code for producing Japanese pretrained models provided by rinna Co., Ltd.
- erikbern/ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python
- garrettj403/SciencePlots - Matplotlib styles for scientific plotting
- EleutherAI/gpt-neox - An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
- EleutherAI/gpt-neo - An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
- TheAtticusProject/cuad - CUAD (NeurIPS 2021)
- instillai/deep-learning-roadmap - :satellite: All You Need to Know About Deep Learning - A kick-starter
- floodsung/Deep-Learning-Papers-Reading-Roadmap - Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
- terrier-org/pyterrier - A Python framework for performing information retrieval experiments, building on http://terrier.org/
- cmacdonald/pyterrier_bert
- microsoft/MSMARCO-Document-Ranking - MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage/document ranking
- WorksApplications/SudachiDict - A lexicon for Sudachi
- facebookresearch/PyTorch-BigGraph - Generate embeddings from large-scale graph-structured data.
- explosion/spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
- castorini/pyserini - Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
- JudyYe/zero-shot-gcn - Zero-Shot Learning with GCN (CVPR 2018)
- microsoft/DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
- online-judge-tools/oj - Tools for various online judges. Downloading sample cases, generating additional test cases, testing your code, and submitting it.
- lucidrains/bottleneck-transformer-pytorch - Implementation of Bottleneck Transformer in Pytorch
- nyu-dl/dl4marco-bert
- TheAlgorithms/Python - All Algorithms implemented in Python
- lucidrains/DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
- shanglianlm0525/PyTorch-Networks - Pytorch implementation of cnn network
- castorini/docTTTTTquery - docTTTTTquery document expansion model
- zihangdai/xlnet - XLNet: Generalized Autoregressive Pretraining for Language Understanding
- taesunwhang/UMS-ResSel - PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"
- facebookresearch/fairseq - Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
- Georgetown-IR-Lab/OpenNIR - An end-to-end neural ad-hoc ranking pipeline.
- Georgetown-IR-Lab/cedr - Code for CEDR: Contextualized Embeddings for Document Ranking, accepted at SIGIR 2019.
- sigmeta/distillation-BERT - knowledge distillation on BERT
- AdeDZY/K-NRM - K-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
- sebastian-hofstaetter/neural-ranking-drmm - Implementation and Evaluation of "A Deep Relevance Matching Model for Ad-hoc Retrieval"
- AdeDZY/SIGIR19-BERT-IR - Repo of code and data for SIGIR-19 short paper "Deeper Text Understanding for IR with Contextual NeuralLanguage Modeling"
- HHousen/TransformerSum - Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
- google-research/batch_rl - Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
- microsoft/TREC-2019-Deep-Learning - Website for the TREC Deep Learning Track 2019
- diffbot/knowledge-net - KnowledgeNet: A Benchmark Dataset for Knowledge Base Population
- universome/class-norm - Class Normalization for Continual Zero-Shot Learning
- congyingxia/ZeroShotCapsule - Code for Paper "Zero-shot User Intent Detection via Capsule Neural Networks".
- pengbaolin/SC-GPT - Few-shot Natural Language Generation for Task-Oriented Dialog
- yumeng5/LOTClass - [EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach
- namisan/mt-dnn - Multi-Task Deep Neural Networks for Natural Language Understanding
- spipm/Depix - Recovers passwords from pixelized screenshots
- castorini/duobert - Multi-stage passage ranking: monoBERT + duoBERT
- cl-tohoku/bert-japanese - BERT models for Japanese text.
- singletongue/WikiEntVec - Distributed representations of words and named entities trained on Wikipedia.
- onnx/onnx - Open standard for machine learning interoperability
- yinwenpeng/BenchmarkingZeroShot
- spotify-research/cosernn - Code for the paper "Contextual and Sequential User Embeddings for Large-Scale Music Recommendation".
- google-deepmind/jraph - A Graph Neural Network Library in Jax
- lightly-ai/lightly - A python library for self-supervised learning on images.
- google-research/electra - ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
- hottolink/hottoSNS-w2v - hottoSNS-w2v: 日本語大規模SNS+Webコーパスによる単語分散表現モデル
- lgalke/vec4ir - Word Embeddings for Information Retrieval
- mpkato/pyNTCIREVAL - Python version of NTCIREVAL http://research.nii.ac.jp/ntcir/tools/ntcireval-en.html
- fastapi/fastapi - FastAPI framework, high performance, easy to learn, fast to code, ready for production
- google-research/bert - TensorFlow code and pre-trained models for BERT
- yagays/embedrank - Python Implementation of EmbedRank
- takuseno/d4rl-atari - Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)
- takuseno/d4rl-pybullet - Datasets for data-driven deep reinforcement learning with PyBullet environments
- pytorch/pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
- google-research/text-to-text-transfer-transformer - Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
- mpkato/interleaving - A python library for conducting interleaving, which compares two or multiple rankers based on observed user clicks by interleaving their results.
- dorianbrown/rank_bm25 - A Collection of BM25 Algorithms in Python
- Hironsan/natural-language-preprocessings - Some recipes of natural language pre-processing
- flairNLP/flair - A very simple framework for state-of-the-art Natural Language Processing (NLP)
- hottolink/hottoSNS-bert - hottoSNS-BERT: 大規模SNSコーパスによる文分散表現モデル
- takuseno/d3rlpy - An offline deep reinforcement learning library
- xinyandai/string-embed - string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
- google/budou - Budou is an automatic organizer tool for beautiful line breaking in CJK (Chinese, Japanese, and Korean).
- sebastianruder/NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
- benedekrozemberczki/GraphWaveMachine - A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
- kivy/pyjnius - Access Java classes from Python
- mocobeta/janome - Japanese morphological analysis engine written in pure Python
- castorini/birch - Document ranking via sentence modeling using BERT
- huggingface/transformers - 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
- oreilly-japan/deep-learning-from-scratch-2 - 『ゼロから作る Deep Learning ❷』(O'Reilly Japan, 2018)
- zalandoresearch/fashion-mnist - A MNIST-like fashion product database. Benchmark :point_down:
- c-bata/pysearch - Web crawler and Search engine in Python.
- HumanSignal/labelImg - LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source dat
- meokz/looking-to-listen - Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
- h2oai/h2o-llmstudio - H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
- explodinggradients/ragas - Supercharge Your LLM Application Evaluations 🚀
- deepset-ai/haystack - :mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d
- wandb/wandb - The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
- jina-ai/serve - ☁️ Build multimodal AI applications with cloud-native stack
- digital-go-jp/mojxml2geojson - 法務省登記所備付地図データ(地図XML)をGeoJSONに変換するコンバータ
- megagonlabs/asdc - Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
- PaddlePaddle/RocketQA - 🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
- Lightning-AI/pytorch-lightning - Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
- googleapis/google-api-python-client - 🐍 The official Python client library for Google's discovery based APIs.
- PaddlePaddle/PaddleRec - Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、E
- huggingface/datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
-
Dart
- Solido/awesome-flutter - An awesome list that curates the best Flutter libraries, tools, tutorials, articles and more.
- gskinnerTeam/flutter-folio - A platform adaptive Flutter app for desktop, mobile and web.
- flutter/flutter - Flutter makes it easy and fast to build beautiful apps for mobile and beyond
-
Others
- EbookFoundation/free-programming-books - :books: Freely available programming books
- ujjwalkarn/Machine-Learning-Tutorials - machine learning and deep learning tutorials, articles and other resources
- kunishou/databricks-dolly-15k-ja
- LIANGKE23/Awesome-Knowledge-Graph-Reasoning - AKGR: Awesome Knowledge Graph Reasoning is a collection of knowledge graph reasoning works, including papers, codes and datasets
- google-research/tuning_playbook - A playbook for systematically maximizing the performance of deep learning models.
- project-miracl/miracl - A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
- OnYuKang/Recommendation-systems-paperlist - Papers about recommendation systems that I am interested in
- RUCAIBox/RecBole2.0 - An up-to-date, comprehensive and flexible recommendation library
- masamasa59/uncertainty-paper
- AustinNeverPee/FedRecPapers - A project for collecting and showing the current research progress of FedRec
- mercari/ml-system-design-pattern - System design patterns for machine learning
- lmcRS/AWS-recommendation-papers
- doc-analysis/TableBank - TableBank: A Benchmark Dataset for Table Detection and Recognition
- DEIM-Management-System/Manual
- apache/lucene-solr - Apache Lucene and Solr open-source search software
- ir100/ir100 - 情報検索100本ノック
- faneshion/HiNT - Implementation of Hierarchical Neural maTching model proposed in SIGIR'18 for ad-hoc retrieval
- chauff/conversationalIR - Overview of venues, research themes and datasets relevant for conversational search.
- yasoob/Flask-React-JWT - A tutorial on how to setup a JWT auth app using React and Flask and then deploy it using NGINX and Gunicorn.
- hideojoho/kb - Joho Lab Knowledge Base 上保研究室ナレッジベース 1.0
- zhoushengisnoob/DeepClustering - Methods and Implements of Deep Clustering
- imsheridan/DeepRec - 推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction
- thunlp/GNNPapers - Must-read papers on graph neural networks (GNN)
- faster-cpython/ideas
- tesseract-ocr/tessdata - Trained models with fast variant of the "best" LSTM models + legacy models
- FedML-AI/FedNLP - FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
- hanjuku-kaso/awesome-offline-rl - An index of algorithms for offline reinforcement learning (offline-rl)
- wantedly/machine-learning-round-table - Gather around the table, and have a discussion to catch up the latest trend of machine learning 🤖
- mrdbourke/machine-learning-roadmap - A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
- sbharadwajj/awesome-zero-shot-learning - A curated list of papers, code and resources pertaining to zero shot learning
- virtualsecureplatform/MitouDocument - 未踏応募書類と成果報告書
- ycjing/Neural-Style-Transfer-Papers - :pencil2: Neural Style Transfer: A Review
- eugeneyan/applied-ml - 📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
- ml-tooling/best-of-ml-python - 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
- dformoso/machine-learning-mindmap - A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.
- openai/gpt-3 - GPT-3: Language Models are Few-Shot Learners
- DigitalNatureGroup/Remote_Voice_Recognition - リモートミーティングでの音声認識の活用事例
- arXivTimes/arXivTimes - repository to research & share the machine learning articles
- ossu/computer-science - 🎓 Path to a free self-taught education in Computer Science!
- kimtth/awesome-azure-openai-llm - "Awesome-AzureOpenAI-LLM: a curated list of Azure OpenAI & Large Language Models" 🔎References to Azure OpenAI, 🦙Large Language Models, and related 🌌 services and 🎋libraries.
-
Assembly
- HazyResearch/H3 - Language Modeling with the H3 State Space Model
-
C
- antimatter15/alpaca.cpp - Locally run an Instruction-Tuned Chat-Style LLM
- kohei-shinden/cintro - C introduction for a software engineering course
- AlexeyAB/darknet - YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
- tarotez/cintro - C introduction for a software engineering course
-
C++
- ggerganov/llama.cpp - LLM inference in C/C++
- aksnzhy/xlearn - High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI int
- thunlp/KB2E - Knowledge Graph Embeddings including TransE, TransH, TransR and PTransE
- kuzudb/kuzu - Embeddable property graph database management system built for query speed and scalability. Implements Cypher.
- facebookresearch/StarSpace - Learning embeddings for classification, retrieval and ranking.
- DeepRec-AI/HybridBackend - A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
- tangjianpku/LINE - LINE: Large-scale information network embedding
- tensorflow/serving - A flexible, high-performance serving system for machine learning models
- polm/fugashi - A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
- xptree/LightNE
- bab2min/tomotopy - Python package of Tomoto, the Topic Modeling Tool
- OctoberChang/X-Transformer - X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification
- microsoft/LightGBM - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tas
- cvangysel/pytrec_eval - pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
- yahoojapan/NGT - Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data
- nmslib/hnswlib - Header-only C++/python library for fast approximate nearest neighbors
- facebookresearch/faiss - A library for efficient similarity search and clustering of dense vectors.
- spotify/annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
- tensorflow/tensorflow - An Open Source Machine Learning Framework for Everyone
- kohei-shinden/cppintro - C++ introduction for a software engineering course
- TatsuyaShirakawa/poincare-embedding - Poincaré Embedding (unofficial)
-
Go
- YadaYuki/omochi - Full text search engine from scratch by Goʕ◔ϖ◔ʔ (Just a toy) 😊
- po3rin/eskeeper - eskeeper synchronizes index and alias with configuration files while ensuring idempotency.
- mosuka/wikipedia-jsonl - wikipedia-jsonl is a CLI that converts Wikipedia dump XML to JSON Lines format.
- isucon/isucon11-qualify - ISUCON11 予選 (ISUCONDITION)
- tkuchiki/alp - Access Log Profiler
- google/differential-privacy - Google's differential privacy libraries.
- isucon/isucon11-prior
- mathetake/intergo - A package for interleaving / multileaving ranking generation in go
- vdaas/vald - Vald. A Highly Scalable Distributed Vector Search Engine
- simonwhitaker/gibo - 🤫 Easy access to gitignore boilerplates
- arosh/ublacklist-stackoverflow-translation - Exclude machine-translated sites of Stack Exchange from Google search results
- line/line-bot-sdk-go - LINE Messaging API SDK for Go
- shogo82148/go-mecab - MeCab binding for Golang
- takatori/go-tinysearch
- ikawaha/kagome - Self-contained Japanese Morphological Analyzer written in pure Go
- neo4j-examples/golang-bolt-movie-example - Movie example for the golang neo4j bolt driver
- neo4j/neo4j-go-driver - Neo4j Bolt Driver for Go
- souring001/illumi-packet - Illuminating Packets on an Ethernet Cable using LED Strip.
- maxence-charriere/go-app - A package to build progressive web apps with Go programming language and WebAssembly.
- ynqa/wego - Word Embeddings in Go!
-
HTML
- f/awesome-chatgpt-prompts - This repo includes ChatGPT prompt curation to use ChatGPT better.
- recsyslab/recsys-text
- themesberg/neumorphism-ui-bootstrap - Neumorphism inspired UI Kit: web components, sections and pages in neumorphic style built with Bootstrap CSS Framework
- NeuCLIR/neuclir.github.io - Website for the NeuCLIR track at NIST TREC.
- tesseract-ocr/tessdoc - Tesseract documentation
- KnowledgeGraphJapan/KGRC-RDF - RDF data for Knowledge Graph Reasoning Challenge.
- UTokyo-ICEPP/qc-workbook - Quantum computing workbook by ICEPP
- ir-anthology/ir-anthology - Software for building the IR Anthology.
- facebookresearch/fastText - Library for fast text representation and classification.
- docker/awesome-compose - Awesome Docker Compose samples
- HugoBlox/hugo-blox-builder - 🚨 GROW YOUR AUDIENCE WITH HUGOBLOX! 🚀 HugoBlox is an easy, fast no-code website builder for researchers, entrepreneurs, data scientists, and developers. Build stunning sites in minutes. 适合研究人员、企业家、数据科
-
Java
- irecsys/CARSKit - Java-Based Context-aware Recommendation Library
- vespa-engine/vespa - AI + Data, online. https://vespa.ai
- terrier-org/terrier-core - Terrier IR Platform
- ppasupat/web-entity-extractor-ACL2014
- tmanabe/RankLibBM25F-IA - A Learning to Rank Library
- castorini/anserini - Anserini is a Lucene toolkit for reproducible information retrieval research
- spring-io/start.spring.io - https://start.spring.io
- elastic/elasticsearch - Free and Open Source, Distributed, RESTful Search Engine
- E869120/math-algorithm-book - 拙著『「アルゴリズム×数学」が基礎からしっかり身につく本』(2021/12/25 発売)の GitHub ページです。演習問題の解答や、C++ 以外のソースコードなどが掲載されています。ぜひご活用ください。
-
JavaScript
- iterative/cml - ♾️ CML - Continuous Machine Learning | CI/CD for ML
- HumanSignal/label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format
- lmenezes/cerebro
- joisino/clear - A fully user-side image search engine. Accepted to CIKM 2022 demo track.
- AKAspanion/ui-neumorphism - 📕 React component library designed on the "new skeuomorphism" or "neumorphism" UI/UX trend.
- marcotcr/lime - Lime: Explaining the predictions of any machine learning classifier
- decomoji/decomoji - Custom icon collection for slack reaction
- kenkawakenkenke/sleeping_population - Script to estimate the % of the population of the world sleeping at any point in time
- AMAI-GmbH/AI-Expert-Roadmap - Roadmap to becoming an Artificial Intelligence Expert in 2022
- xdevplatform/Twitter-API-v2-sample-code - Sample code for the Twitter API v2 endpoints
- overleaf/overleaf - A web-based collaborative LaTeX editor
- takuseno/minerva - An out-of-the-box GUI tool for offline deep reinforcement learning
- appbaseio/reactivesearch - Search UI components for React and Vue
- Templarian/slack-emoji-pokemon - Slack Pokemon Emojis
-
Julia
- sammy-suyama/BayesBook - 「機械学習スタートアップシリーズ ベイズ推論による機械学習入門」のサンプルコード
-
Jupyter Notebook
- tira-io/ir-experiment-platform
- facebookresearch/LASER - Language-Agnostic SEntence Representations
- EleutherAI/pythia - The hub for EleutherAI's work on interpretability and learning dynamics
- kunishou/Japanese-Alpaca-LoRA
- tloen/alpaca-lora - Instruct-tune LLaMA on consumer hardware
- state-spaces/s4 - Structured state space sequence models
- ntcirtransfer/transfer1 - Repository for NTCIR-17 Transfer Task
- oshizo/JapaneseEmbeddingEval
- oshizo/JapaneseEmbeddingTrain
- oshizo/gpt_index_japanese_trial
- microsoft/MSMARCO-Passage-Ranking - MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part of
- microsoft/AI-For-Beginners - 12 Weeks, 24 Lessons, AI for All!
- gasteigerjo/ppnp - PPNP & APPNP models from "Predict then Propagate: Graph Neural Networks meet Personalized PageRank" (ICLR 2019)
- graphdeeplearning/benchmarking-gnns - Repository for benchmarking graph neural networks
- nlp-with-transformers/notebooks - Jupyter notebooks for the Natural Language Processing with Transformers book
- fastforwardlabs/session_based_recommenders - Official repo for FF19: Session-based Recommender Systems
- Netflix-Skunkworks/rl_for_budget_constrained_recs
- malteos/aspect-document-embeddings - Code, dataset & models for the paper Specialized Document Embeddings for Aspect-based Similarity of Research Papers (#JCDL2022)
- usydnlp/Glocal_K - This repository contains code for paper GLocal-K: Global and Local Kernels for Recommender Systems, published in CIKM 2021
- AtomScott/SportsLabKit - A python package for turning sports video into csv files
- suhara/cis6930-fall2021 - Course materials for Fall 2021 "CIS6930 Topics in Computing for Data Science" at New College of Florida
- logics-of-blue/book-decision-analysis-and-forecast - 書籍「意思決定分析と予測の活用 基礎理論からPython実装まで」のサンプルコードとデータ
- kwakaba/mlpr-class
- oreilly-japan/RecommenderSystems - 『推薦システム実践入門』のリポジトリ
- pytorch/opacus - Training PyTorch models with differential privacy
- TUMLegalTech/bva-citation-prediction
- recsyslab/recsys-python
- ProsusAI/finBERT - Financial Sentiment Analysis with BERT
- chakki-works/chABSA-dataset - chakki's Aspect-Based Sentiment Analysis dataset
- yuyay/DEIM2022_XAI_tutorial
- tomomano/learn-aws-by-coding - コードで学ぶAWS入門
- parrt/dtreeviz - A python library for decision tree visualization and model interpretation.
- giannisnik/segk - Learning Structural Node Representations using Graph Kernels
- snap-stanford/graphwave
- ferencberes/online-node2vec - Node Embeddings in Dynamic Graphs
- chokkan/mlnote - 機械学習帳
- Zziwei/Fairness-in-Cold-Start-Recommendation - Code for our SIGIR 2021 paper :'Fairness among New Items in Cold Start Recommender Systems'
- kinit-sk/yaudit-recsys-2021 - Supplementary material for the paper published at ACM RecSys 2021 and its extended version accepted to ACM TORS journal
- tensorflow/docs-l10n - Translations of TensorFlow documentation
- usaito/recsys2021-tutorial - https://sites.google.com/cornell.edu/recsys2021tutorial
- terrier-org/cikm2021tutorial
- arthurdouillard/deepcourse - Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki
- atsukoba/LabelEstimator - Simple Unsupervised Document Labeling with MeCab and Pretrained Doc2Vec Model and some experiments about `Doc2Vec.infer_vector()`
- stockmarkteam/bert-book - 「BERTによる自然言語処理入門: Transformersを使った実践プログラミング」サポートページ
- takitsuba/midoribon
- jkatsuta/18_4q_doc2vec - 18-4Q blog's note
- vespa-engine/sample-apps - Repository of sample applications for https://vespa.ai, the open big data serving engine
- nvedant07/Fairness-Through-Robustness - Official code for FAccT'21 paper "Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning" https://arxiv.org/abs/2006.12621
- fchollet/deep-learning-with-python-notebooks - Jupyter notebooks for the code samples of the book "Deep Learning with Python"
- Building-ML-Pipelines/building-machine-learning-pipelines - Code repository for the O'Reilly publication "Building Machine Learning Pipelines" by Hannes Hapke & Catherine Nelson
- google-research/vision_transformer
- successar/AttentionExplanation
- neomatrix369/awesome-ai-ml-dl - Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
- PacktPublishing/Mastering-Big-Data-Analytics-with-PySpark - Mastering Big Data Analytics with PySpark, Published by Packt
- hundredblocks/ml-powered-applications - Companion repository for the book Building Machine Learning Powered Applications
- oreilly-japan/ml-at-work - 電子書籍『仕事ではじめる機械学習』のサポートリポジトリです
- castorini/pygaggle - a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
- sonoisa/t5-japanese - 日本語T5モデル
- terrier-org/ecir2021tutorial
- mpkato/Statistical-Reform-in-IR - Statistical Reform in Information Retrieval? (Tetsuya Sakai, SIGIR Forum, 2014)
- MrMimic/data-scientist-roadmap - Toturials coming with the "data science roadmap" picture.
- aws/amazon-sagemaker-examples - Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
- stephenleo/adventures-with-ann - All the code for a series of Medium articles on Approximate Nearest Neighbors
- sbharadwajj/embarrassingly-simple-zero-shot-learning - This is a started/demo code for Zero-Shot-Learning via implementation of Embarrassingly simple ZSL (ICML 2015)
- oreilly-japan/deep-learning-from-scratch - 『ゼロから作る Deep Learning』(O'Reilly Japan, 2016)
- google-research/google-research - Google Research
- lukas/ml-class - Machine learning lessons and teaching projects designed for engineers
- wandb/awesome-dl-projects - This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.
- karl-zhao/benchmarking-gnns-pyg
- YutaroOgawa/pytorch_tutorials_jp - PyTorch公式チュートリアル(日本語翻訳版)の各ノートブックファイル(Google Colab用)です
- patil-suraj/onnx_transformers - Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.
- nekoumei/DocumentClassificationUsingBERT-Japanese
- marcotcr/checklist - Beyond Accuracy: Behavioral Testing of NLP models with CheckList
- Hironsan/awesome-embedding-models - A curated list of awesome embedding models tutorials, projects and communities.
- yoheikikuta/bert-japanese - BERT with SentencePiece for Japanese text.
- pytorch/tutorials - PyTorch tutorials.
- HOXOMInc/feature-engineering-book - 『機械学習のための特徴量エンジニアリング』のサンプルコード集
- maruyama097/coq-tutorial - coq-tutorial
- tarotez/pyml - Machine learning course using Python
- matsuolab-edu/dl4us
- langchain-ai/langchain - 🦜🔗 Build context-aware reasoning applications
- csienslab/RAD-domain-analysis - Investigating Advertisers’ Domain-changing Behaviors and Their Impacts on Ad-blocker Filter Lists
- c4ra/kaken_parse_grants_masterxml - KAKENデータベースからXMLファイルをダウンロードして、課題ごとに必要な項目をローカルのデータベースに保存します。
- YutaroOgawa/pytorch_advanced - 書籍「つくりながら学ぶ! PyTorchによる発展ディープラーニング」の実装コードを配置したリポジトリです
-
Kotlin
- WorksApplications/elasticsearch-sudachi - The Japanese analysis plugin for elasticsearch
-
MDX
- openai/openai-cookbook - Examples and guides for using the OpenAI API
-
Makefile
- ymmt2005/grpc-tutorial - gRPC tutorial for Japanese readers
- golang-standards/project-layout - Standard Go Project Layout
- instill-ai/instill-core - 🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
-
Markdown
- OAI/OpenAPI-Specification - The OpenAPI Specification Repository
-
Mojo
- modularml/mojo - The Mojo Programming Language
-
OpenEdge ABL
- sunfanyunn/graph-classification - A collection of graph classification methods
- declare-lab/multimodal-deep-learning - This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
-
OpenQASM
- Qiskit/qiskit-metapackage - Qiskit is an open-source SDK for working with quantum computers at the level of circuits, algorithms, and application modules.
-
R
- ghmagazine/cibook
- sansan-inc/lighthergm - Fit, Simulate, and Diagnose Hierarchical Exponential-Family Models for Networks in A Scalable Way
-
Roff
- pyenv/pyenv - Simple Python version management
- odashi/small_parallel_enja - 50k English-Japanese Parallel Corpus for Machine Translation Benchmark.
-
Ruby
- twitter/twurl - OAuth-enabled curl for the Twitter API
-
Rust
- qdrant/qdrant - Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
- meilisearch/meilisearch - A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
- jjfiv/fastrank - My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".
- kampersanda/tongrams-rs - Rust library providing fast language model queries in compressed space
- daac-tools/vaporetto - 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
- mooreniemi/suntan - A utility to dump Elasticsearch Lucene shards into Tantivy.
- valeriansaliou/sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
- mosuka/bayard - A full-text search and indexing server written in Rust.
- timvisee/advent-of-code-2021 - 🎄 My Advent of Code solutions in Rust. http://adventofcode.com/2021
- huggingface/tokenizers - 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
- TheAlgorithms/Rust - All Algorithms implemented in Rust
- pola-rs/polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
- quickwit-oss/tantivy-py - Python bindings for Tantivy
- quickwit-oss/tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
- denoland/deno - A modern runtime for JavaScript and TypeScript.
- shargri-la/shargri-la - A transaction-level sharding simulator for protocol testing against users' behavior on a sharded blockchain.
- lindera/lindera - A multilingual morphological analysis library.
- daac-tools/daachorse - 🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.
-
SCSS
- kn1cht/doctor-funding-calendar - Calendar of doctoral students funding programs
-
Scala
- twitter/the-algorithm - Source code for Twitter's Recommendation Algorithm
- aditya-grover/node2vec
- dstlry/dstlr - scalable knowledge graph construction from unstructured text
- apache/spark - Apache Spark - A unified analytics engine for large-scale data processing
-
Shell
- MorinoseiMorizo/jparacrawl-finetune - An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.
- yahoojapan/vespa-tutorial - Japanese tutorial for Vespa
- krishnanlab/PecanPy_benchmarks
- neologd/mecab-ipadic-neologd - Neologism dictionary based on the language resources on the Web for mecab-ipadic
- mrowa44/emojify - Emoji on the command line :scream:
- p1ass/mdtopdf - 🐳 Pandoc docker image for converting markdown to PDF using TeX typesetting
-
TeX
- HugoBlox/theme-academic-cv - 🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.
- HarisIqbal88/PlotNeuralNet - Latex code for making neural networks diagrams
-
TypeScript
- posit-dev/positron - Positron, a next-generation data science IDE
- minosvasilias/toolformer-zero - React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.
- askrella/whatsapp-chatgpt - ChatGPT + DALL-E + WhatsApp = AI Assistant :rocket: :robot:
- aws-amplify/amplify-js - A declarative JavaScript library for application development using cloud services.
- elastic/search-ui - Search UI. Libraries for the fast development of modern, engaging search experiences.
- slackapi/node-slack-sdk - Slack Developer Kit for Node.js
- adenohitu/hisui - 競技プログラミングサポートツール Support tool for Competitive programming (AtCoder GUI)
- kazuyaseki/figma-to-react - Simple generator of React code from Figma
- neo4j/graphql - A GraphQL to Cypher query execution layer for Neo4j and JavaScript GraphQL implementations.
- Tatamo/atcoder-cli - AtCoder command line tools
- mil-tokyo/webdnn - The Fastest DNN Running Framework on Web Browser
- Detaysoft/react-chat-elements - Reactjs chat elements chat UI, react chat components
- microsoft/vscode - Visual Studio Code
- Hirai0827/Dynamis
- ryo-ma/github-profile-trophy - 🏆 Add dynamically generated GitHub Stat Trophies on your readme
- kamranahmedse/developer-roadmap - Interactive roadmaps, guides and other educational content to help developers grow in their careers.
-
Vue
- emoji-gen/web-main - :tada: Ultimate Emoji Generator
-
LLVM
- shumaikunkun/Hanoi - 61種類のプログラミング言語でハノイの塔を実装【どなたでもプルリクお待ちしています☺️】
Programming Languages
Categories
Sub Categories
Keywords
deep-learning
76
machine-learning
75
python
53
pytorch
49
nlp
31
information-retrieval
25
natural-language-processing
25
tensorflow
20
data-science
19
recommender-system
17
transformers
17
neural-network
14
ai
14
search-engine
14
bert
14
golang
13
artificial-intelligence
13
search
11
rust
11
reinforcement-learning
11
computer-vision
11
elasticsearch
11
deep-neural-networks
11
gpt
11
language-model
11
collaborative-filtering
10
chatgpt
10
knowledge-graph
10
llm
9
graph-neural-networks
9
dataset
9
transformer
9
recommendation
9
awesome-list
8
ranking
8
typescript
8
recommendation-system
8
mlops
8
react
8
word2vec
8
go
8
deep-reinforcement-learning
7
java
7
matrix-factorization
7
ml
7
representation-learning
7
research
7
awesome
7
gpt-3
6
recommender-systems
6