Projects in Awesome Lists by amazon-science
A curated list of projects in awesome lists by amazon-science .
https://github.com/amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Last synced: 14 May 2025
https://github.com/amazon-science/chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
artificial-intelligence forecasting foundation-models huggingface huggingface-transformers large-language-models llm machine-learning pretrained-models time-series time-series-forecasting timeseries transformers
Last synced: 12 May 2025
https://github.com/amazon-science/auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
chain-of-thought gpt-3 gpt3-prompts gpt3-resources large-language-models prompt-engineering reasoning
Last synced: 15 May 2025
https://github.com/amazon-science/ragchecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
Last synced: 14 May 2025
https://github.com/amazon-science/siam-mot
SiamMOT: Siamese Multi-Object Tracking
computer-vision multi-object-tracking video-analysis
Last synced: 05 Apr 2025
https://github.com/amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
computer-vision few-shot object-detection pretraining
Last synced: 16 May 2025
https://github.com/amazon-science/refchecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Last synced: 15 May 2025
https://github.com/amazon-science/earth-forecasting-transformer
Official implementation of Earthformer
Last synced: 05 Apr 2025
https://github.com/amazon-science/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Last synced: 04 Apr 2025
https://github.com/amazon-science/sccl
Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021
Last synced: 06 Apr 2025
https://github.com/amazon-research/sccl
Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021
Last synced: 05 Apr 2025
https://github.com/amazon-science/esci-data
Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
Last synced: 12 Apr 2025
https://github.com/amazon-science/prompt-pretraining
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
Last synced: 19 Jul 2025
https://github.com/amazon-science/refined
ReFinED is an efficient and accurate entity linking (EL) system.
entity-extraction entity-linking entity-resolution nlp pytorch
Last synced: 13 Apr 2025
https://github.com/amazon-science/unconditional-time-series-diffusion
Official PyTorch implementation of TSDiff models presented in the NeurIPS 2023 paper "Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting"
diffusion-models neurips neurips-2023 pytorch time-series time-series-forecasting
Last synced: 08 Jul 2025
https://github.com/amazon-science/RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
Last synced: 15 Aug 2025
https://github.com/amazon-science/auction-gym
AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.
advertising machine-learning real-time-bidding reinforcement-learning
Last synced: 30 Apr 2025
https://github.com/amazon-science/spot-diff
Project for <SPot-the-Difference Self-Supervised Pre-training for Anomaly Detection and Segmentation> (ECCV 2022)
Last synced: 30 Jun 2025
https://github.com/amazon-science/video-contrastive-learning
Video Contrastive Learning with Global Context, ICCVW 2021
computer-vision contrastive-learning iccv-2021 self-supervised-learning video-understanding
Last synced: 03 May 2025
https://github.com/amazon-science/gan-control
This package provides a pythorch implementation of "GAN-Control: Explicitly Controllable GANs", ICCV 2021.
Last synced: 18 Jul 2025
https://github.com/amazon-science/cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
Last synced: 24 Jul 2025
https://github.com/amazon-science/tanl
Structured Prediction as Translation between Augmented Natural Languages
deep-learning natural-language-processing pytorch
Last synced: 03 May 2025
https://github.com/amazon-science/long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
online-action-detection video-analysis video-transformer
Last synced: 09 Oct 2025
https://github.com/amazon-science/crossnorm-selfnorm
CrossNorm and SelfNorm for Generalization under Distribution Shifts, ICCV 2021
computer-vision domain-adaptation domain-generalization iccv-2021 model-robustness natural-language-processing normalization
Last synced: 03 May 2025
https://github.com/amazon-science/mix-generation
MixGen: A New Multi-Modal Data Augmentation
data-augmentation data-efficiency multimodal pretraining vision-language
Last synced: 03 Jul 2025
https://github.com/amazon-science/mintaka
Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)
Last synced: 03 May 2025
https://github.com/amazon-science/tabsyn
Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""
Last synced: 20 Sep 2025
https://github.com/amazon-science/wqa_tanda
This repo provides code and data used in our TANDA paper.
Last synced: 27 Jan 2026
https://github.com/amazon-science/exponential-moving-average-normalization
PyTorch implementation of EMAN for self-supervised and semi-supervised learning: https://arxiv.org/abs/2101.08482
computer-vision normalization self-supervised-learning semi-supervised-learning
Last synced: 03 May 2025
https://github.com/amazon-science/meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
deep-learning meta-learning multi-task-learning reinforcement-learning
Last synced: 03 May 2025
https://github.com/amazon-science/glass-text-spotting
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
attention deep-learning detection ocr text-spotting
Last synced: 03 Sep 2025
https://github.com/amazon-science/codesage
CodeSage: Code Representation Learning At Scale (ICLR 2024)
Last synced: 07 Apr 2025
https://github.com/amazon-science/datatuner
Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper
Last synced: 04 Oct 2025
https://github.com/amazon-science/semimtr-text-recognition
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
computer-vision consistency-regularization contrastive-learning deep-learning ocr pytorch scene-text-recognition self-supervised-learning semi-supervised-learning text-recognition
Last synced: 14 Jun 2025
https://github.com/amazon-science/auto-rag-eval
Code repo for the ICML 2024 paper "Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation"
evaluation genai llm machine-learning
Last synced: 28 Jun 2025
https://github.com/amazon-science/small-baseline-camera-tracking
A dataset to facilitate the research of Structure-from-Motion (SfM) for movie and TV shows.
Last synced: 03 Mar 2026
https://github.com/amazon-science/bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
bert bert-model bias fairness-ml gpt-2 language-model nlg nlg-dataset nlp text-generation
Last synced: 12 Feb 2026
https://github.com/amazon-science/omni-detr
PyTorch implementation of Omni-DETR for omni-supervised object detection: https://arxiv.org/abs/2203.16089
object-detection omni-supervised-learning semi-supervised-learning weakly-supervised-learning
Last synced: 09 Oct 2025
https://github.com/amazon-science/fraud-dataset-benchmark
Repository for Fraud Dataset Benchmark
Last synced: 03 May 2025
https://github.com/amazon-science/progressive-coordinate-transforms
Progressive Coordinate Transforms for Monocular 3D Object Detection, NeurIPS 2021
3d-detection kitti-dataset neurips-2021 waymo-open-dataset
Last synced: 03 May 2025
https://github.com/amazon-science/masked-diffusion-lm
Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"
Last synced: 02 Apr 2026
https://github.com/amazon-science/repoformer
Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)
Last synced: 27 Feb 2026
https://github.com/amazon-science/redset
Redset is a dataset containing three months worth of user query metadata that ran on a selected sample of instances in the Amazon Redshift fleet. We provide query metadata for 200 provisioned and serverless instances each.
Last synced: 24 Aug 2025
https://github.com/amazon-science/embert
Code for EmBERT, a transformer model for embodied, language-guided visual task completion.
Last synced: 03 May 2025
https://github.com/amazon-science/qa-dataset-converter
Code from the paper "What do Models Learn from Question Answering Datasets?" (EMNLP 2020)
Last synced: 03 May 2025
https://github.com/amazon-science/probconserv
Datasets and code for results presented in the ProbConserv paper
conservation-laws downstream-tasks partial-differential-equations porous-media-flow shock-capturing uncertainty-quantification
Last synced: 20 Aug 2025
https://github.com/amazon-science/dq-bart
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)
Last synced: 15 Oct 2025
https://github.com/amazon-science/semi-vit
PyTorch implementation of Semi-supervised Vision Transformers
machine-learning semi-supervised-learning vision-transformer
Last synced: 17 Oct 2025
https://github.com/amazon-science/gluonmm
A library of transformer models for computer vision and multi-modality research
computer-vision iccv-2021 multimodality pytorch transformer video
Last synced: 03 May 2025
https://github.com/amazon-science/fact-graph
Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"
abstractive-summarization factuality
Last synced: 03 May 2025
https://github.com/amazon-science/dstc11-track2-intent-induction
DSTC 11 Track 2: Intent Induction from Conversations for Task-Oriented Dialogue
Last synced: 03 May 2025
https://github.com/amazon-science/azcausal
Causal Inference in Python
causal-inference did panel sdid
Last synced: 05 Mar 2026
https://github.com/amazon-science/long-tailed-ood-detection
Official implementation for "Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition" (ICML'22 Long Presentation)
Last synced: 03 May 2025
https://github.com/amazon-science/proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems (https://arxiv.org/abs/2104.07777)
Last synced: 12 Feb 2026
https://github.com/amazon-science/transformers-data-augmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
bart bert bert-model data-augmentation gpt
Last synced: 03 May 2025
https://github.com/amazon-science/amazon-multilingual-counterfactual-dataset
Last synced: 24 Feb 2026
https://github.com/amazon-science/crossmodal-contrastive-learning
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
computer-vision contrastive-learning multi-modality natural-language-processing transformers video video-captioning video-text-retrieval
Last synced: 04 Sep 2025
https://github.com/amazon-science/robust-tableqa
Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer models. 2) LI-RAGE is a robust framework for open-domain TableQA which addresses several limitations. (ACL 2023)
Last synced: 13 Aug 2025
https://github.com/amazon-science/hyperbolic-embeddings
Code for hyperboloid embeddings for knowledge graph entities
Last synced: 07 Mar 2026
https://github.com/amazon-science/recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"
code-generation large-language-models nlp robustness
Last synced: 03 May 2025
https://github.com/amazon-science/llm-code-preference
Training and Benchmarking LLMs for Code Preference.
code-generation llm-evaluation llm-training llms-benchmarking
Last synced: 07 Oct 2025
https://github.com/amazon-science/tubelet-transformer
This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection
action-detection ava jhmdb transformer tubelet-transformer tuber ucf
Last synced: 03 May 2025
https://github.com/amazon-science/replay-based-recurrent-rl
Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"
meta-learning multi-task-learning reinforcement-learning
Last synced: 04 Jul 2025
https://github.com/amazon-science/contraclm
[ACL 2023] Code for ContraCLM: Contrastive Learning For Causal Language Model
contrastive-learning generative-ai gpt-2 llm nlp
Last synced: 28 Jun 2025
https://github.com/amazon-science/unified-ept
A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021
efficient iccv-2021 pyramid semantic-segmentation transformers
Last synced: 03 May 2025
https://github.com/amazon-science/creating-and-correcting-novel-ml-model-errors
Last synced: 03 May 2025
https://github.com/amazon-science/bartgraphsumm
Implementation of the paper "Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters (NAACL 2021)"
Last synced: 19 Jun 2025
https://github.com/amazon-science/peft-design-spaces
Official implementation for "Parameter-Efficient Fine-Tuning Design Spaces"
Last synced: 07 Oct 2025
https://github.com/amazon-science/carbon-assessment-with-ml
CaML: Carbon Footprinting of Household Products with Zero-Shot Semantic Text Similarity
Last synced: 03 May 2025
https://github.com/amazon-science/piperag
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)
Last synced: 19 Sep 2025
https://github.com/amazon-science/multiatis
Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)
Last synced: 03 May 2025
https://github.com/amazon-science/adaslot
Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]
object-centric object-centric-learning
Last synced: 03 May 2025
https://github.com/amazon-science/contextualunderstanding-contrastivedecoding
Enhancing contextual understanding in large language models through contrastive decoding
Last synced: 19 Sep 2025
https://github.com/amazon-science/textadain-robust-recognition
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
deep-learning handwriting-recognition ocr pytorch regularization scene-text-recognition shortcut-learning text-recognition
Last synced: 03 May 2025
https://github.com/amazon-science/c2f-seg
Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).
Last synced: 03 May 2025