Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with foundation-models
A curated list of projects in awesome lists tagged with foundation-models .
https://github.com/hpcaitech/colossalai
Making large AI models cheaper, faster and more accessible
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
Last synced: 09 Nov 2024
https://github.com/hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
Last synced: 27 Oct 2024
https://github.com/haotian-liu/llava
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning
Last synced: 16 Dec 2024
https://github.com/microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
beit beit-3 bitnet deepnet document-ai foundation-models kosmos kosmos-1 layoutlm layoutxlm llm minilm mllm multimodal nlp pre-trained-model textdiffuser trocr unilm xlm-e
Last synced: 16 Dec 2024
https://github.com/haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
chatbot chatgpt foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava multi-modality multimodal vision-language-model visual-language-learning
Last synced: 25 Oct 2024
https://github.com/luodian/otter
𦦠Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
artificial-inteligence chatgpt deep-learning embodied-ai foundation-models gpt-4 instruction-tuning large-scale-models machine-learning multi-modality visual-language-learning
Last synced: 19 Dec 2024
https://github.com/Luodian/Otter
𦦠Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
artificial-inteligence chatgpt deep-learning embodied-ai foundation-models gpt-4 instruction-tuning large-scale-models machine-learning multi-modality visual-language-learning
Last synced: 24 Oct 2024
https://github.com/next-gpt/next-gpt
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
chatgpt foundation-models gpt-4 instruction-tuning large-language-models llm multi-modal-chatgpt multimodal visual-language-learning
Last synced: 18 Dec 2024
https://github.com/NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
chatgpt foundation-models gpt-4 instruction-tuning large-language-models llm multi-modal-chatgpt multimodal visual-language-learning
Last synced: 24 Oct 2024
https://github.com/opengvlab/ask-anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
big-model captioning-videos chat chatgpt foundation-models gradio langchain large-language-models large-model stablelm video video-question-answering video-understanding
Last synced: 18 Dec 2024
https://github.com/cluebenchmark/superclue
SuperCLUE: δΈζιη¨ε€§ζ¨‘εη»Όεζ§εΊε | A Benchmark for Foundation Models in Chinese
chatgpt chinese evaluation foundation-models gpt-4
Last synced: 20 Dec 2024
https://github.com/CLUEbenchmark/SuperCLUE
SuperCLUE: δΈζιη¨ε€§ζ¨‘εη»Όεζ§εΊε | A Benchmark for Foundation Models in Chinese
chatgpt chinese evaluation foundation-models gpt-4
Last synced: 28 Oct 2024
https://github.com/OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
big-model captioning-videos chat chatgpt foundation-models gradio langchain large-language-models large-model stablelm video video-question-answering video-understanding
Last synced: 29 Oct 2024
https://github.com/amazon-science/chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
artificial-intelligence forecasting foundation-models huggingface huggingface-transformers large-language-models llm machine-learning pretrained-models time-series time-series-forecasting timeseries transformers
Last synced: 19 Dec 2024
https://github.com/baaivision/eva
EVA Series: Visual Representation Fantasies from BAAI
foundation-models representation-learning vision-transformer
Last synced: 19 Dec 2024
https://github.com/baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
foundation-models representation-learning vision-transformer
Last synced: 28 Oct 2024
https://github.com/deepseek-ai/deepseek-vl
DeepSeek-VL: Towards Real-World Vision-Language Understanding
foundation-models vision-language-model vision-language-pretraining
Last synced: 21 Dec 2024
https://github.com/deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
foundation-models vision-language-model vision-language-pretraining
Last synced: 05 Nov 2024
https://github.com/autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
auto-labeling computer-vision deep-learning foundation-models grounding-dino image-annotation image-classification instance-segmentation labeling-tool machine-learning model-distillation multimodal object-detection pytorch segment-anything yolov5 yolov8
Last synced: 17 Dec 2024
https://github.com/kaiyangzhou/coop
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
foundation-models multimodal-learning prompt-learning
Last synced: 20 Dec 2024
https://github.com/baaivision/emu
Emu Series: Generative Multimodal Models from BAAI
foundation-models generative-pretraining-in-multimodality in-context-learning instruct-tuning multimodal-generalist multimodal-pretraining
Last synced: 19 Dec 2024
https://github.com/baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
foundation-models generative-pretraining-in-multimodality in-context-learning instruct-tuning multimodal-generalist multimodal-pretraining
Last synced: 26 Nov 2024
https://github.com/KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
foundation-models multimodal-learning prompt-learning
Last synced: 27 Oct 2024
https://github.com/tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
deep-learning evaluation foundation-models instruction-following large-language-models leaderboard nlp rlhf
Last synced: 17 Dec 2024
https://github.com/OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
action-recognition benchmark contrastive-learning foundation-models instruction-tuning masked-autoencoder multimodal open-set-recognition self-supervised spatio-temporal-action-localization temporal-action-localization video-clip video-data video-dataset video-question-answering video-retrieval video-understanding vision-transformer zero-shot-classification zero-shot-retrieval
Last synced: 28 Oct 2024
https://github.com/time-series-foundation-models/lag-llama
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
forecasting foundation-models lag-llama llama time-series time-series-forecasting time-series-prediction time-series-transformer timeseries timeseries-forecasting transformers
Last synced: 19 Dec 2024
https://github.com/deepseek-ai/janus
Janus-Series: Unified Multimodal Understanding and Generation Models
any-to-any foundation-models llm multimodal unified-model vision-language-pretraining
Last synced: 20 Dec 2024
https://github.com/deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
any-to-any foundation-models llm multimodal unified-model vision-language-pretraining
Last synced: 06 Dec 2024
https://tatsu-lab.github.io/alpaca_eval/
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
deep-learning evaluation foundation-models instruction-following large-language-models leaderboard nlp rlhf
Last synced: 28 Oct 2024
https://github.com/OFA-Sys/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
audio-language contrastive-loss foundation-models multimodal representation-learning vision-and-language vision-language vision-transformer
Last synced: 29 Nov 2024
https://github.com/mlmed/torchxrayvision
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
chest-radiographs chest-xray chest-xray-images cxr cxr-images dataset deep-learning foundation-models image-classification machine-learning medical medical-ai medical-application medical-image-analysis medical-image-processing medical-imaging pytorch torchxrayvision transfer-learning
Last synced: 15 Nov 2024
https://github.com/hazyresearch/meerkat
Creative interactive views of any dataset.
data-science foundation-models machine-learning ml pandas
Last synced: 15 Dec 2024
https://github.com/HazyResearch/meerkat
Creative interactive views of any dataset.
data-science foundation-models machine-learning ml pandas
Last synced: 29 Oct 2024
https://github.com/NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
ade20k backbone coco deep-learning foundation-models image-classification image-net object-detection pre-trained-model self-attention semantic-segmentation vision-transformer visual-recognition
Last synced: 28 Oct 2024
https://github.com/mrgiovanni/modelsgenesis
[MICCAI 2019 Young Scientist Award] [MEDIA 2020 Best Paper Award] Models Genesis
3d-model fine-tuning foundation-models pre-trained-model representation-learning self-supervised-learning transfer-learning
Last synced: 16 Dec 2024
https://github.com/zjunlp/KnowledgeEditingPapers
[η₯θ―ηΌθΎ] Must-read Papers on Knowledge Editing for Large Language Models.
awsome-list easyedit foundation-models knowledge-editing knowlm large-language-models model-editing natural-language-processing paper paper-list pre-trained-language-models pre-trained-model review rome survey
Last synced: 02 Nov 2024
https://github.com/hazyresearch/hyena-dna
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
foundation-models genomics language-models
Last synced: 15 Dec 2024
https://github.com/OpenRobotLab/PointLLM
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
3d chatbot foundation-models gpt-4 large-language-models llama multimodal objaverse point-cloud pointllm representation-learning vision-and-language
Last synced: 28 Oct 2024
https://github.com/foundationvision/groma
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
foundation-models grounding large-language-models llama llama2 llm mllm multimodal vision-language-model
Last synced: 21 Dec 2024
https://github.com/baaivision/tokenize-anything
[ECCV 2024] Tokenize Anything via Prompting
foundation-models multimodal promptable representation-learning
Last synced: 21 Dec 2024
https://github.com/mbzuai-oryx/groundinglmm
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].
foundation-models llm-agent lmm vision-and-language vision-language-model
Last synced: 21 Dec 2024
https://mbzuai-oryx.github.io/groundingLMM/
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].
foundation-models llm-agent lmm vision-and-language vision-language-model
Last synced: 30 Nov 2024
https://github.com/zubair-irshad/awesome-robotics-3d
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
3d benchmarks computer-vision diffusion-models foundation-models gaussian-splatting grasping llm manipulation navigation nerf pointclouds policy-learning pretraining robotics scene-graph simulations vision-language-model vlm
Last synced: 16 Nov 2024
https://github.com/baaivision/uni3d
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
3d-representation-learning foundation-models vision-transformers
Last synced: 15 Dec 2024
https://github.com/baaivision/Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
3d-representation-learning foundation-models vision-transformers
Last synced: 28 Oct 2024
https://github.com/azure/gen-cv
Vision AI Solution Accelerator
azure-computer-vision cognitive-search-vector-store dalle-3 embeddings florence foundation-models generative-computer-vision image-search stable-diffusion
Last synced: 21 Dec 2024
https://github.com/Azure/gen-cv
Vision AI Solution Accelerator
azure-computer-vision cognitive-search-vector-store dalle-3 embeddings florence foundation-models generative-computer-vision image-search stable-diffusion
Last synced: 07 Nov 2024
https://github.com/vitae-transformer/remote-sensing-rvsa
The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"
deep-learning foundation-model foundation-models object-detection pytorch remote-sensing remote-sensing-foundation-model scene-classification self-supervised-learning semantic-segmentation transfer-learning vision-transformer
Last synced: 15 Dec 2024
https://github.com/mims-harvard/units
A unified multi-task time series model.
anomaly-detection classification ecg eeg few-shot forecasting foundation-models imputation multi-task prompt-tuning time-series unified-model zero-shot
Last synced: 02 Nov 2024
https://github.com/MMMU-Benchmark/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
computer-vision deep-learning deep-neural-networks evaluation foundation-models large-language-models large-multimodal-models llm llms machine-learning multimodal multimodal-deep-learning multimodal-learning multimodality natural-language-processing question-answering stem visual-question-answering
Last synced: 08 Nov 2024
https://github.com/wisconsinaivision/vip-llava
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
chatbot clip cvpr2024 foundation-models gpt-4 gpt-4-vision llama llama2 llava multi-modal vision-language visual-prompting
Last synced: 15 Dec 2024
https://github.com/microsoft/aurora
Implementation of the Aurora model for Earth system forecasting
atmospheric-chemistry atmospheric-dynamics aurora-model deep-learning foundation-models
Last synced: 21 Dec 2024
https://github.com/Haiyang-W/GiT
[ECCV2024 Oralπ₯] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
foundation-models perception transformer unified vision-and-language vision-transformer
Last synced: 28 Oct 2024
https://github.com/FuxiaoLiu/LRV-Instruction?tab=readme-ov-file
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
chatgpt evaluation evaluation-metrics foundation-models gpt gpt-4 hallucination iclr iclr2024 llama llava multimodal object-detection prompt-engineering vicuna vision vision-and-language vqa
Last synced: 01 Nov 2024
https://github.com/huangwl18/language-planner
Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"
artificial-intelligence codex deep-learning embodied-ai foundation-models gpt-3 in-context-learning knowledge-extraction language-model planning transformers
Last synced: 07 Nov 2024
https://github.com/Psycoy/MixEval
The official evaluation suite and dynamic data release for MixEval.
benchmark benchmark-mixture benchmarking-framework benchmarking-suite evaluation evaluation-framework foundation-models large-language-model large-language-models large-multimodal-models llm-evaluation llm-evaluation-framework llm-inference mixeval
Last synced: 16 Nov 2024
https://github.com/aws-samples/foundation-model-benchmarking-tool
Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
bedrock benchmark benchmarking evaluation-metrics foundation-models g5 g6 g6e generative-ai inferentia llama2 llama3 p4d p5 sagemaker trainium
Last synced: 21 Dec 2024
https://github.com/azure/intelligent-app-workshop
Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamline backend workflows. Leverages Microsoft's Copilot stack, Semantic Kernel and Azure primitives to offer an engaging and comprehensive introduction to AI-infused app development and deployment
ai foundation-models gpt-35-turbo intelligent-agents intelligent-app llm ml mlops prompt-engineering semantic-kernel
Last synced: 21 Dec 2024
https://github.com/Azure/intelligent-app-workshop
Immersive workshop showcasing the remarkable potential of integrating SoTA foundation models to enhance product experiences and streamline backend workflows. Leverages Microsoft's Copilot stack, Semantic Kernel and Azure primitives to offer an engaging and comprehensive introduction to AI-infused app development and deployment
ai foundation-models gpt-35-turbo intelligent-agents intelligent-app llm ml mlops prompt-engineering semantic-kernel
Last synced: 04 Nov 2024
https://github.com/xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
action-recognition bert deep-learning foundation-models masked-autoencoder pytorch self-supervised-learning video-representation-learning video-understanding
Last synced: 28 Nov 2024
https://github.com/zjysteven/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc.
finetuning foundation-models instruction-tuning large-language-model large-multimodal-models llava llava-next multimodal multimodal-large-language-models qwen-vl vision-language visual-instruction-tuning
Last synced: 21 Dec 2024
https://github.com/om-ai-lab/rs5m
RS5M: a large-scale vision language dataset for remote sensing
foundation-models remote-sensing vision-and-language
Last synced: 06 Nov 2024
https://github.com/om-ai-lab/RS5M
RS5M: a large-scale vision language dataset for remote sensing
foundation-models remote-sensing vision-and-language
Last synced: 05 Nov 2024
https://github.com/yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
adversarial-attack deep-generative-model foundation-models generative-ai image-to-text-generation large-language-models text-to-image-generation trustworthy-ai vision-language-model
Last synced: 02 Dec 2024
https://github.com/westlake-repl/MicroLens?tab=readme-ov-file
A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).
audio-recommendation foundation-models image-recommendation large large-language-models llm llm-recommendation short-video text-recommendation video video-generation video-generation-dataset video-recommendation video-understanding video-understanding-dataset
Last synced: 16 Nov 2024
https://github.com/vitae-transformer/rsp
The official repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining"
change-detection classification deep-learning foundation-models imagenet object-detection pre-training remote-sensing semantic-segmentation transfer-learning
Last synced: 15 Dec 2024
https://github.com/westlake-repl/MicroLens
A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).
audio-recommendation foundation-models image-recommendation large large-language-models llm llm-recommendation short-video text-recommendation video video-generation video-generation-dataset video-recommendation video-understanding video-understanding-dataset
Last synced: 15 Nov 2024
https://github.com/vitae-transformer/mtp
The official repo for "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
change-detection classification deep-learning foundation-models object-detection pre-training remote-sensing semantic-segmentation transfer-learning
Last synced: 24 Nov 2024
https://github.com/ViTAE-Transformer/MTP
The official repo for "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
change-detection classification deep-learning foundation-models object-detection pre-training remote-sensing semantic-segmentation transfer-learning
Last synced: 05 Nov 2024
https://github.com/OxWearables/ssl-wearables
Self-supervised learning for wearables using the UK-Biobank (>700,000 person-days)
accelerometer deep-learning foundation-models human-activity-recognition pytorch self-supervised-learning wearable
Last synced: 06 Nov 2024
https://github.com/mazurowski-lab/finetune-SAM
This is an official repo for fine-tuning SAM to customized medical images.
finetune foundation-models medical-imaging sam
Last synced: 30 Nov 2024
https://github.com/aim-harvard/foundation-cancer-image-biomarker
[Nature Machine Intelligence 2024] Code and evaluation repository for the paper
cancer-imaging-research foundation-models medical-imaging representation-learning simclr
Last synced: 21 Dec 2024
https://github.com/salute-developers/gigaam
Foundational Model for Speech Recognition Tasks
emotion-recognition foundation-models self-supervised-learning speech-recognition
Last synced: 10 Nov 2024
https://github.com/microsoft/dpsda
Private Evolution: Generating DP Synthetic Data without Training [ICLR 2024, ICML 2024]
differential-privacy foundation-models private-evolution synthetic-data training-free
Last synced: 17 Dec 2024
https://github.com/microsoft/DPSDA
[ICLR 2024] Generating DP Synthetic Data without Training
differential-privacy foundation-models synthetic-data training-free
Last synced: 05 Nov 2024
https://github.com/sayakpaul/robustness-foundation-models
This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.
foundation-models representation-learning robustness
Last synced: 09 Nov 2024
https://github.com/ashleykleynhans/llava-docker
Docker image for LLaVA: Large Language and Vision Assistant
ai chatbot chatgpt docker docker-image foundation-models gpt-4 instruction-tuning llama llama-2 llama2 llava llm multimodal runpod vision-language-model visual-language-learning
Last synced: 25 Nov 2024
https://github.com/jieyuz2/taskmeanything
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
benchmark evaluation foundation-models
Last synced: 18 Dec 2024
https://github.com/yasserben/CLOUDS
[CVPR 2024] Official Implementation of Collaborating Foundation models for Domain Generalized Semantic Segmentation
deep-learning detectron2 domain-adaptation domain-generalization foundation-models mask2former semantic-segmentation transformer
Last synced: 30 Nov 2024
https://github.com/SaberaTalukder/TOTEM
The official code π©βπ» for - TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis
foundation-models representation-learning time-series time-series-analysis time-series-anomaly-detection time-series-forecasting time-series-foundation-model time-series-imputation tokenization
Last synced: 30 Aug 2024
https://github.com/kaiko-ai/eva
Evaluation framework for oncology foundation models (FMs)
evaluation-framework foundation-models machine-learning oncology
Last synced: 13 Nov 2024
https://github.com/rhysdg/vision-at-a-clip
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
clip foundation-models grounding-dino machine-learning onnx siglip tensorrt zero-shot-classification zero-shot-object-detection
Last synced: 27 Oct 2024
https://github.com/neuraloperator/coda-no
Codomain attention neural operator for single to multi-physics PDE adaptation.
foundation-models neural-operator pde-solver
Last synced: 09 Nov 2024
https://github.com/pnnl/cactus
LLM Agent that leverages cheminformatics tools to provide informed responses.
cheminformatics chemistry foundation-models llm llm-agent nlp science
Last synced: 25 Nov 2024
https://github.com/noodlefrenzy/promptgen
CLI for managing and generating Foundation Model prompts
chatgpt foundation-models gpt llms midjourney stable-diffusion
Last synced: 07 Nov 2024
https://github.com/wkentaro/yolo-world-onnx
ONNX models of YOLO-World (an open-vocabulary object detection).
computer-vision deep-learning foundation-models object-detection
Last synced: 08 Nov 2024
https://github.com/build-on-aws/amazon-bedrock-with-builder-and-command-patterns
A simple, yet powerful implementation in Java that allows developers to write a rather straightforward code to create the API requests for the different foundation models supported by Amazon Bedrock.
amazon bedrock builder-pattern command-pattern foundation-models generative-ai java llm
Last synced: 07 Nov 2024
https://github.com/fedebotu/green-planet-transformers-3
MelXior: a Neural Weather Forecasting app distilling knowledge from models including GPT-3, DALL-E and FourCastNet
climate dalle2 foundation-models fourcastnet gpt3 hackaton openai transformers weather
Last synced: 06 Nov 2024
https://github.com/techthoughts2/pwshbedrock
pwshBedrock is a PowerShell module designed to simplify interaction with Amazon Bedrock foundation models. It enables users to send messages, retrieve responses, manage conversation contexts, generate images, and estimate costs. Supporting both InvokeModel and Converse API, it streamlines AI integration in PowerShell workflows.
ai ai21labs amazon-bedrock amazon-titan amazon-web-services anthropic-claude aws claude-3 cohere command-r command-r-plus foundation-models generative-ai jamba large-language-models meta-llama3 mistral-ai powershell powershell-module stability-ai
Last synced: 03 Dec 2024
https://github.com/jiayuww/spatialeval
[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs
claude foundation-models gemini gpt-4o gpt-4v large-language-models llama3 machine-learning multimodal-deep-learning reasoning spatial-reasoning vision-language-models
Last synced: 03 Dec 2024
https://github.com/ycheng517/awesome-foundation-model-ros
A collection ROS projects utilizing foundation models.
awesome awesome-list foundation-models robotics ros ros2
Last synced: 29 Oct 2024
https://github.com/rituyadav92/context-aware-change-detection-with-semi-supervised-learning_igarss23
Context Aware Change Detection With Semi Supervised Learning
change-detection flooding foundation-models landslide
Last synced: 30 Nov 2024
https://github.com/superbrucejia/awesome-semantic-textual-similarity
Awesome Semantic Textual Similarity: a curated list of Semantic Textual Similarity in Large Language Models and NLP
foundation-models large-language-models prompt-engineering prompt-similarity prompt-toolkit semantic-preserving-transformation semantic-similarity semantic-similarity-measures semantic-textual-similarity
Last synced: 09 Nov 2024
https://github.com/mbari-org/aipipeline
Library for running detection, clustering or classification ai pipelines plus performance monitoring
foundation-models image-classification object-detection
Last synced: 10 Dec 2024
https://github.com/mbari-org/fastapi-vss
RESTful API for vector similarity search. It uses the Python web framework FastAPI. This accelerates machine learning workflows that require vector similarity search using foundational models.
fastapi foundation-models image-classification vision-transformer
Last synced: 10 Dec 2024
https://github.com/microsoft/mattersim
MatterSim: A deep learning atomistic model across elements, temperatures and pressures.
ai4materials ai4science computational-materials-science foundation-models machine-learning machine-learning-force-field materials-science mlff
Last synced: 03 Dec 2024
https://github.com/agora-lab-ai/forestnet
A Deep Learning Framework for Quantifying Collective Forest Intelligence Through Multi-Variable Temporal-Spatial Analysis
ai amazon amazonforests bioinformatics biology biologyai bioml collective-behavior collective-intelligence forest-ai forests foundation-models greenery greenery-ai ml swarms
Last synced: 10 Nov 2024
https://github.com/superbrucejia/gsm8k-consistency
GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.
arithmetic-consistency arithmetic-reasoning factual-consistency foundation-models grade grade-school-math gsm8k large-language-models logical-consistency mathematical-reasoning prompt prompt-engineering prompt-perturbation prompt-toolkit reasoning self-consistency self-consistency-benchmark semantics-consistency semantics-preserving-transformations semantics-similar
Last synced: 09 Nov 2024
https://github.com/superbrucejia/awesome-mixture-of-experts
Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of Multimodal Experts (MoME)
artificial-intelligence expert-network foundation-models gating-network large-language-model large-language-models large-vision-language-models llms llms-benchmarking llms-reasoning load-balancing mixtrure-of-multimodal-experts mixture-of-experts moe mome multimodal-learning sparse sparse-mixture-of-experts sparse-mixture-of-multimodal-experts sparse-moe
Last synced: 09 Nov 2024
https://github.com/himanshuvnm/foundation-model-large-language-model-fm-llm
This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.
attention-is-all-you-need aws fine-tuning flan-t5 foundation-models generative-ai hate-speech-detection huggingface huggingface-transformers large-language-models lora low-rank-ada ml-m5-2xlarge peft-fine-tuning-llm python3 pytorch rlhf rnn-pytorch
Last synced: 06 Nov 2024