An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with foundation-model

A curated list of projects in awesome lists tagged with foundation-model .

https://github.com/guardrails-ai/guardrails

Adding guardrails to large language models.

ai foundation-model gpt-3 llm openai

Last synced: 16 Mar 2026

https://github.com/OpenGVLab/InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

chatgpt click draggan foundation-model gpt gpt-4 gradio husky image-captioning imagebind internimage langchain llama llm multimodal sam segment-anything vicuna video-generation vqa

Last synced: 27 Mar 2025

https://github.com/opengvlab/interngpt

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

chatgpt click draggan foundation-model gpt gpt-4 gradio husky image-captioning imagebind internimage langchain llama llm multimodal sam segment-anything vicuna video-generation vqa

Last synced: 14 May 2025

https://github.com/opengvlab/internimage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

backbone deformable-convolution foundation-model object-detection semantic-segmentation

Last synced: 10 Apr 2025

https://github.com/OpenGVLab/InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

backbone deformable-convolution foundation-model object-detection semantic-segmentation

Last synced: 20 Mar 2025

https://github.com/idea-research/grounding-dino-1.5-api

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

foundation-model grounding-dino object-detection open-set open-vocabulary-detection open-world zero-shot-object-detection

Last synced: 14 Apr 2025

https://github.com/IDEA-Research/Grounding-DINO-1.5-API

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

foundation-model grounding-dino object-detection open-set open-vocabulary-detection open-world zero-shot-object-detection

Last synced: 27 Sep 2025

https://github.com/opendrivelab/driveagi

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

autonomous-driving embodied-ai foundation-model general-artificial-intelligence large-dataset policy-learning video-dataset video-generation world-models

Last synced: 15 May 2025

https://github.com/ailab-cvc/seed

Official implementation of SEED-LLaMA (ICLR 2024).

foundation-model multimodal vision-language

Last synced: 09 Apr 2025

https://github.com/OpenDriveLab/DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

autonomous-driving embodied-ai foundation-model general-artificial-intelligence policy-learning

Last synced: 20 Mar 2025

https://github.com/Clay-foundation/model

The Clay Foundation Model - An open source AI model and interface for Earth

digital-elevation-model earth-observation embeddings foundation-model sentinel-1 sentinel-2

Last synced: 24 Sep 2025

https://clay-foundation.github.io/model/

The Clay Foundation Model - An open source AI model and interface for Earth

digital-elevation-model earth-observation embeddings foundation-model sentinel-1 sentinel-2

Last synced: 01 Aug 2025

https://cambridgeltl.github.io/visual-med-alpaca/

Visual Med-Alpaca is an open-source, multi-modal foundation model designed specifically for the biomedical domain, built on the LLaMa-7B.

biomedical biomedical-image-processing foundation-model large-language-models multimodal

Last synced: 12 May 2025

https://github.com/opendrivelab/openscene

3D Occupancy Prediction Benchmark in Autonomous Driving

3d-occupancy autonomous-driving foundation-model

Last synced: 05 Apr 2025

https://github.com/OpenDriveLab/OpenScene

3D Occupancy Prediction Benchmark in Autonomous Driving

3d-occupancy autonomous-driving foundation-model

Last synced: 20 Mar 2025

https://github.com/spotify-research/llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

foundation-model multimodal music-information-retrieval

Last synced: 17 Mar 2025

https://github.com/chao1224/MoleculeSTM

Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)

clip computation-chemistry drug-discovery editing foundation-model molecule-editing moleculeclip moleculestm pretraining retrieval

Last synced: 09 May 2025

https://github.com/chao1224/moleculestm

Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)

clip computation-chemistry drug-discovery editing foundation-model molecule-editing moleculeclip moleculestm pretraining retrieval

Last synced: 13 Apr 2025

https://github.com/zhanghm1995/Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

3dgs adaptation autonomous-driving diffusion end-to-end-autonomous-driving foundation-model large-language-models nerf pre-training survey world-models

Last synced: 24 Jul 2025

https://github.com/med-air/Endo-FM

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

endoscopy foundation-model large-scale miccai2023 pre-train self-supervised video

Last synced: 16 Mar 2025

https://github.com/mahmoodlab/mil-lab

Feather - Lightweight supervised slide foundation models (ICML 2025)

deep-learning foundation-model histology pathology whole-slide-image

Last synced: 16 Feb 2026

https://github.com/naver/dune

Code repository for "DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers"

computer-vision foundation-model image-encoder knowledge-distillation vision-transformer

Last synced: 04 Apr 2026

https://github.com/sap-samples/btp-cap-genai-rag

Explore this repository for GenAI samples on SAP Business Technology Platform (SAP BTP). We provide examples for single and multitenant versions, showcasing integration of LLMs via SAP AI Core, LangChain in SAP CAP, and advanced techniques like Retrieval Augmented Generation (RAG).

4371 ai-core btp-use-case-factory cloud-foundry foundation-model genai generative-ai gpt hana kyma llm openai rag saas sample sample-code sap-btp sap-cap typescript vector-engine

Last synced: 31 Mar 2025

https://github.com/alan-turing-institute/robots-in-disguise

Information and materials for the Turing's "robots-in-disguise" reading group on fundamental AI research.

deep-learning diffusion-models foundation-model hut23 language-models large-language-models machine-learning nlp transformers

Last synced: 21 Aug 2025

https://github.com/BoevaLab/CancerFoundation

CancerFoundation: A single-cell RNA sequencing foundation model to decipher drug resistance in cancer

cancer foundation-model single-cell

Last synced: 09 May 2026

https://github.com/yjyddq/eoser-ass-rl

Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step"

foundation-model masked-diffusion-large-language-model reinforcement-learning

Last synced: 09 Oct 2025

https://github.com/automl/tempopfn

Official code release for the paper "TempoPFN: Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting"

foundation-model synthetic-data-generation time-series-forecasting

Last synced: 28 Jan 2026

https://github.com/11yxk/SAM-LST

Pytorch implementation of paper Ladder Fine-tuning approach for SAM integrating complementary network.

fine-tuning foundation-model multi-organ-segmentation segment-anything

Last synced: 24 Jul 2025

https://github.com/thomasgust/molecumixer

Very incomplete right now, pretrained ARGVAET system for generating, classifying, and predicting the properties of molecules. I couldn't upload the dataset or checkpoints due to size constraints.

argvaet bioinformatics foundation-model generative-ai generative-pretraining gnn molecule neural-network pretraining pytorch rdkit

Last synced: 23 Oct 2025

https://github.com/itrummer/naturalminer

Mine data for patterns described in natural language

data-mining data-science foundation-model language-model nlp

Last synced: 07 Mar 2026

https://github.com/garystafford/genai_fiction_summary

Mastering Long Document Insights: Advanced Summarization with Amazon Bedrock and Anthropic Claude Foundation Model

anthropic-claude foundation-model generative-ai text-summarization

Last synced: 27 Mar 2025

https://github.com/chansigit/scgpt-modern

A drop-in modernization of bowang-lab/scGPT for Python 3.12 + torch 2.6 + flash-attn 3 (H100 sm_90a native). Original pretrained weights load unmodified — compatible, more modern, faster.

bioinformatics deep-learning flash-attention foundation-model genomics h100 hopper llm pytorch rna-seq scgpt single-cell single-cell-genomics transcriptomics

Last synced: 29 Apr 2026

https://github.com/pointcloudyc/Industrial3D

Industrial3D: A Terrestrial LiDAR Point Cloud Dataset and Cross-Paradigm Benchmark for Industrial Infrastructure

benchmark digital-construction foundation-model point-cloud scan-to-bim unsupervised-segmentation weakly-supervised-segmentation

Last synced: 20 Apr 2026