Projects in Awesome Lists tagged with multimodal-data
A curated list of projects in awesome lists tagged with multimodal-data .
https://github.com/scverse/muon
muon is a multimodal omics Python framework
anndata cite-seq mudata multi-omics multimodal-data multimodal-omics-analysis muon scanpy scatac-seq scrna-seq scverse
Last synced: 07 Jul 2025
https://github.com/google/space
Unified storage framework for the entire machine learning lifecycle
apache-arrow apache-parquet data-warehouse dataops dataset dml lakehouse machine-learning mlops multimodal multimodal-data olap ray tensorflow tensorflow-dataset
Last synced: 14 Jan 2026
https://github.com/machine-intelligence-laboratory/TopicNet
Interface for easier topic modelling.
bigartm-library custom-score document-representation modalities multimodal-data multimodal-learning pypi topic-modeling topic-modelling
Last synced: 03 May 2025
https://github.com/kyegomez/EXA-1
An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!
artificial-intelligence dataset gpt4 jax kosmos large-dataset large-language-models multimodal multimodal-data multimodality pytorch pytorch-implementation triton
Last synced: 28 Mar 2025
https://github.com/paccmann/fdsa
A fully differentiable set autoencoder
deep-learning multimodal-data set-autoencoder
Last synced: 03 Oct 2025
https://github.com/aclai-lab/soledata.jl
Manage logical datasets!
machine-learning multimodal-data unstructured-data
Last synced: 07 Jan 2026
https://github.com/dhchenx/mmkit-features
A multimodal architecture to build multimodal knowledge graphs with flexible multimodal feature extraction and dynamic multimodal concept generation
multimodal-data multimodal-feature multimodal-knowledge-graph
Last synced: 24 Jul 2025
https://github.com/pdx-labs/pdx
Prompt Engineering and Dev-Ops toolkit for applications powered by Language Models
anthropic anthropic-claude cohere gpt-3 gpt-4 llm llmops llms multimodal multimodal-data openai prompt prompt-engineering prompt-toolkit
Last synced: 06 Oct 2025
https://github.com/eurus-holmes/tumor2graph
Tumor2Graph: a novel Overall-Tumor-Profile-derived virtual graph deep learning for predicting tumor typing and subtyping.
cluster cnn gcn gnn graph-deep-learning multimodal-data multimodal-graphs tcga-data
Last synced: 02 May 2025
https://github.com/kyegomez/odin
SOTA Classification at scale for UAVs, Drones, and much more
computer-vision multimodal multimodal-data multimodal-deep-learning swarm-intelligence
Last synced: 19 Aug 2025
https://github.com/laminetourelab/drug-discovery
I am working on the discovery of new potential therapeutic tagets using Machine learning on multi-omics data.
autoencoder-classification deep-learning drug-discovery graph-convolutional-networks longitudinal-data machine-learning multimodal-data neural-network ngs-pipeline pytorch regularization-methods target
Last synced: 18 Jun 2025
https://github.com/fork123aniket/multi-round-vlm-powered-multimodal-conversational-ai-navigation-bot
Streamlit App Combining Vision, Language, and Audio AI Models
conversational-agent conversational-ai conversational-bot conversational-interface generative-ai internvl internvl2 multimodal multimodal-data multimodal-deep-learning multimodal-large-language-models multimodal-learning vision-language vision-language-learning vision-language-model vision-language-models vision-language-navigation vision-language-transformer
Last synced: 19 Feb 2026
https://github.com/sitamgithub-msit/streamlit-app-builder
A Streamlit-based AI assistant generates custom Streamlit app code from user-provided images or text using the Google Gemini model.
code-generation gemini-15-pro generative-ai llm-tracing multimodal-data multimodal-large-language-models python streamlit wandb weave
Last synced: 07 May 2025
https://github.com/mims-harvard/optimuskg
A modern multimodal knowledge graph with type-specific metadata across biomedical domains.
biomedical graph-ai heterogeneous-graphs knowledge-graph multimodal-ai multimodal-data neo4j ontology python
Last synced: 02 May 2026
https://github.com/fork123aniket/agentic-rag-story-generation-with-multimodal-genai
Multimodal Agentic GenAI Workflow – Seamlessly blends retrieval and generation for intelligent storytelling
agentic-ai agentic-rag agentic-workflow generative-ai generative-ai-model internvl2 multimodal multimodal-data multimodal-deep-learning multimodal-large-language-models multimodal-learning story-generation vision-language vision-language-learning vision-language-model vision-language-transformer
Last synced: 14 Oct 2025
https://github.com/sitamgithub-msit/vidiqa
VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using natural language.
gradio gradio-interface huggingface-spaces huggingface-transformers minicpm-v multilingual-models multimodal-data multimodal-deep-learning python question-answering
Last synced: 07 May 2025
https://github.com/sitamgithub-msit/well-being
Reducing neonatal and under-5 mortality rates via an AI-driven awareness platform with a Gradio app, Gemini API integration, and essential project utilities. #AIForGood
artificial-intelligence chatbot gemini-15-pro gemini-api generative-ai gradio huggingface-spaces multimodal-data multimodal-large-language-models
Last synced: 15 Jun 2025
https://github.com/mdh266/speech2image
A Streamlit App For Speech To Image
docker generative-ai google-api google-cloud multimodal-data replicate-api speech-to-text streamlit
Last synced: 09 May 2026
https://github.com/distant-viewing/dvscripts
Tutorials and scripts for doing computational analysis with visual and multimodal data
computer-vision multimodal-data python scripts sound-analysis textbook
Last synced: 16 Oct 2025