Projects in Awesome Lists tagged with grounding
A curated list of projects in awesome lists tagged with grounding .
https://github.com/simular-ai/Agent-S
Agent S: an open agentic framework that uses computers like a human
agent-computer-interface ai-agents computer-automation computer-use grounding gui-agents in-context-reinforcement-learning memory mllm planning retrieval-augmented-generation
Last synced: 07 May 2025
https://github.com/mees/calvin
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
computer-vision deep-learning grounding manipulation natural-language-processing pytorch robotics vision vision-and-language vision-language
Last synced: 18 Jan 2026
https://github.com/foundationvision/groma
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
foundation-models grounding large-language-models llama llama2 llm mllm multimodal vision-language-model
Last synced: 04 Apr 2025
https://github.com/BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
ai ai-agent ai-agents-framework computer-control cradle foundation-agent gcc general-computer-control generative-ai grounding large-language-models llm lmm multimodality personoid vision-language-model vlm
Last synced: 07 May 2025
https://github.com/allenai/lumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
decision-making grounding language-agent maths planning question-answering reasoning web-agent
Last synced: 13 Oct 2025
https://github.com/mbzuai-oryx/video-llava
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
grounding llm lmm transcription video video-conversation video-grounding
Last synced: 10 Apr 2025
https://github.com/mbzuai-oryx/Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
grounding llm lmm transcription video video-conversation video-grounding
Last synced: 24 Jul 2025
https://github.com/mees/hulc2
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
computer-vision deep-learning grounding manipulation natural-language-processing pytorch robotics vision vision-and-language vision-language
Last synced: 18 Jan 2026
https://github.com/manuvarkey/earthing
A python library for design of earthing networks in electrical substations.
earth-grid earthing electrical-engineering grounding grounding-grid ieee-80 ieee80 substation substation-earthing substation-grounding
Last synced: 13 Apr 2025
https://github.com/cennest/ground-cite
An Open Source Python package to strengthen Gemini Citations- Validated, Controlled, Reliable and Structured Citations
broken-citations brokengeminicitations gemini gemini-404-urls gemini-invalid-urls googlegrounding grounding invalid-citations irrelevant-citations irrelevantgeminicitations missinggeminicitations structured-citations
Last synced: 14 Jan 2026
https://github.com/phanxuanquang/gemini.net
A lightweight SDK enabling integration of Gemini for .NET in a simple way
csharp dotnet dotnet-core gemini gemini-2-0-flash gemini-api gemini-net grounding net8 sdk
Last synced: 03 Apr 2025
https://github.com/yaniv-golan/proof-engine
AI agent skill that creates formal, verifiable proofs of claims — every fact computed or cited, never asserted
agent-skills ai-agent chatgpt citations claude claude-code-plugin cursor fact-checking formal-verification grounding hallucination-prevention llm python reproducible-research trustworthy-ai verification
Last synced: 25 Apr 2026
https://github.com/coffeevampir3/grounding-sam2-demo
A simple demo for utilizing grounding dino and segment anything v2 models together
demo-app gradio grounding grounding-dino machine-learning segment segment-anything segmentation
Last synced: 14 Apr 2025
https://github.com/gammahazard/locate-anything
Sleek, mobile-friendly web UI for NVIDIA LocateAnything-3B — open-vocabulary object detection & grounding on your own GPU, via one docker compose up.
bounding-boxes computer-vision cuda docker fastapi gpu grounding locate-anything machine-learning nvidia object-detection ocr open-vocabulary-detection react self-hosted tailwindcss typescript vision-language-model web-ui
Last synced: 28 May 2026
https://github.com/lazauk/aoai-cognitivesearch-azd
Adapting original Azure OpenAI sample from https://github.com/Azure-Samples/azure-search-openai-demo for newer GPT4-compatible "Chat Completion" syntax.
ai azure chatgpt cognitive-search gpt-4 grounding openai
Last synced: 05 May 2026