An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by modelscope

A curated list of projects in awesome lists by modelscope .

https://github.com/modelscope/funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 16 May 2025

https://github.com/modelscope/facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Last synced: 14 May 2025

https://github.com/modelscope/diffsynth-studio

Enjoy the magic of Diffusion models!

Last synced: 12 Jan 2026

https://github.com/modelscope/DiffSynth-Studio

Enjoy the magic of Diffusion models!

Last synced: 02 May 2025

https://github.com/modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

cv deep-learning machine-learning multi-modal nlp python science speech

Last synced: 14 Mar 2026

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).

deepseek-r1 deploy embedding grpo internvl liger llama llama4 llm lora megatron multimodal omni open-r1 peft qwen2-vl qwen3 qwen3-moe rft sft

Last synced: 28 Feb 2026

https://github.com/modelscope/agentscope

Start building LLM-empowered multi-agent applications in an easier way.

agent chatbot distributed-agents drag-and-drop gpt-4 gpt-4o large-language-models llama3 llm llm-agent mcp multi-agent multi-modal

Last synced: 14 May 2025

https://github.com/modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 24 Mar 2025

https://github.com/modelscope/funclip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

gradio gradio-python-llm llm speech-recognition speech-to-text subtitles-generator video-clip video-subtitles

Last synced: 13 May 2025

https://github.com/modelscope/FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

gradio gradio-python-llm llm speech-recognition speech-to-text subtitles-generator video-clip video-subtitles

Last synced: 11 Sep 2025

https://github.com/modelscope/clearervoice-studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

audio bandwidth-extension deep-learning noise-suppression pytorch speaker-extraction speech speech-enhancement speech-separation speech-super-resolution

Last synced: 14 May 2025

https://github.com/modelscope/3d-speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

3d-speaker campplus cnceleb eres2net language-identification modelscope rdino speaker-diarization speaker-verification voxceleb

Last synced: 14 May 2025

https://github.com/modelscope/evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

evaluation llm performance rag vlm

Last synced: 05 Jan 2026

https://github.com/modelscope/KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

modelscope speech speech-synthesis tts

Last synced: 11 Oct 2025

https://github.com/modelscope/scepter

SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.

aigc generative-model lar-gen scedit stylebooth

Last synced: 15 May 2025

https://github.com/modelscope/kan-tts

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

modelscope speech speech-synthesis tts

Last synced: 04 Apr 2025

https://github.com/modelscope/richdreamer

[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC

Last synced: 09 Apr 2025

https://github.com/modelscope/funcodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

audio-generation audio-quantization codec encodec speech-synthesis speech-to-text tts voicecloning

Last synced: 05 Apr 2025

https://github.com/modelscope/awesome-deep-reasoning

Collect every awesome work about r1!

collection deepseek grpo o1 qwen r1 reasoning rl

Last synced: 14 Jun 2025

https://github.com/modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

audio-generation audio-quantization codec encodec speech-synthesis speech-to-text tts voicecloning

Last synced: 26 Oct 2025

https://github.com/modelscope/motionagent

MotionAgent is your AI assistent to convert ideas into motion pictures.

Last synced: 06 Apr 2025

https://github.com/modelscope/dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

cpu cuda guided-decoding llm llm-inference native-engine

Last synced: 12 Apr 2025

https://github.com/modelscope/trinity-rft

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

agent llm rlhf

Last synced: 14 Jun 2025

https://github.com/modelscope/modelscope-studio

A third-party component library based on Gradio.

ant-design ant-design-x gradio gradio-custom-component modelscope python ui

Last synced: 16 May 2025

https://github.com/modelscope/mcpbench

The evaluation benchmark on MCP servers

benchmark database mcp mcp-server websearch

Last synced: 14 Jun 2025

https://github.com/modelscope/lite-sora

An initiative to replicate Sora

Last synced: 02 May 2025

https://github.com/modelscope/AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

agent llm reinforcement-learning self-evolving

Last synced: 28 Nov 2025

https://github.com/modelscope/mcp-central

Collection of model-centric MCP servers

Last synced: 25 Jun 2025

https://github.com/modelscope/imagepulse

Open Image Curation Tools

Last synced: 14 Jun 2025

https://github.com/modelscope/ReMe

Last synced: 24 Sep 2025

https://github.com/modelscope/langchain-modelscope

Langchain integration for ModelScope

Last synced: 14 Jun 2025

https://github.com/modelscope/r-chain

Last synced: 14 Jun 2025

https://github.com/modelscope/comfyscope

Collection of various Comfy components.

Last synced: 24 Aug 2025

https://github.com/modelscope/modelscope-mcp-server

ModelScope MCP Server (in development)

agent aigc fastmcp llm mcp mcp-server modelscope

Last synced: 23 Jul 2025

https://github.com/modelscope/ClearerVoice-Studio

ClearVoice

Last synced: 02 May 2025

https://github.com/modelscope/katz

Last synced: 14 Jun 2025

https://github.com/modelscope/rm-gallery

A One-Stop Reward Model Platform

Last synced: 23 Jul 2025