Projects in Awesome Lists by modelscope
A curated list of projects in awesome lists by modelscope .
https://github.com/modelscope/funasr
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper
Last synced: 16 May 2025
https://github.com/modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Last synced: 14 May 2025
https://github.com/modelscope/diffsynth-studio
Enjoy the magic of Diffusion models!
Last synced: 12 Jan 2026
https://github.com/modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Last synced: 02 May 2025
https://github.com/modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
cv deep-learning machine-learning multi-modal nlp python science speech
Last synced: 14 Mar 2026
https://github.com/modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).
deepseek-r1 deploy embedding grpo internvl liger llama llama4 llm lora megatron multimodal omni open-r1 peft qwen2-vl qwen3 qwen3-moe rft sft
Last synced: 28 Feb 2026
https://github.com/modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
agent chatbot distributed-agents drag-and-drop gpt-4 gpt-4o large-language-models llama3 llm llm-agent mcp multi-agent multi-modal
Last synced: 14 May 2025
https://github.com/modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper
Last synced: 24 Mar 2025
https://github.com/modelscope/funclip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
gradio gradio-python-llm llm speech-recognition speech-to-text subtitles-generator video-clip video-subtitles
Last synced: 13 May 2025
https://github.com/modelscope/data-juicer
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
chinese data-analysis data-science data-visualization dataset gpt gpt-4 instruction-tuning large-language-models llama llava llm llms multi-modal nlp opendata pre-training pytorch streamlit synthetic-data
Last synced: 13 May 2025
https://github.com/modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
gradio gradio-python-llm llm speech-recognition speech-to-text subtitles-generator video-clip video-subtitles
Last synced: 11 Sep 2025
https://github.com/modelscope/modelscope-agent
MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration
agent agentic-insight assistantapi chatbot code data-science data-science-assistant deep-research gpts llm multi-agents multimodal-large-language-models open-gpts qwen rag
Last synced: 08 Jul 2025
https://github.com/modelscope/ms-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agent android-application assistantapi chatbot chatglm-4 code codexgraph data-science data-science-assistant gpts llm mobile-agent mobile-agents multi-agents multimodal-large-language-models open-gpts qwen rag
Last synced: 06 Feb 2026
https://github.com/modelscope/clearervoice-studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
audio bandwidth-extension deep-learning noise-suppression pytorch speaker-extraction speech speech-enhancement speech-separation speech-super-resolution
Last synced: 14 May 2025
https://github.com/modelscope/3d-speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
3d-speaker campplus cnceleb eres2net language-identification modelscope rdino speaker-diarization speaker-verification voxceleb
Last synced: 14 May 2025
https://github.com/modelscope/evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
evaluation llm performance rag vlm
Last synced: 05 Jan 2026
https://github.com/modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech speech-synthesis tts
Last synced: 11 Oct 2025
https://github.com/modelscope/scepter
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
aigc generative-model lar-gen scedit stylebooth
Last synced: 15 May 2025
https://github.com/modelscope/kan-tts
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech speech-synthesis tts
Last synced: 04 Apr 2025
https://github.com/modelscope/richdreamer
[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Last synced: 09 Apr 2025
https://github.com/modelscope/adaseq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
bert chinese-nlp crf entity-typing information-extraction multi-modal-ner named-entity-recognition natural-language-processing natural-language-understanding ner nlp pytorch relation-extraction sequence-labeling token-classification word-segmentation
Last synced: 16 May 2025
https://github.com/modelscope/AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
bert chinese-nlp crf entity-typing information-extraction multi-modal-ner named-entity-recognition natural-language-processing natural-language-understanding ner nlp pytorch relation-extraction sequence-labeling token-classification word-segmentation
Last synced: 18 Mar 2025
https://github.com/modelscope/funcodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
audio-generation audio-quantization codec encodec speech-synthesis speech-to-text tts voicecloning
Last synced: 05 Apr 2025
https://github.com/modelscope/awesome-deep-reasoning
Collect every awesome work about r1!
collection deepseek grpo o1 qwen r1 reasoning rl
Last synced: 14 Jun 2025
https://github.com/modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
audio-generation audio-quantization codec encodec speech-synthesis speech-to-text tts voicecloning
Last synced: 26 Oct 2025
https://github.com/modelscope/motionagent
MotionAgent is your AI assistent to convert ideas into motion pictures.
Last synced: 06 Apr 2025
https://github.com/modelscope/dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
cpu cuda guided-decoding llm llm-inference native-engine
Last synced: 12 Apr 2025
https://github.com/modelscope/trinity-rft
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).
Last synced: 14 Jun 2025
https://github.com/modelscope/modelscope-studio
A third-party component library based on Gradio.
ant-design ant-design-x gradio gradio-custom-component modelscope python ui
Last synced: 16 May 2025
https://github.com/modelscope/mcpbench
The evaluation benchmark on MCP servers
benchmark database mcp mcp-server websearch
Last synced: 14 Jun 2025
https://github.com/modelscope/adadet
AdaDet: A Development Toolkit for Object Detection based on ModelScope
computer-vision deep-learning detection detection-toolkit domain-specific-models face-detection keypoint-detection machine-learning object-detection ocr python tracking
Last synced: 12 Jul 2025
https://github.com/modelscope/promptscope
Enjoy easier conversations with LLM
gpt-4 in-context-learning large-language-models llms multi-modal prompt prompt-engineering
Last synced: 14 Jun 2025
https://github.com/modelscope/AgentEvolver
AgentEvolver: Towards Efficient Self-Evolving Agent System
agent llm reinforcement-learning self-evolving
Last synced: 28 Nov 2025
https://github.com/modelscope/mcp-central
Collection of model-centric MCP servers
Last synced: 25 Jun 2025
https://github.com/modelscope/langchain-modelscope
Langchain integration for ModelScope
Last synced: 14 Jun 2025
https://github.com/modelscope/comfyscope
Collection of various Comfy components.
Last synced: 24 Aug 2025
https://github.com/modelscope/modelscope-mcp-server
ModelScope MCP Server (in development)
agent aigc fastmcp llm mcp mcp-server modelscope
Last synced: 23 Jul 2025