Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by modelscope

A curated list of projects in awesome lists by modelscope .

https://github.com/modelscope/facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Last synced: 29 Oct 2024

https://github.com/modelscope/funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 01 Nov 2024

https://github.com/modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

cv deep-learning machine-learning multi-modal nlp python science speech

Last synced: 28 Oct 2024

https://github.com/modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 29 Oct 2024

https://github.com/modelscope/agentscope

Start building LLM-empowered multi-agent applications in an easier way.

agent chatbot distributed-agents drag-and-drop gpt-4 gpt-4o large-language-models llama3 llm llm-agent multi-agent multi-modal

Last synced: 29 Oct 2024

https://github.com/modelscope/ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

agent deploy dpo internvl liger llama llama3 llava llm lora megatron minicpm-v modelscope multimodal peft pre-training qwen2 qwen2-vl reflection sft

Last synced: 01 Nov 2024

https://github.com/modelscope/funclip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

gradio gradio-python-llm llm speech-recognition speech-to-text subtitles-generator video-clip video-subtitles

Last synced: 09 Oct 2024

https://github.com/modelscope/data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! ๐ŸŽ ๐Ÿ‹ ๐ŸŒฝ โžก๏ธ โžก๏ธ๐Ÿธ ๐Ÿน ๐Ÿทไธบๅคงๆจกๅž‹ๆไพ›ๆ›ด้ซ˜่ดจ้‡ใ€ๆ›ดไธฐๅฏŒใ€ๆ›ดๆ˜“โ€ๆถˆๅŒ–โ€œ็š„ๆ•ฐๆฎ๏ผ

chinese data-analysis data-science data-visualization dataset gpt gpt-4 instruction-tuning large-language-models llama llava llm llms multi-modal nlp opendata pre-training pytorch sora streamlit

Last synced: 13 Oct 2024

https://github.com/modelscope/modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

agent chatglm-4 gpts llm multi-agents open-gpts qwen

Last synced: 15 Oct 2024

https://github.com/modelscope/DiffSynth-Studio

Enjoy the magic of Diffusion models!

Last synced: 02 Aug 2024

https://github.com/modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

audio-generation audio-quantization codec encodec speech-synthesis speech-to-text tts voicecloning

Last synced: 11 Oct 2024

https://github.com/modelscope/scepter

SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.

Last synced: 30 Oct 2024

https://github.com/modelscope/evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

evaluation llm performance

Last synced: 13 Aug 2024