Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by shibing624
A curated list of projects in awesome lists by shibing624 .
https://github.com/shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
csc error-correction error-detection kenlm macbert4csc pycorrector spelling-errors t5
Last synced: 31 Jul 2024
https://github.com/shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec
Last synced: 01 Aug 2024
https://github.com/shibing624/medicalgpt
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
chatgpt dpo gpt llama llm medical medicalgpt
Last synced: 02 Aug 2024
https://github.com/shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
chatgpt dpo gpt llama llm medical medicalgpt
Last synced: 01 Aug 2024
https://github.com/shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
bart bert chatglm chatgpt gpt2 llama seq2seq t5 text-generation textgen xlnet
Last synced: 02 Aug 2024
https://github.com/shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
chinese-speech-recognition chinese-speech-synthesis parrot pinyin2hanzi speech-recognition text-to-speech-python3 tts
Last synced: 04 Aug 2024