{"id":23611609,"url":"https://github.com/leafyeexyz/digitallife","last_synced_at":"2025-05-12T19:09:17.208Z","repository":{"id":235778125,"uuid":"791237662","full_name":"LeafYeeXYZ/DigitalLife","owner":"LeafYeeXYZ","description":"一个具有长时记忆和 Live2d 形象的\"数字生命\" / A digital life with long-term memories and live2d body","archived":false,"fork":false,"pushed_at":"2025-04-30T08:40:58.000Z","size":300693,"stargazers_count":25,"open_issues_count":1,"forks_count":3,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-05-12T19:08:39.889Z","etag":null,"topics":["ai","ai-agents","javascript","live2d","live2d-web","memory","openai","psychology","react","stt","tauri","tts","typescript","web-api","web-speech-api"],"latest_commit_sha":null,"homepage":"https://being.leafyee.xyz","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/LeafYeeXYZ.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-04-24T10:53:09.000Z","updated_at":"2025-05-09T17:34:38.000Z","dependencies_parsed_at":"2024-08-12T11:05:11.281Z","dependency_job_id":"1da3a022-56e7-4e6c-aa0b-3b88fd031348","html_url":"https://github.com/LeafYeeXYZ/DigitalLife","commit_stats":null,"previous_names":["leafyeexyz/counselorleaf","leafyeexyz/cyberleaf","leafyeexyz/digitallife"],"tags_count":12,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LeafYeeXYZ%2FDigitalLife","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LeafYeeXYZ%2FDigitalLife/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LeafYeeXYZ%2FDigitalLife/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LeafYeeXYZ%2FDigitalLife/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/LeafYeeXYZ","download_url":"https://codeload.github.com/LeafYeeXYZ/DigitalLife/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253805861,"owners_count":21967053,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","ai-agents","javascript","live2d","live2d-web","memory","openai","psychology","react","stt","tauri","tts","typescript","web-api","web-speech-api"],"created_at":"2024-12-27T16:16:27.275Z","updated_at":"2025-05-12T19:09:17.201Z","avatar_url":"https://github.com/LeafYeeXYZ.png","language":"TypeScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Digital Life / 数字生命\n\n一个有自我概念、长时记忆的\"数字生命\"; 长时记忆包括对话内记忆、对话间记忆、自我概念、用户画像; 所有推理功能都可在 `3000` 总 `tokens` 内完成, 对端侧小模型友好, 也可在设置中提高 `tokens` 上限. 同时支持 Web 和桌面端、支持多种推理、TTS、STT 服务.\n\n如果您想快速体验本项目, 可在 \u003chttps://deepseek.com\u003e 等平台注册一个免费账号, 获取 `API Key`, 并访问 \u003chttps://being.leafyee.xyz\u003e, 在应用内设置 `Endpoint`、`API Key`、`Model Name` 即可使用 (也可直接本地运行 `ollama`, 见[4 开发和部署](#4-开发和部署)).\n\n![示意图](./readme/intro.png)\n\n- [Digital Life / 数字生命](#digital-life--数字生命)\n - [1 项目说明](#1-项目说明)\n - [2 长时记忆](#2-长时记忆)\n - [2.1 现有做法](#21-现有做法)\n - [2.2 本项目设计](#22-本项目设计)\n - [3 使用说明](#3-使用说明)\n - [3.1 聊天界面](#31-聊天界面)\n - [3.2 语音服务](#32-语音服务)\n - [3.3 自定义界面](#33-自定义界面)\n - [3.4 云存储](#34-云存储)\n - [3.5 记忆](#35-记忆)\n - [4 开发和部署](#4-开发和部署)\n - [4.1 桌面端](#41-桌面端)\n - [4.2 Web 端](#42-web-端)\n - [4.3 环境变量](#43-环境变量)\n - [5 Live2d 模型版权声明](#5-live2d-模型版权声明)\n - [6 参考文献](#6-参考文献)\n\n\u003e `v1.4.0 引用信息`: `叶一杉. (2025). DigitalLife: 具有长时记忆和自我概念的AI虚拟人 (v1.4.0) [Computer software]. GitHub. https://github.com/LeafYeeXYZ/DigitalLife/tree/v1.4.0`\n\n## 1 项目说明\n\n本项目希望如*流浪地球2* (\u003chttps://digitallife.ac.cn\u003e) 那样, 创造一个有自己的情感、记忆的\"数字生命\". 并在现阶段通过 `Live2d` 技术, 使其有更好的交互效果 (如[Neuro Sama](https://www.bilibili.com/video/BV1Db1WYgESn/?share_source=copy_web\u0026vd_source=f73d7b1cc6b3e028bd1d6a660f91c4f1)). 相比[同类项目](https://github.com/t41372/Open-LLM-VTuber), 本项目使用更为简单 (直接在 \u003chttps://being.leafyee.xyz\u003e 即可体验), 且记忆功能更为强大.\n\n关于长时记忆, 本项目参考了多篇论文, 设计了一个独特而创新的记忆系统. 详见[2 长时记忆](#2-长时记忆).\n\n| 模块 | 可选项 | 说明 |\n| :------------: | :-------------------------------------------------------------------------------: | :-------------------------------------------------------------------------------------------------------------------------: |\n| 推理模型 | 任意兼容 `OpenAI` 规范的服务 | 默认使用 `ollama`, 可在应用内修改 `Endpoint`、`API Key`、`Model Name` 等 |\n| 嵌入模型 | `jina-embeddings-v3` | 用于记忆提取, 需在应用内设置官方 `API Key` 或自部署 `/server/vector.py`\u003cbr /\u003e注意: 自部署版本的向量维度暂时只支持 `1024` 维 |\n| 语音生成 `TTS` | [`F5 TTS` 服务](https://github.com/jianchang512/f5-tts-api) | 默认服务地址为 `'http://127.0.0.1:5010/api'`, 可在应用内修改 |\n| | [`Fish Speech` 服务](https://speech.fish.audio/zh/inference/#http-api) | 默认服务地址为 `'http://127.0.0.1:8080'`, 可在应用内修改 |\n| 语音输入 `STT` | [Web Speech API](https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API) | 使用浏览器内置的 `STT` 功能 |\n| 动态形象 | `Live2d` | 使用 \u003chttps://github.com/hacxy/l2d\u003e 项目 |\n| 其他 | 天气信息 | 默认关闭, 可在应用内设置[和风天气 API Key](https://dev.qweather.com/)开启 |\n| | 时间信息 | 模型可获知当前设备时间、对话开始时间、首次相遇时间 |\n| | 图片输入 | 🚧 WIP |\n| | 图片输出 | 🚧 WIP |\n| | 文件输入 | 🚧 WIP, 可能会使用 \u003chttps://github.com/microsoft/markitdown\u003e |\n| 记忆云备份 | S3 | 在记忆备份界面填写相关信息即可使用 |\n\n\u003e `TTS` 和 `STT` 服务默认关闭, 可在应用内的设置中开启\n\n## 2 长时记忆\n\n### 2.1 现有做法\n\n**MemoryBank: Enhancing Large Language Models with Long-Term Memory** (Zhong et al., 2023): 通过记忆库 (MemoryBank) 实现长时记忆. 具体来讲, 将用户的每批对话生成摘要并按时间顺序存储在记忆库中, 过程中更新用户画像. 对话时, 将用户输入用向量搜索的方式与记忆库中的摘要进行匹配, 将提取到的摘要、用户画像、当前对话上下文输入给模型; 参考艾宾浩斯遗忘曲线, 按照时间和重复次数来确定记忆遗忘率.\n\n![](./readme/refer-3.png)\n\n**Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models** (Wang et al., 2024): 通过递归生成摘要/记忆来实现长时记忆. 具体来讲, LLMs首先记忆小的对话上下文, 然后递归地使用之前的记忆和后续的上下文来生成新的记忆.\n\n![](./readme/refer-1.png)\n\n**\"My agent understands me better\": Integrating Dynamic Human-like Memory Recall and Consolidation in LLM-Based Agents** (Hou et al., 2024): 通过与人类记忆类似的\"线索回忆\"和\"记忆巩固\"机制实现长时记忆. 具体来讲, 将用户输入向量化, 并存入/更新数据库 (新增/加强记忆), 随后在数据库中进行相关性查询, 根据相关性、记忆巩固强度、记忆存储的时间三个因子, 找到能回忆起的记忆并发送给模型.\n\n![](./readme/refer-2.png)\n\n**A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts** (Lee et al., 2024): 并非直接的长时记忆方案, 而是一个处理长文本的方法. 具体来讲, 将长文本分为多个片段, 每个片段生成一个摘要并将其与原文关联, 根据任务和所有摘要决定要查询哪些片段; 以上过程均由模型自主完成 (原文给出了每步的提示词). 最后将得到的精简信息输入给模型.\n\n![](./readme/refer-5.png)\n\n**MemGPT: Towards LLMs as Operating Systems** (Packer et al., 2024): 借鉴虚拟内存的思想, 让模型通过函数调用读取外部记忆. 具体来讲, 将模型的上下文分为主上下文 (包含系统提示词、记忆调用结果、当前对话上下文) 和外部上下文 (记忆); 每次对话时, 模型自主进行函数调用, 读写外部上下文, 更新主上下文的记忆调用结果, 并通过更新后的主上下文进行推理.\n\n![](./readme/refer-4.png)\n\n### 2.2 本项目设计\n\n长时记忆模块的要解决的核心问题为多次对话间AI回复的连贯性. 本项目设计了\"用户画像\"、\"自我概念\"、\"日记\"和\"记忆检索\"机制：在每次对话结束后, AI模型将基于本次对话全文, 生成/更新使用者的\"用户画像\" (类似于 Zhong et al., 2023)、自身的\"自我概念\" (即AI模型对自己的认知, 在与用户的对话中学到的知识等)、对本次对话中提到的事件的总结.\n \n生成的\"总结\"将被词嵌入模型向量化后存入用户专属的记忆库. 在用户的对话过程中, 当AI接收到用户输入时, 可以选择通过 Function Calling 调取记忆, 并给出想要调取的记忆的文字描述；系统将先把AI给出的描述同样进行向量化, 之后在记忆库中找出相关性最高的数条记忆并返回给AI模型；AI模型得到记忆后, 生成对用户的回复.\n\n当前判断\"一次对话结束\"的标准为: 用户主动结束对话, 或连续数小时 (当前默认为八小时) 无新增对话. 综上所述, 处理逻辑如下图所示：\n \n\u003e 处理逻辑示意图\n\n![](./readme/model-1.png)\n\n![](./readme/model-2.png)\n\n## 3 使用说明\n\n请打开网页 (如果是 Web 端) 或应用 (如果是桌面端), 在右下角找到 `设置`, 选择 `推理服务设置`, 输入 `推理服务地址` (即你所用的大模型提供商的 API 地址), `推理服务密钥` (即你所用的大模型提供商的 API Key), `推理服务模型` (即你需要使用的模型的名称); 之后, 在右下角找到 `设置`, 选择 `嵌入服务设置`, 再在 \u003chttps://jina.ai/embeddings/\u003e 获取一个免费的 `API Key`, 填入 `嵌入服务密钥` 即可. 如下图所示.\n\n| 推理服务设置 | 嵌入服务设置 |\n| :------------------------: | :------------------------: |\n| ![](./readme/config-1.png) | ![](./readme/config-2.png) |\n\n设置后, 回到聊天界面即可开始聊天.\n\n### 3.1 聊天界面\n\n聊天分为 `文字语音聊天` 和 `连续语音聊天`, 前者以文字输入为主 (如果开启语音识别服务, 也可以使用语音转文字), 后者是和模型进行连续语音对话, 无需进行任何文字输入. 要是用 `连续语音聊天`, 必须在设置中开启语音识别.\n\n\u003e 连续语音对话界面\n\n![](./readme/intro-1.png)\n\n### 3.2 语音服务\n\n在 `设置` -\u003e `语音服务设置` 中, 可以选择要使用的语音合成和语音识别服务, 在开启并设置了正确的语音服务地址 (见[1 项目说明](#1-项目说明)) 后, 模型将可以以语音进行回复 (在 `文字语音聊天` 中, 可以手动点击播放; 在 `连续语音对话` 中, 会自动播放), 你也可以进行语音输入.\n\n\u003e 语音服务设置界面\n\n![](./readme/intro-2.png)\n\n### 3.3 自定义界面\n\n你可以自定义数字生命的 `live2d` 模型和背景图片, 只需在 `设置` -\u003e `自定义设置` 中选择模型/上传图片即可. 你还可以选择背景图片的显示区域 (局部/全屏).\n\n\u003e 自定义设置界面\n\n![](./readme/intro-3.png)\n\n### 3.4 云存储\n\n你可以将本地的配置和记忆上传到指定的 `S3` 存储服务中, 只需在 `设置` -\u003e `云存储设置` 中设定 `S3` 相关的凭证即可.\n\n\u003e 云存储设置界面\n\n![](./readme/intro-4.png)\n\n### 3.5 记忆\n\n你可以在 `记忆` -\u003e `名字和自我` 中查看数字生命关于自己和关于你的记忆, 或在 `记忆` -\u003e `日记本` 中查看数字生命的\"日记\", 即对每次对话的总结和感受. 要导入和导出记忆, 请前往 `记忆` -\u003e `导入和导出`; 如果设置了云存储服务, 你还可以在那里把记忆上传到云端.\n\n| 名字和自我 | 日记本 | 导入和导出 |\n| :-----------------------: | :-----------------------: | :-----------------------: |\n| ![](./readme/intro-5.png) | ![](./readme/intro-6.png) | ![](./readme/intro-7.png) |\n\n## 4 开发和部署\n\n本项目同时支持 Web 和桌面端, 所有设置均可在应用内完成, 无需修改代码. LLM 推理使用 `OpenAI SDK` 实现, 您可以使用任何兼容的服务, 如 `deepseek` 和 `ollama`. 默认情况下, 本项目使用 `ollama` 服务和 `qwen2.5:7b` 模型, 可在应用内设置中指定其他模型/服务\n\n\u003e 如果使用 `ollama`, 请提前安装 `ollama` 并运行 `ollama pull qwen2.5:7b` 下载模型, 并启动 `ollama` 服务. 在 Web 端时, 您可能需要手动设置 `ollama` 的 `CORS` 策略以避免请求被浏览器拦截: 首先设置本地的 `OLLAMA_ORIGINS` 环境变量为 `\"*\"`、在终端中运行 `echo $OLLAMA_ORIGINS` 确认设置成功、在终端中运行 `ollama serve` 启动服务 (即使进行了上述设置, 仍然可能会在 `Safari` 中遇到 `CORS` 问题, 请尝试使用 `Chrome` 浏览器)\n\n### 4.1 桌面端\n\n```bash\n# 克隆项目\ngit clone https://github.com/LeafYeeXYZ/DigitalLife.git\ncd DigitalLife\n# 安装依赖\nbun install\n# 运行\nbun dev:tauri\n# 构建\nbun build:tauri\n```\n\n### 4.2 Web 端\n\n请注意 Web 端构建后的输出目录为 `/dist-web` 而非 `/dist`\n\n```bash\n# 克隆项目\ngit clone https://github.com/LeafYeeXYZ/DigitalLife.git\ncd DigitalLife\n# 安装依赖\nbun install\n# 运行\nbun dev:web\n# 构建\nbun build:web\n```\n\n### 4.3 环境变量\n\n| 环境变量名 | 默认值 | 说明 |\n| :--------------------: | :-----: | :--------------------------------: |\n| `VITE_DEBUG_COMPONENT` | `'off'` | 仅当设为 `'on'` 时, 会显示调试组件 |\n\n## 5 Live2d 模型版权声明\n\n本项目内的所有模型皆为在 B 站发布的免费模型, 请勿直接复制使用, 如有需要可至原地址获取\n\n## 6 参考文献\n\n- Hou, Y., Tamoto, H., \u0026 Miyashita, H. (2024). \"My agent understands me better\": Integrating Dynamic Human-like Memory Recall and Consolidation in LLM-Based Agents. Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 1–7. https://doi.org/10.1145/3613905.3650839\n- Lee, K.-H., Chen, X., Furuta, H., Canny, J., \u0026 Fischer, I. (2024). A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts (arXiv:2402.09727). arXiv. https://doi.org/10.48550/arXiv.2402.09727\n- Packer, C., Wooders, S., Lin, K., Fang, V., Patil, S. G., Stoica, I., \u0026 Gonzalez, J. E. (2024). MemGPT: Towards LLMs as Operating Systems (arXiv:2310.08560). arXiv. https://doi.org/10.48550/arXiv.2310.08560\n- Wang, Q., Ding, L., Cao, Y., Tian, Z., Wang, S., Tao, D., \u0026 Guo, L. (2024). Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models (arXiv:2308.15022). arXiv. https://doi.org/10.48550/arXiv.2308.15022\n- Zhong, W., Guo, L., Gao, Q., Ye, H., \u0026 Wang, Y. (2023). MemoryBank: Enhancing Large Language Models with Long-Term Memory (arXiv:2305.10250). arXiv. https://doi.org/10.48550/arXiv.2305.10250\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fleafyeexyz%2Fdigitallife","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fleafyeexyz%2Fdigitallife","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fleafyeexyz%2Fdigitallife/lists"}