{"id":13487887,"url":"https://github.com/GZTipDM/TipDM","last_synced_at":"2025-03-27T23:32:03.950Z","repository":{"id":36129503,"uuid":"192271135","full_name":"GZTipDM/TipDM","owner":"GZTipDM","description":"TipDM建模平台，开源的数据挖掘工具。","archived":false,"fork":false,"pushed_at":"2022-12-16T10:36:48.000Z","size":27272,"stargazers_count":221,"open_issues_count":52,"forks_count":129,"subscribers_count":15,"default_branch":"master","last_synced_at":"2024-10-30T23:36:00.842Z","etag":null,"topics":["bigdata","data-analysis","data-analysis-python","data-mining","graph-schedule","machine-learning","tensorflow","workflow"],"latest_commit_sha":null,"homepage":null,"language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/GZTipDM.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-06-17T03:48:23.000Z","updated_at":"2024-10-19T05:46:07.000Z","dependencies_parsed_at":"2023-01-16T23:31:04.192Z","dependency_job_id":null,"html_url":"https://github.com/GZTipDM/TipDM","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GZTipDM%2FTipDM","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GZTipDM%2FTipDM/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GZTipDM%2FTipDM/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GZTipDM%2FTipDM/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/GZTipDM","download_url":"https://codeload.github.com/GZTipDM/TipDM/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245944020,"owners_count":20697945,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bigdata","data-analysis","data-analysis-python","data-mining","graph-schedule","machine-learning","tensorflow","workflow"],"created_at":"2024-07-31T18:01:05.888Z","updated_at":"2025-03-27T23:31:58.902Z","avatar_url":"https://github.com/GZTipDM.png","language":"JavaScript","funding_links":[],"categories":["JavaScript"],"sub_categories":[],"readme":"# Introduction #\nTipDM建模平台，是由广东泰迪智能科技股份有限公司研发并开源的数据挖掘工具，TipDM建模平台提供数据丰富的数据预处理、\n数据分析与数据挖掘组件，帮助广大中小企业快速建立数据挖掘工程，提升数据处理的效能。同时，我们也在积极\n推动大数据挖掘社区建设，构建校企对接桥梁，为企业精准推送优质大数据挖掘人才；在产业需求的基础上推动高\n校的人才培养工作。\n\n# Documentation #\n[使用文档](http://python.tipdm.org/bzzx/index.jhtml?n=%E6%93%8D%E4%BD%9C%E6%96%87%E6%A1%A3)\n\n# Communication #\n[社区交流](http://bbs.tipdm.org)\n\n# Features #\n1. 基于Python，用于数据挖掘建模。\n2. 使用直观的拖放式图形界面构建数据挖掘工作流程，无需编程。\n3. 支持多种数据源，包括CSV文件和关系型数据库。\n4. 支持挖掘流程每个节点的结果在线预览。\n5. 提供5大类共40种算法组件，包括数据预处理、分类、聚类等数据挖掘算法。\n6. 支持新增/编辑算法组件，自定义程度高。\n7. 提供众多公开可用的数据挖掘示例工程，一键创建，快速运行。\n8. 提供完善的交流社区，提供数据挖掘相关的学习资源（数据、代码和模型等）。\n\n# Screenshot #\n![输入图片说明](https://images.gitee.com/uploads/images/2019/0617/112412_0a4abed4_4964548.jpeg \"1.jpg\")\n![输入图片说明](https://images.gitee.com/uploads/images/2019/0617/112438_656d0053_4964548.jpeg \"2.jpg\")\n![输入图片说明](https://images.gitee.com/uploads/images/2019/0617/112450_a0ff4eb8_4964548.jpeg \"3.jpg\")\n![输入图片说明](https://images.gitee.com/uploads/images/2019/0617/112509_238a7067_4964548.jpeg \"4.jpg\")\n\n# Development #\n## 环境依赖 ##\n- [Oracle JDK 1.8.x及以上版本](http://www.oracle.com/technetwork/java/javasebusiness/downloads/java-archive-downloads-javase6-419409.html)，安装详情可参考[JDK安装教程](https://docs.oracle.com/javase/8/docs/technotes/guides/install/install_overview.html)\n- [Apache-Maven 3.x及以上版本](http://maven.apache.org)，安装详情可参考[Maven安装教程](https://maven.apache.org/install.html)\n- [Apache-Tomcat 8.x及以上版本](http://tomcat.apache.org)\n- [PostgreSQL 9.4.x及以上版本](http://www.postgresql.org/download/)\n- [Python 3.6.x版本及以上版本](https://www.python.org/)\n- [Redis 3.2.x及以上版本](https://redis.io/)\n- [Alibaba DataX](https://github.com/alibaba/DataX)\n- IntelliJ Idea IDE(可选，您也可以使用其他IDE，如eclipse、NetBeans)，安装详情可参考[IntelliJ安装教程](https://www.jetbrains.com/help/idea/installing-and-launching.html)\n\n### 安装Java开发环境 ###\n下载JDK 1.8.x和Apache-Maven并安装，设置JAVA_HOME和PATH环境变量，如添加以下到~/.bashrc中（不同的操作系统，环境变量的设置方式有所不同，请根据自己的情况设置环境变量）： \n    echo 'export JAVA_HOME=~/jdk_1.8.0_171' \u003e\u003e ~/.bashrc  \n    echo 'export PATH=$JAVA_HOME/bin:~/apache-maven-3.3.9/bin:$PATH' \u003e\u003e ~/.bashrc\n    . ~/.bashrc\n\n检查java与maven环境及版本是否正确安装，运行如下命令检查：\n\u003e tipdm: ~ devp$ javac -version  \n\u003e javac 1.8.0_171  \n\u003e tipdm: ~ devp$ mvn -version  \n\u003e Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; 2015-11-11T00:41:47+08:00)   \n\n如果返回\"-bash: xxx: command not found\"，或者版本号低于TipDM要求，请确认依赖软件是否安装正确，相应的环境变量是否设置生效。\n\n### 安装Python ###\n\n   下载Python 3.6.x，并完成环境变量的配置。\n   \n   required library:\n   \n   \tarch==4.3.1\r\n\tdocx==0.2.4\r\n\tgensim==3.6.0\r\n\tgraphviz==0.10.1\r\n\tjieba==0.38\r\n\tjieba-fast==0.53\r\n\tKeras==2.2.4\r\n\tmatplotlib==2.2.0\r\n\tnumpy==1.14.2\r\n\tpandas==0.23.4\r\n\tpdfminer3k==1.3.1\r\n\tpyclust==0.2.0\r\n\tpydot==1.2.4\r\n\tpython-docx==0.8.10\r\n\tscikit-learn==0.19.1\r\n\tscipy==0.19.1\r\n\tSQLAlchemy==1.2.0\r\n\tstatsmodels==0.9.0\r\n\ttensorflow==1.12.0\r\n\tthulac==0.2.0\r\n\twordcloud==1.5.0\n\t\n#### 批量安装依赖库 ####\n将上面的内容粘贴至requirements.txt（**可随意命名**），进入命令行，CD到requirements.txt所在的目录下，执行命令：\n\r\n\tpip install -r requirements.txt\n   \t\n### 安装PostgreSQL ###\n  \n   下载9.4x并完成安装。[PostgreSQL中文社区](http://www.postgres.cn)\n## 快速入门 ##\n### 构建项目 ###\n#### backend ####\n下载源代码至本地,按照maven格式将源码导入IDE（Eclipse或IDEA）\n##### 数据初始化 #####\n首先在本机上要有PostgreSQL服务，使它监听127.0.0.1的5432端口（默认安装和初始化的PostgreSQL即监听127.0.0.1的5432端口），然后使用PG的管理员身份(一般是初始化PG数据库的linux账号,这里是postgres账户)，运行$TipDM_HOME/WEB-INF/classes/sql/目录下的.sql脚本，初始化元数据：\n\n    psql -h 127.0.0.1 -p 5432 -U postgres -d tipdm_DB -f initData.sql\n    psql -h 127.0.0.1 -p 5432 -U postgres -d tipdm_DB -f quartz_postgres.sql\n\n##### 系统配置 #####\n配置文件说明：\n\n\tsysconfig/database.properties\t\t\t数据库配置文件\n\tsysconfig/dbSupport.config\t\t\t在此配置系统可支持的数据库类型\n\tsysconfig/system.properties\t\t\t系统的相关配置\n\tsysconfig/redis.properties\t\t\tRedis\n\tPyConnection.xml\t\t\t\tPython服务(该文件在sysconfig目录的上层)\n\n##### 编译 #####\n\tcd进入到源码根目录，使用maven进行编译即可，源码结构如下：\n\tframework-common\t\t公共模块\n\tframework-model  \t\t数据模型\n\tframework-persist \t\t数据持久化\n\tframework-service \t\tservice\n\ttipdm-server  \t\t\t后台服务\n\n看到\n\u003e BUILD SUCCESS  \n\u003e Total time: ...  \n\n表示编译成功，生成的二进制包在$HOME/target/目录中。 \n\n##### 部署 #####\n部署详情参考[IntelliJ IDEA – Run / debug web application on Tomcat](https://www.mkyong.com/intellij/intellij-idea-run-debug-web-application-on-tomcat/)\n\n# FAQ #\n[http://python.tipdm.org/bzzx/index.jhtml?n=%E5%B8%B8%E8%A7%81%E9%97%AE%E9%A2%98](http://python.tipdm.org/bzzx/index.jhtml?n=%E5%B8%B8%E8%A7%81%E9%97%AE%E9%A2%98)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FGZTipDM%2FTipDM","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FGZTipDM%2FTipDM","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FGZTipDM%2FTipDM/lists"}