{"id":13399189,"url":"https://github.com/jacksu/utils4s","last_synced_at":"2025-05-16T13:03:07.735Z","repository":{"id":41380767,"uuid":"43056772","full_name":"jacksu/utils4s","owner":"jacksu","description":"scala、spark使用过程中，各种测试用例以及相关资料整理","archived":false,"fork":false,"pushed_at":"2019-02-09T02:17:18.000Z","size":10778,"stargazers_count":1086,"open_issues_count":5,"forks_count":433,"subscribers_count":128,"default_branch":"master","last_synced_at":"2025-04-09T08:04:28.685Z","etag":null,"topics":["akka","breeze","json4s","scala","scala-demo","scala-spark","spark","spark-streaming"],"latest_commit_sha":null,"homepage":"http://jacksu.github.io/utils4s","language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jacksu.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-09-24T09:15:03.000Z","updated_at":"2025-03-20T06:44:25.000Z","dependencies_parsed_at":"2022-09-05T08:50:35.333Z","dependency_job_id":null,"html_url":"https://github.com/jacksu/utils4s","commit_stats":null,"previous_names":[],"tags_count":2,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jacksu%2Futils4s","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jacksu%2Futils4s/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jacksu%2Futils4s/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jacksu%2Futils4s/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jacksu","download_url":"https://codeload.github.com/jacksu/utils4s/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254535826,"owners_count":22087398,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["akka","breeze","json4s","scala","scala-demo","scala-spark","spark","spark-streaming"],"created_at":"2024-07-30T19:00:35.034Z","updated_at":"2025-05-16T13:03:07.716Z","avatar_url":"https://github.com/jacksu.png","language":"Scala","readme":"\u003ch1 id=\"id1\"\u003eutils4s\u003c/h1\u003e\n\n公众号：\n![公众号](picture/datacube.jpg)\n\n[![Build Status](https://travis-ci.org/jacksu/utils4s.svg?branch=master)](https://travis-ci.org/jacksu/utils4s)[![Join the chat at https://gitter.im/jacksu/utils4s](https://badges.gitter.im/Join%20Chat.svg)](https://gitter.im/jacksu/utils4s?utm_source=badge\u0026utm_medium=badge\u0026utm_campaign=pr-badge\u0026utm_content=badge)\n\n*   [utils4s](#id1)\n    *   [scala语法学习](#id2)\n    *   [common库](#id21)\n    *   [BigData库](#id22)\n        *   [Spark](#id221)\n            *   [Spark core](#id2211)\n            *   [Spark Streaming](#id2212)\n            *   [Spark SQL](#id2213)\n            *   [Spark 机器学习](#id2213)\n            *   [Spark Zeppelin](#id2214)\n            *   [Spark 其它](#id2215)\n        *   [ES](#id222)\n    *   [贡献代码步骤](#id23)\n    *   [贡献者](#id24)\n\n**Issues 中包含我们平时阅读的关于scala、spark好的文章，欢迎推荐**\n\nutils4s包含各种scala通用、好玩的工具库demo和使用文档，通过简单的代码演示和操作文档，各种库信手拈来。\n\n**同时欢迎大家贡献各种好玩的、经常使用的工具库。**\n\n[开源中国地址](http://git.oschina.net/jack.su/utils4s)\n\nQQ交流群 `432290475（已满），请加530066027` \u003ca target=\"_blank\" href=\"http://shang.qq.com/wpa/qunwpa?idkey=cea04391fbc7ae1c55b78476174b518a0ba3fdc040cd6abeadd1e85826c51f3b\"\u003e\u003cimg border=\"0\" src=\"http://pub.idqqimg.com/wpa/images/group.png\" alt=\"Scala Spark\" title=\"Scala Spark\"\u003e\u003c/a\u003e 或者点击上面gitter图标也可以参与讨论\n\n[作者博客专注大数据、分布式系统、机器学习，欢迎交流](http://www.jianshu.com/users/92a1227beb27/latest_articles)\n\n微博：[**jacksu_**](http://weibo.com/jack4s)\n\n\u003ch2 id=\"id2\"\u003escala语法学习\u003c/h2\u003e\n\n说明：scala语法学习过程中，用例代码都放在scala-demo模块下。\n\n[利用IntelliJ IDEA与Maven开始你的Scala之旅](https://www.jianshu.com/p/ecc6eb298b8f)\n\n[快学scala电子书](http://vdisk.weibo.com/s/C7NmUN3g8gH46)(推荐入门级书)\n\n[scala理解的比较深](http://hongjiang.info/scala/)\n\n[scala99问题](http://aperiodic.net/phil/scala/s-99/)\n\n[scala初学者指南](https://windor.gitbooks.io/beginners-guide-to-scala/content/introduction.html)(这可不是初学者可以理解的欧，还是写过一些程序后再看)\n\n[scala初学者指南英文版](http://danielwestheide.com/scala/neophytes.html)\n\n[scala学习用例](scala-demo)\n\n[scala入门笔记](http://blog.djstudy.net/2016/01/24/scala-rumen-biji/)\n\n[Databricks风格](https://github.com/databricks/scala-style-guide)\n\n[scala/java 通过maven编译（Mixed Java/Scala Projects）](http://davidb.github.io/scala-maven-plugin/example_java.html)\n\n\u003ch2 id=\"id21\"\u003ecommon库\u003c/h2\u003e\n\n[日志操作](log-demo)（[log4s](https://github.com/Log4s/log4s)）\n\n[单元测试](unittest-demo)（[scalatest](http://www.scalatest.org)）\n\n[日期操作](lamma-demo)（[lama](http://www.lamma.io/doc/quick_start)）（注:只支持日期操作，不支持时间操作）\n\n[日期时间操作](nscala-time-demo)（[nscala-time](https://github.com/nscala-time/nscala-time)）（注：没有每月多少天，每月最后一天，以及每年多少天）\n\n[json解析](json4s-demo)（[json4s](https://github.com/json4s/json4s)）\n\n[resources下文件加载用例](resources-demo)\n\n[文件操作](file-demo)（[better-files](https://github.com/pathikrit/better-files)）\n\n[单位换算](analysis-demo)（[squants](https://github.com/garyKeorkunian/squants)）\n\n[线性代数和向量计算](breeze-demo)([breeze](https://github.com/scalanlp/breeze))\n\n[分布式并行实现库akka](akka-demo)([akka](http://akka.io))\n\n[Twitter工具库](twitter-util-demo)（[twitter util](https://github.com/twitter/util)）\n\n[日常脚本工具](manger-tools)\n\n\u003ch2 id=\"id22\"\u003eBigData库\u003c/h2\u003e\n\n\u003ch3 id=\"id221\"\u003eSpark\u003c/h3\u003e\n\n\u003ch4 id=\"id2211\"\u003eSpark core\u003c/h4\u003e\n[spark远程调试源代码](http://hadoop1989.com/2016/02/01/Spark-Remote-Debug/)\n\n[spark介绍](http://litaotao.github.io/introduction-to-spark)\n\n[一个不错的spark学习互动课程](http://www.hubwiz.com/class/5449c691e564e50960f1b7a9)\n\n[spark 设计与实现](http://spark-internals.books.yourtion.com/index.html)\n\n[aliyun-spark-deploy-tool](https://github.com/aliyun/aliyun-spark-deploy-tool)---Spark on ECS\n\u003ch4 id=\"id2212\"\u003eSpark Streaming\u003c/h4\u003e\n\n[Spark Streaming使用Kafka保证数据零丢失](spark-knowledge/md/spark_streaming使用Kafka保证数据零丢失.md)\n\n[spark streaming测试用例](sparkstreaming-demo)\n\n[spark streaming源码解析](https://github.com/proflin/CoolplaySpark)\n\n[基于spark streaming的聚合分析(Sparkta)](https://github.com/Stratio/Sparkta)\n\n\u003ch4 id=\"id2213\"\u003eSpark SQL\u003c/h4\u003e\n\n[spark DataFrame测试用例](spark-dataframe-demo)\n\n[Hive Json加载](hive-json-demo)\n\n[SparkSQL架构设计和代码分析](https://github.com/marsishandsome/SparkSQL-Internal)\n\n\u003ch4 id=\"id2213\"\u003eSpark 机器学习\u003c/h4\u003e\n\n[spark机器学习源码解析](https://github.com/endymecy/spark-ml-source-analysis)\n\n[KeyStoneML](http://keystone-ml.org)\nKeystoneML is a software framework, written in Scala, from the UC Berkeley AMPLab designed to simplify the construction of large scale, end-to-end, machine learning pipelines with Apache Spark.\n\n[spark TS](spark-timeseries-demo)\n\n\u003ch4 id=\"id2214\"\u003eSpark zeppelin\u003c/h4\u003e\n\n[**Z-Manager**](https://github.com/NFLabs/z-manager)--Simplify getting Zeppelin up and running\n\n[**zeppelin**](https://github.com/apache/incubator-zeppelin)--a web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more.\n\n[**helium**](http://s.apache.org/helium)--Brings Zeppelin to data analytics application platform\n\n\u003ch4 id=\"id2215\"\u003eSpark 其它\u003c/h4\u003e\n\n[spark专题在简书](http://www.jianshu.com/collection/6157554bfdd9)\n\n[databricks spark知识库](https://aiyanbo.gitbooks.io/databricks-spark-knowledge-base-zh-cn/content/)\n\n[spark学习知识总结](spark-knowledge)\n\n[Spark library for doing exploratory data analysis in a scalable way](https://github.com/vicpara/exploratory-data-analysis/)\n\n[图处理(cassovary)](https://github.com/twitter/cassovary)\n\n[基于spark进行地理位置分析(gagellan)](https://github.com/harsha2010/magellan)\n\n[spark summit east 2016 ppt](http://vdisk.weibo.com/s/BP8uNBea_C2Af?from=page_100505_profile\u0026wvr=6)\n\n\u003ch3 id=\"id222\"\u003eES\u003c/h3\u003e\n\n[ES 非阻塞scala客户端](https://github.com/sksamuel/elastic4s)\n\n\u003ch3 id=\"id223\"\u003eBeam\u003c/h3\u003e\n[Apache Beam：下一代的数据处理标准](http://geek.csdn.net/news/detail/134167)\n\u003ch2 id=\"id23\"\u003e贡献代码步骤\u003c/h2\u003e\n1. 首先 fork 我的项目\n2. 把 fork 过去的项目也就是你的项目 clone 到你的本地\n3. 运行 git remote add jacksu git@github.com:jacksu/utils4s.git 把我的库添加为远端库\n4. 运行 git pull jacksu master 拉取并合并到本地\n5. coding\n6. commit后push到自己的库( git push origin master )\n7. 登陆Github在你首页可以看到一个 pull request 按钮,点击它,填写一些说明信息,然后提交即可。\n1~3是初始化操作,执行一次即可。在coding前必须执行第4步同步我的库(这样避免冲突),然后执行5~7既可。\n\n\u003ch2 id=\"id24\"\u003e贡献者\u003c/h2\u003e\n[jjcipher](https://github.com/jjcipher)\n\n","funding_links":[],"categories":["Tutorials and courses","Learning Scala","Scala"],"sub_categories":["Devops","Misc."],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjacksu%2Futils4s","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjacksu%2Futils4s","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjacksu%2Futils4s/lists"}