{"id":23130098,"url":"https://github.com/reycn/data-analytics-in-julia","last_synced_at":"2025-05-07T09:44:35.622Z","repository":{"id":267579990,"uuid":"901647190","full_name":"reycn/data-analytics-in-julia","owner":"reycn","description":"Notebooks for data analysis in social science using Julia, replicating frequent analytical steps in Python \u0026 R.","archived":false,"fork":false,"pushed_at":"2025-04-11T07:14:02.000Z","size":5145,"stargazers_count":9,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-11T07:49:59.559Z","etag":null,"topics":["data","data-analysis","data-science","data-visualization","julia"],"latest_commit_sha":null,"homepage":"https://data-julia.rongxin.me","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/reycn.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-12-11T03:50:13.000Z","updated_at":"2025-04-11T07:14:06.000Z","dependencies_parsed_at":"2024-12-11T07:22:32.032Z","dependency_job_id":"4766c91d-72f4-4c33-9d23-e658c8765e0e","html_url":"https://github.com/reycn/data-analytics-in-julia","commit_stats":null,"previous_names":["reycn/data-analytics-in-julia"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reycn%2Fdata-analytics-in-julia","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reycn%2Fdata-analytics-in-julia/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reycn%2Fdata-analytics-in-julia/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reycn%2Fdata-analytics-in-julia/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/reycn","download_url":"https://codeload.github.com/reycn/data-analytics-in-julia/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252853305,"owners_count":21814505,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-analysis","data-science","data-visualization","julia"],"created_at":"2024-12-17T10:12:37.839Z","updated_at":"2025-05-07T09:44:35.617Z","avatar_url":"https://github.com/reycn.png","language":"Jupyter Notebook","readme":"# Data Analytics in Julia\n- **[🔗 Read the book online](https://data-julia.rongxin.me)**\n- By [**Rongxin Ouyang**](https://rongxin.me/cv), PhD student in Computational Communication, NUS\n\n![](./image/cover.png)  \n![](../image/cover.png)  \n*(Generated by GPT-4o)*\n\n## Scope\nThis short book provides a practical guide for data analysis in social science using Julia. It replicates common analytical steps in the field.\n\nBecause of its speed.\n\n## Outline\n\n- [✅ Chapter 1. Installation](https://data-julia.rongxin.me/data-analysis-in-julia/1.installation.basics.jl)\n  - ✅ Why do we need Julia\n  - ✅ How to install Julia\n  - ✅ How to install Julia as a Jupyter kernal for notebooks\n  - ✅ The basics of operations, data structures, packages, methods, and define functions\n- [✅ Chapter 2. Data Loading and Selection](https://data-julia.rongxin.me/data-analysis-in-julia/2.data.loading.selection.jl)\n  - ✅ Load a dataframe from a local file, an online link, and a common datasets; or create it from scratch\n  - ✅ Select by rows, columns, and conditions.\n- [✅ Chapter 3. Transformation and calculation](https://data-julia.rongxin.me/data-analysis-in-julia/3.transform.calculate.jl)\n  - ✅ Split and combine\n  - ✅ Grouping\n  - ✅ Sorting\n  - ✅ Transforming between long / wide tables\n  - ✅ Find / fill / drop missing values\n- [✅ Chapter 4. Pipeline and Useful Packages](https://data-julia.rongxin.me/data-analysis-in-julia/4.pipeline.tools.jl)\n  - ✅ Data pipeline\n  - ✅ Manipulate strings\n  - ✅ Network\n- [✅ Chapter 5.1 Models and Tests](https://data-julia.rongxin.me/data-analysis-in-julia/5.1.models.jl)\n   1. ✅ Common parametric tests (t-tests and ANOVA)\n   2. ✅ Regression (multi-variate regression and fixed effects models)\n   3. ✅ Path Analysis\n      1. ✅ Mediation\n      2. ✅ Moderation\n      3. ✅ Conditional Path Analysis\n- [✅ Chapter 5.2 Models and Tests (continued)](https://data-julia.rongxin.me/data-analysis-in-julia/5.2.models.jl)\n  \n   1. 🚧 / ✅ Counterfactual Framework\n      1. 🚧 / ✅ Instrumental Variables\n      2. 🚧 / ✅ Regression Discontinuity Design\n      3. 🚧 / ✅ Difference-in-Difference\n      4. 🚧 / ✅ Synthetic Control\n      5. 🚧 / ✅ Synthetic Difference-in-Difference\n- [✅ Chapter 6. Visualization](https://data-julia.rongxin.me/data-analysis-in-julia/6.visualize.jl) (ggplot2-like)\n  - ✅ Scatterplot, barplot, lineplot, and histogram\n  - ✅ Styles and themes\n  - ✅ Multiple-plots in facets\n\n- [✅ Chapter 7. Using R and Python in Julia](https://data-julia.rongxin.me/data-analysis-in-julia/7.r.and.python.in.julia.jl)\n  - ✅ Using R functions and R code blocks in Julia\n  - ✅ Using Python functions and Python code blocks in Julia\n\n- [✅ Chapter 8. Performance Optimization](https://data-julia.rongxin.me/8.performance.jl)\n  - ✅ Tips for increasing the speed\n  - ✅ Profiling tool and visualization\n\n- [✅ Appendix. Codes for plotting](https://data-julia.rongxin.me/8.plot.and.notebooks)\n  - ✅ All codes used for plotting\n\n\n## License\nThis work is licensed under a\n[Creative Commons Attribution-NonCommercial 4.0 International License][cc-by-nc].\n\n[![CC BY-NC 4.0][cc-by-nc-image]][cc-by-nc]\n\n[cc-by-nc]: https://creativecommons.org/licenses/by-nc/4.0/\n[cc-by-nc-image]: https://licensebuttons.net/l/by-nc/4.0/88x31.png\n[cc-by-nc-shield]: https://img.shields.io/badge/License-CC%20BY--NC%204.0-lightgrey.svg\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Freycn%2Fdata-analytics-in-julia","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Freycn%2Fdata-analytics-in-julia","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Freycn%2Fdata-analytics-in-julia/lists"}