{"id":27786188,"url":"https://github.com/angel-ml/papers-we-learn","last_synced_at":"2026-03-08T08:35:37.275Z","repository":{"id":43081853,"uuid":"273132652","full_name":"Angel-ML/papers-we-learn","owner":"Angel-ML","description":"Papers from the computer science and implemented by angel","archived":false,"fork":false,"pushed_at":"2020-09-03T07:59:04.000Z","size":43888,"stargazers_count":26,"open_issues_count":0,"forks_count":6,"subscribers_count":6,"default_branch":"master","last_synced_at":"2023-03-03T03:36:25.881Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Angel-ML.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-06-18T03:26:58.000Z","updated_at":"2023-01-30T19:22:36.000Z","dependencies_parsed_at":"2022-09-10T21:30:54.963Z","dependency_job_id":null,"html_url":"https://github.com/Angel-ML/papers-we-learn","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Angel-ML%2Fpapers-we-learn","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Angel-ML%2Fpapers-we-learn/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Angel-ML%2Fpapers-we-learn/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Angel-ML%2Fpapers-we-learn/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Angel-ML","download_url":"https://codeload.github.com/Angel-ML/papers-we-learn/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251737482,"owners_count":21635605,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-04-30T15:57:43.018Z","updated_at":"2026-03-08T08:35:32.254Z","avatar_url":"https://github.com/Angel-ML.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# papers-we-learn\n旨在归纳Angel高性能分布式机器学习与图计算平台实践过程中所借鉴的计算机领域论文，理论与实践相结合\n## distributed systems\n收录angel实现分布式参数服务器、以及机器学习、图计算框架时借鉴的论文\n1. **Resilient Distributed Datasets A Fault-Tolerant Abstraction for In-Memory Cluster Computing(Spark)**.  [paper](https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf)\n2. **Parameter Server for Distributed Machine Learning(Parameter Server)**. [paper](https://pdfs.semanticscholar.org/30e9/4e24d67994c5a8e2f20f852a51d28a720de2.pdf)\n3. **Angel a new large-scale machine learning system(Angel).** [paper](https://watermark.silverchair.com/nwx018.pdf?token=AQECAHi208BE49Ooan9kkhW_Ercy7Dm3ZL_9Cf3qfKAc485ysgAAApgwggKUBgkqhkiG9w0BBwagggKFMIICgQIBADCCAnoGCSqGSIb3DQEHATAeBglghkgBZQMEAS4wEQQM7WLJn4lqzHtLysYMAgEQgIICS1NYL7L5PcMcKzXEss8zpE5bJGQM8Lp_eK177hnLQYZj8Po6_25C0NaYCld_YGmKREgMhLy498bUT5vRKrUncFOkkVFmfLIwxZ-vUg0Czgo63lZWZYgjHBe8jt0zDKVlvWt4lIgP0DDqJvmXjzqRXS3uw8CPHPevqXpFpBwqw5UU-Txky0YxP_VWRuF3qpXMSqqk8nfiTCracFXuwNO2VSOibKcjGUTA-iiI2jzwd2VbKZij1pl9FLBdvtf-cay203nXfHdMbks2h9aETkKGNinch2B46oBifczW8aw4-OVsKTS2SSgt0FqLFDPr-_lzEEYKSXhkSFkQPSpRemrAuTQr9cOPWjk5WK9tMP862Ld19d7hDy6Q7_mawu0SKkjiS6pfXQ6EpJlck97BwibHUYdlDGsSWwi7mMcsPIxxiGNFU7SoDuHTXDyiOSskg6HezlGwyZ5ZC_BMWrzW3auluw5SrL_o87NRhkSSnDJ5W5jszzvjjTbGLDGWjsusHieXOzVkJUiSmUF3kkr8wz_DV2fpjffdRgsxBkeSMPEhpnmYIaP-uB9CRnbhqFTy7FzscG0GgrgvhGI8CEd255b0HyYLLrdhUYQ-2zZnP0QokWtoVadt8L1VLRQFqut76VlaId8s8mDwqnONMYaDdXjxyRcg1nIW7hH3eAI_eYfEDF2xipmuMrv8uo4eY_aiiAhJYGg7O9oesspWGS9dVkP0TuujwnRweNo3CKyGarux1re9ysLu105TNUFM3NfCDCal5My2m6cPfvUnYrEZ)\n4. **Pregel A System for Large-scale Graph Processing(Pregel).** [paper](https://kowshik.github.io/JPregel/pregel_paper.pdf)\n5. **Distributed GraphLab A Framework for Machine Learning and Data Mining in the cloud(GraphLab).** [paper](http://vldb.org/pvldb/vol5/p716_yuchenglow_vldb2012.pdf)\n6. **PowerGraph Distributed Graph-Parallel Computation on Natural Graphs(PowerGraph).** [paper](https://www.usenix.org/system/files/conference/osdi12/osdi12-final-167.pdf)\n7. **Graphx Unifying Data-Parallel and Graph-Parallel Analytics(Graphx).** [paper](https://endymecy.gitbooks.io/spark-graphx-source-analysis/content/docs/graphx.pdf)\n8. **PSGraph How Tencent trains extremely large-scale graphs with Spark(PSGraph).** [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=\u0026arnumber=9101887)\n\n## graph mining\n收录angel实现的传统图算法\n1. **The PageRank Citation Ranking Bringing Order to the Web(Pagerank).** [paper](http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=6B71249025955741845EB201BE407DD2?doi=10.1.1.38.5427\u0026rep=rep1\u0026type=pdf)\n2. **The H-index of a network node and its relation to degree and coreness(kcore hindex).** [paper](https://scinapse.io/papers/2261717057)\n3. **HyperAnf Approximating the Neighbourhood Function of Very Large Graphs on a Budget(HyperAnf).** [paper](https://arxiv.org/pdf/1011.5599.pdf)\n4. **Centralities in Large Networks Algorithms and Observations(Closeness).** [paper](https://www.cs.cmu.edu/~ukang/papers/CentralitySDM2011.pdf)\n\n## graph embedding \n收录angel实现的图表示学习相关论文\n1. **DeepWalk Online Learning of Social Representations(DeepWalk).** [paper](https://arxiv.org/pdf/1403.6652.pdf)\n2. **LINE Large-scale Information Network Embedding(LINE).** [paper](https://arxiv.org/pdf/1503.03578.pdf)\n3. **Metapath2Vec Scalable Representation Learning for Heterogeneous Networks(Metapath2Vec).** [paper](https://ericdongyx.github.io/papers/KDD17-dong-chawla-swami-metapath2vec.pdf)\n\n## graph neural network\n收录angel已实现或即将实现的图神经网络相关论文\n1. **Graph Convolutional Neural Networks for Web-Scale Recommender Systems(Pinsage).** [paper](https://arxiv.org/pdf/1806.01973.pdf)\n2. **How Powerful Are Graph Neural Networks(GNN and WL Test).** [paper](https://arxiv.org/pdf/1810.00826.pdf)\n3. **Inductive Representation Learning On Large Graphs(Graphsage).** [paper](https://arxiv.org/pdf/1706.02216.pdf)\n4. **Semi-Supervised Classification With Graph Convolutional Networks(GCN)**. [paper](https://arxiv.org/pdf/1609.02907.pdf)\n5. **DeepTrax Embedding Graphs of Financial Transactions(Financial)**.  [paper](https://arxiv.org/pdf/1907.07225.pdf)\n6. **Graph Neural Networks A Review of Methods and Applications(GNN综述)**. arxiv 2019  [paper](https://arxiv.org/pdf/1812.08434.pdf)\n\n\n## machine learning\n收录angel已实现或即将实现的机器学习算法论文\n1. **Ad Click Prediction ~ a View from the Trenches(FTRL).** [paper](https://static.googleusercontent.com/media/research.google.com/zh-CN//pubs/archive/41159.pdf)\n2. **An Introduction to Logistic Regression Analysis and Reporting.** [paper](https://datajobs.com/data-science-repo/Logistic-Regression-[Peng-et-al].pdf)\n3. **Attentional Factorization Machines ~ Learning the Weighted of Feature Interactions via Attention Networks(AttentionFM).** [paper](https://www.ijcai.org/Proceedings/2017/0435.pdf)\n4. **Factorization Machines(FM).** [paper](https://www.csie.ntu.edu.tw/~b97053/paper/Rendle2010FM.pdf)\n5. **DeepFM A Factorization-Machine based Neural Network for CTR Prediction(DeepFM).** [paper](https://www.ijcai.org/Proceedings/2017/0239.pdf)\n6. **xDeepFM~ Combining Explicit and Implicit Feature Interactions for Recommender Systems(xDeepFM).** [paper](https://arxiv.org/pdf/1803.05170.pdf)\n7. **LDA ~ A Robust and Large-scale Topic Modeling System(LDA).** [paper](http://www.vldb.org/pvldb/vol10/p1406-yu.pdf)\n8. **Product-based Neural Networks for User Response Prediction(PNN).** [paper](https://arxiv.org/pdf/1611.00144.pdf)\n9. **Space-Efficient Online Computation of Quantile Summaries(Quantile Summaries).** [paper](http://infolab.stanford.edu/~datar/courses/cs361a/papers/quantiles.pdf)\n10. **Web-Scale K-Means Clustering(Kmeans).** [paper](https://www.eecs.tufts.edu/~dsculley/papers/fastkmeans.pdf)\n11. **Wide \u0026 Deep Learning for Recommender Systems(DeepAndWide).** [paper](https://arxiv.org/pdf/1606.07792.pdf)\n12. **XGBoost ~ A Scalable Tree Boosting System(XGBoost).** [paper](https://arxiv.org/pdf/1603.02754.pdf)\n13. **Deep \u0026 Cross Network for Ad Click Predictions(DCN).**[paper](https://arxiv.org/pdf/1708.05123.pdf)\n\n## nlp\n\n自然语言处理相关的算法论文\n\n1. **Distributed Representations of Words and Phrases and their Compositionality (Word2Vec).** [paper](https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fangel-ml%2Fpapers-we-learn","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fangel-ml%2Fpapers-we-learn","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fangel-ml%2Fpapers-we-learn/lists"}