https://github.com/ashishps1/awesome-engineering-articles

A curated collection of 300+ engineering blog articles from top tech companies. Learn how the best engineering teams solve real-world problems at scale.
https://github.com/ashishps1/awesome-engineering-articles
List: awesome-engineering-articles
ai data-engineering database distributed-systems engineering infrastructure ml system-design
Last synced: 4 months ago
JSON representation
A curated collection of 300+ engineering blog articles from top tech companies. Learn how the best engineering teams solve real-world problems at scale.
Host: GitHub
URL: https://github.com/ashishps1/awesome-engineering-articles
Owner: ashishps1
License: mit
Created: 2026-02-15T09:31:32.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-02-15T09:35:39.000Z (4 months ago)
Last Synced: 2026-02-15T15:42:10.888Z (4 months ago)
Topics: ai, data-engineering, database, distributed-systems, engineering, infrastructure, ml, system-design
Homepage: https://algomaster.io
Size: 42 KB
Stars: 8
Watchers: 0
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

ultimate-awesome - awesome-engineering-articles - A curated collection of 300+ engineering blog articles from top tech companies. Learn how the best engineering teams solve real-world problems at scale. (Other Lists / TeX Lists)
README

          # Awesome Engineering Articles

> A curated collection of 300+ engineering blog articles from top tech companies. Learn how the best engineering teams solve real-world problems at scale.

---

## Table of Contents

[Airbnb](#airbnb) · [Amazon Science](#amazon-science) · [Atlassian](#atlassian) · [Booking.com](#bookingcom) · [Canva](#canva) · [Coinbase](#coinbase) · [Discord](#discord) · [DoorDash](#doordash) · [Dropbox](#dropbox) · [eBay](#ebay) · [Facebook/Meta](#facebookmeta) · [Figma](#figma) · [GitHub](#github) · [Google Research](#google-research) · [Instagram](#instagram) · [Instacart](#instacart) · [LinkedIn](#linkedin) · [Lyft](#lyft) · [Netflix](#netflix) · [Notion](#notion) · [PayPal](#paypal) · [Pinterest](#pinterest) · [Quora](#quora) · [Reddit](#reddit) · [Salesforce](#salesforce) · [Shopify](#shopify) · [Slack](#slack) · [Snap](#snap) · [Spotify](#spotify) · [Stripe](#stripe) · [Tinder](#tinder) · [Twitch](#twitch) · [Twitter/X](#twitterx) · [Uber](#uber) · [Walmart](#walmart)

---

## Airbnb

> [Airbnb Engineering Blog](https://medium.com/airbnb-engineering)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Embedding-Based Retrieval for Airbnb Search](https://medium.com/airbnb-engineering/embedding-based-retrieval-for-airbnb-search-aabebfc85839) | Search, AI/ML | 2025 |

| 2 | [How Airbnb improved page performance using HTTP Streaming](https://medium.com/airbnb-engineering/improving-performance-with-http-streaming-ba9e72c66408) | Performance, Frontend | 2023 |

| 3 | [Airbnb's Data Framework for faster and more reliable read-heavy workloads](https://medium.com/airbnb-engineering/riverbed-optimizing-data-access-at-airbnbs-scale-c37ecf6456d9) | Data Engineering, Performance | 2023 |

| 4 | [Avoiding Double Payments in a Distributed Payments System](https://medium.com/airbnb-engineering/avoiding-double-payments-in-a-distributed-payments-system-2981f6b070bb) | Payments, Architecture | 2019 |

## Amazon Science

> [Amazon Science Blog](https://www.amazon.science/blog)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Training code generation models to debug their own outputs](https://www.amazon.science/blog/training-code-generation-models-to-debug-their-own-outputs) | AI/ML | 2025 |

| 2 | [The technology behind Amazon's GenAI-powered shopping assistant, Rufus](https://www.amazon.science/blog/the-technology-behind-amazons-genai-powered-shopping-assistant-rufus) | AI/ML | 2024 |

| 3 | [Ensuring that customers don't miss out on trending products](https://www.amazon.science/blog/ensuring-that-customers-dont-miss-out-on-trending-products) | Search, AI/ML | 2023 |

| 4 | [From structured search to learning-to-rank-and-retrieve](https://www.amazon.science/blog/from-structured-search-to-learning-to-rank-and-retrieve) | Search, AI/ML | 2023 |

| 5 | [Invalidating robotic ad clicks in real time](https://www.amazon.science/blog/invalidating-robotic-ad-clicks-in-real-time) | Security, Data Engineering | 2023 |

| 6 | [Using large language models (LLMs) to synthesize training data](https://www.amazon.science/blog/using-large-language-models-llms-to-synthesize-training-data) | AI/ML | 2023 |

| 7 | [Lessons learned from 10 years of DynamoDB](https://www.amazon.science/blog/lessons-learned-from-10-years-of-dynamodb) | Databases | 2022 |

| 8 | [Using graph neural networks to recommend related products](https://www.amazon.science/blog/using-graph-neural-networks-to-recommend-related-products) | AI/ML, Search | 2022 |

## Atlassian

> [Atlassian Engineering Blog](https://www.atlassian.com/blog/developer)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Atlassian Scaled and Enhanced Throughput in the Jira Export Service](https://www.atlassian.com/blog/atlassian-engineering/boosting-performance-how-we-scaled-and-enhanced-throughput-in-the-jira-export-service) | Performance, Architecture | 2025 |

| 2 | [How one of Atlassian's critical services consistently gets above 99.9999% of availability](https://www.atlassian.com/blog/atlassian-engineering/atlassian-critical-services-above-six-nines-of-availability) | Infrastructure | 2022 |

| 3 | [How Atlassian made Git push over HTTPS faster for Bitbucket Cloud](https://www.atlassian.com/blog/atlassian-engineering/faster-git-push-over-https-for-bitbucket-cloud) | Performance | 2022 |

| 4 | [How Atlassian Revamped Confluence Cloud Search](https://www.atlassian.com/blog/atlassian-engineering/revamping-confluence-cloud-search) | Search | 2021 |

| 5 | [Caching JQL search in Jira Cloud](https://www.atlassian.com/blog/atlassian-engineering/reducing-jql-database-load-with-caches) | Caching, Databases | 2021 |

| 6 | [Scaling, rearchitecting, and decomposing Confluence Cloud](https://www.atlassian.com/blog/atlassian-engineering/scaling-rearchitecting-and-decomposing-confluence-cloud) | Architecture | 2020 |

| 7 | [Scaling Bitbucket's Database](https://www.atlassian.com/blog/atlassian-engineering/scaling-bitbuckets-database) | Databases | 2020 |

| 8 | [Atlassian's journey scaling low latency, multi-region services on AWS](https://www.atlassian.com/blog/atlassian-engineering/aws-scaling-multi-region-low-latency-service) | Infrastructure, Performance | 2019 |

## Booking.com

> [Booking.com Tech Blog](https://blog.booking.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Anomaly Detection in Time Series Using Statistical Analysis](https://medium.com/booking-com-development/anomaly-detection-in-time-series-using-statistical-analysis-cc587b21d008) | Observability, Data Engineering | 2025 |

| 2 | [How Booking Cut 20% of the Cloud Cost with a Single Code Change](https://medium.com/booking-com-development/use-compression-luke-cut-20-of-the-cloud-cost-with-a-single-code-change-510d14d96891) | Cost Optimization | 2025 |

| 3 | [The Engineering Behind Booking.com's High-Performance Ranking Platform](https://medium.com/booking-com-development/the-engineering-behind-booking-coms-ranking-platform-a-system-overview-2fb222003ca6) | Search, Performance | 2024 |

| 4 | [How Booking.com Leverage graph technology for real-time Fraud Detection and Prevention](https://medium.com/booking-com-development/leverage-graph-technology-for-real-time-fraud-detection-and-prevention-438336076ea5) | Security | 2024 |

| 5 | [How Booking.com Predicts cancellations with survival modeling](https://booking.ai/predicting-cancellations-with-survival-modeling-a299af54249b) | AI/ML | 2024 |

## Canva

> [Canva Engineering Blog](https://www.canva.dev/blog/engineering/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Canva's continuous data platform](https://www.canva.dev/blog/engineering/snowpipe-streaming/) | Data Engineering | 2025 |

| 2 | [How Canva's drawing tool works](https://www.canva.dev/blog/engineering/behind-the-draw/) | Frontend | 2024 |

| 3 | [How Canva collects 25 billion events per day](https://www.canva.dev/blog/engineering/product-analytics-event-collection/) | Data Engineering | 2024 |

| 4 | [Canva's scalable and reliable content usage counting service](https://www.canva.dev/blog/engineering/scaling-to-count-billions/) | Architecture, Performance | 2024 |

| 5 | [How Canva saves millions annually in Amazon S3 costs](https://www.canva.dev/blog/engineering/optimising-s3-savings/) | Cost Optimization | 2023 |

| 6 | [How Canva scaled media uploads from Zero to 50 Million per day](https://www.canva.dev/blog/engineering/from-zero-to-50-million-uploads-per-day-scaling-media-at-canva/) | Performance, Infrastructure | 2022 |

| 7 | [Canva's fast and scalable reverse image search](https://www.canva.dev/blog/engineering/simple-fast-and-scalable-reverse-image-search-using-perceptual-hashes-and-dynamodb/) | Search, Databases | 2022 |

| 8 | [How Canva enables real-time collaboration with RSocket](https://www.canva.dev/blog/engineering/enabling-real-time-collaboration-with-rsocket/) | Messaging, Architecture | 2021 |

## Coinbase

> [Coinbase Engineering Blog](https://www.coinbase.com/blog/landing/engineering)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Coinbase Optimizes Network Requests](https://www.coinbase.com/blog/boosting-app-performance-strategies-to-optimize-network-requests) | Performance | 2024 |

| 2 | [Accelerating Deep Learning Adoption at Coinbase](https://www.coinbase.com/blog/accelerating-deep-learning-adoption-at-coinbase) | AI/ML | 2024 |

| 3 | [Lessons from launching Enterprise-grade GenAI solutions at Coinbase](https://www.coinbase.com/blog/lessons-from-launching-enterprise-grade-genAI-solutions-at-Coinbase) | AI/ML | 2024 |

| 4 | [How Coinbase Uses ML to Predict Traffic and Scale Databases](https://www.coinbase.com/blog/how-coinbase-is-using-machine-learning-to-predict) | AI/ML, Databases | 2024 |

| 5 | [Detecting Fraudulent Transactions at Coinbase](https://www.coinbase.com/blog/detecting-fraudulent-transactions-coinbase-scalable-blockchain-address-risk) | Security | 2023 |

| 6 | [Building a notification platform at Coinbase](https://www.coinbase.com/blog/building-a-notification-platform-at-coinbase) | Messaging, Architecture | 2022 |

## Discord

> [Discord Engineering Blog](https://blog.discord.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Discord Reduced Websocket Traffic by 40%](https://discord.com/blog/how-discord-reduced-websocket-traffic-by-40-percent) | Performance, Messaging | 2024 |

| 2 | [How Discord Stores Trillions of Messages](https://discord.com/blog/how-discord-stores-trillions-of-messages) | Databases | 2023 |

| 3 | [Pushing Discord's Limits with a Million+ Online Users in a Single Server](https://discord.com/blog/maxjourney-pushing-discords-limits-with-a-million-plus-online-users-in-a-single-server) | Performance, Infrastructure | 2023 |

| 4 | [How Discord uses ML to Build a Delightful Notification Experience](https://discord.com/blog/building-delightful-notifications-using-ml) | AI/ML, Messaging | 2022 |

| 5 | [How Discord Creates Insights from Trillions of Data Points](https://discord.com/blog/how-discord-creates-insights-from-trillions-of-data-points) | Data Engineering | 2021 |

## DoorDash

> [DoorDash Engineering Blog](https://careersatdoordash.com/engineering-blog/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How DoorDash Uses LLMs to transcribe restaurant menu photos](https://careersatdoordash.com/blog/doordash-llm-transcribe-menu/) | AI/ML | 2025 |

| 2 | [How DoorDash leverages LLMs for better search retrieval](https://careersatdoordash.com/blog/how-doordash-leverages-llms-for-better-search-retrieval/) | AI/ML, Search | 2024 |

| 3 | [Building DoorDash's product knowledge graph with large language models](https://careersatdoordash.com/blog/building-doordashs-product-knowledge-graph-with-large-language-models/) | AI/ML, Data Engineering | 2024 |

| 4 | [DoorDash's in-house search engine](https://careersatdoordash.com/blog/introducing-doordashs-in-house-search-engine/) | Search | 2024 |

| 5 | [DoorDash's write-heavy scalable and reliable inventory platform](https://careersatdoordash.com/blog/how-doordash-designed-a-successful-write-heavy-scalable-and-reliable-inventory-platform/) | Architecture, Databases | 2023 |

| 6 | [Doordash's scalable real time event processing with Kafka and Flink](https://careersatdoordash.com/blog/building-scalable-real-time-event-processing-with-kafka-and-flink/) | Data Engineering | 2022 |

| 7 | [DoorDash's Lessons on Improving Performance on High-Traffic Web Pages](https://careersatdoordash.com/blog/doordashs-lessons-on-improving-performance-on-high-traffic-web-pages/) | Performance, Frontend | 2022 |

| 8 | [How DoorDash Applied Client-Side Caching to Improve Feature Store Performance by 70%](https://careersatdoordash.com/blog/how-we-applied-client-side-caching/) | Caching, AI/ML | 2022 |

| 9 | [Building a Unified Chat Experience at DoorDash](https://careersatdoordash.com/blog/building-a-unified-chat-experience-at-doordash/) | Messaging | 2022 |

## Dropbox

> [Dropbox Tech Blog](https://dropbox.tech/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Dropbox evolved its infrastructure through the messaging system model](https://dropbox.tech/infrastructure/infrastructure-messaging-system-model-async-platform-evolution) | Architecture, Messaging | 2025 |

| 2 | [Dropbox's scalable, consistent, metadata caching solution](https://dropbox.tech/infrastructure/meet-chrono-our-scalable-consistent-metadata-caching-solution) | Caching | 2024 |

| 3 | [Bringing AI-powered answers and summaries to file previews on the web](https://dropbox.tech/machine-learning/bringing-ai-powered-answers-and-summaries-to-file-previews-on-the-web) | AI/ML | 2024 |

| 4 | [Dropbox's ML-powered file organization](https://dropbox.tech/machine-learning/smart-move-ml-ai-file-organization-automation) | AI/ML | 2023 |

| 5 | [How Dropbox uses ML to identify date formats in file names](https://dropbox.tech/machine-learning/using-ml-to-identify-date-formats-in-file-names) | AI/ML | 2023 |

| 6 | [How Dropbox optimizes payments with machine learning](https://dropbox.tech/machine-learning/optimizing-payments-with-machine-learning) | AI/ML, Payments | 2021 |

## eBay

> [eBay Tech Blog](https://innovation.ebayinc.com/tech/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How ebay Exports Billion-Scale Graphs on Transactional Graph Databases](https://innovation.ebayinc.com/stories/how-we-export-billion-scale-graphs-on-transactional-graph-databases/) | Databases | 2023 |

| 2 | [eBay's Personalized User-Based Ranking Model for Recommendations](https://innovation.ebayinc.com/stories/evolving-recommendations-a-personalized-user-based-ranking-model/) | AI/ML, Search | 2023 |

| 3 | [How Multimodal Embeddings Elevate eBay's Product Recommendations](https://innovation.ebayinc.com/stories/beyond-words-how-multimodal-embeddings-elevate-ebays-product-recommendations/) | AI/ML, Search | 2023 |

| 4 | [eBay's Blazingly Fast Billion-Scale Vector Similarity Engine](https://innovation.ebayinc.com/stories/ebays-blazingly-fast-billion-scale-vector-similarity-engine/) | Search, Performance | 2023 |

| 5 | [How eBay Handles Real-Time Push Notifications at Scale](https://innovation.ebayinc.com/stories/ebays-notification-streaming-platform-how-ebay-handles-real-time-push-notifications-at-scale/) | Messaging | 2022 |

| 6 | [Building a Deep Learning Based Retrieval System for Personalized Recommendations](https://innovation.ebayinc.com/stories/building-a-deep-learning-based-retrieval-system-for-personalized-recommendations/) | AI/ML, Search | 2022 |

| 7 | [How eBay Loads and Updates Over Ten-Billion-Vertex Graphs](https://innovation.ebayinc.com/stories/graphload-a-framework-to-load-and-update-over-ten-billion-vertex-graphs-with-performance-and-consistency/) | Databases, Performance | 2021 |

| 8 | [eBay's real-time and performant index service for its large-scale, in-house database platform](https://innovation.ebayinc.com/stories/ebays-global-secondary-indexes/) | Databases | 2021 |

## Facebook/Meta

> [Meta Engineering Blog](https://engineering.fb.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Indexing code at scale with Glean - Meta's open source system](https://engineering.fb.com/2024/12/19/developer-tools/glean-open-source-code-indexing/) | Search, Infrastructure | 2024 |

| 2 | [Inside Facebook's video delivery system](https://engineering.fb.com/2024/12/10/video-engineering/inside-facebooks-video-delivery-system/) | Video/Media | 2024 |

| 3 | [Meta's Sequence learning Model for personalized ads recommendations](https://engineering.fb.com/2024/11/19/data-infrastructure/sequence-learning-personalized-ads-recommendations/) | AI/ML | 2024 |

| 4 | [How Meta animates AI-generated images at scale](https://engineering.fb.com/2024/08/14/production-engineering/how-meta-animates-ai-generated-images-at-scale/) | AI/ML, Infrastructure | 2024 |

| 5 | [How Meta trains large language models at scale](https://engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/) | AI/ML, Infrastructure | 2024 |

| 6 | [Building Meta's GenAI Infrastructure](https://engineering.fb.com/2024/03/12/data-center-engineering/building-metas-genai-infrastructure/) | AI/ML, Infrastructure | 2024 |

| 7 | [RoCE networks for distributed AI training at scale](https://engineering.fb.com/2024/08/05/data-center-engineering/roce-network-distributed-ai-training-at-scale/) | Infrastructure, AI/ML | 2024 |

| 8 | [How Meta built the infrastructure for Threads](https://engineering.fb.com/2023/12/19/core-infra/how-meta-built-the-infrastructure-for-threads/) | Infrastructure | 2023 |

| 9 | [Building end-to-end security for Messenger](https://engineering.fb.com/2023/12/06/security/building-end-to-end-security-for-messenger/) | Security | 2023 |

| 10 | [Modernizing Meta's data platform](https://engineering.fb.com/2023/01/26/data-infrastructure/tulip-modernizing-metas-data-platform/) | Data Engineering | 2023 |

| 11 | [How Precision Time Protocol is being deployed at Meta](https://engineering.fb.com/2022/11/21/production-engineering/precision-time-protocol-at-meta/) | Infrastructure | 2022 |

| 12 | [Scaling data ingestion for machine learning training at Meta](https://engineering.fb.com/2022/09/19/ml-applications/data-ingestion-machine-learning-training-meta/) | AI/ML, Data Engineering | 2022 |

| 13 | [Meta's cloud gaming infrastructure](https://engineering.fb.com/2022/06/09/web/cloud-gaming-infrastructure/) | Infrastructure | 2022 |

| 14 | [Cache made consistent - How Meta handles cache invalidation](https://engineering.fb.com/2022/06/08/core-infra/cache-made-consistent/) | Caching | 2022 |

| 15 | [A highly available, strongly consistent storage service using chain replication](https://engineering.fb.com/2022/05/04/data-infrastructure/delta/) | Databases, Architecture | 2022 |

| 16 | [Making a distributed priority queue disaster-ready](https://engineering.fb.com/2022/01/18/production-engineering/foqs-disaster-ready/) | Architecture | 2022 |

| 17 | [How we built a general purpose key value store for Facebook with ZippyDB](https://engineering.fb.com/2021/08/06/core-infra/zippydb/) | Databases | 2021 |

| 18 | [Fully Sharded Data Parallel: faster AI training with fewer GPUs](https://engineering.fb.com/2021/07/15/open-source/fsdp/) | AI/ML, Infrastructure | 2021 |

| 19 | [How Facebook encodes your videos](https://engineering.fb.com/2021/04/05/video-engineering/how-facebook-encodes-your-videos/) | Video/Media | 2021 |

| 20 | [Scaling a distributed priority queue at Meta](https://engineering.fb.com/2021/02/22/production-engineering/foqs-scaling-a-distributed-priority-queue/) | Architecture | 2021 |

| 21 | [How machine learning powers Facebook's News Feed ranking algorithm](https://engineering.fb.com/2021/01/26/ml-applications/news-feed-ranking/) | AI/ML | 2021 |

| 22 | [How Meta scaled Live streaming for millions of viewers simultaneously](https://engineering.fb.com/2020/10/22/video-engineering/live-streaming/) | Video/Media, Performance | 2020 |

## Figma

> [Figma Engineering Blog](https://www.figma.com/blog/engineering/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [The infrastructure behind AI search in Figma](https://www.figma.com/blog/the-infrastructure-behind-ai-search-in-figma/) | Search, AI/ML | 2024 |

| 2 | [Speeding up file load times at Figma](https://www.figma.com/blog/speeding-up-file-load-times-one-page-at-a-time/) | Performance | 2024 |

| 3 | [Figma's LiveGraph: a real-time data system at scale](https://www.figma.com/blog/livegraph-real-time-data-at-scale/) | Architecture, Data Engineering | 2024 |

| 4 | [How Figma horizontally scaled Postgres to unlock nearly infinite scalability](https://www.figma.com/blog/how-figmas-databases-team-lived-to-tell-the-scale/) | Databases | 2024 |

| 5 | [How Figma improved performance and load time with incremental frame loading](https://www.figma.com/blog/incremental-frame-loading/) | Performance | 2024 |

| 6 | [How Figma reduced potential instability by scaling to multiple databases](https://www.figma.com/blog/how-figma-scaled-to-multiple-databases/) | Databases | 2023 |

| 7 | [The hidden challenges of autosave](https://www.figma.com/blog/behind-the-feature-autosave/) | Architecture | 2020 |

| 8 | [Figma's deep search to find the right files even faster](https://www.figma.com/blog/deep-search/) | Search | 2020 |

## GitHub

> [GitHub Engineering Blog](https://github.blog/engineering/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How we improved push processing on GitHub](https://github.blog/engineering/architecture-optimization/how-we-improved-push-processing-on-github/) | Performance | 2024 |

| 2 | [How GitHub uses merge queue to ship hundreds of changes every day](https://github.blog/engineering/engineering-principles/how-github-uses-merge-queue-to-ship-hundreds-of-changes-every-day/) | Infrastructure | 2024 |

| 3 | [How GitHub Docs' new search works](https://github.blog/engineering/architecture-optimization/how-github-docs-new-search-works/) | Search | 2023 |

| 4 | [The technology behind GitHub's new code search](https://github.blog/engineering/architecture-optimization/the-technology-behind-githubs-new-code-search/) | Search | 2023 |

| 5 | [Scaling Git's garbage collection](https://github.blog/engineering/architecture-optimization/scaling-gits-garbage-collection/) | Performance | 2022 |

| 6 | [Improve Git monorepo performance with a file system monitor](https://github.blog/engineering/infrastructure/improve-git-monorepo-performance-with-a-file-system-monitor/) | Performance, Infrastructure | 2022 |

| 7 | [Partitioning GitHub's relational databases to handle scale](https://github.blog/engineering/infrastructure/partitioning-githubs-relational-databases-scale/) | Databases | 2021 |

## Google Research

> [Google Research Blog](https://research.google/blog/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Load balancing with random job arrivals](https://research.google/blog/load-balancing-with-random-job-arrivals/) | Infrastructure | 2025 |

| 2 | [Transformers in music recommendation](https://research.google/blog/transformers-in-music-recommendation/) | AI/ML, Search | 2024 |

| 3 | [Scaling multimodal understanding to long videos](https://research.google/blog/scaling-multimodal-understanding-to-long-videos/) | AI/ML, Video/Media | 2023 |

| 4 | [Answering billions of reporting queries each day with low latency](https://research.google/blog/answering-billions-of-reporting-queries-each-day-with-low-latency/) | Databases, Performance | 2023 |

| 5 | [Grammar checking at Google Search scale](https://research.google/blog/grammar-checking-at-google-search-scale/) | AI/ML, Search | 2023 |

| 6 | [World scale inverse reinforcement learning in Google Maps](https://research.google/blog/world-scale-inverse-reinforcement-learning-in-google-maps/) | AI/ML | 2023 |

| 7 | [Resolving code review comments with ML](https://research.google/blog/resolving-code-review-comments-with-ml/) | AI/ML | 2023 |

## Instagram

> [Instagram Engineering Blog](https://engineering.fb.com/tag/instagram/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Scaling the Instagram Explore recommendations system](https://engineering.fb.com/2023/08/09/ml-applications/scaling-instagram-explore-recommendations-system/) | AI/ML, Search | 2023 |

| 2 | [Reducing Instagram's basic video compute time by 94 percent](https://engineering.fb.com/2022/11/04/video-engineering/instagram-video-processing-encoding-reduction/) | Video/Media, Performance | 2022 |

| 3 | [Improving Instagram notification management with machine learning and causal inference](https://engineering.fb.com/2022/10/31/ml-applications/instagram-notification-management-machine-learning/) | AI/ML, Messaging | 2022 |

| 4 | [Building text animations for Instagram Stories](https://engineering.fb.com/2022/07/18/developer-tools/building-text-animations-for-instagram-stories/) | Frontend | 2022 |

| 5 | [Pushing the limits of compression in Facebook's mobile apps](https://engineering.fb.com/2021/09/13/core-infra/superpack/) | Mobile, Performance | 2021 |

| 6 | [How Instagram suggests new content](https://engineering.fb.com/2020/12/10/web/how-instagram-suggests-new-content/) | AI/ML, Search | 2020 |

## Instacart

> [Instacart Tech Blog](https://tech.instacart.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Real-time Fraud Detection with Yoda and ClickHouse](https://tech.instacart.com/real-time-fraud-detection-with-yoda-and-clickhouse-bd08e9dbe3f4) | Security, Data Engineering | 2024 |

| 2 | [How Instacart Uses ML to Suggest Replacements for Out-of-Stock Products](https://tech.instacart.com/how-instacart-uses-machine-learning-to-suggest-replacements-for-out-of-stock-products-8f80d03bb5af) | AI/ML | 2024 |

| 3 | [Sequence models for Contextual Recommendations at Instacart](https://tech.instacart.com/sequence-models-for-contextual-recommendations-at-instacart-93414a28e70c) | AI/ML, Search | 2024 |

| 4 | [Supercharging Discovery in Search with LLMs](https://tech.instacart.com/supercharging-discovery-in-search-with-llms-556c585d4720) | AI/ML, Search | 2024 |

| 5 | [Optimizing search relevance at Instacart using hybrid retrieval](https://tech.instacart.com/optimizing-search-relevance-at-instacart-using-hybrid-retrieval-88cb579b959c) | Search | 2024 |

| 6 | [Instacart's Item Availability Architecture: Solving for scale and consistency](https://tech.instacart.com/instacarts-item-availability-architecture-solving-for-scale-and-consistency-f5661acb20a6) | Architecture | 2023 |

| 7 | [Instacart's one Deep Learning model for multiple surfaces](https://tech.instacart.com/one-model-to-serve-them-all-0eb6bf60b00d) | AI/ML | 2023 |

| 8 | [Distributed Machine Learning at Instacart](https://tech.instacart.com/distributed-machine-learning-at-instacart-4b11d7569423) | AI/ML, Infrastructure | 2023 |

| 9 | [How Instacart Uses Embeddings to Improve Search Relevance](https://tech.instacart.com/how-instacart-uses-embeddings-to-improve-search-relevance-e569839c3c36) | AI/ML, Search | 2022 |

| 10 | [The Journey to Real-Time Machine Learning at Instacart](https://tech.instacart.com/lessons-learned-the-journey-to-real-time-machine-learning-at-instacart-942f3a656af3) | AI/ML | 2022 |

| 11 | [How Instacart Uses ML-Driven Autocomplete to Help People Fill Their Carts](https://tech.instacart.com/how-instacart-uses-machine-learning-driven-autocomplete-to-help-people-fill-their-carts-9bc56d22bafb) | AI/ML, Search | 2022 |

| 12 | [How Instacart optimized its Logistics engine using ML](https://tech.instacart.com/dont-let-the-crow-guide-your-routes-f24c96daedba) | AI/ML | 2021 |

| 13 | [A simple search query correction heuristic for the resource-constrained](https://tech.instacart.com/avacado-or-avocado-4b4b78dc0698) | Search | 2020 |

| 14 | [Predicting the real-time availability of 200 million grocery items](https://tech.instacart.com/predicting-real-time-availability-of-200-million-grocery-items-in-us-canada-stores-61f43a16eafe) | AI/ML | 2018 |

| 15 | [How Instacart delivers on time](https://tech.instacart.com/how-instacart-delivers-on-time-using-quantile-regression-2383e2e03edb) | AI/ML | 2018 |

## LinkedIn

> [LinkedIn Engineering Blog](https://engineering.linkedin.com/blog)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Scalable federated learning at LinkedIn](https://www.linkedin.com/blog/engineering/machine-learning/scalable-federated-learning-at-linkedin) | AI/ML | 2025 |

| 2 | [Building a resilient DNS client for web-scale infrastructure](https://www.linkedin.com/blog/engineering/infrastructure/building-a-resilient-dns-client-for-web-scale-infrastructure) | Infrastructure | 2025 |

| 3 | [Journey of next generation control plane for data systems](https://www.linkedin.com/blog/engineering/infrastructure/journey-of-next-generation-control-plane-for-data-systems) | Infrastructure, Data Engineering | 2025 |

| 4 | [Candidate Generation in a Large Scale Graph Recommendation System](https://www.linkedin.com/blog/engineering/recommendations/candidate-generation-in-a-large-scale-graph-recommendation-system-people-you-may-know) | AI/ML, Search | 2024 |

| 5 | [Accelerating LinkedIn's My Network tab by reducing latency and improving UX](https://www.linkedin.com/blog/engineering/infrastructure/accelerating-linkedins-my-network-tab) | Performance, Frontend | 2024 |

| 6 | [Tuning Java for high-performance services](https://www.linkedin.com/blog/engineering/infrastructure/java-heap-memory-and-garbage-collection-tuning-for-high-performance-services) | Performance | 2024 |

| 7 | [LinkedIn OpenHouse for Big Data Management](https://www.linkedin.com/blog/engineering/data-management/taking-charge-of-tables--introducing-openhouse-for-big-data-mana) | Data Engineering | 2023 |

| 8 | [How LinkedIn Adopted A GraphQL Architecture for Product Development](https://www.linkedin.com/blog/engineering/architecture/how-linkedin-adopted-a-graphql-architecture-for-product-developm) | Architecture | 2023 |

| 9 | [How LinkedIn Is Using Embeddings to Up Its Match Game for Job Seekers](https://www.linkedin.com/blog/engineering/platform-platformization/using-embeddings-to-up-its-match-game-for-job-seekers) | AI/ML, Search | 2023 |

| 10 | [Building the Infrastructure for Delivering Labor Market Insights from LinkedIn Data](https://www.linkedin.com/blog/engineering/economic-graph/from-the-economic-graph-to-economic-insights-building-the-infra) | Data Engineering | 2023 |

| 11 | [Upscaling LinkedIn's Profile Datastore While Reducing Costs](https://www.linkedin.com/blog/engineering/data-management/upscaling-profile-datastore-while-reducing-costs) | Databases, Cost Optimization | 2023 |

| 12 | [Unifying Messaging Experiences across LinkedIn](https://www.linkedin.com/blog/engineering/messaging-notifications/unifying-messaging-experiences-across-linkedin) | Messaging | 2023 |

| 13 | [Applying multitask learning to AI models at LinkedIn](https://www.linkedin.com/blog/engineering/data-modeling/applying-multitask-learning-to-ai-models-at-linkedin) | AI/ML | 2022 |

| 14 | [Building a mutable dataset in data lake](https://www.linkedin.com/blog/engineering/data-management/opal-building-a-mutable-dataset-in-data-lake) | Data Engineering | 2022 |

| 15 | [Completing a member knowledge graph with Graph Neural Networks](https://www.linkedin.com/blog/engineering/knowledge/completing-a-member-knowledge-graph-with-graph-neural-networks) | AI/ML | 2021 |

| 16 | [Homepage feed multi-task learning using TensorFlow](https://www.linkedin.com/blog/engineering/feed/homepage-feed-multi-task-learning-using-tensorflow) | AI/ML | 2021 |

| 17 | [Evolving LinkedIn's analytics tech stack](https://www.linkedin.com/blog/engineering/analytics/evolving-linkedin-s-analytics-tech-stack) | Data Engineering | 2021 |

| 18 | [Scaling LinkedIn's Hadoop YARN cluster beyond 10,000 nodes](https://www.linkedin.com/blog/engineering/open-source/scaling-linkedin-s-hadoop-yarn-cluster-beyond-10-000-nodes) | Infrastructure, Data Engineering | 2021 |

| 19 | [HTTP/2 in infrastructure: Ambry network stack refactoring](https://www.linkedin.com/blog/engineering/optimization/http-2-in-infrastructure-ambry-network-stack-refactoring) | Infrastructure, Performance | 2021 |

| 20 | [Building a heterogeneous social network recommendation system](https://www.linkedin.com/blog/engineering/optimization/building-a-heterogeneous-social-network-recommendation-system) | AI/ML, Search | 2020 |

## Lyft

> [Lyft Engineering Blog](https://eng.lyft.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [From Big Data to Better Data: Ensuring Data Quality with Verity](https://eng.lyft.com/from-big-data-to-better-data-ensuring-data-quality-with-verity-a996b49343f6) | Data Engineering | 2023 |

| 2 | [Building Real-time Machine Learning Foundations at Lyft](https://eng.lyft.com/building-real-time-machine-learning-foundations-at-lyft-6dd99b385a4e) | AI/ML, Infrastructure | 2023 |

| 3 | [The Recommendation System at Lyft](https://eng.lyft.com/the-recommendation-system-at-lyft-67bc9dcc1793) | AI/ML, Search | 2023 |

| 4 | [lyft2vec — Embeddings at Lyft](https://eng.lyft.com/lyft2vec-embeddings-at-lyft-d4231a76d219) | AI/ML | 2023 |

| 5 | [Powering Millions of Real-Time Decisions with LyftLearn Serving](https://eng.lyft.com/powering-millions-of-real-time-decisions-with-lyftlearn-serving-9bb1f73318dc) | AI/ML, Infrastructure | 2023 |

| 6 | [Pricing at Lyft](https://eng.lyft.com/pricing-at-lyft-8a4022065f8b) | Architecture | 2022 |

| 7 | [ML Model Training Infrastructure built on Kubernetes](https://eng.lyft.com/lyftlearn-ml-model-training-infrastructure-built-on-kubernetes-aef8218842bb) | AI/ML, Infrastructure | 2021 |

| 8 | [Elasticsearch Optimizations at Lyft](https://eng.lyft.com/elasticsearch-optimizations-at-lyft-b555dc020932) | Search, Performance | 2021 |

| 9 | [How Lyft discovered OpenStreetMap is the Freshest Map for Rideshare](https://eng.lyft.com/how-lyft-discovered-openstreetmap-is-the-freshest-map-for-rideshare-a7a41bf92ec) | Data Engineering | 2021 |

| 10 | [Using Client-Side Map Data to Improve Real-Time Positioning](https://eng.lyft.com/using-client-side-map-data-to-improve-real-time-positioning-a382585ac6e) | Mobile | 2021 |

| 11 | [How Lyft predicts a rider's destination for better in-app experience](https://eng.lyft.com/how-lyft-predicts-your-destination-with-attention-791146b0a439) | AI/ML | 2020 |

| 12 | [A New Real-Time Map-Matching Algorithm at Lyft](https://eng.lyft.com/a-new-real-time-map-matching-algorithm-at-lyft-da593ab7b006) | AI/ML | 2020 |

## Netflix

> [Netflix Tech Blog](https://netflixtechblog.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Foundation Model for Personalized Recommendation](https://netflixtechblog.com/foundation-model-for-personalized-recommendation-1a0bd8e02d39) | AI/ML, Search | 2025 |

| 2 | [How Netflix processes billions of impressions daily](https://netflixtechblog.com/introducing-impressions-at-netflix-e2b67c88c9fb) | Data Engineering | 2025 |

| 3 | [Netflix's Distributed Counter Abstraction](https://netflixtechblog.com/netflixs-distributed-counter-abstraction-8d0c45eb66b2) | Architecture | 2024 |

| 4 | [Evolving Netflix's WebSocket proxy for the future](https://netflixtechblog.com/pushy-to-the-limit-evolving-netflixs-websocket-proxy-for-the-future-b468bc0ff658) | Messaging, Infrastructure | 2024 |

| 5 | [Netflix's Key-Value Data Abstraction Layer](https://netflixtechblog.com/introducing-netflixs-key-value-data-abstraction-layer-1ea8a0a11b30) | Databases | 2024 |

| 6 | [Netflix's TimeSeries Data Abstraction Layer](https://netflixtechblog.com/introducing-netflix-timeseries-data-abstraction-layer-31552f6326f8) | Databases | 2024 |

| 7 | [Recommending for Long-Term Member Satisfaction at Netflix](https://netflixtechblog.com/recommending-for-long-term-member-satisfaction-at-netflix-ac15cada49ef) | AI/ML | 2024 |

| 8 | [Maestro: Data/ML Workflow Orchestrator at Netflix](https://netflixtechblog.com/maestro-netflixs-workflow-orchestrator-ee13a06f9c78) | Data Engineering, AI/ML | 2024 |

| 9 | [Reverse Searching Netflix's Federated Graph](https://netflixtechblog.com/reverse-searching-netflixs-federated-graph-222ac5d23576) | Architecture, Search | 2024 |

| 10 | [Supporting Diverse ML Systems at Netflix](https://netflixtechblog.com/supporting-diverse-ml-systems-at-netflix-2d2e6b6d205d) | AI/ML, Infrastructure | 2024 |

| 11 | [Rebuilding Netflix Video Processing Pipeline with Microservices](https://netflixtechblog.com/rebuilding-netflix-video-processing-pipeline-with-microservices-4e5e6310e359) | Video/Media, Architecture | 2024 |

| 12 | [Building In-Video Search](https://netflixtechblog.com/building-in-video-search-936766f0017c) | Search, Video/Media | 2023 |

| 13 | [Streaming SQL in Data Mesh](https://netflixtechblog.com/streaming-sql-in-data-mesh-0d83f5a00d08) | Data Engineering | 2023 |

| 14 | [Migrating Netflix to GraphQL Safely](https://netflixtechblog.com/migrating-netflix-to-graphql-safely-8e1e4d4f1e72) | Architecture | 2023 |

| 15 | [Scaling Media Machine Learning at Netflix](https://netflixtechblog.com/scaling-media-machine-learning-at-netflix-f19b400243) | AI/ML, Video/Media | 2023 |

| 16 | [Building a Media Understanding Platform for ML Innovations](https://netflixtechblog.com/building-a-media-understanding-platform-for-ml-innovations-9bef9962dcb7) | AI/ML, Video/Media | 2023 |

| 17 | [Finding Cuts with Smooth Visual Transitions Using Machine Learning](https://netflixtechblog.com/match-cutting-at-netflix-finding-cuts-with-smooth-visual-transitions-31c3fc14ae59) | AI/ML, Video/Media | 2022 |

| 18 | [Machine Learning for Fraud Detection in Streaming Services](https://netflixtechblog.com/machine-learning-for-fraud-detection-in-streaming-services-b0b4ef3be3f6) | AI/ML, Security | 2022 |

| 19 | [Netflix's High-Throughput, Low-Latency Priority Queueing System](https://netflixtechblog.com/timestone-netflixs-high-throughput-low-latency-priority-queueing-system-with-built-in-support-1abf249ba95f) | Architecture, Performance | 2022 |

| 20 | [Rapid Event Notification System at Netflix](https://netflixtechblog.com/rapid-event-notification-system-at-netflix-6deb1d2b57d1) | Messaging | 2022 |

| 21 | [Building Netflix's Distributed Tracing Infrastructure](https://netflixtechblog.com/building-netflixs-distributed-tracing-infrastructure-bb856c319304) | Observability | 2020 |

## Notion

> [Notion Blog](https://www.notion.com/blog)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Building and scaling Notion's data lake](https://www.notion.com/blog/building-and-scaling-notions-data-lake) | Data Engineering | 2024 |

| 2 | [How we sped up Notion in the browser with WASM SQLite](https://www.notion.com/blog/how-we-sped-up-notion-in-the-browser-with-wasm-sqlite) | Performance, Frontend | 2024 |

| 3 | [The Great Re-shard: adding Postgres capacity (again) with zero downtime](https://www.notion.com/blog/the-great-re-shard) | Databases | 2023 |

| 4 | [Creating the Notion API](https://www.notion.com/blog/creating-the-notion-api) | Architecture | 2022 |

| 5 | [The data model behind Notion's flexibility](https://www.notion.com/blog/data-model-behind-notion) | Architecture, Databases | 2021 |

| 6 | [Lessons learned from sharding Postgres at Notion](https://www.notion.com/blog/sharding-postgres-at-notion) | Databases | 2021 |

## PayPal

> [PayPal Tech Blog](https://medium.com/paypal-tech)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Scaling PayPal's AI Capabilities with PayPal Cosmos.AI Platform](https://medium.com/paypal-tech/scaling-paypals-ai-capabilities-with-paypal-cosmos-ai-platform-e67a48e04691) | AI/ML, Infrastructure | 2024 |

| 2 | [Scaling Kafka to Support PayPal's Data Growth](https://medium.com/paypal-tech/scaling-kafka-to-support-paypals-data-growth-a0b4da420fab) | Data Engineering | 2023 |

| 3 | [JunoDB: PayPal's Key-Value Store](https://medium.com/paypal-tech/unlocking-the-power-of-junodb-paypals-key-value-store-goes-open-source-ee85f935bdc1) | Databases | 2023 |

| 4 | [Scaling Kubernetes to Over 4k Nodes and 200k Pods](https://medium.com/paypal-tech/scaling-kubernetes-to-over-4k-nodes-and-200k-pods-29988fad6ed) | Infrastructure | 2022 |

| 5 | [GraphQL at PayPal: An Adoption Story](https://medium.com/paypal-tech/graphql-at-paypal-an-adoption-story-b7e01175f2b7) | Architecture | 2021 |

| 6 | [How PayPal Uses Real-time Graph Database and Graph Analysis to Fight Fraud](https://medium.com/paypal-tech/how-paypal-uses-real-time-graph-database-and-graph-analysis-to-fight-fraud-96a2b918619a) | Security, Databases | 2021 |

| 7 | [Next-Gen Data Movement Platform at PayPal](https://medium.com/paypal-tech/next-gen-data-movement-platform-at-paypal-100f70a7a6b) | Data Engineering | 2021 |

| 8 | [Deploying Large-scale Fraud Detection Machine Learning Models at PayPal](https://medium.com/paypal-tech/machine-learning-model-ci-cd-and-shadow-platform-8c4f44998c78) | AI/ML, Security | 2021 |

## Pinterest

> [Pinterest Engineering Blog](https://medium.com/pinterest-engineering)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Pinterest improved Search Relevance using LLMs](https://medium.com/pinterest-engineering/improving-pinterest-search-relevance-using-large-language-models-4cd938d4e892) | AI/ML, Search | 2025 |

| 2 | [How Pinterest built it's Text-to-SQL feature](https://medium.com/pinterest-engineering/how-we-built-text-to-sql-at-pinterest-30bad30dabff) | AI/ML, Data Engineering | 2024 |

| 3 | [Change Data Capture at Pinterest](https://medium.com/pinterest-engineering/change-data-capture-at-pinterest-7e4c357ac527) | Data Engineering | 2024 |

| 4 | [Real Time Anomaly Detection at Pinterest](https://medium.com/pinterest-engineering/warden-real-time-anomaly-detection-at-pinterest-210c122f6afa) | Observability | 2023 |

| 5 | [Improving Distributed Caching Performance and Efficiency at Pinterest](https://medium.com/pinterest-engineering/improving-distributed-caching-performance-and-efficiency-at-pinterest-92484b5fe39b) | Caching | 2022 |

| 6 | [How Pinterest Leverages Realtime User Actions to Boost Homefeed Engagement Volume](https://medium.com/pinterest-engineering/how-pinterest-leverages-realtime-user-actions-in-recommendation-to-boost-homefeed-engagement-volume-165ae2e8cde8) | AI/ML, Search | 2022 |

| 7 | [How Pinterest scaled the size of it's ad corpus by 60x](https://medium.com/pinterest-engineering/how-we-scaled-the-size-of-pinterests-ad-corpus-by-60x-d6d5bfa6bf16) | Performance | 2021 |

| 8 | [The machine learning behind delivering relevant ads](https://medium.com/pinterest-engineering/the-machine-learning-behind-delivering-relevant-ads-8987fc5ba1c0) | AI/ML | 2021 |

## Quora

> [Quora Engineering Blog](https://engineering.quora.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Building Embedding Search at Quora](https://quoraengineering.quora.com/Building-Embedding-Search-at-Quora) | Search, AI/ML | 2024 |

| 2 | [Migrating a decade of Redshift usages to Trino at Quora](https://quoraengineering.quora.com/Migrating-a-decade-of-Redshift-usages-to-Trino-at-Quora) | Data Engineering | 2024 |

| 3 | [Trino at Quora Scale: Cost, Speed, and Reliability](https://quoraengineering.quora.com/Trino-at-Quora-Scale-Cost-Speed-and-Reliability) | Data Engineering, Performance | 2023 |

| 4 | [MySQL sharding at Quora](https://quoraengineering.quora.com/MySQL-sharding-at-Quora) | Databases | 2020 |

## Reddit

> [Reddit Engineering Blog](https://www.reddit.com/r/RedditEng/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Evolving Reddit's Media Infrastructure](https://www.reddit.com/r/RedditEng/comments/1k4o2mc/evolving_reddits_media_infrastructure/) | Video/Media, Infrastructure | 2025 |

| 2 | [Scaling our Apache Flink powered real-time ad event validation pipeline](https://www.reddit.com/r/RedditEng/comments/1ijcfge/scaling_our_apache_flink_powered_realtime_ad/) | Data Engineering | 2025 |

| 3 | [Scaling Reddit's ad-serving system](https://www.reddit.com/r/RedditEng/comments/1gzua17/scaling_ads_serving_find_and_eliminate_redundant/) | Performance | 2024 |

| 4 | [Product Candidate Generation for Reddit Dynamic Product Ads](https://www.reddit.com/r/RedditEng/comments/1gug4x9/product_candidate_generation_for_reddit_dynamic/) | AI/ML | 2024 |

| 5 | [Scaling Ads Pacing: from Singleton to Sharded](https://www.reddit.com/r/RedditEng/comments/1e5mhs3/scaling_ads_pacing_from_singleton_to_sharded/) | Architecture | 2024 |

| 6 | [Introducing a Global Retrieval Ranking Model in the Ads Funnel](https://www.reddit.com/r/RedditEng/comments/1d2wfsd/introducing_a_global_retrieval_ranking_model_in/) | AI/ML | 2024 |

| 7 | [Building an Experiment-Based Routing Service](https://www.reddit.com/r/RedditEng/comments/1c4pkql/building_an_experimentbased_routing_service/) | Architecture | 2023 |

| 8 | [The Reddit Media Metadata Store](https://www.reddit.com/r/RedditEng/comments/1avlywv/the_reddit_media_metadata_store/) | Video/Media, Databases | 2023 |

## Salesforce

> [Salesforce Engineering Blog](https://engineering.salesforce.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Scaling Real-Time Search to 30 Billion Queries with Sub-Second Latency and 0% Downtime](https://engineering.salesforce.com/scaling-real-time-search-to-30-billion-queries-with-sub-second-latency-and-0-downtime/) | Search, Performance | 2025 |

| 2 | [Scaling Agentic AI Powering 2 Billion Predictions Monthly](https://engineering.salesforce.com/agentforce-scaling-agentic-ai-for-enterprise-automation-observability-powering-2-billion-predictions-monthly/) | AI/ML | 2025 |

| 3 | [How Agentforce Data Library Powers RAG with 99.99% Uptime](https://engineering.salesforce.com/optimizing-ai-retrieval-how-agentforce-data-library-powers-rag-with-99-99-uptime/) | AI/ML | 2025 |

| 4 | [Secrets for Managing 100,000 Training and Metadata Requests Per Minute](https://engineering.salesforce.com/scaling-ai-systems-secrets-for-managing-100000-training-and-metadata-requests-per-minute/) | AI/ML, Infrastructure | 2024 |

| 5 | [Inside the Brain of Agentforce](https://engineering.salesforce.com/inside-the-brain-of-agentforce-revealing-the-atlas-reasoning-engine/) | AI/ML | 2024 |

| 6 | [How Salesforce Supports Millions of Users Seamlessly for GenAI](https://engineering.salesforce.com/scaling-generative-ai-how-salesforce-supports-millions-of-users-seamlessly/) | AI/ML, Infrastructure | 2024 |

| 7 | [Inside Salesforce's Scalable Time Series Forecasting AI Platform](https://engineering.salesforce.com/inside-salesforces-scalable-time-series-forecasting-ai-platform/) | AI/ML, Data Engineering | 2024 |

| 8 | [How Salesforce's Data Cloud Handles 250 Trillion Transactions Weekly](https://engineering.salesforce.com/the-unstructured-data-dilemma-how-data-cloud-handles-250-trillion-transactions-weekly/) | Data Engineering, Performance | 2024 |

## Shopify

> [Shopify Engineering Blog](https://shopify.engineering/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Shopify improved consumer search intent with real-time ML](https://shopify.engineering/how-shopify-improved-consumer-search-intent-with-real-time-ml) | AI/ML, Search | 2024 |

| 2 | [Horizontally scaling the Rails backend of Shop app with Vitess](https://shopify.engineering/horizontally-scaling-the-rails-backend-of-shop-app-with-vitess) | Databases, Performance | 2024 |

| 3 | [Improving Shopify App's Performance](https://shopify.engineering/improving-shopify-app-s-performance) | Performance, Mobile | 2024 |

| 4 | [Building a ShopifyQL Code Editor](https://shopify.engineering/building-a-shopifyql-code-editor) | Frontend | 2023 |

| 5 | [Creating a Flexible Order Routing System with Shopify Functions](https://shopify.engineering/creating-a-flexible-order-routing-system-with-shopify-functions) | Architecture | 2023 |

| 6 | [Using Server Sent Events to Simplify Real-time Streaming at Scale](https://shopify.engineering/server-sent-events-data-streaming) | Architecture, Data Engineering | 2022 |

| 7 | [Capturing Every Change From Shopify's Sharded Monolith](https://shopify.engineering/capturing-every-change-shopify-sharded-monolith) | Data Engineering, Architecture | 2021 |

## Slack

> [Slack Engineering Blog](https://slack.engineering/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Slack Optimizes its E2E Pipeline](https://slack.engineering/speedup-e2e-testing/) | Infrastructure | 2025 |

| 2 | [How Slack built enterprise search to be secure and private](https://slack.engineering/how-we-built-enterprise-search-to-be-secure-and-private/) | Search, Security | 2025 |

| 3 | [Advancing Our Chef Infrastructure](https://slack.engineering/advancing-our-chef-infrastructure/) | Infrastructure | 2024 |

| 4 | [How We Re-Architected Slack for Our Largest Customers](https://slack.engineering/unified-grid-how-we-re-architected-slack-for-our-largest-customers/) | Architecture | 2024 |

| 5 | [How Slack automatically detects stolen session cookies](https://slack.engineering/catching-compromised-cookies/) | Security | 2024 |

| 6 | [How a request flows — from a Slack's user perspective](https://slack.engineering/traffic-101-packets-mostly-flow/) | Architecture | 2023 |

| 7 | [Slack's Migration to a Cellular Architecture](https://slack.engineering/slacks-migration-to-a-cellular-architecture/) | Architecture | 2023 |

| 8 | [Real-time Messaging at Slack](https://slack.engineering/real-time-messaging/) | Messaging | 2023 |

| 9 | [How Slack traces the flow of notifications across systems](https://slack.engineering/tracing-notifications/) | Observability, Messaging | 2023 |

| 10 | [Slack's Unified end-to-end machine learning infrastructure to generate recommendations](https://slack.engineering/recommend-api/) | AI/ML | 2023 |

| 11 | [How We Design Our APIs at Slack](https://slack.engineering/how-we-design-our-apis-at-slack/) | Architecture | 2021 |

| 12 | [How we built an eventually-consistent data model to predict Slack Connect invites](https://slack.engineering/email-classification/) | AI/ML, Architecture | 2021 |

| 13 | [Migrating Millions of Concurrent Websockets to Envoy](https://slack.engineering/migrating-millions-of-concurrent-websockets-to-envoy/) | Infrastructure, Messaging | 2021 |

| 14 | [Scaling Datastores at Slack with Vitess](https://slack.engineering/scaling-datastores-at-slack-with-vitess/) | Databases | 2020 |

## Snap

> [Snap Engineering Blog](https://eng.snap.com/blog)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Bento - Snap's ML Platform](https://eng.snap.com/introducing-bento) | AI/ML, Infrastructure | 2025 |

| 2 | [Snap's Embedding-based Retrieval for its video recommendation system](https://eng.snap.com/embedding-based-retrieval) | AI/ML, Search | 2023 |

| 3 | [How Snap Speed Up Feature Engineering for Recommendation Systems](https://eng.snap.com/speed-up-feature-engineering) | AI/ML, Data Engineering | 2022 |

| 4 | [How Snap leverages synthetic data to boost the development of ML models](https://eng.snap.com/synthetic-data-for-machine-learning) | AI/ML | 2022 |

| 5 | [Training Large-Scale Recommendation Models with TPUs](https://eng.snap.com/training-models-with-tpus) | AI/ML, Infrastructure | 2022 |

| 6 | [Machine Learning for Snapchat Ad Ranking](https://eng.snap.com/machine-learning-snap-ad-ranking) | AI/ML | 2022 |

## Spotify

> [Spotify Engineering Blog](https://engineering.atspotify.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [How Spotify Generated Millions of Content Annotations](https://engineering.atspotify.com/2024/10/how-we-generated-millions-of-content-annotations/) | AI/ML, Data Engineering | 2024 |

| 2 | [Spotify's Data Platform](https://engineering.atspotify.com/2024/05/data-platform-explained-part-ii/) | Data Engineering | 2024 |

| 3 | [The What, Why, and How of Mastering App Size](https://engineering.atspotify.com/2023/11/the-what-why-and-how-of-mastering-app-size/) | Mobile, Performance | 2023 |

| 4 | [How Spotify Automated Content Marketing to Acquire Users at Scale](https://engineering.atspotify.com/2023/11/how-we-automated-content-marketing-to-acquire-users-at-scale/) | AI/ML | 2023 |

| 5 | [How We Built Infrastructure to Run User Forecasts at Spotify](https://engineering.atspotify.com/2022/06/how-we-built-infrastructure-to-run-user-forecasts-at-spotify/) | Infrastructure, AI/ML | 2022 |

## Stripe

> [Stripe Engineering Blog](https://stripe.com/blog/engineering)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Stripe's system for tracking and validating money movement](https://stripe.com/blog/ledger-stripe-system-for-tracking-and-validating-money-movement) | Payments, Architecture | 2024 |

| 2 | [How Stripe Processed $1 Trillion in Payments with Zero Downtime](https://stripe.com/blog/how-stripes-document-databases-supported-99.999-uptime-with-zero-downtime-data-migrations) | Databases, Payments | 2023 |

| 3 | [How Stripe built its fraud prevention system](https://stripe.com/blog/how-we-built-it-stripe-radar) | Security, Payments | 2023 |

| 4 | [How Stripe builds interactive docs with Markdoc](https://stripe.com/blog/markdoc) | Frontend | 2022 |

| 5 | [Stripe's payments APIs: The first 10 years](https://stripe.com/blog/payment-api-design) | Architecture, Payments | 2020 |

## Tinder

> [Tinder Tech Blog](https://medium.com/tinder)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Tinder API Style Guide](https://medium.com/tinder/tinder-api-style-guide-part-1-081804a7ef40) | Architecture | 2024 |

| 2 | [Building Obsidian, Tinder's Design System](https://medium.com/tinder/building-obsidian-tinders-design-system-e127770d8e3f) | Frontend | 2023 |

| 3 | [How Tinder built its API Gateway](https://medium.com/tinder/how-we-built-the-tinder-api-gateway-831c6ca5ceca) | Architecture, Infrastructure | 2022 |

| 4 | [Scaling out Tinder Android Payment Flow using State Machine](https://medium.com/tinder/scaling-out-tinder-android-payment-flow-using-state-machine-e14ef0591b6) | Mobile, Payments | 2020 |

## Twitch

> [Twitch Engineering Blog](https://blog.twitch.tv/en/tags/engineering/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Ingesting Live Video Streams at Global Scale](https://blog.twitch.tv/en/2022/04/26/ingesting-live-video-streams-at-global-scale/) | Video/Media, Infrastructure | 2022 |

| 2 | [Breaking the Monolith at Twitch](https://blog.twitch.tv/en/2022/04/12/breaking-the-monolith-at-twitch-part-2/) | Architecture | 2022 |

| 3 | [Using Machine Learning to Review Emotes](https://blog.twitch.tv/en/2022/06/22/smarter-better-faster-using-machine-learning-to-review-emotes/) | AI/ML | 2022 |

| 4 | [Defense, threat modeling and High Availability at Twitch](https://blog.twitch.tv/en/2021/11/16/defend-your-castle-high-availability-for-high-stakes-cloud-services/) | Security, Infrastructure | 2021 |

## Twitter/X

> [Twitter/X Engineering Blog](https://blog.x.com/engineering/en_us)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Twitter's Recommendation Algorithm](https://blog.x.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm) | AI/ML, Search | 2023 |

| 2 | [How we scaled Reads On the Twitter Users Database](https://blog.x.com/engineering/en_us/topics/infrastructure/2023/how-we-scaled-reads-on-the-twitter-users-database) | Databases, Performance | 2023 |

| 3 | [Powering real-time data analytics with Druid at Twitter](https://blog.x.com/engineering/en_us/topics/infrastructure/2022/powering-real-time-data-analytics-with-druid-at-twitter) | Data Engineering | 2022 |

| 4 | [How we built Twitter's highly reliable ads pacing service](https://blog.x.com/engineering/en_us/topics/infrastructure/2021/how-we-built-twitter-s-highly-reliable-ads-pacing-service) | Architecture | 2021 |

| 5 | [Storing and retrieving millions of ad impressions per second](https://blog.x.com/engineering/en_us/topics/infrastructure/2021/storing-and-retrieving-millions-of-ad-impressions-per-second) | Databases, Performance | 2021 |

| 6 | [Processing billions of events in real time at Twitter](https://blog.x.com/engineering/en_us/topics/infrastructure/2021/processing-billions-of-events-in-real-time-at-twitter-) | Data Engineering | 2021 |

| 7 | [Logging at Twitter](https://blog.x.com/engineering/en_us/topics/infrastructure/2021/logging-at-twitter-updated) | Observability | 2021 |

| 8 | [Twitter's ads serving platform](https://blog.x.com/engineering/en_us/topics/infrastructure/2021/sharding-simplification-and-twitters-ads-serving-platform) | Architecture | 2021 |

## Uber

> [Uber Engineering Blog](https://eng.uber.com/)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Migrating Uber's Compute Platform to Kubernetes](https://www.uber.com/en-IN/blog/migrating-ubers-compute-platform-to-kubernetes-a-technical-journey/?uclick_id=b4e6f2b7-b4a5-446a-beeb-0cc53334b2fe) | Infrastructure | 2025 |

| 2 | [MySQL At Uber](https://www.uber.com/en-IN/blog/mysql-at-uber/) | Databases | 2025 |

| 3 | [How Uber Uses Ray to Optimize the Rides Business](https://www.uber.com/en-IN/blog/how-uber-uses-ray-to-optimize-the-rides-business/) | AI/ML | 2025 |

| 4 | [How Uber Optimizes LLM Training](https://www.uber.com/en-IN/blog/open-source-and-in-house-how-uber-optimizes-llm-training/) | AI/ML | 2024 |

| 5 | [Natural Language to SQL Using Gen AI](https://www.uber.com/en-IN/blog/query-gpt/) | AI/ML | 2024 |

| 6 | [Lucene: Uber's Search Platform](https://www.uber.com/en-IN/blog/lucene-version-upgrade/) | Search | 2024 |

| 7 | [Uber's implementation of Live Activity on iOS](https://www.uber.com/en-IN/blog/live-activity-on-ios/) | Mobile | 2024 |

| 8 | [Odin: Uber's Stateful Platform](https://www.uber.com/en-IN/blog/odin-stateful-platform/) | Infrastructure | 2024 |

| 9 | [Kafka Tiered Storage at Uber](https://www.uber.com/en-IN/blog/kafka-tiered-storage/) | Data Engineering | 2024 |

| 10 | [Modernizing Logging at Uber with CLP](https://www.uber.com/en-IN/blog/modernizing-logging-with-clp-ii/) | Observability | 2024 |

| 11 | [How Uber ensures Apache Cassandra's tolerance for single-zone failure](https://www.uber.com/en-IN/blog/single-zone-failure-tolerance/) | Databases, Infrastructure | 2024 |

| 12 | [How LedgerStore Supports Trillions of Indexes at Uber](https://www.uber.com/en-IN/blog/how-ledgerstore-supports-trillions-of-indexes/) | Databases | 2024 |

| 13 | [Balancing HDFS DataNodes in the Uber DataLake](https://www.uber.com/en-IN/blog/balancing-hdfs-datanodes-in-the-uber-datalake/) | Data Engineering | 2024 |

| 14 | [How Uber Serves Over 40 Million Reads Per Second from Online Storage Using an Integrated Cache](https://www.uber.com/en-IN/blog/how-uber-serves-over-40-million-reads-per-second-using-an-integrated-cache/) | Caching, Performance | 2024 |

| 15 | [How Uber Optimized Cassandra Operations At Scale](https://www.uber.com/en-IN/blog/how-uber-optimized-cassandra-operations-at-scale/) | Databases | 2023 |

| 16 | [How Uber Optimizes the Timing of Push Notifications using ML and Linear Programming](https://www.uber.com/en-IN/blog/how-uber-optimizes-push-notifications-using-ml/) | AI/ML, Messaging | 2022 |

| 17 | [Deduping and Storing Images at Uber Eats](https://www.uber.com/en-IN/blog/deduping-and-storing-images-at-uber-eats/) | Infrastructure | 2022 |

| 18 | [Uber's Next Gen Push Platform on gRPC](https://www.uber.com/en-IN/blog/ubers-next-gen-push-platform-on-grpc/) | Messaging, Infrastructure | 2022 |

| 19 | [Uber's Highly Scalable and Distributed Shuffle as a Service](https://www.uber.com/en-IN/blog/ubers-highly-scalable-and-distributed-shuffle-as-a-service/) | Data Engineering | 2022 |

| 20 | [How Uber Predicts Arrival Times Using Deep Learning](https://www.uber.com/en-IN/blog/deepeta-how-uber-predicts-arrival-times/) | AI/ML | 2022 |

| 21 | [Real-Time Exactly-Once Ad Event Processing with Apache Flink, Kafka, and Pinot](https://www.uber.com/en-IN/blog/real-time-exactly-once-ad-event-processing/) | Data Engineering | 2021 |

## Walmart

> [Walmart Global Tech Blog](https://medium.com/walmartglobaltech)

| # | Article | Topics | Year |

|---|---------|--------|------|

| 1 | [Walmart's Cassandra CDC Solution](https://medium.com/walmartglobaltech/walmarts-cassandra-cdc-solution-6fc650031a3) | Data Engineering, Databases | 2022 |

| 2 | [Scaling the Walmart Inventory Reservations API for Peak Traffic](https://medium.com/walmartglobaltech/scaling-the-walmart-inventory-reservations-api-for-peak-traffic-9ba37833ef9d) | Performance, Architecture | 2022 |

| 3 | [A Markov Chain Formulation for the Grocery Item Picking Process](https://medium.com/walmartglobaltech/a-markov-chain-formulation-of-grocery-item-picking-process-54c65a3ec5b5) | AI/ML | 2021 |

| 4 | [How we rebuilt the Walmart Autocomplete Backend](https://medium.com/walmartglobaltech/how-we-rebuilt-the-walmart-autocomplete-backend-10efe71d624a) | Search, Performance | 2021 |

| 5 | [Building a Notification Framework for Microservice-based Application](https://medium.com/walmartglobaltech/building-a-notification-framework-for-microservice-based-application-6fe5ac9dfcee) | Messaging, Architecture | 2021 |

---

## Contributing

Contributions are welcome! Please read the following guidelines:

1. **Fork** the repository

2. **Add** your article suggestion with the following format:

   - Article must be from a well-known tech company's engineering blog

   - Include the article title, URL, topic tags, and year

3. **Submit** a pull request

Please ensure:

- The article is a genuine engineering blog post (not a marketing piece)

- The link is working and points to the original source

- The topic tags match the categories used in this list

## License

[MIT](LICENSE) - feel free to use this list however you'd like!

---

If you find this repository helpful, please consider giving it a star! It helps others discover these valuable engineering resources.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ashishps1/awesome-engineering-articles

Awesome Lists containing this project

README