Welcome to the Ops in MAANG repository, a curated collection of articles, manuals, and research blogs from the engineering realms of top-tier technology companies - Meta(Facebook), Amazon (AWS), Apple, Netflix, and Google. This carefully curated repository combines a wealth of engineering and research insights, manuals, and projects directly sourced from the esteemed engineering blogs of MAANG giants. Whether you're a seasoned practitioner looking to fine-tune your skills or a newcomer eager to dive into the world of cutting-edge technology, this repository serves as your go-to guide.
Act now MAANG is waiting for you
- Why Rust for safe systems programming
- Using Rust in Windows
- Microsoft joins Rust Foundation
- Open-source Rust driver development platform
- An intern's experience with Rust
- Designing a COM library for Rust
- Microsoft Azure security evolution: Embrace secure multitenancy, Confidential Compute, and Rust
- The Safety Boat: Kubernetes and Rust
- Announcing Rust for Windows v0.9
- Rust/WinRT Public Preview
- Sustainability with Rust
- Why AWS loves Rust, and how we’d like to help
- Why AWS is the Best Place to Run Rust
- How Open Source Projects are Using Kani to Write Better Software in Rust
- How our AWS Rust team will contribute to Rust’s future successes
- AWS’ sponsorship of the Rust project
- Innovating with Rust
- Debugging a FUSE deadlock in the Linux kernel
- Netflix at Velocity 2015: Linux Performance Tools
- Linux Performance Analysis in 60,000 Milliseconds
- Extending Vector with eBPF to inspect host and container performance
- Predictive CPU isolation of containers at Netflix
-
Facebook open-sources new suite of Linux kernel components and tools
-
drgn: How the Linux Kernel Team at Meta Debugs the Kernel at Scale
-
Transparent memory offloading: more memory at a fraction of the cost and power
- Fixing Linux filesystem performance regressions
- Upgrading to RHEL7 with minimal interruptions
- Application Pauses When Running JVM Inside Linux Control Groups
- Optimizing Linux Memory Management for Low-latency / High-throughput Databases
- Don’t Let Linux Control Groups Run Uncontrolled
- Skyfall: eBPF agent for infrastructure observability
- Overcoming challenges with Linux cgroups memory accounting
- AI debugging at Meta with HawkEye
- BUILDING RESILIENT MONITORING AT META
- Dynolog: Open source system observability
- Below: a time travelling resource monitoring tool
- Resource Control Demo: Better Resource Control with Simulation
- Lessons Learned: Running Presto at Meta Scale
- Uncovering the Unknown Unknown
- Inside Meta's AI optimization platform for engineers across the company
- Using Chakra execution traces for benchmarking and network performance optimization
- Twitter’s Blobstore Hardware Lifecycle Monitoring and Reporting Service
- Powering real-time data analytics with Druid at Twitter
- Observability at Twitter: technical overview, part I
- Observability at Twitter: technical overview, part II
- Monitoring business performance data with ThirdEye smart alerts
- Analyzing anomalies with ThirdEye
- Smart alerts in ThirdEye, LinkedIn’s real-time monitoring platform
- InGraphs: Monitoring and Unexpected Artwork
- Open Sourcing Kafka Cruise Control
- Samza Aeon: Latency Insights for Asynchronous One-Way Flows
- inMesh: Real-Time Monitoring of Remote Sites
- InFlow - Making the LinkedIn network visible
- Burrow: Kafka Consumer Monitoring Reinvented
- Scaling the collection of self-service metrics
- Monitoring at Spotify: The Story So Far
- Analyzing Volatile Memory on a Google Kubernetes Engine Node
- Monitoring at Spotify: Introducing Heroic
- How LinkedIn handles merging code in high-velocity repositories
- Accelerating Code Delivery By 97% With Yarn Workspaces
- How LinkedIn automates cherry-picking commits to improve developer productivity
- Effective Code Reviews and File Ownerships
- Introducing HubCommander
- Towards true continuous integration: distributed repositories and dependencies
- Sapling: Source control that’s user-friendly and scalable
- Build faster with Buck2: Our open source build system
- Meta developer tools: Working at scale
- A Meta developer's workflow: Exploring the tools used to code at scale
- Faster Together: Uber Engineering’s iOS Monorepo
- Building Uber’s Go Monorepo with Bazel
- The Journey To Android Monorepo: The History Of Uber Engineering’s Android Codebase Organization
- How We Halved Go Monorepo CI Build Time
- Fleet Management at Spotify (Part 1): Spotify’s Shift to a Fleet-First Mindset
- Fleet Management at Spotify (Part 2): The Path to Declarative Infrastructure
- Fleet Management at Spotify (Part 3): Fleet-wide Refactoring
-
Open sourcing Kube2Hadoop: Secure access to HDFS from Kubernetes
-
Benchmarking Apache Samza: 1.2 million messages per second on a single node
-
Asynchronous Processing and Multithreading in Apache Samza, Part I: Design and Architecture
-
Asynchronous Processing and Multithreading in Apache Samza, Part II: Experiments and Evaluation
- Kubernetes And Kernel Panics
- Evolving Container Security With Linux User Namespaces
- Predictive CPU isolation of containers at Netflix
- Extending Vector with eBPF to inspect host and container performance
- Auto Scaling Production Services on Titus
- Titus, the Netflix container management platform, is now open source
- Updates on Netflix’s Container Management Platform
- The Evolution of Container Usage at Netflix
- Containerizing Apache Hadoop Infrastructure at Uber
- uBuild: Fast and Safe Building of Thousands of Container Images
- Containerizing the Beast – Hadoop NameNodes in Uber’s Infrastructure
- Devpod: Improving Developer Productivity at Uber with Remote Development
- Up: Portable Microservices Ready for the Cloud
- Introducing Makisu: Uber’s Fast, Reliable Docker Image Builder for Apache Mesos and Kubernetes
- Efficient and Reliable Compute Cluster Management at Scale
- Dockerizing MySQL at Uber Engineering
- Uber’s Highly Scalable and Distributed Shuffle as a Service
- Uber Engineering’s Micro Deploy: Deploying Daily with Confidence