Exploring the LLM-RAG Inference Architecture Stack
Posted on July 3, 2024
Get your work recognized: write a brag document
Posted on June 23, 2024
Unpacking How Ad Ranking Works at Pinterest
Posted on June 7, 2024
AI-Powered Conversion From Enzyme to React Testing Library at Slack
Posted on May 29, 2024
Portfolio Analysis at Scale: Running Risk and Analytics on 15+ Million Portfolios
Posted on May 15, 2024
Airbnb ML Feature Platform Chronon
Posted on April 25, 2024
Effective Performance Engineering at Twitter-Scale
Posted on April 3, 2024
Yelp Overhauls Its Streaming Architecture with Apache Beam and Apache Flink
Posted on March 23, 2024
AWS Lambda Code Start and Deep Dive
Posted on March 16, 2024
SQL DBs: Trino, Apache Hive, Apache Impala, Apache Drill
Posted on February 29, 2024
RAG Retrieval Augmented Generation
Posted on February 8, 2024
InfoQ Generally AI Episodes
Posted on February 3, 2024
Griffin v2 as Instacart’s Next-Gen ML Platform
Posted on January 7, 2024
Tips on How Staff Engineers Can Impact Incidents
Posted on January 6, 2024
ML Ops Platform at Cloudflare
Posted on December 9, 2023
Airflow Vs Flyte
Posted on November 21, 2023
Book "Staff Engineer"
Posted on November 19, 2023
Lessons Learned from Building LinkedIn AI Data Platform
Posted on August 9, 2023
People Search AI @LinkedIn
Posted on July 12, 2023
Lucene, Solr, and Elasticsearch
Posted on May 30, 2023
Million Dollar Lines of Code - an Engineering Perspective on Cloud Cost Optimization
Posted on March 12, 2023
Jetson Nano Real-Time Object Detection
Posted on January 21, 2023
How great leaders inspire action
Posted on September 10, 2022
Edge Computing Framework Compare
Posted on June 22, 2022
Small Kubernetes for local testing - k0s, MicroK8s, kind, k3s, k3d, and Minikube
Posted on February 21, 2022
ML Infrastructure+Orchestration Tooling (MLflow | KubeFlow | Sagemaker | MLeap)
Posted on December 29, 2021
Career and Leadership Advice for Engineers and Technical Professionals
Posted on November 7, 2021
SageMaker Pipeline vs MLFlow Details
Posted on June 19, 2021
Data Processing Infrastructure Tooling Quick Checkup
Posted on May 3, 2021
SageMaker Spark
Posted on April 16, 2021
Dagster Deep Dive
Posted on March 9, 2021
Lakehouse A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
Posted on February 22, 2021
Machine learning in production
Posted on September 19, 2020
Career Development Gems
Posted on September 5, 2020
How Slack Works
Posted on July 7, 2020
Netflix Part-3: ML orchestration + recommendation
Posted on May 18, 2020
Netflix Part-2: Data processing
Posted on May 17, 2020
Netflix Part-1: PlayBack deep dive
Posted on May 16, 2020
2020 April - Refresh on all current AWS services
Posted on April 30, 2020
Building an AI-powered Battlesnake with reinforcement learning on Amazon SageMaker
Posted on March 27, 2020
Jackson Gabbard - Intro to Behavioural Interviews
Posted on March 25, 2020
Inside NGINX & How We Designed for Performance & Scale
Posted on March 24, 2020
UBER Machine Learning Platform - Michelangelo
Posted on March 12, 2020
UBER system design
Posted on January 5, 2020
Engineering Uber Predictions in Real Time with ELK
Posted on September 28, 2018
Real-Time Data Exploration and Analytics with Amazon Elasticsearch Service
Posted on December 10, 2017
Scaling Data Quality at Netflix
Posted on November 11, 2017
FB Research Crowd Intelligence Enhances Automated Mobile Testing
Posted on October 7, 2017
A Developer View into Spark Memory Model
Posted on October 3, 2017
Apache IoT Software Stack - Apache Ignite and Spark
Posted on September 23, 2017
Facebook Tuning Apache Spark for Large-Scale Workloads
Posted on September 14, 2017
Spark Dataframe and Dataset
Posted on September 9, 2017
GoDaddy Dashboard using ML LDA
Posted on August 17, 2017
Databricks Structured Streaming
Posted on August 10, 2017
How Netflix Uses Kinesis Streams to Monitor Applications and Analyze Billions of Traffic Flows
Posted on August 8, 2017
Google Kubernetes
Posted on August 1, 2017
Salesforce User Behavior Anomaly Detection
Posted on July 31, 2017
IBM BigDL
Posted on July 31, 2017
Spark Compute as a Service at Paypal
Posted on July 15, 2017
Random Walk on LargeScale Graphs with Spark (LinedIn)
Posted on July 3, 2017
Hive Bucketing in Apache Spark
Posted on June 26, 2017
Krux, a Salesforce company, is a Data Management Platform (DMP)
Posted on June 21, 2017
two sigma time series tool Flint
Posted on February 15, 2017
software multitenancy
Posted on February 14, 2017
Apache Spark Streaming ETL
Posted on February 6, 2017
DISTRIBUTED TRACING AT UBER
Posted on February 5, 2017
Hello World :)
Posted on February 4, 2017