Tech Reading and Notes
  • Posts
  • Tag Index
  • Search
  • About
Navigation bar avatar

Exploring DeepSeek-AI's Open-Source Contributions: Advanced AI Models and Infrastructure

Posted on March 1, 2025

Tags: LLM

Building Interactive Web Apps with Streamlit: A Comprehensive Guide

Posted on February 15, 2025

Kafka Testing

Posted on January 27, 2025

Tags: Docker Kafka Test

KEDA, Kubernetes based Event Driven Autoscaler

Posted on December 11, 2024

Tags: K8S Infrastructure

Introduction to Haystack: Open-Source NLP Framework for Building Search & QA Systems

Posted on November 29, 2024

Tags: haystack LLM ML

A Comprehensive Guide to Quantiacs Stock Market Data API

Posted on October 11, 2024

Introduction to LLaMA-Factory: A Framework for Fine-Tuning LLaMA Models

Posted on September 16, 2024

Tags: LLM

Introduction to AWS Powertools: Enhancing Serverless Applications

Posted on August 22, 2024

Tags: aws powertools lambda observability serverless

Exploring the LLM-RAG Inference Architecture Stack

Posted on July 3, 2024

Tags: LLM ML

Get your work recognized: write a brag document

Posted on June 23, 2024

Tags: Leadership

Unpacking How Ad Ranking Works at Pinterest

Posted on June 7, 2024

Tags: ML Ranking

AI-Powered Conversion From Enzyme to React Testing Library at Slack

Posted on May 29, 2024

Tags: LLM ML

Portfolio Analysis at Scale: Running Risk and Analytics on 15+ Million Portfolios

Posted on May 15, 2024

Tags: Analytics

Airbnb ML Feature Platform Chronon

Posted on April 25, 2024

Tags: ML

Effective Performance Engineering at Twitter-Scale

Posted on April 3, 2024

Tags: Performance

Yelp Overhauls Its Streaming Architecture with Apache Beam and Apache Flink

Posted on March 23, 2024

Tags: Leadership

AWS Lambda Code Start and Deep Dive

Posted on March 16, 2024

Tags: Performance

SQL DBs: Trino, Apache Hive, Apache Impala, Apache Drill

Posted on February 29, 2024

Tags: ML

RAG Retrieval Augmented Generation

Posted on February 8, 2024

Tags: ML

InfoQ Generally AI Episodes

Posted on February 3, 2024

Tags: ML

Griffin v2 as Instacart’s Next-Gen ML Platform

Posted on January 7, 2024

Tags: ML Infrastructure

Tips on How Staff Engineers Can Impact Incidents

Posted on January 6, 2024

Tags: Leadership

ML Ops Platform at Cloudflare

Posted on December 9, 2023

Tags: Infrastructure ML

Airflow Vs Flyte

Posted on November 21, 2023

Book "Staff Engineer"

Posted on November 19, 2023

Tags: Leadership

Lessons Learned from Building LinkedIn AI Data Platform

Posted on August 9, 2023

Tags: ML Infrastructure

People Search AI @LinkedIn

Posted on July 12, 2023

Tags: ML Search

Lucene, Solr, and Elasticsearch

Posted on May 30, 2023

Tags: Search

Million Dollar Lines of Code - an Engineering Perspective on Cloud Cost Optimization

Posted on March 12, 2023

Tags: Cost

Jetson Nano Real-Time Object Detection

Posted on January 21, 2023

Tags: JetsonNano

How great leaders inspire action

Posted on September 10, 2022

Tags: Leadership

Edge Computing Framework Compare

Posted on June 22, 2022

Tags: ML Infrastructure Edge

Small Kubernetes for local testing - k0s, MicroK8s, kind, k3s, k3d, and Minikube

Posted on February 21, 2022

Tags: K8S kubernetes k0s MicroK8s kind k3s k3d Minikube

ML Infrastructure+Orchestration Tooling (MLflow | KubeFlow | Sagemaker | MLeap)

Posted on December 29, 2021

Tags: ML Infrastructure

Career and Leadership Advice for Engineers and Technical Professionals

Posted on November 7, 2021

Tags: Career Culture

SageMaker Pipeline vs MLFlow Details

Posted on June 19, 2021

Tags: ML Infrastructure

Data Processing Infrastructure Tooling Quick Checkup

Posted on May 3, 2021

Tags: ETL Orchestration Infrastructure

SageMaker Spark

Posted on April 16, 2021

Tags: Dagster Code

Dagster Deep Dive

Posted on March 9, 2021

Tags: Dagster Code

Lakehouse A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics

Posted on February 22, 2021

Tags: Data

Machine learning in production

Posted on September 19, 2020

Tags: ML

Career Development Gems

Posted on September 5, 2020

Tags: Career

How Slack Works

Posted on July 7, 2020

Netflix Part-3: ML orchestration + recommendation

Posted on May 18, 2020

Tags: Netflix

Netflix Part-2: Data processing

Posted on May 17, 2020

Tags: Netflix

Netflix Part-1: PlayBack deep dive

Posted on May 16, 2020

Tags: Netflix

2020 April - Refresh on all current AWS services

Posted on April 30, 2020

Tags: ML

Building an AI-powered Battlesnake with reinforcement learning on Amazon SageMaker

Posted on March 27, 2020

Tags: ML

Jackson Gabbard - Intro to Behavioural Interviews

Posted on March 25, 2020

Tags: Career Interview

Inside NGINX & How We Designed for Performance & Scale

Posted on March 24, 2020

UBER Machine Learning Platform - Michelangelo

Posted on March 12, 2020

Tags: ML

UBER system design

Posted on January 5, 2020

Tags: Design

Engineering Uber Predictions in Real Time with ELK

Posted on September 28, 2018

Tags: ML

Real-Time Data Exploration and Analytics with Amazon Elasticsearch Service

Posted on December 10, 2017

Tags: AWS

Scaling Data Quality at Netflix

Posted on November 11, 2017

Tags: Netflex Spark

FB Research Crowd Intelligence Enhances Automated Mobile Testing

Posted on October 7, 2017

Tags: Infrastructure Testing

A Developer View into Spark Memory Model

Posted on October 3, 2017

Tags: Design Spark

Apache IoT Software Stack - Apache Ignite and Spark

Posted on September 23, 2017

Facebook Tuning Apache Spark for Large-Scale Workloads

Posted on September 14, 2017

Tags: Facebook Spark

Spark Dataframe and Dataset

Posted on September 9, 2017

GoDaddy Dashboard using ML LDA

Posted on August 17, 2017

Databricks Structured Streaming

Posted on August 10, 2017

Tags: Infrastructure Spark

How Netflix Uses Kinesis Streams to Monitor Applications and Analyze Billions of Traffic Flows

Posted on August 8, 2017

Tags: Infrastructure Netflix AWS

Google Kubernetes

Posted on August 1, 2017

Tags: Google Spark

Salesforce User Behavior Anomaly Detection

Posted on July 31, 2017

Tags: ML Spark

IBM BigDL

Posted on July 31, 2017

Tags: Infrastructure Spark

Spark Compute as a Service at Paypal

Posted on July 15, 2017

Tags: Infrastructure Spark

Random Walk on LargeScale Graphs with Spark (LinedIn)

Posted on July 3, 2017

Tags: Infrastructure

Hive Bucketing in Apache Spark

Posted on June 26, 2017

Tags: Infrastructure Spark Hive

Krux, a Salesforce company, is a Data Management Platform (DMP)

Posted on June 21, 2017

Tags: Tool Company

two sigma time series tool Flint

Posted on February 15, 2017

Tags: Infrastructure Flint

software multitenancy

Posted on February 14, 2017

Tags: Concept

Apache Spark Streaming ETL

Posted on February 6, 2017

Tags: Infrastructure Databricks

DISTRIBUTED TRACING AT UBER

Posted on February 5, 2017

Tags: Infrastructure Uber

Hello World :)

Posted on February 4, 2017

  • Email me
  • GitHub
  • LinkedIn
  • Google Scholar

Xin Ren  •  2025

Copyright © 2024 Xin Ren