Architecture Weekly Issue #182. Articles, books, and playlists on architecture and related topics. Split by sections, highlighted with complexity: 🤟 means hardcore, 👷‍♂️ is technically applicable right away, 🍼 - is an introduction to the topic or an overview. Now in telegram and Substack as well.

Highlights

BigQuery Column Metadata index 👷‍♂️

Petabytes warehouses are already no wonder, but now we're in the era of petabytes tables; with this scale even metadata tables become huge and we need metadata for metadata. Enter CMETA index in BigQuery: a mechanism to optimize queries for petabytes-sized tables.

Understanding the BigQuery column metadata (CMETA) index | Google Cloud Blog
BigQuery’s Column Metadata (CMETA) index delivers superior scalability and performance, with no maintenance overhead, and may reduce slot usage.

#bigdata

The CTO's Cost of Goods Sold 🍼

You might not be a CTO yet, but this piece provides a great example to software architects and engineering managers on cost of development. In my experience the ROI, profitability and other financial metrics are a mandatory part of quarterly and annual plannings, and are required to be clear for engineering leaders.

The CTO’s Cost of Goods Sold
Understanding COGS transformed how I approach sprint planning and roadmap discussions. Instead of treating all work equally, we now evaluate the financial impact of our choices.

#cost

Stop Saying RAG is Dead 👷‍♂️

Million token context window seem to kill RAG, but not unless you actually understand what RAG stands for. Grab the series explaining what is actually dead and how to think about information retrieval(IR).

Stop Saying RAG Is Dead – Hamel’s Blog
Why the future of RAG lies in better retrieval, not bigger context windows.

#ai

Follow-Up

Behind the livestreams 👷‍♂️

Interesting how the default feature of TV - live streaming - is being adopted by online streaming(cis!) platforms. Netflix shares their learnings on scaling CDNs, graceful degradation and many more topics.

#performance #scalability #casestudy

Data Engineering Design Patterns 👷‍♂️

The more you work with Data Pipelines, the more you need to understand the related patterns. Thanks to Nik, I got to know about the book covering exactly that: Data Engineering Design Patterns which covers data ingestion, error management, data flow, security, storage and many other topics. Great news: it's being distributed for free, grab your copy!

#book #dataengineering

Keeping Secrets Out of Logs 👷‍♂️

This problem is as old as the software development itself: logging passwords, access keys and API access tokens. And it does not go away! Allan Reyes does great job explaining the potential sources of leaked secrets in logs and provides methods to fight it.

Keeping Secrets Out of Logs
There’s no silver bullet, but if we put some “lead” bullets in the right places, we have a good shot at keeping sensitive data out of logs.

#security

Architecture Interview Tips 🍼

I recently conducted couple of mock system design interviews and I need to remind once again: there is a bunch of common mistakes people make during them. I have a video with foundational tips about Architecture Interview - grab it here.

Business Oriented System Design Course Cohort #7 is officially open!

Speaking of system design: I have got an entire course to help you design software solutions and eventually pass interviews. 10 hours of content packed lectures, engaging practice and the final work you will be proud to showcase as well as Credly(by Pearson)-based digital certificate proving your experience. More than 70 engineers already passed the course with amazing feedback and advanced their careers. New cohort starts on 1st of October. APPLY NOW.

Big thanks to Nikita, Constantin, Anatoly, Oleksandr, Dima, Pavel B, Pavel, Robert, Roman, Iyri, Andrey, Lidia, Vladimir, August, Roman, Egor, Roman, Evgeniy, Nadia, Daria, Dzmitry, Mikhail, Nikita, Dmytro, Denis and Mikhail for supporting the newsletter on Patreon!