Delivering Billions of Messages Exactly Once — The single requirement of all data pipelines is that they cannot lose data, but de-duplicating messages can be the trickier part.
Segment story
A pipeline for continuous data migration with the SQLizer API — With data holed up in third party apps everywhere, APIs are the sensible way to continuously migrate data to your own databases. The SQLizer API gives you essential conversion to SQL capability for your migration pipeline, all from your command line.
SQLizer sponsored
Do We Need Specialized Graph Databases? — A look through an academic paper where the authors ran a graph data management system benchmark suite over several popular graph databases with interesting results.
Using SQL Within Google Sheets — Did you know Google’s Web-based spreadsheet tool includes a ‘query’ function for using SQL against your spreadsheet? This is a guide to using it.
Ben Collins tutorial
All About Indexes: The Very Basics — Well placed and defined indexes will improve your database’s performance significantly. A popular article from last year.