#281 — November 22, 2019

Read on the Web

Database Weekly

ksqlDB: An Event Streaming Database for for Stream Processing Applications — A scalable, real-time event streaming database for Apache Kafka which makes stream processing accessible via a familiar, lightweight SQL syntax. Built in Java. Datanami has a more editorial writeup of the project too.

Confluent Inc.

12 Common Mistakes and Missed Optimization Opportunities in SQL — A lengthy list of SQL “dos and don’ts” covering topics like timezones, ranges, formatting queries, and unions.

Haki Benita

Your Data Is Your Business — PGX is a full-service database consultancy focused around PostgreSQL systems. Let us help you architect, build, troubleshoot and scale your PostgreSQL data systems on any platform or hosting environment.

PostgreSQL Experts, Inc. sponsor

EasyDB: A 'One-Click' Server-Free Database — A quick way to provision a temporary database (that’s basically a key/value store) and use it from JavaScript or Python (though the API is simple and HTTP based so it could be used from anywhere). Ideal for hackathons or quick once-off scripts, maybe.

Jake and Tyson

Aerospike Raises More Cash ($32M To Be Precise) — Aerospike is a real-time NoSQL database vendor and one of the first non-relational database vendors to embrace Intel’s Optane persistent memory.

Datanami

RedisInsight: The Redis GUI You’ve Been Looking For? — A free tool from Redis Labs that spawned from the acquisition of RDBTools, a popular GUI tool for working with and visualizating data stored in Redis.

Redis Labs

The Rise of SQL-Based Data Modeling and 'DataOps' — While NoSQL systems like HBase, Cassandra and MongoDB were rapidly becoming popular for managing data at one point, there’s now a resurgence of SQL-based systems, including in massively parallel data warehouse scenarios, a domain once restricted solely to high cost, enterprise use cases.

Anthony Thong Do

💻 Jobs

Data Pipelines, Reinvented. Find Your Place at Fivetran — Rooted in Oakland, we are a fast-growing company hiring across software engineering, SRE, product, and data analytics. Come join us.

Fivetran

Find a Job Through Vettery — Make a profile, name your salary, and connect with hiring managers from top employers. Vettery is completely free for job seekers.

Vettery

📒 Everything else

Sum Types for Relational Databases — A look at different ways to encode sum types.

Dmitry Olshansky

ETL with A Glue Python Shell Job: Loading Data From S3 to Redshift — AWS Glue offers tools for solving ETL challenges. A Glue Python Shell job is a perfect fit for ETL tasks with low to medium complexity and data volume.

Andreas Wittig

Leading with Commas — Ugly or Efficient? An Investigation Over 320 GB of SQL Code — This post is over two years old but has recently resurfaced on social media. It looks at the results of analyzing 320GB of public SQL code to see what the convention on using commas is.

Felipe Hoffa

Your DB Apps and APIs 25x Faster with Macrometa's Global Data Network — Store, cache, query & serve hot cloud data instantly with the Global Data Network for Web, Mobile, IoT & Analytics Apps.

Macrometa sponsor

Evaluating SQLite Performance by Testing All Parameters — This post is a few months old, but as a huge SQLite fan (and user) I’m sad I missed it, so here we go! What page size, cache size, locking mode, and journal mode are the most performant for a single thread accessing a single SQLite database?

Eric Draken

Unpacking Competitive Benchmark Claims — A look at a CockroachDB vs Yugabyte benchmark from the Cockroach Labs perspective.

Peter Mattis (Cockroach Labs)

An Interview on What Makes PostgreSQL Unique (Extensions) — Dimitri Fontaine interviewed Craig Kerstiens about the importance of extensions in the Postgres ecosystem and what his favorites are.

Craig Kerstiens

Labeling 1M Data Points/Week for OpenAI's GPT-2

Akshat Bubna

BadgerDB v2.0 Released — Badger is the Go-powered key-value store that powers much of DGraph’s capabilities.

DGraph

Scylla Getting Support for the Redis API? — This is still a little edgy (as we’re looking directly at a code commit here) but an interesting development nonetheless.

ScyllaDB