#252 — May 3, 2019

Read on the Web

Database Weekly

A Close Look at the Index INCLUDE Clause — The INCLUDE clause for CREATE INDEX is available in SQL Server, IBM Db2, and, most recently, PostgreSQL. It allows you to include extra columns’ data to be referred to in index-only scans withou being key columns of the index itself. Markus explains.

Markus Winand

Microsoft Bringing Azure SQL Database to the 'Edge' (and Arm) — Ahead of their annual Build conference, Microsoft has announced a preview of Azure SQL Database Edge, an effort to take Azure SQL Database to edge computing devices, including, Arm-based machines. Watch this space.

TechCrunch

A Health Check Playbook for Your Postgres Database — Your Postgres database is a living breathing thing. As data comes in and queries are executed, the composition of your database will change. This health check playbook for Postgres will help ensure things are working and in good health.

CITUS DATA, NOW PART OF THE MICROSOFT FAMILY sponsor

A Deep Dive into GIN Indexes — A truly ‘deep dive’ into Postgres’s Generalized Inverted Indexes which are suited for indexing composite values where elements within those composite values need to be looked up frequently (e.g. full text search).

Egor Rogov

Hazelcast Responds to Redis Labs' 2018 Benchmark — Drama alert! Last year, Redis Labs published a Redis vs Hazelcast benchmark that showed Redis having higher throughput and lower latency, but.. as often happens with benchmarks, Hazelcast considered it misleading and responds here, along with showing how Hazelcast really “outperforms Redis at scale.”

Greg Luck

💻 Jobs

Data & Applied Scientist (Sunnyvale, CA) — Passionate about machine learning and data mining at web scale? Find out more.

Microsoft

Have You Tried Vettery? — Vettery specializes in tech roles and is completely free for job seekers. Create a profile to get started.

Vettery

📒 Tutorials and Opinions

How to Get Started with Mapping GIS Data — This tutorial brings together a lot of neat things, including Leaflet.js, the Chicago Open Rideshare Dataset, Digital Ocean and Postgres.

Sam Cohen

Six Technical Challenges Developing a Distributed SQL Database — This is one for the implementers or anyone curious at the technology choices database implementers make behind the scenes.

Karthik Ranganathan (YugaByte)

Free Whitepaper: How to Deploy Open Source Databases — Learn from the experts how to deploy a variety of open source DBs using proven methodologies to get battle-tested results.

Severalnines sponsor

Consistent Data Partitioning through Global Indexing for Large Hadoop Tables at Uber — Performing updates of individual records in Uber’s over 100 petabyte Apache Hadoop data lake required building Global Index, a component that manages data bookkeeping and lookups at scale.

Nishith Agarwal and Kaushik Devarajaiah

In 2019, Disorderly Is the New Orderly (for Data)“Managing a data warehouse is anything but boring in this day and age. Having to add a whole new ‘room’ to your ‘data house’ at a moment’s notice has become the new norm, whereas implementing a rigid data infrastructure could be tantamount to a nail in your company’s coffin.”

Chris Stewart

Using Amazon Aurora to Seamlessly Increase Capacity of WordPress Database Backends — WordPress is hugely popular, powering 30 percent of all websites, and is commonly associated with the MySQL database. Amazon’s Aurora is compatible with MySQL and is therefore being used to make WordPress sites run even faster than before.

Amazon Web Services

Benchmarking Redis's Experimental Multi-Threaded I/O Feature

Filipe Oliveira

Having Lunch with PostgreSQL, MongoDB and JSON — A comparison of JSON handling in PostgreSQL and MongoDB, ultimately favoring Postgres.

Álvaro Hernández

🛠 Code and Tools

Masquerade: A PostgreSQL Proxy to Mask Data in Realtime — Let’s say you want developers, analysts, or others to use your database but in a privacy-friendly way.

Tonic

Praeco: Elasticsearch Alerting Made Simple — A Web-based GUI for ElastAlert for building alerts interactively. It can test alerts against historical data, show you logs, send notifications to numerous places (Slack, email, HTTP endpoints), etc.

ServerCentral, Inc.