Issue 151 — April 21, 2017
GeoMesa: An Open-Source Spatio-Temporal Database Layer
— GeoMesa provides spatio-temporal indexing on top of Accumulo, Bigtable, and Cassandra, as well as near real-time stream processing and spatial semantics on top of Kafka.
The GeoMesa Project
Amazon Redshift Spectrum: Exabyte-Scale In-Place Queries of S3 Data
— Spectrum makes it possible to run complex queries using Redshift on data stored on AWS S3 without any loading or data prep.
The Top Features Coming to SQL Server 2017
— A summary of what Microsoft has unveiled as coming to SQL Server 2017 from
to adaptive query optimization and
a built-in graph database.
High Available and Scalable Open Source Database - SiriDB
— The time series database SiriDB can scale on the fly, is robust by design and uses a unique mechanism to operate without indexes. SiriDB’s query language includes dynamic grouping for easy and fast analysis.
How to Calculate Multiple Aggregate Functions in a Single Query
— A look at alternatives to using a large collection of SQL queries to query the same data in different ways, such as pivots and grouping sets.
Architecture of Giants: Data Stacks at Facebook, Netflix, Airbnb, and Pinterest
— Simple event data infrastructure diagrams from several fast-scaling companies.
Why to Use A Relational Database for Time-Series Data
— NoSQL databases are commonly used to store time-series data, but the creator of
sets out a technical case for bringing time-series data into a relational setup.
Mike Freedman (Timescale)
Iguazio Re-Architects the Stack for Continuous Analytics
Oracle's Databases and Developer Tools Now Available on Docker
— Oracle’s main database product, WebLogic, Coherence, MySQL, and others are available in Docker containers on the Docker Store marketplace.
Whitepaper: 5 Steps to Agile DB Management
— Database Management is years behind software development. Here's how to bring your DBs up to speed.
Removing Duplicate Rows in Postgres
— What if you accidentally load data twice? A look at handling duplicate data and a resulting clean up.
Tips for Monitoring Redis
— Ways to get more info from Redis, such as on latency and slow commands.
Creating Role-Based Access Control in MongoDB
A Data Model for the Recruitment Process
MySQL Stored Procedures 101
Inside Anodot's Anomaly Detection System for Time-Series Data
Designing a Data Warehouse on Hadoop
Running Out of IDs
“Keep on using serial for most use cases and keep bigserial in your back pocket if a real need arises.”
Cassandra vs. MongoDB
— Considering the differences between Cassandra vs. MongoDB as the data store for your next project
BigQuery: One Store to Rule Them All
SQL Is The Perfect Interface
Apache ORC: High-Performance Columnar Storage for Hadoop