Experiences with approximating questions in Microsoft’s manufacturing big-data groups

Arandom stroll through Computer Science research, by Adrian Colyer

Experiences with approximating queries in Microsoft’s manufacturing big-data clusters Kandula et al., VLDB’19 I’ve been excited in regards to the possibility of approximate question processing in analytic groups for many time, and also this paper defines its usage at scale in manufacturing. Microsoft’s big information groups have actually 10s of thousands of devices, as they are employed by tens and thousands of … Continue reading Experiences with approximating questions in Microsoft’s manufacturing big-data groups

DDSketch: a quick and fully-mergeable quantile design with relative-error guarantees

DDSketch: an easy and fully-mergeable quantile sketch with relative-error guarantees Masson et al., VLDB’19 Datadog handles a huge amount of metrics – some clients have actually endpoints producing over 10M points per second! For reaction times (latencies) reporting a straightforward metric such as for instance ‘average’ is close to worthless. rather you want to understand what’s happening at various … Continue reading DDSketch: an easy and fully-mergeable quantile sketch with relative-error guarantees

SLOG: serializable, low-latency, geo-replicated deals

IPA: invariant-preserving applications for weakly constant replicated databases

IPA: invariant-preserving applications for weakly consistent replicated databases Balegas et al., VLDB’19 IPA for designers, pleased times! Final we week looked over automating checks for invariant confluence, and extending the group of cases where we could show that the item is certainly invariant confluent. I’m maybe maybe not planning to re-cover that back ground in this write-up, so reading that is… continue: invariant-preserving applications for weakly constant replicated databases

Choosing a cloud DBMS: architectures and tradeoffs

selecting a cloud DBMS: architectures and tradeoffs Tan et al., VLDB’19 If you’re going an OLAP workload to your cloud (AWS within the context with this paper), exactly what DBMS setup should you choose to go with? (more…)