Jacob Tomlinson's profile picture Jacob Tomlinson
Home Blog Talks Newsletter About

Intro to distributed computing on GPUs with Dask in Python

BlazingSQL Webinars Online Webinar 60 minutes Abstract Video Additional Resources

RAPIDS is an end to end data science stack built entirely for CUDA GPUs. Faster analytics, at scale, for lower total cost of ownership. Dask natively scales Python and the RAPIDS ecosystem stack onto multiple servers and GPUs, supporting unprecedented scale. BlazingSQL is a distributed SQL engine built in Python. It performs incredibly fast SQL queries on the RAPIDS DataFrame and ensures optimal usage of GPU primitives.

BlazingSQL is built using the same libraries underpinning the RAPIDS ecosystem. As RAPIDS improves, so does BlazingSQL.

This workshop runs through the basics of using Dask alongside RAPIDS to perform out-of-core distributed GPU computations in Python. We cover high level APIs such as DataFrames and Arrays and then dive under the covers to explore delayed functions and distributed futures.