PyCon SE 2021 Track: Workshops

2021

List of videos

Workshop (Workshops - Day 1) - First steps to learn Pyspark

Abstract: PySpark is a distributed data processing engine widely used in Data Engineering and Data Science. Another way to think of PySpark is a library that allows processing large amounts of data on a single machine or a cluster of machines. We will go through the basic concepts and operations so you will leave the workshop ready to continue learning on your own. Workshop steps: - Introduction: Motivation, intro to parallel data processing, Spark's main concepts (transformations versus actions, dataframes versus RDDs), and overall architecture, focusing on Spark SQL - Setup environment: There are two ways of executing the notebook with the exercises. The first one is creating an account on Databricks community and cloning the notebook. The alternative is running the notebook locally as described in the instructions. - Exercises: Going through a series of exercises covering Spark's main transformations (filter, select, groupBy) and ways to visualize them. The idea is to give people some time to complete each exercise and then solve it in an interactive way For more details: https://pretalx.com/pycon-sweden-2021/talk/ABFJMT/ Speaker: Natalia Pipas

Watch
Workshop (Workshops - Day 1) - Airflow 2.0 for ML pipelines – design, implementation and management

Abstract: With a lot of changes under the hood with Airflow 2.0, the workshop aims to give an overview on major updates in Airflow 2.0 from 1.0, major components and working of Airflow and hands-on demo of implementation and management of an end-to-end Machine Learning pipeline. Without a pipeline in-place, management of multiple Machine Learning stages in production can be difficult. This gives an overview of simplified process and management of Python based ML projects using Airflow. 05 min: Introduction 05 min: Major changes in Airflow 2.0 05 min: Pre-requisites setup overview 10 min: Walkthrough of different backend components 10 min: Different stages of a DAG file – steps and operators 10 min: Dynamic DAG creation to improve parallelism 15 min: How to trigger Airflow DAG runs 15 min: Debug and clear Airflow task errors 10 min: Overview of production-level Airflow-based architecture 05 min: Wrap up questions For more details: https://pretalx.com/pycon-sweden-2021/talk/JRCLRG/ Speaker: Alen Jacob

Watch
Workshop (Workshops - Day 1) - Writing Python extensions in Rust

Abstract: Many times we have to write Python extensions, particularly in C. To do various system operations, or doing calculations in a faster manner. But, writing safe C code is always difficult, even for an experienced developer. This is where writing Python extensions in Rust is becoming more popular among developers where people think about speed and security at the same time. In this workshop we will learn about how to create a Python module using Rust. No previous Rust experience is required. If you are using latest cryptography in any of your project (which you most probably already do), you are using one of the most powerful and trusted Python module where a part is written in Rust. In this workshop we will go through a given git repository (no prior Rust knowledge is required) and start building a Python extension module step by step. Outline - Initial module creation - single function - functions with arguments - Help documentation - Functions to read files - Exception generation - Dictionaries, lists - Creating your own class - A module with some real life work done in Rust We will follow prewritten code for most of the sessions, I will ask you to modify those as exercises during the session. For more details: https://pretalx.com/pycon-sweden-2021/talk/DJ7LWR/ Speaker: Kushal Das

Watch
Workshop (Workshops - Day 2) - Zero To Hero Tutorial on a Deep Learning Classification Task

Abstract: This workshop will demonstrate a zero-to-hero tutorial on how to solve a classification task using deep learning. The tutorial kicks off demonstrating a simple classification task on synthetic data, first in low and then in high dimension. Then, a harder classification task based on FashinMNIST, a famous dataset containing images of clothes, will be tackled. Apart from solving the classification task itself, we will show how to generate and analyze embedding vectors that can be used to solve other downstream tasks, different from the original classification problem on which the model was trained. Finally, we are going to face a more advanced type of classification problem, namely, predicting links on a graph using Graph Neural Networks. Link prediction will be demonstrated on an open source dataset that contains information about collaborations among authors of scientific papers. The target of this workshop is to show how we can use Python to solve the the aforementioned tasks, taking into account both the data science aspects and the engineering and project lifecycle related ones. In particular, the python packages that we are going to cover in the workshop are PyTorch, PyTorch-Lightning, Deep Graph Library. For more details: https://pretalx.com/pycon-sweden-2021/talk/993BDA/ Speakers: Georgios Deligiorgis Marco Trincavelli David Andersson

Watch
Workshop (Workshops - Day 2) - Build an answering machine with Flask 📞

Abstract: Join Carolina & Victoria, developers at 46elks, for a code along workshop 👩🏻‍💻 We will be building an answering machine with Flask. Using Python & 46elks you can setup your very own answering machine. What you need to follow this code along: - A 46elks account, [here's a link](https://46elks.se/register/pycon-2021) with some credits to test your answering machine - A computer and be excited to code some cool stuff 👩🏻‍💻 We will be coding together for about 60 minutes and then we'll answer any questions you might have (literally, ask us anything), or just hang, getting to know new developers friends 🥳 For more details: https://pretalx.com/pycon-sweden-2021/talk/PP8L7D/ Speakers: Carolina J. Säll Victoria Wagman

Watch