List of videos

Solving Data Problems in Management Accounting — Alexander CS Hendorf, Lucas-Raphael Müller
[EuroPython 2023 — South Hall 2B on 2023-07-20] https://ep2023.europython.eu/session/solving-data-problems-in-management-accounting Controllers deal with numbers all day long. They have to check a lot of data from different sources. Often the reports contain erroneous or missing data. Identifying outliers and suspicious data is time-consuming. This presentation will introduce a Small Data Problem-End2End workflow using statistical tools and machine learning to make controllers' jobs easier and help them be more productive. We will demonstrate how we used amongst others, - [scipy](https://scipy.org/) - [pandera](https://pandera.readthedocs.io/en/stable/) - [dirty cat](https://dirty-cat.github.io/stable/) - [nltk](https://www.nltk.org/) - [fastnumbers](https://pypi.org/project/fastnumbers/) to create a self-improving system to automate the screening of reports and report outliers in advance so that they can be eliminated more quickly. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
High Volume PDF Text Extraction using Python Open-Source Tools — Harald Lieder
[EuroPython 2023 — South Hall 2A on 2023-07-20] https://ep2023.europython.eu/session/high-volume-pdf-text-extraction-using-python-open-source-tools All major companies have huge amounts of (mostly PDF) documents that contain important - even critically important - information, that does no longer exist anywhere else in their data stores. Reports, once generated for shareholders and legal or financial authorities, may still be useful for developing longterm forecasts or triggering company management decisions. By definition, documents are intended for human perception, and as such contain unstructured data from an information technology perspective. Therefore, tools to extract PDF text content (mostly, but not only text) from millions of pages have become important vehicles to recreate structured information. This presentation talks about extraction "need for speed" in this Big Data scenario, the need for integration with OCR capabilities and presents an open-source toolset which combines both, top-of-the-class performance and maximum extraction detail. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
The Python's stability promise — Cristián Maureira-Fredes
[EuroPython 2023 — Forum Hall on 2023-07-21] https://ep2023.europython.eu/session/the-pythons-stability-promise Many modules you use and love have a portion of their implementation written in other languages, and for that a Python extension need to be made. Python offers a C-API that allow people extending the language, and being a nice glue-language, C is also a bridge to many other languages as well. So if everything is simple, what's the deal with stability? Changes in the C-API might break the functionality in older versions, so PEP 387 saves the day with a policy for backward compatibility. Starting from Python 3.2, the Limited API was introduced, which defined a subset of Python's C-API that it's promised that if used, the code can be compiled in one version, and run in many others as well. Also, having a Stable ABI compatible wheel, allow you to only have one-wheel-per-OS, and not one-wheel-per-python-version, which can simplify your release process. This talk will introduce the Limited API concept, and provide the necessary information to include it in your project. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
How well do we understand our Universe? Let’s Python it out! — Eirini Angeloudi, Regina Sarmiento
[EuroPython 2023 — Terrace 2B on 2023-07-20] https://ep2023.europython.eu/session/how-well-do-we-understand-our-universe-lets-python-it-out As our understanding of the Universe is expanding, the desire to model the physics that govern cosmic evolution is more evident than ever, driving the emergence of cosmological simulations that model the Universe from the beginning of time till present day. In combination with Machine Learning, they allow for an unprecedented capability; one can train AI models on simulations, where the evolution history of galaxies is available, that can in turn be applied on real galaxies. In this work, we propose the use of Python as a ML tool, through the popular library Tensorflow, to quantify the impact of different cosmological models on the derivation of the history of galaxies. Python accompanies us at every step of the way, from creating the datasets and training the probabilistic neural networks to the visualization of the results, as we attempt to shed light on the cosmic past of galaxies, surpassing the unshakeable reality that we can only observe them at a specific moment in time. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
The Standard Library Tour — Mia Bajić
[EuroPython 2023 — South Hall 2B on 2023-07-20] https://ep2023.europython.eu/session/the-standard-library-tour Are you tired of writing complicated code only to discover that Python has tools in its standard library that could have made your life easier? Join us for a tour of the standard library where we'll dive into less-known modules that do well-known things and well-known modules that do less-known things. This talk is tailored to beginners or anyone who wants to learn more about Python's standard library. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
Give your program Appeal! — Larry Hastings
[EuroPython 2023 — South Hall 2B on 2023-07-20] https://ep2023.europython.eu/session/give-your-program-appeal This talk presents Appeal, a new library for command-line parsing in Python. Appeal avoids the cumbersome APIs and repetition endemic to the currently prevalent libraries in this space by leveraging Python's own function call interface. This talk will familiarize the audience with Appeal, its motivation, its approach, and its expressive power, and show them how to use Appeal in their own programs. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
Don’t Panic! A Developer’s Guide to Security — Sebastiaan Zeeff
[EuroPython 2023 — South Hall 2A on 2023-07-20] https://ep2023.europython.eu/session/dont-panic-a-developers-guide-to-security As a developer, you play a crucial role in the security of your projects. At the same time, it can be difficult to know if what you’re doing is enough. Luckily, you don’t have to be a security expert to contribute to the security of your projects. Instead, you can use industry standards as a guide for your approach to security. In this talk, I will introduce you to a framework that is especially accessible to developers, the [OWASP DevSecOps Maturity Model](https://owasp.org/www-project-devsecops-maturity-model/), and help you get started with a systematic approach to improving the security of your projects. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
Responding to Earthquakes using Machine Learning and Racing through Time — Merve Noyan
[EuroPython 2023 — Terrace 2B on 2023-07-20] https://ep2023.europython.eu/session/responding-to-earthquakes-using-machine-learning-and-racing-through-time Right after the devastating earthquakes in Turkey, there has been a massive flow of tweets and posts from survivors and their relatives, calling for help. There was a need to extract the data, make it meaningful and open to public, so we have come up with afetharita.com. The machine learning part of the application is completely based on open-source tools in Python and I will go through the pipeline and the process. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch
Working in Units: How to Decouple the Database and Domain Layers in Python — Alvaro Duran
[EuroPython 2023 — North Hall on 2023-07-21] https://ep2023.europython.eu/session/working-in-units-how-to-decouple-the-database-and-domain-layers-in-python A crucial element of architecting a software application for scale is the collaboration of domain experts and developers. For that to happen, the application must separate the domain layer— where elements that represent the real world reside—from the infrastructure layer—where these elements are translated into precise software processes. Within the Fintech team at Kiwi.com, we are rearchitecting a critical service to accept more payment providers. As part of this refactor, we are adopting the Unit of Work pattern to disentangle domain entities from the database processes that represent them. This way, domain experts can share their knowledge with developers more easily, and developers can find opportunities for optimization without the involvement of domain experts in the process. Attendees will gain a solid understanding of how to implement the UoW pattern in their Python applications, how it fits into the broader context of DDD, and how to prepare their code for future growth. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License http://creativecommons.org/licenses/by-nc-sa/4.0/
Watch