PyCon Nigeria Annual Conference

Using Polars for data manipulation and analysis

speaker-foto

Ojeifo Oziegbe

I am a data scientist passionate about geospatial projects and love using AI for good, especially in Sub-Saharan Africa.

Description

Polars is a modern, blazingly fast, and intuitive data manipulation library for Python. This workshop aims to introduce participants to Polars, focusing on its efficient data cleaning, transformation, and analysis capabilities. Attendees will learn how to leverage Polars to streamline their data processing workflows, enabling them to work with large datasets effortlessly.

Abstract

In the era of big data, efficient data manipulation and analysis have become crucial for businesses, researchers, and individuals alike. Polars, a cutting-edge data manipulation library for Rust and Python, offers a powerful solution to address these challenges. This workshop will provide a comprehensive introduction to Polars, enabling attendees to leverage its capabilities for data manipulation and analysis effectively.

The workshop will begin with an overview of Polars, highlighting its key features, performance advantages, and its integration with the Python ecosystem. Participants will learn how to install and set up Polars, ensuring a smooth start to their data manipulation journey.

Next, the workshop will delve into the core concepts of Polars, such as lazy and eager evaluation, data structures (Series and DataFrames), and data manipulation operations. Attendees will gain hands-on experience with data loading, cleaning, transforming, and reshaping using Polars' intuitive syntax and powerful functions.

The workshop will also cover advanced topics, including data visualization, time-series analysis, and integration with other Python libraries like pandas and NumPy. Participants will learn how to leverage Polars' exceptional performance for large datasets, enabling them to handle complex data processing tasks efficiently.

Throughout the workshop, real-world examples and use cases will be explored, allowing attendees to apply their newly acquired skills to practical scenarios. Participants will also have the opportunity to ask questions, share their experiences, and engage in discussions with the instructor and fellow attendees.

By the end of the workshop, attendees will have a solid understanding of Polars and its capabilities, empowering them to streamline their data manipulation and analysis workflows. They will be equipped with the knowledge and skills to leverage Polars in their projects, whether for data science, analytics, or software development.

Audience level: Intermediate or Advanced