Introduction to Python for Data-analysis
Introduction to Python for Data-analysis
Worldwide, Python is the most popular language for data science. Python is free and open source, allowing professionals worldwide to continuously update and add functionality. Although Python is a general programming language at its core, it offers numerous modules specifically designed for data analysis, data science, and machine learning. At the UG, both R and Python are widely used for data science. R is generally preferred for statistics and visualization, while Python is favored for larger programs and machine learning, including neural networks, deep learning, and large language models.
In this beginner's course, we will guide you through the basics of importing data, cleaning and restructuring data, visualizing and summarizing data, and finally applying statistical models to your data.
Who Should Attend?
This introductory course is designed for anyone who wants to (re)learn the basics of the Python programming language necessary for data handling.
This course is suitable for beginners who are new to Python and those who have some experience with Python but never had the opportunity to grasp the fundamentals.
Prerequisits
None, but a dream of what you’d like to do with Python would be nice.
Please note that Python has a steep learning curve. Expect you’ll need 3-4 hours between sessions to prepare for the next one and plan accordingly. Please be kind to yourself and allow yourself at least 48 hours to complete the course.
What do you learn?
✅ Preparing your Python journey
✅ Getting started with Numpy
✅ Reading and writing data with Pandas
✅ Data transformations with Pandas
✅ Exploratory data analysis with Mathplotlib: Graphics
✅ Exploratory data analysis: descriptive statistics
✅ Statistical tests (how to run the tests, not how to interpret them)
✅ What's more (other useful packages and more classical Programming)
Result
At the end of the course, you will not only be able to work with Python, but you will also be able to expand your knowledge for your own specific work field.
If you want to learn more about general programming using Python you can follow the course Introduction to Programming using Python either before or after this course.
The content of this Python for Data-analysis course will be used as a prerequisite for a Python for Machine Learning course (under construction; mail Theo van Mourik (t.j.van.mourik rug.nl if you want to be kept up to date on the progress).
Interactive Learning Experience
This course relies heavily on highly interactive (online or hybrid) sessions where we review what you’ve done in the reader. During a review the teacher will share his screen and go through the code asking you by voting and chatting to find the error or complete the code. These reviews are used to rehearse material, show tips and tricks, warn for common mistakes, explain error messages, show how to use the helpfiles and the program (IDE) in general, and overall to motivate you to keep up the pace. Participants report they are highly involved during these sessions and the course is consistently highly evaluated. On average this course is rated with an 8.2 (10% gives a 10!) by students, PHD’s and other employees alike.
More information
-
For more information on this or similar courses, please mail the coordinator, Theo van Mourik (t.j.van.mourik rug.nl).
Last modified: | 25 November 2024 1.55 p.m. |