Skip to ContentSkip to Navigation
Research PhD courses

Practical Computing for Biologists

Organizers
Aim of the course

At the end of the course, the student is able to:

1. Explain and summarize the origin(s) and properties of big data; evaluate and solve challenges in analyses of big data in biology.
2. Use high performance computing clusters in a Linux/Unix environment: examine directory structure & permissions, create and modify files & directories as well as install and execute software.
3. Evaluate, extract and explore information contained in large, complex data files employing basic functions and options included in standard 'nix analysis software.
4. Analyse large data sets and visualize results in Python; integrate external Python libraries in own analyses
5. Design, implement own custom pipelines using scripting languages (e.g., BASH, R and/or Python) to explore, analyse, and visualize data; defend the specific programmatic choice made, critically evaluate the performance and areas of improvement in developed pipeline

Contents & Structure

Practical Computing for Biologists (PCfB) introduces students to general computational tools in order to enable to design and execute efficient computations. PCfB presents a broad range of open-source, free and flexible computational tools applicable to geneticists, molecular biologists, ecologists, oceanographers, physiologists, or anyone with an interest or need for efficient means to handle and analyse large data sets in their research. PCfB emphasizes the practical application of computational tools and methods toward real-life analyses.

PCfB covers data­-centered computing in a Unix/Linux environment. PCfB introduces the basics of a 'nix environment, such as; remote installation and execution of software. Students will be familiar with command line tools to explore and analyze data as well as the use of scripting languages such as Python to (a) code custom analyses, (b) design effective pipelines with existing software, (c) data visualisation, (d) versioning with git and (e) reproducible analyses with markdown.

Topics addressed in PCfB will employ practical example from different research fields, e.g., Next Generation Sequencing (NGS) data in genetics and molecular biology, as well as remote sensing and oceanographic data widely used in spatial ecological and evolutionary biology.

The course comprises of short lectures in a Q & A format addressing the new concepts introduced in daily text book readings along with discussion and execution of relevant examples. Afternoons are aimed at practical computer exercises. The two first weeks include individual graded assignments. During the last week, students will conduct a project assignment individually or in small groups implementing aimed the skills acquired during the course towards, solving real-life analyses. Students will present their pipeline, results and reflect on the project in an oral presentation to the entire class during the last days of the course.

General Information
Lecturers
Required knowledge & preparation
  • The course is highly recommended for students planning to take other, computational courses in the Master program, such as Meta-analyses in Ecology (WMBY013-05), Principles of Population Genetics in Natural Populations (WMMB005-05), Practical Modeling for Biologists (WMBY009-05), Mathematical Models in Ecology and Evolution (WMEV013-06), and in general for students in evolutionary biology and genetics/genomics.
  • During the first two weeks, presence to the course is mandatory since the students will have a combination of short lectures, tutorials and personal exercises. The final day of the course is project presentation day, and mandatory, since each student will present their project to the class.
  • Course relies to some extent on each student having access to a suitable laptop.
Course material
  • Computing Skills for Biologists. Princeton University Press. 2019 - S. Allesina & M. Wilmes ISBN: 9780691167299.
  • Course material provided to students via Brightspace including also the course website (to be announced)
  • Bioinformatics Data Skills: Reproducible and Robust Research with Open Source Tools. O'Reilly Media; 1st Edition 2015 - Vince Buffalo ISBN: 978-1449367374 (€ 32,00)
Course credits
5 ECTS
Location
Linneausborg, Nijenborgh 7, Groningen
Duration & date
15-1-2024 till 04-02-2024, Full time course
Costs
Participation fee is € 275 for all participating GELIFES PhD students. PhD students from other institution may be welcome to participate if there is sufficient room in the course. The course fee for external participants is € 350. Potential travelling & housing costs are not included and are for the student.
Participants
Maximum number of students will be fixed to 25
Information
  • Per Palsboll (Genomics Research in Ecology and Evolution in Nature, Linnaeusborg, Nijenborgh 7, 9747 AG Groningen

  • Corine Eising (Research School Ecology and Evolution, Linnaeusborg, Nijenborgh 7, 9747 AG Groningen, phone: 0031 (0) 50 363 9140, 2046 or 8357)
Registration

Master students: please register through the Ocasys portal
PhD students: please register by filling out the registration form.

Payment, Registration
and Cancellation
Please take note of our general course information on payment, registration and cancellation, applicable to all registrations.

Last modified:07 March 2024 3.29 p.m.