Astronomical Data Science

Faculteit Science and Engineering
Jaar 2021/22
Vakcode WMAS007-05
Vaknaam Astronomical Data Science
Niveau(s) master
Voertaal Engels
Periode semester II b

Uitgebreide vaknaam Astronomical Data Science
Leerdoelen During this course students will become familiar with a number of concepts and languages, including the role of XML, UML, SADT, SQL and R. Throughout the course a number of practical exercises will be given and a key aim is to enable the student to reach a level where she/he can apply the tools to their own research problems. At the end of the course the student should be able to plan the optimal data processing during her/his research, to select necessary tools, to use Virtual Observatory for the data mining and publishing her/his own results, to create a system for managing and developing code and to check code quality.

After having taken this course you should be able to efficiently search and access astronomical databases, know the basics of how to access databases both through on-line interfaces and by scripting. You will also be able to apply up-to-date statistical techniques to your data and use these to mine large datasets for scientifically interesting information.
Omschrijving Modern astronomical observations produce data volumes comparable to the data volumes in high energy physics, lifesciences and public media - up to the hundred PetaByte domain. The course is targeting problems of Big Data handling such as storage, processing, databasing, validation and exploration of very large data sets. It reviews data processing for a range of current astronomical wide field imaging survey instruments such as OmegaCAM (optical), VISTA (near infrared), Lofar (radio) which already supersede by an order of magnitude the data volumes of current ESO- VLT telescopes and the famous Sloan Digital Sky. The data handling of current (GAIA) and future astronomical Space missions (Euclid) will be discussed.

Apart from practical problems of dealing with Big Data the course describes how processing systems for such BigData missions work and how to design and create information systems dealing with the research analysis and dissemination of PetaByte data volumes. Todays Open Science and FAIR principles (Findable, assessable, Interoperable and Re-usable) will be discussed.

The course starts with an introduction to astronomical information systems and how to use them to obtain astrometrically and photometrically calibrated astronomical images. This is followed by lectures on the scientific use of astronomical databases and information systems, including the European and International Virtual Observatory and the Astro-WISE information system. The course will cover information system theory and its practical applications. We will discuss the design of scientific information systems, the principle concepts and how they work and connect users to large databases, parallel computers, data archives, networks and Grid infrastructure. It will be explained how users can do their research with the Virtual Observatory and during the practical assignments exercises on the use of astronomical information systems will be done.
Uren per week 140
Onderwijsvorm Hoorcollege (LC), Werkcollege (T)
(Hours per week: 2 * 2x45 min + 1 * werkcollege = 6 hour)
Toetsvorm Mondeling tentamen (OR), Opdracht (AST), Verslag (R)
(2 mandatory werkcollege tasks (done/not done, if done 1.5 points) – 30%; paper (0.0 -5.0 points) – 50%; answers during exam (0.0-2.0 points) – 20%)
Vaksoort master
Coördinator L. Wang
Docent(en) L. Wang
Opmerkingen The lectures are coordinated by Prof Valentijn – several lectures will be given by experts in the field (a.o. Prof Peletier, Dr Belikov). We plan to organize a BigData visualization lecture in DOTliveplanetarium.

This course was registered last year with course code WMAS16000
Opgenomen in
Opleiding Jaar Periode Type
MSc Astronomy: Quantum Universe  (Astrophysics Core Courses) - semester II b keuze
MSc Astronomy: Science, Business and Policy  (Astrophysics Core Cours) - semester II b verplicht