
Denk je dat spraaktechnologie ophoudt bij spraakassistenten? Denk nog eens na. Van gezondheidszorg tot cyberbeveiliging: spraaktechnologie verandert de wereld en geeft vorm aan de toekomst.
Leeuwarden is een bruisende, middelgrote studentenstad en de hoofdstad van Friesland, een regio die bekend staat om zijn innovatie en meertalige karakter. De stad is de thuisbasis van start-ups, gevestigde tech-bedrijven, game-ontwikkelaars, onderzoeksinstituten en taalgerichte NGO's. Spraaktechnologie is door de provincie erkend als een belangrijk gebied met regionaal belang, wat betekent dat je zult studeren op een plek waar jouw vakgebied echt wordt gewaardeerd.
Het curriculum is opgedeeld in vier blokken, waar elk blok voortbouwt op de vorige.
| Semesters | ||||
|---|---|---|---|---|
| VakkenVakkencatalogus > | 1a | 1b | 2a | 2b |
| Introduction to Speech Technology (5 EC) This course will explain the basics of speech synthesis and recognition. It will briefly touch upon the history of speech recordings and the technology that comes along with speech, and the history of speech recognition and synthesis. You will become familiar with several voice technology applications, such as voice assistants, smart speakers, open-source speech recognizers and synthesizers, among others. The (speech) resources needed for creating speech technology applications will be addressed. You will acquire essential knowledge on data management requirements, licensing and privacy issues. This course will also show you how human and contextual factors affect the interaction between people and speech technology systems. Finally, you get acquainted with models that study the user acceptance of speech technology systems. During the course, you will work on an interesting speech tech project. | ||||
| Programming (5 EC) In this course, you will learn how to program in Python for voice technology. The code used by voice technology experts needs to be written so that it both achieves the purpose for which it is designed, but also is reusable and has replicable results. You will learn to adjust your code in response to reviews and be encouraged to reuse code of others. This course is split evenly into two units. The first unit provides the essentials of programming. For example, you'll learn how to work with data organized into lists, dataframes and numpy.ndarrays and apply mathematical operations. The second unit explores the use of Python for data science in general and voice technology in particular. This unit builds on the content in unit 1. For instance, you'll learn how to execute mathematical operations on numpy.ndarrays as well as get first-hand experience using seaborn and matplotlib to visualize your data, data distributions and results. You will also do some hands-on work with speech and language data. | ||||
| Speech Sounds (5 EC) This course provides the fundamentals from phonetics and phonology. Therefore, we will address a few aspects of the phonetics and phonology of English and many other languages, from Aymara to Xhosa. We also cover aspects of anatomy and physiology of the vocal tract and ear, discuss how the International Phonetic Alphabet reflects the diversity of speech sounds, and consider applied issues relating to accented speech, speech perception, speech pathologies, and whispered speech, among other topics. Most importantly, we will leverage theoretical knowledge from phonetics and phonology and relate it to voice technologies. You will develop a Lab Book in which completed speech analysis and processing assignments are organized. This Lab Book will be a useful resource not only for other courses and your thesis project but for your career in voice technology after the completion of this master's program. Additionally, together with peers, you will work on a group research project relating to speech production / perception analysis and processing. | ||||
| Machine Learning (5 EC) This course teaches you to design computational models for specific tasks and problems in a data driven manner. One challenge is to ensure that any model you develop is replicable by peers. Although there is no fool-proof method, you'll learn how to reliably validate and adapt your model in a standard way that is widely accepted by data scientists. You'll work with Python to create classical machine learning models and more modern neural network architectures to process tabular data, images, text and, most prominently, sound. These will lay the foundation for the speech synthesis and recognition courses. | ||||
| Speech Recognition I (5 EC) Since 1952 when the first speech recognizer Audrey was invented, Automatic Speech Recognition has developed in leaps and bounds. This advancement accelerated particularly after the 2010s when Deep Neural Networks (DNNs) were introduced in speech engineering. Consequently, many commercial products integrate speech recognition and some approach a human recognition level. This course provides an introduction to such speech recognition technologies. To learn how speech recognition systems work, you'll make your own speech recognizer from scratch! At each step, you will gain experience with technologies in chronological order to achieve a deep understanding of the foundation upon which the state-of-the-art is built. You will simulate the product development process and make an ASR application using your hand-made speech recognizer, which you present in a demonstration session in the final week. | ||||
| Speech Synthesis I (5 EC) Speech synthesis has come a long way since its beginnings from a niche field with limited interest and high entry requirements. It is now a large field with people of widely varying expertise producing essential components of very successful commercial products. The success of voices like Alexa and Siri build on years of work on speech modelling and parametrization. In this course you will learn the theoretical and practical foundations of speech synthesis. The course is divided into four units, in each you will be given an assignment and/or a quiz. | ||||
| Research Design (5 EC) This course is dedicated to the design of your Master's thesis. We will focus on research design and experimental protocol. This is a highly interactive course and includes hands-on training, in-class group exercises, and individual reflection to help you pursue your interests in a rigorous, scientific way. To help streamline your educational experience, you will develop three deliverables in this course: 1) A paper based on independent research; 2) A software demonstrator prototype which demonstrates the outcomes of your research; and 3) A scientific poster related to the paper and demonstrator prototype which will be presented in a poster session. | ||||
| Speech Recognition II (5 EC) In Speech Recognition II you will deepen your knowledge for practical speech recognition use cases. The course is organized into three units. In the first you will learn about the impact of Deep Neural Networks (DNNs) on the HMM-based framework.You will become (re)acquainted with the DNN and learn about the state-of-the-art speech recognition frameworks. You will also become familiar with speech recognition toolkits and interfaces. The second unit concerns building speech recognition systems for under-resourced languages and/or in multilingual contexts. The final unit concerns speech recognition technologies around you. In that unit, scholars and professionals will present on unique applications. At the end of the course, you will write a term paper and present on it. Through Speech Recognition I and II, you will acquire familiarity with many speech recognizers, will know the challenges that speech scientists face, have ideas of how to improve the speech recognition framework, and may come up with interesting ASR applications. | ||||
| Speech Synthesis II (5 EC) State of the art systems, based on advanced neural modelling techniques, are bridging the quality and naturalness gap while still offering flexibility and controllability. Such systems are capable of modelling challenging heterogeneous data, i.e. data that contains multiple sources of variation such as speakers and languages, non-ideal recording conditions, expressive and spontaneous speech. In this course, you will learn how deep neural networks can generate speech from text, the advanced techniques that allow such systems to handle heterogeneous data and to be controllable and how they can be applied in different case scenarios. You will learn how to work with advanced tools for generating speech and consolidate knowledge by designing an experiment which answers a research question or showcases a new product. | ||||
| Thesis Project (15 EC) The thesis forms the aptitude test for the Speech Technology MSc. In the course Thesis Design (block 3), students have written a research proposal and a related paper with a literature overview, research problem, research questions, appropriate methods for data collection and analysis and a planning for block 4. In this block students elaborate this further, based on the feedback they received from the instructor, and develop it into a thesis. Additionally, students will develop further their demonstrator prototype, modifying it from a proof-of-concept to a more polished demonstrator (it is also permitted that a student starts over with a completely new demonstrator in the event that the prototype from the Thesis Design course fell short of his/her expectations or if the student wants to tackle a different issue for other reasons). This demonstrator should be related to the experiment of the thesis study, or it can also be an application that is built based upon the outcomes of the thesis study. | ||||
| Specifieke eisen | Extra informatie |
|---|---|
| vooropleiding |
Studenten met een bachelordiploma in Taalwetenschap (Linguistics), Kunstmatige Intelligentie (Artificial Intelligence) of Informatica (Computer/Computing Science) krijgen directe toegang tot het programma. Studenten met een ander diploma kunnen in aanmerking komen voor toelating via een geschiktheidsbeoordeling, waarbij mogelijk een pre-masterprogramma moet worden gevolgd. |
| taaltoets cijfer |
Voldoende beheersing van de Engelse taal is vereist, behalve voor moedertaalsprekers van het Engels uit de volgende landen: Australië, Canada, Ierland, Nieuw-Zeeland, Nederland, Verenigde Staten, Verenigd Koninkrijk. Om uw Engelse taalvaardigheid aan te tonen, kunt u één van de volgende documenten overleggen:
Vrijstelling kan worden verleend door de Toelatingscommissie. |
| Type student | Deadline | Start opleiding |
|---|---|---|
| Nederlandse studenten | 01 juli 2026 | 01 september 2026 |
| 01 juli 2027 | 01 september 2027 | |
| EU/EEA studenten | 01 mei 2026 | 01 september 2026 |
| 01 mei 2027 | 01 september 2027 | |
| non-EU/EEA studenten | 01 mei 2026 | 01 september 2026 |
| 01 mei 2027 | 01 september 2027 |
LET OP: jouw aanmelddeadline wordt bepaald door het land waar je je diploma hebt behaald, niet door je nationaliteit. Bijvoorbeeld: 'Nederlandse studenten' betekent studenten met een Nederlands diploma; 'EU/EER-studenten' betekent studenten met een diploma uit een EU/EER-land; 'niet-EU/EER-studenten' betekent studenten met een diploma uit een niet-EU/EER-land.
| Specifieke eisen | Extra informatie |
|---|---|
| vooropleiding |
Students with a Bachelor's degree in Linguistics, Artificial Intelligence or Computer/Computing Science will have direct access to the programme. Applicants with other degrees may qualify for admission through an eligibility assessment, which may require completion of a pre-master's programme. |
| taaltoets cijfer |
Sufficient English language proficiency is required, except for native speakers of the English language from the following countries: Australia, Canada, Ireland, New Zealand, The Netherlands, United States, United Kingdom. To prove your English language proficiency, you can provide one of the following documents:
An exemption can be given by the Admission Board. |
| Examen | Minimum score |
|---|---|
| C1 Gevorderd (voormalig CAE) | C1 |
| C2 Bekwaamheid (voormalig CPE) | C2 |
| IELTS gemiddelde score | 6.5 |
| IELTS luistervaardigheid | 6 |
| IELTS leesvaardigheid | 6 |
| IELTS schrijfvaardigheid | 6 |
| IELTS spreekvaardigheid | 6 |
| TOEFL internetversie | 90 |
The MSc Speech Technology allows direct entry for anyone with a bachelor's degree from a recognized university and a sincere interest in the topic. That said, a Bachelor's degree in Computer Science, Artificial intelligence and Applied Linguistics (and familiarity with Python) would be an asset.
| Type student | Deadline | Start opleiding |
|---|---|---|
| Nederlandse studenten | 01 juli 2026 | 01 september 2026 |
| 01 juli 2027 | 01 september 2027 | |
| EU/EEA studenten | 01 mei 2026 | 01 september 2026 |
| 01 mei 2027 | 01 september 2027 | |
| non-EU/EEA studenten | 01 mei 2026 | 01 september 2026 |
| 01 mei 2027 | 01 september 2027 |
| Nationaliteit | Jaar | Kosten | Vorm |
|---|---|---|---|
| EU/EER | 2025-2026 | € 2601 | voltijd |
| niet EU/EER | 2025-2026 | € 21400 | voltijd |
| EU/EER | 2026-2027 | € 2695 | voltijd |
| niet EU/EER | 2026-2027 | € 22200 | voltijd |
Praktische informatie voor:
De MSc. Speech Technology combineert programmeren, machine learning en taalkunde. Dit geeft je zeldzame en veelgevraagde vaardigheden die nuttig zijn in de huidige arbeidsmarkt. Afgestudeerden gaan aan de slag in spraak laboratoria bij grote techbedrijven in Europa en daarbuiten, of doen academisch onderzoek.
Tijdens de opleiding delen gastsprekers uit de industrie hun expertise en geven ze studenten inzicht in diverse carrièrepaden, van de ontwikkeling van spraakherkenners en synthetische stemmen tot automatische detectie van spraakpathologieën en toepassingen in robotica en dialoogsystemen. De spraaktechnologie sector groeit snel en biedt volop kansen voor afgestudeerden
De MSc Speech Technology is onderdeel van de onderzoeksafdeling “Language, Technology and Culture”. Dit internationale en multidisciplinaire team bestaat uit PhD- en postdoctorale onderzoekers die zich bezighouden met baanbrekende onderwerpen zoals spraakherkenning van ziekten, meertaligheid en de hersenen, spraaksynthese voor minder gebruikte talen en nog veel meer.