Warning:
Programme is being prepared.
Degree programme objectives
This program will focus on data science from the perspective of the five fundamental pillars of OSEMN (Obtain, Scrub, Explore, Model, and iNterpret), i.e., (1) obtaining new data through scientific studies and measurements, as well as acquiring existing data from databases, servers, and the internet; (2) data management, data cleaning, imputing missing observations, standardizing data into a predefined format, and statistical programming; (3) exploratory data analysis and visualization; (4) computational statistics and statistical modeling; (5) interpretation of results. Students will learn to use SQL, R, Python, and SAS in data science. They will also learn to design scientific and clinical studies, select statistical units, analyze real-world data, write about numerical results, and present findings. The program is designed for students interested in advanced working with real-world data and statistics. Graduates will find opportunities in various fields that collect and analyze data.Upon completing this program, students can apply their knowledge in various scientific fields focused on data collection and analysis. These fields include medicine (clinical and medical research, epidemiology, disease diagnostics), geography, ecology, and hydrology (climate change, weather forecasting, analysis of extreme events such as floods), as well as economics and finance (econometric analysis, risk management, and portfolio theory in financial markets).
It is a new two-year follow-up master's program that will partially build upon and innovate specific courses from the master's program in Applied Mathematics and incorporate courses from the Faculty of Informatics.
Studies
- ObjectivesThis program will focus on data science from the perspective of the five fundamental pillars of OSEMN (Obtain, Scrub, Explore, Model, and iNterpret), i.e., (1) obtaining new data through scientific studies and measurements, as well as acquiring existing data from databases, servers, and the internet; (2) data management, data cleaning, imputing missing observations, standardizing data into a predefined format, and statistical programming; (3) exploratory data analysis and visualization; (4) computational statistics and statistical modeling; (5) interpretation of results. Students will learn to use SQL, R, Python, and SAS in data science. They will also learn to design scientific and clinical studies, select statistical units, analyze real-world data, write about numerical results, and present findings. The program is designed for students interested in advanced working with real-world data and statistics. Graduates will find opportunities in various fields that collect and analyze data.
Upon completing this program, students can apply their knowledge in various scientific fields focused on data collection and analysis. These fields include medicine (clinical and medical research, epidemiology, disease diagnostics), geography, ecology, and hydrology (climate change, weather forecasting, analysis of extreme events such as floods), as well as economics and finance (econometric analysis, risk management, and portfolio theory in financial markets).
It is a new two-year follow-up master's program that will partially build upon and innovate specific courses from the master's program in Applied Mathematics and incorporate courses from the Faculty of Informatics.
- Learning Outcomes
After successfully completing his/her studies the graduate is able to:
- explain and apply advanced concepts of statistical inference, estimation and testing techniques, apply probability theory to understand the methodology;
- recognize common violations of classical assumptions, assess the impact of assumption violations, apply diagnostic tests and techniques, use alternative modelling approaches;
- understand the concept of flexible data modelling, apply and adapt non-linear and machine learning methods for flexible data modelling, evaluate model performance, identify and mitigate overfitting;
- understand the principles of predictive modelling and applications in decision-making, construct predictive models, train, validate and test the models, evaluate model performance, incorporate uncertainty and risk, optimize the models;
- understand the characteristics and representation of various types of complex data, including space-time data and sequential data, preprocess and clean such data, implement specific statistical and machine learning models;
- differentiate between various model architectures, choose and justify suitable statistical techniques for various datasets, apply systematic approaches for model selection, identify and explain the model components;
- understand key components of reports and principles of effective data visualization, create high-quality visualizations, summarize and interpret statistical results of a model, organize reports logically.
- Occupational Profiles of GraduatesGraduates of the Master's program in Statistical Data Science will possess analytical, statistical, and programming skills and a broad knowledge of modern mathematical and statistical modeling. Their significant market applicability lies in the planning, simulation, evaluating, and comparing scenarios for projects and their sustainability. Graduates' analytical skills, ability to model various scenarios, simulate their future impacts, and choose the optimal approach represent a valuable contribution to organizations in many industries.
The expected areas of employment for graduates include:
- research and evaluation of the impact of vaccines and medications,
- analysis and planning of environmental impacts, water use, and ensuring sustainable extraction,
- financial institutions, banking, and insurance, developing mathematical and statistical models for financial markets and trading
- climate modeling, climate change, greenhouse gas emission analysis, forecasts, and adaptation strategies,
- innovations and research in clean energy technologies and renewable energy, energy efficiency modeling,
- analyzing production flows in industrial enterprises and designing and evaluating new methods that will be sustainable and effective in production,
- analyzing and planning accessible and sustainable transportation systems, public and shared transportation,
- analyzing and planning methods for monitoring the impacts of sustainable development on tourism,
- a support for research projects in scientific institutions, working with large datasets, and designing experiments,
- public administration bodies, including ministries of the environment and industry and trade, non-governmental organizations, or think tanks focused on sustainability.
- Practical TrainingInternships are not part of this study program.
- Goals of ThesesCompleting and defending a master's thesis is a mandatory part of the Statistical Data Science study program. By working on the master's thesis, the student demonstrates the ability to navigate the topic of the thesis, conduct specialized work under the guidance of their supervisor, and deliver both written and oral presentations. Guidelines for the preparation of the master's thesis are specified in the Dean's Measure 3/2019: Guidelines for drawing up Bachelor's, Master's and Advanced Master's theses at the Faculty of Science of Masaryk University.
- Access to Further StudiesA graduate of the Master's study program may (upon meeting the admission requirements) continue their studies in the doctoral program in Mathematics and Statistics (specialization in Probability, Statistics, and Mathematical Modeling, among others) or another related doctoral program in fields such as Applied Mathematics, Statistics, and Mathematics, both at Czech and foreign universities.