Description of Courses of Master of Science in Data Science & Analytics
Course Code: DSA5001 Statistical Methods and Probability Credits: 3 |
Course Rationale
Data scientists often have to describe and evaluate evidence based on the verities of information. For this, they often describe and simulate the samples to understand how the population behaves and for that they make assumptions about the underlying probability distributions of the variable. This course will help them to make propositions about the entire population of the data. This is one of the core reasons why statistical inference is important among the data scientists.
Course Contents:
Descriptive and Inferential Statistics; Variables and Types of Data; Sampling; Frequency distributions; Histograms; Other types of charts and graphs; Scatter plots; Measures of central tendency; Measures of variation; Basic probability; Conditional probability, multiplication rule; Probability distributions; Expected value and variance; Binomial distribution; Normal distribution; Applications of the normal distribution; Central Limit Theorem. Collecting, summarizing and visualizing data; Distribution of sampling statistics; Point estimation and confidence intervals; Hypothesis testing; Inference with two populations; Maximum likelihood; Nonparametric methods.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs |
PLO1 |
PLO2 |
PLO3 |
PLO4 |
PLO5 |
PLO6 |
PLO7 |
PLO8 |
PLO9 |
CLO 1 |
√ |
|
|
|
|
|
|
|
|
CLO 2 |
|
√ |
|
|
|
|
|
|
|
CLO 3 |
|
√ |
|
|
|
|
|
|
|
CLO 4 |
|
|
|
√ |
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO1
|
Describe descriptive statistics
|
Analyze
|
Term and Final Examination
|
CLO2
|
Solve problems by using probabilistic approach.
|
Apply
|
Assignment, Term and Final Examination
|
CLO3
|
Explain the basic ideas and methods about fundamental principles for statistical inference.
|
Analyze
|
Term and Final Examination
|
CLO4
|
perform point estimation, hypothesis testing (parametric and non-parametric) and interval estimation under a large variety of situation.
|
Apply
|
Assignment, Term and Final Examination
|
Course Code: DSA5002 |
Course Title: Programming for Data Science |
Credits: 3 |
Course Rationale
This is the computing course which will help the students have the basic knowledge of data and statistics needed to go for computation. This programming software Python/ R provides the user a wide scope of data manipulation and analysis of interest. The course will not only help the students to learn the basic statistical calculations but also emphasize programming so that they may write their own program for advanced statistical analysis.
Course Content
Understand and utilize Python/ R/RStudio; Understand basic data types and data structures in Python/R. Introductory concepts of Python/R, Getting data into Python/R (Familiarize and load data files into Python/R, basic data manipulation; Sub-setting, Basic statistical calculations; Visualize datasets using low-level and high-level plots in Python/R. Concept of Array and matrix, Inference, random sampling; Regression, Programming with R; Writing R functions, Control structures.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs | PLO1 | PLO2 | PLO3 | PLO4 | PLO5 | PLO6 | PLO7 | PLO8 | PLO9 |
CLO 1 | √ |
|
|
|
|
|
|
|
|
CLO 2 |
|
|
|
|
|
√ |
|
|
|
CLO 3 |
|
|
√ |
|
|
|
|
|
|
CLO 4 |
|
√ |
|
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO1 | Understand basic data structures in R | Understand | Term and Final Examination |
CLO2 | Getting data into R (data import, export) | Analyze | Term and Final Examination |
CLO3 | Visualize datasets using plots | Understand | Term and Final Examination |
CLO4 | Writing R functions, Control structures | Apply | Assignment, Term and Final Examination |
Course Code: DSA5003 |
Course Title: Database Management Systems |
Credits: 3 |
Course Rationale:
The objective of the course is to present an introduction to database management systems, emphasizing how to organize, maintain and retrieve - efficiently and effectively - information from a DBMS.
Course Contents:
Database System- concepts and architecture; The Relational Model; Conceptual Modeling Data Definition Languages and SQL; Relational Algebra and SQL; Database Design and Normalization; Triggers and Active Databases; Physical Data Organization and Indexing; Dynamic Multi-level indexing using B Trees and B+ Trees, Query Processing; Transaction Processing; Design Coding and Testing.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
|
|
|
√
|
|
|
|
|
|
CLO 3
|
|
|
|
√
|
|
|
|
|
|
CLO 4
|
|
|
√
|
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLO Description | Domain/Level | Assessment Tool | |
CLO 1 | Describe the fundamental elements of relational database management systems. | Understand | Term and Final Examination |
CLO 2 | Explain the basic concepts of the relational data model | Understand | Assignment, Term Examination |
CLO 3 | Design models to represent simple database application scenarios | Analyze | Term and Final Examination |
CLO 4 | Improve the database design. | Analyze | Term and Final Examination |
Course Code: DSA5004 |
Course Title: Regression Analysis |
Credits: 3 |
Course Rationale:
Real-life events are related to each other make up a system; one or more events influence one or more other events. To understand such a system regression modeling are a great tool. This course also puts emphasis on the justification of a model that is built by incorporating different residual-based model diagnostic measures.
Course Contents:
Simple and multiple regression model and theirestimation, assumptions, residual analysis, Interaction in regression models, Reduced and full models, F test for general linear hypothesis; Dummy variables and interpretation of model parameters; Multicollinearity: sources, diagnostic measures, remedy measures, ridge regression; Influence Measures:Detection of outliers by residual analysis, influence statistics- Cook’s D, Cov Ratio etc. Resistant fitting techniques such as M Estimate, MM-Estimate for data sets with outliers; Model Selection, All possible regression, forward selection, backward elimination, and stepwise algorithm; Polynomial Regression Models; Non-linear Regression, kernel estimator.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO1
|
√
|
|
|
|
√
|
|
|
|
|
CLO2
|
|
|
|
|
√
|
|
|
|
|
CLO3
|
|
|
√
|
√
|
|
|
|
|
|
CLO4
|
|
|
|
|
√
|
√
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO1 | Build simple, multiple, polynomial and non-linear regression models | Create | Assignments, Term and final examinations |
CLO2 | Evaluate an estimated regression model’s performance through rigorous statistical tests and measures | Evaluate | Assignments, Term and final examinations |
CLO3 | Diagnose model assumptions through visualization of residuals and different statistical measures | Analyze | Assignments, Term and final examinations |
CLO4 | Select best models among all possible regression models through step-by-step systematic methods. | Analyze | Assignments, Term and final examinations |
Course Code: DSA5005 |
Course Title: Multivariate Statistical Analysis |
Credits: 3 |
Course Rationale:
In many observational or designed studies, observations are collected simultaneously on more than one variable on each experimental unit. With the availability of inexpensive, fast and efficient computing resources and statistical packages there has been a growth in the application of these techniques. This course introduces the student to various multivariate data analysis tools and dimension reduction techniques for high dimensional data. Thus, this course is important who wants to build carrier in data scientist.
Course Contents:
Applications of multivariate techniques, Organization of data, Distance; Basics of linear algebra including matrices, singular value decomposition, Random vectors and matrices, Mean vectors and Covariance matrices; The multivariate normal density and its properties; Hotelling T2 and likelihood ratio tests, confidence regions and simultaneous comparisons of component means. Large sample inference about a population mean vector; Comparisons of several multivariate means; Introduction and concepts of principal components and its application; Factor analysisand methods of estimation, applications; Cluster analysis, Hierarchical and Non-Hierarchical Methods; separation and classification for two populations; profiles analysis, repeated measures designs and growth curves.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs | PLO1 | PLO2 | PLO3 | PLO4 | PLO5 | PLO6 | PLO7 | PLO8 | PLO9 |
CLO 1 | √
|
|
|
|
|
|
|
|
|
CLO 2 | √
|
|
|
|
|
|
|
|
|
CLO 3 |
|
|
|
|
√
|
|
|
|
|
CLO 4 |
|
|
|
|
|
√
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO 1
|
Understand the principles and characteristics of the multivariate data analysis, including strengths and weaknesses.
|
Understand
|
Term and Final Examination
|
CLO 2
|
Select appropriate techniques of multivariate analysis for a multivariate dataset.
|
Understand
|
Term and Final Examination
|
CLO 3
|
Apply the multivariate methods in the framework of the multivariate analysis.
|
Apply
|
Term and Final Examination
|
CLO 4
|
use the statistical software to analyze data and to make proper interpretations of the results.
|
Analyze
|
Term and Final Examination
|
Course Code: DSA5006 |
Course Title: Machine Learning |
Credits: 3 |
Course Rationale:
Machine learning methods are probabilistic methods used to make computers self-learn from a given data and improve over that over the time when more new data are provided. This course provides knowledge on some of these machine learning methods.
Course Contents:
Basic concepts. Supervised, unsupervised and reinforcement learning.
Support vector machine; Decision tree; random forest. K-NN, Kernel based regression. Bagging and boosting. Naïve Bayes, Bayesian network. Artificial neural network, Convolutional neural network.Basic idea of Deep Learning; Natural language processing; Text and image recognition.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs | PLO1 | PLO2 | PLO3 | PLO4 | PLO5 | PLO6 | PLO7 | PLO8 | PLO9 |
CLO1 | √
|
||||||||
CLO2 | √
|
√
|
√
|
||||||
CLO3 | √
|
√
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO1
|
Compare and contrast supervised and unsupervised learning
|
Examine
|
Assignments, Term and final examinations
|
CLO2
|
Apply machine learning methods such as neural networks and Bayesian networks etc. on different data to teach a machine on that matter.
|
Apply
|
Assignments, Term and final examinations
|
CLO3
|
Train a machine for natural language processing, text recognition, image detection etc.
|
Apply
|
Assignments and project work
|
Course Code: DSA5007 |
Course Title: Big Data & Cloud Computing |
Credits: 3 |
Course Rationale:
Today’s technology allows a lot of data to be acquired easily. To make decisions based on these data, one needs to know how to store, manage, and process or analyze such data keeping in mind the complexity of the data and the current constraint of processing power. This course provides such knowledge.
Course Contents:
Definition and Features of Big Data, Challenges of Big Data; Cloud Computing, Relationship between Cloud Computing and IoT; Types of Data Sources, Sampling, Types of Data Elements, Big Data Generation, Enterprise Data, IoT Data, Internet Data, Bio-Medical Data, Data Collection, Data Transportation, Data Pre-Processing, Visual Data Exploration and Exploratory Statistical Analysis; Storage System for Massive Data, Distributed Storage System, Storage Mechanism for Big Data, Database Technology, Design Factors, Database Programming; Multi-Core Versus Distributed Systems, Distributed Algorithms, Distributed Hash Tables, Bulk Synchronous Parallel (BSP), MapReduce Paradigm; Big Data Search and Retrieval, K-Means Clustering, Social Network Clustering—Topology Discovery, Clustering Algorithm to find Network Topologies, Social Network Condensation, Text Sentiment Mining, Big Data Mining and Analysis Tools.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs | PLO1 | PLO2 | PLO3 | PLO4 | PLO5 | PLO6 | PLO7 | PLO8 | PLO9 |
CLO1 | √ | √
|
|||||||
CLO2 | √ | √ | √ | √ | |||||
CLO3 | √ | ||||||||
CLO4 | √ |
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO1
|
Acquire, store and manage big data
|
Examine
|
Assignments or project work
|
CLO2
|
Analyze big data by means of visualization and statistical and probabilistic methods
|
Analyze
|
Assignments, Term and final examinations
|
CLO3
|
Reduce the dimension of a data for efficient analysis
|
Apply
|
Assignments, Term and final examinations
|
CLO4
|
Apply knowledge on big data on different fields, where such data are available.
|
Apply
|
Assignments or project work
|
Course Code: DSA5099 |
Course Title: Research Project |
Credits: 7 |
Course Rationale
After successful completion of 22 credits of the core courses, a student shall have to register for a research project, which is compulsory and will continue for the remaining one or two semesters. The primary purpose of this research project is to apply the knowledge that the student gained during her/his coursework to a practical problem and to gain firsthand experience in doing research. The experience gained in the research project will help the graduates in securing positions in industries as researchers, analysts, and data scientists. Furthermore, this project also can help the students to get admissions for higher studies, such as PhD into national and international universities.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
|
√
|
√
|
√
|
√
|
√
|
√
|
|
|
CLO 2
|
√
|
|
|
|
|
|
|
|
|
CLO 3
|
|
|
|
|
|
|
√
|
√
|
|
CLO 4
|
|
|
|
|
|
|
|
√
|
√
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO1 | Explain and analyze statistical and data-oriented problems independently and confidently. | Analyze |
Project paper and presentation |
CLO2 | Develop critical thinking through observation and knowledge. | Evaluate | |
CLO3 | Develop communication skills in the scientific community | Examine | |
CLO4 | Apply their knowledge in different fields to contribute to society through research activities. | Apply |
Course Code: DSA5011 |
Course Title: AI and Deep Learning |
Credits: 3 |
Course Rationale:
The main purpose of this course is to provide the most fundamental knowledge to the students so that they can understand what the AI is and be able to understand the possible applications of AI in real systems. Thus, this course advances the knowledge of machine learning methods, focuses on the advanced neural network-based learning algorithms.
Course Contents:
An introduction to the basic principles, techniques, and applications of Artificial Intelligence; Coverage includes knowledge representation, logic, inference, problem solving, search algorithms, game theory, perception, learning, planning, and agent design; Programming in AI language tools; Potential areas of further exploration include expert systems, neural networks, fuzzy logic, robotics, natural language processing, and computer vision; Artificial Neural Network. Deep feedforward, convolutional, recurrent, and graph neural networks; generative adversarial networks.Learning from data, performance evaluation and network visualization.Handling sequential and relational data.Representations and analysis for images, text and structured data.
Mapping of course learning outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO1
|
√
|
|
|
|
|
|
|
|
|
CLO2
|
|
|
√
|
|
|
|
|
|
|
CLO3
|
|
|
|
|
√
|
|
|
|
|
CLO4
|
|
|
√
|
|
|
|
|
|
|
Mapping of the course learning outcomes (CLOs) with the teaching-learning & assessment strategy
CLOs | CLO description | Domain/Level | Assessment tool |
CLO1 | Demonstrate fundamental understanding of the AI and its foundations. | Understand | Term and final examination |
CLO2 | Apply basic principles of AI in solutions that require problem solving, inference, perception, knowledge representation, and learning. | Apply | Assignments, Term and final examination |
CLO3 | Demonstrate awareness and a fundamental understanding of various applications of AI techniques in intelligent agents, expert systems, artificial neural networks and other machine learning models. | Apply | Assignments, Term and final examination |
CLO4 | Demonstrate proficiency in applying scientific method to models of machine learning. | Analyze | Assignments, Term and final examination |
Course Code: DSA5021 |
Course Title: Time Series Analysis & Forecasting |
Credits: 3 |
Course Rationale
This course will provide basic knowledge on time dependent observations. Students will get the idea of time dependent models. The emphasis on choosing on appropriate model for a given time dependent data set will lead to forecast the future phenomena.
Course Content
Descriptive techniques, trend, seasonality, the correlogram. White noise (WN), Transformation to stationarity, Stationary Time series with practical examples; Probability models for time series: stationarity. Moving average (MA), Autoregressive (AR), ARMA, ARIMA, SARIMA models with applications to economics, engineering and biomedical sciences; Estimating the autocorrelation function and fitting ARIMA models; Forecasting: Exponential smoothing, Forecasting from ARIMA models; Stationary multivariate models: Stationary multivariate models with application to real life data. Dynamic simultaneous equations models, Vector autoregression (VAR) models, Granger causality, Impulse response functions, Variance decompositions, Structural VAR models; Nonstationary Multivariate models: Nonstationary Multivariate models with examples. Spurious regression, Cointegration, Granger representation theorem, Vector error correction models (VECMs), Structural VAR models with cointegration, testing for cointegration, estimating the cointegrating rank, estimating cointegrating vectors; Stationary processes in the frequency domain: The spectral density function, the periodogram, spectral analysis with Empirical aspects of spectral analysis State-space models: Dynamic linear models and the Kalman filter with applications of filter. Regression for Rates; Empty Cells and Sparseness in Modeling Contingency Tables.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs | PLO1 | PLO2 | PLO3 | PLO4 | PLO5 | PLO6 | PLO7 | PLO8 | PLO9 |
CLO 1 |
|
√
|
|
|
|
|
|
|
|
CLO 2 |
|
|
|
|
√
|
|
|
|
|
CLO 3 |
|
|
|
√
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO1
|
Understand the time dependent data set
|
Understand
|
Assignment
|
CLO2
|
Derive the properties of time dependent models and choose appropriate model
|
Analyze
|
Term and Final Examination
|
CLO3
|
Compute forecasts for a variety of models
|
Apply
|
Assignment, Term and Final Examination
|
Course Code: DSA5041 |
Course Title: Applied Econometrics |
Credits: 3 |
Course Rationale
Econometrics is a rapidly developing branch of economics which, broadly speaking, aims to give empirical content to economic relations. Econometrics can be defined generally as the application of mathematics and statistical methods to the analysis of economic data. The objective of this course is to prepare students for basic empirical work in economic data. Thus, the course is important for students who want to make a career as data analyst in applied economics and related fields.
Course Content
Types, nature and sources of Econometrics data, Methodology of Econometrics; Heteroscedasticity, Multicollinearity and Autocorrelation: Concept and nature, Sources and consequences, Detection and Remedial Measures; Model Specification: Consequences of under and over specification, model selection criteria; Non-linear models: Estimation and application of Cobb-Doglas production function and other nonlinear functions of econometric data; Simultaneous equation models : Simultanious equation bias, Inconsistance of OLS estimations, Types and rules of identification, Estimation of simultanious estimation methods: Methods of idirect least square(ILS) and two stage least square(2SLS); Panel Data Models: Fixed Effects and Random Effects models in econometrics; Dynamic Econometric Model: Lag Models, Meaning of Dynamic model, Autoregressive and distributed lag models and estimation.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
|
√
|
|
|
|
|
|
|
|
CLO 3
|
√
|
|
|
|
|
|
|
|
|
CLO 4
|
|
|
|
|
√
|
|
|
|
|
CLO 5
|
|
|
|
|
|
√
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO 1 | Explain core concepts and methods in econometrics where the classical linear regression model gets more attention. | Understand | Term and Final Examination |
CLO 2 | Basic knowledge of how to detect and treat violations of the OLS method. | Understand | Term and Final Examination |
CLO 3 | Understand about nonlinear econometric models, Simultaneous equation models, Fixed Effects, Random Effects, and Dynamic Models. | Understand | Term and Final Examination |
CLO 4 | Apply appropriate econometric methods in various econometric data. | Apply | Term and Final Examination |
CLO 5 | Use statistical software to implement the various modes taught employing secondary data and demonstrate ability to analyse and assess empirical results. | Analyze | Term and Final Examination |
Course Code: DSA5043 |
Course Title: Business Process Analytics |
Credits: 3 |
Course Rationale
The goal is to develop an understanding of modern business processes and to introduce the student to several process management tools including simulation. This course focuses on the utilization of problem-solving techniques applied to the functional areas of business under risk and uncertainty. The students will be expected to apply the methodology to improve processes in their personal or professional life.
Course Contents:
Introduction to ARIS: BPM methodology and ARIS; Models, objects, occurrences and relationship. Before modeling a process…create the object libraries: The ARIS metamodel and the object libraries (organizational chart and application portfolio); Object Properties. Model a process: Fundamentals of process modeling; EPC (Event-Driven-Process-Chain) methodolology; Process flow rules; Detailed process design; Process hierarchy and Model’s assignment; Connecting processes together - process interfaces; Top-Down approach; End to End process navigation. Basic Process Analysis: zAdmin/Configuration; Analysis; Find and report; Database Queries.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
|
√
|
|
|
|
|
|
|
|
CLO 3
|
|
√
|
|
|
|
|
|
|
|
CLO 4
|
|
|
|
|
√
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO 1 | Describe and apply business process modelling techniques | Understand | Term and Final Examination |
CLO 2 | Use appropriate business process modelling techniques to best understand a business context and need for an information system | Apply | Term and Final Examination |
CLO 3 | Design and analyse information systems using a business process' lenses | Apply | Term and Final Examination |
CLO 4 | Document business processes for information systems design and analysis | Analyze | Term and Final Examination |
Course Code: DSA5045 |
Course Title: Machine Learning for Finance |
Credits: 3 |
Course Rationale:
The purpose of this course is to introduce students to the theory and practice of supervised and reinforcement learning to big data problems in finance. This course emphasizes the various mathematical frameworks for applying machine learning in quantitative finance, such as quantitative risk modeling with kernel learning and optimal investment with reinforcement learning. Neural networks are used to implement many of these mathematical frameworks in finance using real market data.
Course Contents:
Introduction: History, Comparisons, Definitions, Applications in Finance, Linear regression, Ensemble Models, Time Series, Classification Models, overfitting, regularisation, loss functions, feature engineering, exploratory data analysis, data munging; Preprocessing: mismatch processing (type, length, missing, frequency, dimension), data augmentation (transforming, interacting, mapping, synthesising, extracting), feature selection (linear, non-linear, causal, conditional); Model Development: specification (handcrafted, supervised, unsupervised, reinforcement learning, hybrid model), optimisation (hyperparameter tuning, cross validation), results (accuracy, robustness, performance); Explanations: importance (permutation, conditional, relative), examples (prototypical, counterfactual, contrastive), visualisations (partial dependence, individual expectations, accumulated effects) ; Unsupervised Learning: clustering and dimensionality reduction; Deep learning: classification, regression, and generative models; Natural Language Processing; Reinforcement Learning; Asset Management: trading strategies (asset pricing, ranking), weight optimisation (ESG investing, unsupervised methods), risk management (recession prediction, regime switching, bet sizing).
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
|
√
|
|
|
|
|
|
|
|
CLO 2
|
|
|
√
|
|
|
|
|
|
|
CLO 3
|
|
√
|
|
|
|
|
|
|
|
CLO 4
|
|
|
|
√
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO 1 | Execute the basics of Neural Networks for point estimation from financial data. | Analyze | Term and Final Examination |
CLO 2 | Use the basics of Gaussian Processes for financial risk modeling. | Analyze | Term and Final Examination |
CLO 3 | Use the basics of Reinforcement Learning for optimal stochastic control problems in finance. | Analyze | Term and Final Examination |
CLO 4 | gain hands on experience working with real market data and implementing machine learning methods in Python or R. | Apply | Term and Final Examination |
Course Code: DSA5047 | Course Title: Data Analytics for Finance | Credits: 3 |
Course rational:
The aim of this module is to provide a comprehensive understanding of the theory and practice of data analysis in Finance and wider business domains. The course will leverage the wide use of Data Analytics software tools, such as Excel, Tableau, SPSS, STATA, R and Python among finance practitioners and develop their understanding of the Data Analysis functions and complement this with other analytics tools.
Course Contents:
Strategic Data Management in Finance: Challenges in Human Decision Making, Data Analytics Processes, Financial Big Data for Competitive Advantage; Strategic Data Management in Finance; Exploratory Data Analysis: Data Preparation, Sampling, Descriptive Statistics, Missing Values, Outliers, Data Distributions hypothesis tests and Confidence Intervals, Interpret outputs from statistical software to analyze patterns in financial data for signaling unexpected fluctuations e.g. Fraud Analysis, detecting anomaly transactions etc., Factor Analysis; Principal Component Analysis (PCA); Cluster Analysis, Regression and Big Data; Time Series analysis: Forecasting Revenue/Margin, Predicting Going Concern status, What-if analysis etc.; Machine Learning methods and their applications in Finance data; Evaluate ML techniques for analysis of high-volume financial data - Classifying fraud risk factors in financial transactions - Profiling customer buying and payment performance based on known or unknown attributes - Clustering financial data for internal control procedure; Model Evaluation: Misclassification Rate on a Hold-out Test Set, ROC Curves, Confusion Matrix; Data Ethics in Finance.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
|
√
|
|
|
|
|
|
|
|
CLO 2
|
|
|
|
√
|
|
|
|
|
|
CLO 3
|
|
|
|
|
√
|
|
|
|
|
CLO 4
|
|
|
|
|
√
|
|
|
|
|
CLO5
|
√
|
|
|
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO 1
|
Discuss the strategic processes, benefits and challenges of applying Data Analysis techniques to improve decision making for competitive advantage
|
Apply
|
Term and Final Examination
|
CLO 2
|
Apply statistical techniques, including regression analysis and time series analysis, to business datasets and interpret results for various user groups.
|
Analyze
|
Term and Final
Examination
|
CLO 3
|
Evaluate the benefits and risks of utilizing machine learning techniques in a variety of business financial activities
|
Analyze
|
Term and Final
Examination
|
CLO 4
|
Interpret outputs from a variety of Data Analytics software tools, such as Excel, Tableau, SPSS, STATA, R and Python, and discuss the risks in making business recommendations from these outputs
|
Apply
|
Term and Final
Examination
|
CLO 5
|
Assess data protection, data privacy and other ethical issues associated with the utilization of large corporate datasets.
|
understand
|
Term and Final Examination
|
Course Code: DSA5061 |
Course Title: Environmental Data Analysis and Climate Change |
Credits: 3 |
Course rational:
The primary focus of this course is on probability-based statistical methods employed in the environmental, earth, and ecological sciences. Students will be exposed to the concepts behind these approaches, the computational techniques to implement them, and their application to common problems in environmental science. Attention throughout the course will be given to environmental and climate change applications, and in particular data and models.
Course Contents:
Pollution and its Importance, Why does Pollution happen, Pollutant Sources, Detail Study of Air and Water Pollution, Global Climate Change; Concept of Environmental Standards, Statistically Verifiable Ideal Standard (SVIS), Guard Point Standards, Standards along Cause-Effect Chain; Applications of Bernoulli, Poisson and Normal Process to Environmental problems; Environmental Sampling methods; Diversity: Measurement of diversity, Different diversity indices; Diffusion and Dispersion of Pollutants, Wedge Machine, Particle Frame Machine, Plume Model; Dilution of Pollutants, Deterministic Dilution, Theory of Successive Random Dilution (SRD), Application of SRD to Environmental Phenomena: Air Quality, Indoor Air Quality, Water Quality, Concentrations of Pollutants in Soils, Plants and Animals, Concentration in Food and Human Tissue; Statistical Theory of Rollback: Predicting concentrations after Source Control, Correlation, Previous Rollback Concepts, Environmental Transport Models in Air and Water; Environment and economics: Theory of Environmental Externalities, Coase Theorem, Environmental Welfare analysis, Trade and Environmental policy, Resource allocation over time, valuing the environment, Cost-benefit analysis, Allocation of resources, Renewable and Non-renewable resources; Machine Learning in Climate data, Prediction of weather and climate, Modeling the weather- Weather analysis-gathering data and analysing weather maps.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
|
√
|
|
|
|
|
|
|
|
CLO 2
|
|
|
|
|
√
|
|
|
|
|
CLO 3
|
|
|
|
|
√
|
|
|
|
|
CLO 4
|
|
|
|
√
|
|
|
|
|
|
CLO 5
|
|
|
|
|
√
|
√
|
|
|
|
CLO 6
|
|
|
|
|
√
|
√
|
√ |
|
|
CLO 7
|
|
|
|
|
√
|
√
|
√
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | Course learning outcomes | Domain/Level | Assessment Tool |
CLO 1 | Learn application areas of environmental statistics. | Recall | Term and Final Examination |
CLO 2 | Evaluate environmental sampling data statistically. | Evaluate | Term and Final Examination |
CLO 3 | Explain the distribution of data. | Examine | Term and Final Examination |
CLO 4 | Develop appropriate hypotheses about sample and population and test hypotheses. | Analyze | Term and Final Examination |
CLO 5 | Define statistical tests used in experimental design. | Analyze | Term and Final Examination |
CLO 6 | Solve statistical calculations and experimental design methods in statistical package programs. | Apply | Term and Final Examination |
CLO 7 | Machine learning applications in environmental data analysis. | Analyze | Term and Final Examination |
Course Code: DSA5063 |
Course Title: Bioinformatics |
Credits: 3 |
Course rational: The aim of this course is to provide an overview of the most common statistical methods for molecular genomics and transcriptomics data analysis, and to provide the necessary information for solving the complex biological problems and achieving the satisfactory score of sustainable development goal (SDG) index from the agriculture, health sectors and other sectors.
Course Contents:
Basic Cell Architecture, the Structure, Content and Scale of Deoxyribonucleic Acid (DNA), History of the Human Genome, Genes and Proteins. Knowledge of Various Databases and Bioinformatics Tools Available at these Resources, the Major Content of the Databases, Nucleic Acid Sequence Databases (Genbank, EMBL, DDBJ), Protein Sequence Databases(SWISS-PROT, Trembl, PIR, PDB), Genome Databases (NCBI, EBI, TIGR, SANGER); Genome Sequencing: DNA sequencing, RNA sequencing, Whole genome sequencing, Basic methods of Sequencing, High-throughput sequencing (HTS) methods, Genome assembly, Next Generation Sequencing (NGS) and application areas; Sequence Analysis: Various File Formats for Bio-Molecular Sequences: Genbank, Fasta, Gcg, Msf, Nbrf-Pir Etc., Basic Concepts of Sequence Similarity, Identity and Homology, BLAST and FASTA Algorithms, Various Versions of Basic BLAST and FASTA, Pairwise and Multiple Sequence Alignments-Concepts of Sequence Alignment, Needleman and Wuncsh, Smith and Waterman Algorithms for Pairwise Alignments, Progressive and Hierarchical Algorithms for MSA; Machine Learning in Bioinformatics: Introduction to Various Machine Learning Techniques and their Applications in Bioinformatics. Genetic Algorithms, Support Vector Machine, Neural Networks.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
|
|
|
√
|
|
|
|
|
|
CLO 3
|
|
|
|
|
|
√
|
|
|
|
CLO 4
|
|
|
|
|
|
|
√
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO1 | Understand statistical modeling and learn most common statistical methods for genome data analysis.
|
Understand | Term and Final Examination |
CLO2 | Select appropriate statistical algorithms for genome assembling and their analysis
|
Apply | Term and Final Examination |
CLO3 | Analyze genome datasets to provide the necessary information to solve the complex biological problems that are associated with the genetic factors
|
Analyze | Term and Final Examination |
CLO4 | Contribute to the development of agricultural, medicine and health sectors and other sectors. | Evaluate | Term and Final Examination |
Course Code: DSA5065 |
Course Title: Biostatistics & Epidemiology |
Credits: 3 |
Course rational: This course introduces epidemiology and biostatistics to the student with minimal training in public health, the biomedical sciences, and statistics. It is important for the student to be able to identify causal factors and modes of transmission, with the assistance of statistical tools and biomedical information, and reflect the primary aim of epidemiology.
Course Content:
Basic Concepts: Definition, Scope of Epidemiology, uses of Epidemiology; Types of Epidemiologic Studies: Cross Sectional, Cohort, Case-Control, Retrospective and Prospective, Clinical Trials, Community Intervention and Cluster Randomized Trials; Measures of Disease Frequency: Incidence and Prevalence Rates, Relation between Incidence and Prevalence, Case Fatality Rate, Risk Ratio, Rate Ratio, Risk Difference, Rate Difference, Mortality Measures, Standardized Mortality Ratio; Measures of Association between Disease and Risk Factor: Relative Risk, Attributable Risk, Odds Ratio; Screening, Properties of Screening Test: Sensitivity, Specificity, Negative and Positive Predictive Values; Biostatistics Scope of Biostatistics: Survivor Function, Hazard Function, their Inter Relationships; Censoring and Truncation; Type I, Type II and Random Censoring; Non-Parametric Methods of Estimating Survivor Functions: Life Table Method, Product Limit Method, Variance Estimates, Cumulative Hazard Function, Plots Involving Estimated Survivor and Hazard Functions; Inference Procedures for Exponential Distributions: One Parameter Exponential Distribution with Type I and Type II Censored Data, Comparison of Exponential Distributions; Exponential Regression Model: Method of Estimation, Tests of Hypothesis. Assignment and/or a mini project to be completed on the basis of the above topics by Microsoft-Excel and or R.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs | PLO1 | PLO2 | PLO3 | PLO4 | PLO5 | PLO6 | PLO7 | PLO8 | PLO9 |
CLO 1 |
|
√
|
|
|
|
|
|
|
|
CLO 2 |
|
|
|
√
|
|
|
|
|
|
CLO 3 |
|
|
|
√
|
√
|
√
|
|
|
|
CLO 4 |
|
|
|
√
|
√
|
√
|
|
|
|
CLO 5 |
|
|
|
√
|
√
|
√
|
|
|
|
CLO 6 |
|
|
|
|
|
|
√
|
|
|
CLO 7 |
|
|
|
|
|
|
√
|
|
√
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain
/Level |
Assessment Tool
|
CLO1
|
Apply the theoretical foundations of probability theory and distribution theory
|
Analyze
|
Term and Final Examination
|
CLO2
|
Derive the theoretical mathematics of statistical inferences
|
Analyze
|
Term and Final Examination
|
CLO3
|
Perform linear regression model fitting and diagnosis
|
Apply
|
Term and Final Examination
|
CLO4
|
Perform generalized linear regression model fitting and diagnosis
|
Apply
|
Term and Final Examination
|
CLO5
|
Perform ANOVA analysis and longitudinal data analysis
|
Apply
|
Term and Final Examination
|
CLO6
|
Interpret the results of statistical analysis to public health audience
|
Apply
|
Term and Final Examination
|
CLO7
|
Write scientific reports based on statistical analysis for effective collaboration with public health related scientists in epidemiology, health management and policy, environmental health sciences, nutrition, and health behavior and health education
|
Analyze
|
Term and Final Examination
|
Course Code: DSA 5071 |
Course Title: Generalized linear Model |
Credits: 3 |
Course Rationale:
The course will help the students to build statistical models where the response data may follow any particular probability distribution. The emphasis will be given in finding the link which will help the students derive the models. This course will also guide the students on the procedure of comparing the models and finding out the best fitted one.
Course Content:
Introduction to the Concepts of Modeling; Model Fitting, Exponential Family and Generalized Linear Models (Bernoulli, Binomial, Poisson, Exponential, Gamma, Normal, etc.); Properties of distributions in the exponential family, Random, Systematic and Link Functions, Poisson Regression; Maximum Likelihood Identity Link, Logit Link, Log Link, Parameter Estimation; Score Function and Information Matrix, Estimation, Residuals analysis for GLM, Quasi Likelihood Estimating Equations, Generalized Estimating Equations for Repeated Measures Data, Comparison between Likelihood and Quasi Likelihood Methods, Mixed Effect Models.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
|
|
|
|
√
|
|
|
|
|
CLO 3
|
|
|
|
√
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain
/Level |
Assessment Tool |
CLO1 | Understanding the probability distribution of the population from where the samples are derived | understand | Term and Final Examination |
CLO2 | Find the link to derive statistical model | Apply | Assignment, Term and Final Examination |
CLO3 | Compare the models to find the best one and get the inference from the model | Analyze | Assignment, Term and Final Examination |
Course Code: DSA 5073 |
Course Title: Categorical Data Analysis |
Credits: 3 |
Course Rationale
This course will introduce students to the categorical data and inference regarding the data. There will be emphasis on the understanding the model building of binary response data.
Course Content
Contingency table, Inference of contingency table, GLM for binary and count data, Inference of logistic regression; Nominal Responses: Baseline-Category Logit Models; Ordinal Responses: Cumulative Logit and link Models; Alternative Models for Ordinal Responses; Testing Conditional Independence in I×J×K Tables; Discrete-Choice Multinomial Logit Models; Loglinear Models for Two-Way Tables; Loglinear Models for Independence and Interaction in Three-Way Tables; Inference for Loglinear Models; Loglinear Models for Higher Dimensions; The Loglinear/Logit Model Connection;Modeling Ordinal Associations; Association Models; Association Models, Correlation Models, and Correspondence Analysis; Poisson Regression for Rates; Empty Cells and Sparseness in Modeling Contingency Tables.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
|
√
|
|
|
|
|
|
|
|
CLO 2
|
√
|
|
|
|
|
|
|
|
|
CLO 3
|
|
|
|
|
√
|
|
|
|
|
CLO 4
|
|
|
|
√
|
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO1 | Compute summary statistics of categorical data | analyze | Assignment |
CLO2 | Find the inter dependency of the binary bi-variate data | Analyze | Term and Final Examination |
CLO3 | Derive the models for binary response data | Apply | Assignment, Term and Final Examination |
CLO4 | Finding the estimates using the model | Apply | Assignment, Term and Final Examination |
Course Code: DSA5075 |
Course Title: Design of Experiment |
Credits: 3 |
Course Rationale
Experimental designs play an important role in process development and process troubleshooting in industry, business, agriculture and many others areas. Thus the course objective is to learn how to plan, design and conduct experiments efficiently and effectively, and analyze the resulting data to obtain valid conclusions.
Course Content
Basic concept, Principles of experimental design, Analysis of Variance for Unbalanced Data, Estimates, Confidence Intervals, Inference about Difference between Treatment Means, Multiple Comparisons, Effects and Tests of Departures from Assumptions Underlying the Analysis of Variance Model, Two Way Crossed Classification without Interaction, Model, Assumptions, Mean Squares and Expected Mean Squares, Fixed Effects, Random Effects, Mixed Effects, Tests, Two Way Crossed Classification with Interaction. Model, Assumption, Partition of SS, Mean Squares and Expectations, Fixed Effect, Random Effect and Mixed Effects, Tests, Models for Unbalanced Data. Two Way Nested (Hierarchical) Classification, Model, Assumptions, Fixed Effects, Random Effects and Mixed Effects, Estimation and Tests, Multivariate Analysis of Variance, Repeated Measures Data and ANOVA, Multilevel Models.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
√
|
|
|
|
|
|
|
|
|
CLO 3
|
|
|
|
√
|
|
|
|
|
|
CLO 4
|
|
|
|
|
√
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs | CLO Description | Domain/Level | Assessment Tool |
CLO 1 | Understand the experimental methods and major experimental designs, and think critically about their proper applications. | Understand | Term and Final Examination |
CLO 2 | Formulate hypotheses that can be tested using experiments.
|
Understand | Term and Final Examination |
CLO 3 | Apply different types of experimental design.
|
Apply | Term and Final Examination |
CLO 4 | Explain the output of analysis of variance (ANOVA) from different design.
|
Analyze | Term and Final Examination |
Course Code: DSA5077 |
Course Title: Actuarial Data Analysis |
Credits: 3 |
Course Rationale
Actuarial statistics is very important because it is essential to know underlying theories behind actuarial computations. In this course some fundamental concepts of actuarial statistics and its applications in insurance and others areas will be discussed. Thus, this course will help how statistical and mathematical models are used in pricing and valuing actuarial products and their real-life applications.
Course Content
The meaning and scope of actuarial science; role of actuaries in insurance, business and Economy; The basic deterministic model, Cashflows, Discount functions, Interest and discount rates, Balances and reserves, Change of discount function, Internal rates of return, Forward prices and term structure, Spreadsheet calculations; The life table: Basic definitions, Probabilities related to life table, Constructing the life table from the values of qx, Life expectancy, Choice of life tables; Life annuities: Calculating annuity premiums, The interest and survivorship discount function, Guaranteed payments, Deferred annuities with annual premiums, Gross premiums, Gender aspects, Spreadsheet calculations; Life insurance: Calculating life insurance premiums, Types of life insurance, Combined insurance–annuity benefits, Insurances viewed as annuities, Spreadsheet applications; Insurance and annuity reserves: Definition, The general pattern of reserves, Interest and mortality bases for reserves, Nonforfeiture values, Policies involving a ‘return of the reserve’, Premium difference and paid-up formulas, Universal life and variable annuities, Spreadsheet applications; Continuous payments: Introduction to continuous annuities, The force of discount, The constant interest case, Continuous life annuities, The force of mortality, Insurances payable at the moment of death, Premiums and reserves, The general insurance–annuity identity in the continuous case, Differential equations for reserves; Multiple-life contracts: Joint-life annuities and insurances, Moment of death insurances, The general two-life annuity contract, The general two-life insurance contract, Contingent insurances, Duration problem, Applications to annuity credit risk. Spreadsheet applications.
Mapping Course Learning Outcomes (CLOs) with the PLOs
CLOs
|
PLO1
|
PLO2
|
PLO3
|
PLO4
|
PLO5
|
PLO6
|
PLO7
|
PLO8
|
PLO9
|
CLO 1
|
√
|
|
|
|
|
|
|
|
|
CLO 2
|
√
|
|
|
|
|
|
|
|
|
CLO 3
|
|
|
|
√
|
|
|
|
|
|
CLO 4
|
|
|
|
|
√
|
|
|
|
|
Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy
CLOs
|
CLO Description
|
Domain/Level
|
Assessment Tool
|
CLO 1
|
Understand the basic concept of actuarial statistics and its applications in real field.
|
Understand
|
Term and Final Examination
|
CLO 2
|
Know the basic principles of actuarial science and actuarial mathematics.
|
Understand
|
Term and Final Examination
|
CLO 3
|
Acquire knowledge in actuarial mathematics/life and other contingencies to appear in actuarial professional examination.
|
Apply
|
Term and Final Examination
|
CLO 4
|
Analyze actuarial computations by using the actuarial statistics.
|
Analyze
|
Term and Final Examination
|