Description of courses M. S. in Data Science & Analytics

Home/MPS/Description of courses M. S. in Data Science & Analytics
  • Description of Courses of Master of Science in Data Science & Analytics

  • Course Code: DSA5001 Statistical Methods and Probability            Credits: 3

     

    Course Rationale

    Data scientists often have to describe and evaluate evidence based on the verities of information. For this, they often describe and simulate the samples to understand how the population behaves and for that they make assumptions about the underlying probability distributions of the variable. This course will help them to make propositions about the entire population of the data. This is one of the core reasons why statistical inference is important among the data scientists.

     

    Course Contents:

    Descriptive and Inferential Statistics; Variables and Types of Data; Sampling; Frequency distributions; Histograms; Other types of charts and graphs; Scatter plots; Measures of central tendency; Measures of variation; Basic probability; Conditional probability, multiplication rule; Probability distributions; Expected value and variance; Binomial distribution; Normal distribution; Applications of the normal distribution; Central Limit Theorem. Collecting, summarizing and visualizing data; Distribution of sampling statistics; Point estimation and confidence intervals; Hypothesis testing; Inference with two populations; Maximum likelihood; Nonparametric methods.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs

    PLO1

    PLO2

    PLO3

    PLO4

    PLO5

    PLO6

    PLO7

    PLO8

    PLO9

    CLO 1









    CLO 2









    CLO 3









    CLO 4










    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain/Level
    Assessment Tool
    CLO1
    Describe descriptive statistics
    Analyze
    Term and Final Examination
    CLO2
    Solve problems by using probabilistic approach.
    Apply
    Assignment, Term and Final Examination
    CLO3
    Explain the basic ideas and methods about fundamental principles for statistical inference.
    Analyze
    Term and Final Examination
    CLO4
    perform point estimation, hypothesis testing (parametric and non-parametric) and interval estimation under a large variety of situation.
    Apply
    Assignment, Term and Final Examination

    Course Code: DSA5002

    Course Title: Programming for Data Science

    Credits: 3

     

    Course Rationale

    This is the computing course which will help the students have the basic knowledge of data and statistics needed to go for computation. This programming software Python/ R provides the user a wide scope of data manipulation and analysis of interest. The course will not only help the students to learn the basic statistical calculations but also emphasize programming so that they may write their own program for advanced statistical analysis. 

    Course Content 

    Understand and utilize Python/ R/RStudio; Understand basic data types and data structures in Python/R. Introductory concepts of Python/R, Getting data into Python/R (Familiarize and load data files into Python/R, basic data manipulation; Sub-setting, Basic statistical calculations; Visualize datasets using low-level and high-level plots in Python/R. Concept of Array and matrix, Inference, random sampling; Regression, Programming with R; Writing R functions, Control structures. 

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs PLO1 PLO2 PLO3 PLO4 PLO5 PLO6 PLO7 PLO8 PLO9
    CLO 1







    CLO 2







    CLO 3







    CLO 4








    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO1 Understand basic data structures in R Understand Term and Final Examination
    CLO2 Getting data into R (data import, export) Analyze Term and Final Examination
    CLO3 Visualize datasets using plots Understand Term and Final Examination
    CLO4 Writing R functions, Control structures Apply Assignment, Term and Final Examination


    Course Code: DSA5003

    Course Title: Database Management Systems

    Credits: 3

     

    Course Rationale:

    The objective of the course is to present an introduction to database management systems, emphasizing how to organize, maintain and retrieve - efficiently and effectively - information from a DBMS.

    Course Contents:

    Database System- concepts and architecture; The Relational Model; Conceptual Modeling Data Definition Languages and SQL; Relational Algebra and SQL; Database Design and Normalization; Triggers and Active Databases; Physical Data Organization and Indexing; Dynamic Multi-level indexing using B Trees and B+ Trees, Query Processing; Transaction Processing; Design Coding and Testing.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

      CLO Description Domain/Level Assessment Tool
    CLO 1 Describe the fundamental elements of relational database management systems. Understand Term and Final Examination
    CLO 2 Explain the basic concepts of the relational data model Understand Assignment, Term Examination
    CLO 3 Design models to represent simple database application scenarios Analyze Term and Final Examination
    CLO 4 Improve the database design. Analyze Term and Final Examination

    Course Code: DSA5004

    Course Title: Regression Analysis          

    Credits: 3

     

    Course Rationale:

    Real-life events are related to each other make up a system; one or more events influence one or more other events. To understand such a system regression modeling are a great tool. This course also puts emphasis on the justification of a model that is built by incorporating different residual-based model diagnostic measures. 

    Course Contents:

    Simple and multiple regression model and theirestimation, assumptions, residual analysis, Interaction in regression models, Reduced and full models, F test for general linear hypothesis;  Dummy variables and interpretation of model parameters; Multicollinearity: sources, diagnostic measures, remedy measures, ridge regression; Influence Measures:Detection of outliers by residual analysis, influence statistics- Cook’s D, Cov Ratio etc. Resistant fitting techniques such as M Estimate, MM-Estimate for data sets with outliers; Model Selection, All possible regression, forward selection, backward elimination, and stepwise algorithm; Polynomial Regression Models; Non-linear Regression, kernel estimator.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO1

     
     
     

     
     
     
     
    CLO2
     
     
     
     

     
     
     
     
    CLO3
     
     


     
     
     
     
     
    CLO4
     
     
     
     


     
     
     


    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO1 Build simple, multiple, polynomial and non-linear regression models Create Assignments, Term and final examinations
    CLO2 Evaluate an estimated regression model’s performance through rigorous statistical tests and measures Evaluate Assignments, Term and final examinations
    CLO3 Diagnose model assumptions through visualization of residuals and different statistical measures Analyze Assignments, Term and final examinations
    CLO4 Select best models among all possible regression models through step-by-step systematic methods. Analyze Assignments, Term and final examinations

    Course Code: DSA5005

    Course Title: Multivariate Statistical Analysis

    Credits: 3


    Course Rationale:

    In many observational or designed studies, observations are collected simultaneously on more than one variable on each experimental unit. With the availability of inexpensive, fast and efficient computing resources and statistical packages there has been a growth in the application of these techniques. This course introduces the student to various multivariate data analysis tools and dimension reduction techniques for high dimensional data. Thus, this course is important who wants to build carrier in data scientist. 

    Course Contents:

    Applications of multivariate techniques, Organization of data, Distance; Basics of linear algebra including matrices, singular value decomposition, Random vectors and matrices, Mean vectors and Covariance matrices; The multivariate normal density and its properties; Hotelling T2 and likelihood ratio tests, confidence regions and simultaneous comparisons of component means. Large sample inference about a population mean vector; Comparisons of several multivariate means; Introduction and concepts of principal components and its application; Factor analysisand methods of estimation, applications; Cluster analysis, Hierarchical and Non-Hierarchical Methods; separation and classification for two populations; profiles analysis, repeated measures designs and growth curves.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs PLO1 PLO2 PLO3 PLO4 PLO5 PLO6 PLO7 PLO8 PLO9
    CLO 1








    CLO 2








    CLO 3








    CLO 4








    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain/Level
    Assessment Tool
    CLO 1
    Understand the principles and characteristics of the multivariate data analysis, including strengths and weaknesses.
    Understand
    Term and Final Examination
    CLO 2
    Select appropriate techniques of multivariate analysis for a multivariate dataset.
    Understand
    Term and Final Examination
    CLO 3
    Apply the multivariate methods in the framework of the multivariate analysis.
    Apply
    Term and Final Examination
    CLO 4
    use the statistical software to analyze data and to make proper interpretations of the results.
    Analyze
    Term and Final Examination

    Course Code: DSA5006

    Course Title: Machine Learning

    Credits: 3

     

    Course Rationale:

    Machine learning methods are probabilistic methods used to make computers self-learn from a given data and improve over that over the time when more new data are provided. This course provides knowledge on some of these machine learning methods. 

    Course Contents:

    Basic concepts. Supervised, unsupervised and reinforcement learning.

    Support vector machine; Decision tree; random forest. K-NN, Kernel based regression. Bagging and boosting. Naïve Bayes, Bayesian network. Artificial neural network, Convolutional neural network.Basic idea of Deep Learning; Natural language processing; Text and image recognition.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs PLO1 PLO2 PLO3 PLO4 PLO5 PLO6 PLO7 PLO8 PLO9
    CLO1
                   
    CLO2    


           
    CLO3      

           

    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain/Level
    Assessment Tool
    CLO1
    Compare and contrast supervised and unsupervised learning
    Examine
    Assignments, Term and final examinations
    CLO2
    Apply machine learning methods such as neural networks and Bayesian networks etc. on different data to teach a machine on that matter.
    Apply
    Assignments, Term and final examinations
    CLO3
    Train a machine for natural language processing, text recognition, image detection etc.
    Apply
    Assignments and project work

    Course Code: DSA5007

    Course Title: Big Data & Cloud Computing

    Credits: 3

     

    Course Rationale:

    Today’s technology allows a lot of data to be acquired easily. To make decisions based on these data, one needs to know how to store, manage, and process or analyze such data keeping in mind the complexity of the data and the current constraint of processing power. This course provides such knowledge.

    Course Contents:

    Definition and Features of Big Data, Challenges of Big Data; Cloud Computing, Relationship between Cloud Computing and IoT; Types of Data Sources, Sampling, Types of Data Elements, Big Data Generation, Enterprise Data, IoT Data, Internet Data, Bio-Medical Data, Data Collection, Data Transportation, Data Pre-Processing, Visual Data Exploration and Exploratory Statistical Analysis; Storage System for Massive Data, Distributed Storage System, Storage Mechanism for Big Data, Database Technology, Design Factors, Database Programming; Multi-Core Versus Distributed Systems, Distributed Algorithms, Distributed Hash Tables, Bulk Synchronous Parallel (BSP), MapReduce Paradigm; Big Data Search and Retrieval, K-Means Clustering, Social Network Clustering—Topology Discovery, Clustering Algorithm to find Network Topologies, Social Network Condensation, Text Sentiment Mining, Big Data Mining and Analysis Tools.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs PLO1 PLO2 PLO3 PLO4 PLO5 PLO6 PLO7 PLO8 PLO9
    CLO1            

     
    CLO2          
    CLO3                
    CLO4                

    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain/Level
    Assessment Tool
    CLO1
    Acquire, store and manage big data
    Examine
    Assignments or project work
    CLO2
    Analyze big data by means of visualization and statistical and probabilistic methods
    Analyze
    Assignments, Term and final examinations
    CLO3
    Reduce the dimension of a data for efficient analysis
    Apply
    Assignments, Term and final examinations
    CLO4
    Apply knowledge on big data on different fields, where such data are available.
    Apply
    Assignments or project work

    Course Code: DSA5099

    Course Title: Research Project

    Credits: 7

     

    Course Rationale

    After successful completion of 22 credits of the core courses, a student shall have to register for a research project, which is compulsory and will continue for the remaining one or two semesters. The primary purpose of this research project is to apply the knowledge that the student gained during her/his coursework to a practical problem and to gain firsthand experience in doing research. The experience gained in the research project will help the graduates in securing positions in industries as researchers, analysts, and data scientists. Furthermore, this project also can help the students to get admissions for higher studies, such as PhD into national and international universities.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO1 Explain and analyze statistical and data-oriented problems independently and confidently. Analyze



    Project paper and presentation
    CLO2 Develop critical thinking through observation and knowledge. Evaluate
    CLO3 Develop communication skills in the scientific community Examine
    CLO4 Apply their knowledge in different fields to contribute to society through research activities. Apply

    Course Code: DSA5011

    Course Title: AI and Deep Learning

    Credits: 3

     

    Course Rationale:

    The main purpose of this course is to provide the most fundamental knowledge to the students so that they can understand what the AI is and be able to understand the possible applications of AI in real systems. Thus, this course advances the knowledge of machine learning methods, focuses on the advanced neural network-based learning algorithms. 

    Course Contents:

    An introduction to the basic principles, techniques, and applications of Artificial Intelligence; Coverage includes knowledge representation, logic, inference, problem solving, search algorithms, game theory, perception, learning, planning, and agent design; Programming in AI language tools; Potential areas of further exploration include expert systems, neural networks, fuzzy logic, robotics, natural language processing, and computer vision;  Artificial Neural Network. Deep feedforward, convolutional, recurrent, and graph neural networks; generative adversarial networks.Learning from data, performance evaluation and network visualization.Handling sequential and relational data.Representations and analysis for images, text and structured data.

    Mapping of course learning outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO1









    CLO2









    CLO3









    CLO4









    Mapping of the course learning outcomes (CLOs) with the teaching-learning & assessment strategy

    CLOs CLO description Domain/Level Assessment tool
    CLO1 Demonstrate fundamental understanding of the AI and its foundations. Understand Term and final examination
    CLO2 Apply basic principles of AI in solutions that require problem solving, inference, perception, knowledge representation, and learning. Apply Assignments, Term and final examination
    CLO3 Demonstrate awareness and a fundamental understanding of various applications of AI techniques in intelligent agents, expert systems, artificial neural networks and other machine learning models. Apply Assignments, Term and final examination
    CLO4 Demonstrate proficiency in applying scientific method to models of machine learning. Analyze Assignments, Term and final examination

    Course Code: DSA5021

    Course Title: Time Series Analysis & Forecasting

    Credits: 3


    Course Rationale

    This course will provide basic knowledge on time dependent observations. Students will get the idea of time dependent models. The emphasis on choosing on appropriate model for a given time dependent data set will lead to forecast the future phenomena.

    Course Content 

    Descriptive techniques, trend, seasonality, the correlogram. White noise (WN), Transformation to stationarity, Stationary Time series with practical examples; Probability models for time series: stationarity. Moving average (MA), Autoregressive (AR), ARMA, ARIMA, SARIMA models with applications to economics, engineering and biomedical sciences; Estimating the autocorrelation function and fitting ARIMA models; Forecasting: Exponential smoothing, Forecasting from ARIMA models; Stationary multivariate models: Stationary multivariate models with application to real life data. Dynamic simultaneous equations models, Vector autoregression (VAR) models, Granger causality, Impulse response functions, Variance decompositions, Structural VAR models; Nonstationary Multivariate models: Nonstationary Multivariate models with examples. Spurious regression, Cointegration, Granger representation theorem, Vector error correction models (VECMs), Structural VAR models with cointegration, testing for cointegration, estimating the cointegrating rank, estimating cointegrating vectors; Stationary processes in the frequency domain: The spectral density function, the periodogram, spectral analysis with Empirical aspects of spectral analysis State-space models: Dynamic linear models and the Kalman filter with applications of filter. Regression for Rates; Empty Cells and Sparseness in Modeling Contingency Tables.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs PLO1 PLO2 PLO3 PLO4 PLO5 PLO6 PLO7 PLO8 PLO9
    CLO 1








    CLO 2








    CLO 3








    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain/Level
    Assessment Tool
    CLO1
    Understand the time dependent data set
    Understand
    Assignment
    CLO2
    Derive the properties of time dependent models and choose appropriate model
    Analyze
    Term and Final Examination
    CLO3
    Compute forecasts for a variety of models

    Apply
    Assignment, Term and Final Examination

    Course Code: DSA5041

    Course Title: Applied Econometrics

    Credits: 3

     

    Course Rationale

    Econometrics is a rapidly developing branch of economics which, broadly speaking, aims to give empirical content to economic relations. Econometrics can be defined generally as the application of mathematics and statistical methods to the analysis of economic data. The objective of this course is to prepare students for basic empirical work in economic data. Thus, the course is important for students who want to make a career as data analyst in applied economics and related fields.

    Course Content

    Types, nature and sources of Econometrics data, Methodology of Econometrics; Heteroscedasticity, Multicollinearity and Autocorrelation: Concept and nature, Sources and consequences, Detection and Remedial Measures; Model Specification: Consequences of under and over specification, model selection criteria; Non-linear models: Estimation and application of Cobb-Doglas production function and other nonlinear functions of econometric data; Simultaneous equation models : Simultanious equation bias, Inconsistance of OLS estimations, Types and rules of identification, Estimation of simultanious estimation methods: Methods of idirect least square(ILS) and two stage least square(2SLS); Panel Data Models: Fixed Effects and Random Effects models in econometrics; Dynamic Econometric Model: Lag Models, Meaning of Dynamic model, Autoregressive and distributed lag models and estimation. 

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    CLO 5









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO 1 Explain core concepts and methods in econometrics where the classical linear regression model gets more attention. Understand Term and Final Examination
    CLO 2 Basic knowledge of how to detect and treat violations of the OLS method.  Understand Term and Final Examination
    CLO 3 Understand about nonlinear econometric models, Simultaneous equation models, Fixed Effects, Random Effects, and Dynamic Models. Understand Term and Final Examination
    CLO 4 Apply appropriate econometric methods in various econometric data.   Apply Term and Final Examination
    CLO 5 Use statistical software to implement the various modes taught employing secondary data and demonstrate ability to analyse and assess empirical results. Analyze Term and Final Examination

    Course Code: DSA5043

    Course Title: Business Process Analytics

    Credits: 3

     

    Course Rationale

    The goal is to develop an understanding of modern business processes and to introduce the student to several process management tools including simulation. This course focuses on the utilization of problem-solving techniques applied to the functional areas of business under risk and uncertainty. The students will be expected to apply the methodology to improve processes in their personal or professional life.

    Course Contents:

    Introduction to ARIS:  BPM methodology and ARIS; Models, objects, occurrences and relationship. Before modeling a process…create the object libraries: The ARIS metamodel and the object libraries (organizational chart and application portfolio); Object Properties. Model a process: Fundamentals of process modeling; EPC (Event-Driven-Process-Chain) methodolology; Process flow rules; Detailed process design; Process hierarchy and Model’s assignment; Connecting processes together - process interfaces; Top-Down approach; End to End process navigation. Basic Process Analysis: zAdmin/Configuration; Analysis; Find and report; Database Queries.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO 1 Describe and apply business process modelling techniques Understand Term and Final Examination
    CLO 2 Use appropriate business process modelling techniques to best understand a business context and need for an information system Apply Term and Final Examination
    CLO 3 Design and analyse information systems using a business process' lenses Apply Term and Final Examination
    CLO 4 Document business processes for information systems design and analysis Analyze Term and Final Examination

    Course Code: DSA5045

    Course Title: Machine Learning for Finance

    Credits: 3

     

    Course Rationale:

    The purpose of this course is to introduce students to the theory and practice of supervised and reinforcement learning to big data problems in finance. This course emphasizes the various mathematical frameworks for applying machine learning in quantitative finance, such as quantitative risk modeling with kernel learning and optimal investment with reinforcement learning. Neural networks are used to implement many of these mathematical frameworks in finance using real market data.

    Course Contents:

    Introduction: History, Comparisons, Definitions, Applications in Finance, Linear regression, Ensemble Models, Time Series, Classification Models, overfitting, regularisation, loss functions, feature engineering, exploratory data analysis, data munging; Preprocessing: mismatch processing (type, length, missing, frequency, dimension), data augmentation (transforming, interacting, mapping, synthesising, extracting), feature selection (linear, non-linear, causal, conditional); Model Development: specification (handcrafted, supervised, unsupervised, reinforcement learning, hybrid model), optimisation (hyperparameter tuning, cross validation), results (accuracy, robustness, performance); Explanations: importance (permutation, conditional, relative), examples (prototypical, counterfactual, contrastive), visualisations (partial dependence, individual expectations, accumulated effects) ; Unsupervised Learning: clustering and dimensionality reduction; Deep learning: classification, regression, and generative models; Natural Language Processing;  Reinforcement Learning; Asset Management: trading strategies (asset pricing, ranking), weight optimisation (ESG investing, unsupervised methods), risk management (recession prediction, regime switching, bet sizing).

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO 1 Execute the basics of Neural Networks for point estimation from financial data. Analyze Term and Final Examination
    CLO 2 Use the basics of Gaussian Processes for financial risk modeling. Analyze Term and Final Examination
    CLO 3 Use the basics of Reinforcement Learning for optimal stochastic control problems in finance. Analyze Term and Final Examination
    CLO 4 gain hands on experience working with real market data and implementing machine learning methods in Python or R. Apply Term and Final Examination
    Course Code: DSA5047 Course Title: Data Analytics for Finance Credits: 3

     

    Course rational: 

    The aim of this module is to provide a comprehensive understanding of the theory and practice of data analysis in Finance and wider business domains. The course will leverage the wide use of Data Analytics software tools, such as Excel, Tableau, SPSS, STATA, R and Python among finance practitioners and develop their understanding of the Data Analysis functions and complement this with other analytics tools.

    Course Contents:  

    Strategic Data Management in Finance: Challenges in Human Decision Making, Data Analytics Processes, Financial Big Data for Competitive Advantage; Strategic Data Management in Finance; Exploratory Data Analysis: Data Preparation, Sampling, Descriptive Statistics, Missing Values, Outliers, Data Distributions hypothesis tests and Confidence Intervals, Interpret outputs from statistical software to analyze patterns in financial data for signaling unexpected fluctuations e.g. Fraud Analysis, detecting anomaly transactions etc., Factor Analysis; Principal Component Analysis (PCA); Cluster Analysis, Regression and Big Data; Time Series analysis: Forecasting Revenue/Margin, Predicting Going Concern status, What-if analysis etc.; Machine Learning methods and their applications in Finance data; Evaluate ML techniques for analysis of high-volume financial data - Classifying fraud risk factors in financial transactions - Profiling customer buying and payment performance based on known or unknown attributes - Clustering financial data for internal control procedure; Model Evaluation: Misclassification Rate on a Hold-out Test Set, ROC Curves, Confusion Matrix; Data Ethics in Finance.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs 
    PLO1 
    PLO2 
    PLO3 
    PLO4 
    PLO5 
    PLO6 
    PLO7 
    PLO8 
    PLO9
    CLO 1








    CLO 2








    CLO 3








    CLO 4








    CLO5








    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs 
    CLO Description 
    Domain/Level 
    Assessment Tool 
    CLO 1
    Discuss the strategic processes, benefits and challenges of applying Data Analysis techniques to improve decision making for competitive advantage
    Apply
    Term and Final Examination
    CLO 2
    Apply statistical techniques, including regression analysis and time series analysis, to business datasets and interpret results for various user groups.
    Analyze
    Term and Final
    Examination
    CLO 3
    Evaluate the benefits and risks of utilizing machine learning techniques in a variety of business financial activities
    Analyze
    Term and Final
    Examination
    CLO 4
    Interpret outputs from a variety of Data Analytics software tools, such as Excel, Tableau, SPSS, STATA, R and Python, and discuss the risks in making business recommendations from these outputs
    Apply
    Term and Final
    Examination
    CLO 5
    Assess data protection, data privacy and other ethical issues associated with the utilization of large corporate datasets.
    understand
    Term and Final Examination

    Course Code: DSA5061

    Course Title: Environmental Data Analysis 

    and Climate Change

    Credits: 3

     

    Course rational:

    The primary focus of this course is on probability-based statistical methods employed in the environmental, earth, and ecological sciences. Students will be exposed to the concepts behind these approaches, the computational techniques to implement them, and their application to common problems in environmental science. Attention throughout the course will be given to environmental and climate change applications, and in particular data and models.

    Course Contents:

    Pollution and its Importance, Why does Pollution happen, Pollutant Sources, Detail Study of Air and Water Pollution, Global Climate Change; Concept of Environmental Standards, Statistically Verifiable Ideal Standard (SVIS), Guard Point Standards, Standards along Cause-Effect Chain; Applications of Bernoulli, Poisson and Normal Process to Environmental problems; Environmental Sampling methods; Diversity: Measurement of diversity, Different diversity indices; Diffusion and Dispersion of Pollutants, Wedge Machine, Particle Frame Machine, Plume Model; Dilution of Pollutants, Deterministic Dilution, Theory of Successive Random Dilution (SRD), Application of SRD to Environmental Phenomena: Air Quality, Indoor Air Quality, Water Quality, Concentrations of Pollutants in Soils, Plants and Animals, Concentration in Food and Human Tissue; Statistical Theory of Rollback: Predicting concentrations after Source Control, Correlation, Previous Rollback Concepts, Environmental Transport Models in Air and Water; Environment and economics: Theory of Environmental Externalities, Coase Theorem, Environmental Welfare analysis, Trade and Environmental policy, Resource allocation over time, valuing the environment, Cost-benefit analysis, Allocation of resources, Renewable and Non-renewable resources; Machine Learning in Climate data, Prediction of weather and climate, Modeling the weather- Weather analysis-gathering data and analysing weather maps.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    CLO 5









    CLO 6








    CLO 7









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs Course learning outcomes Domain/Level Assessment Tool
    CLO 1 Learn application areas of environmental statistics. Recall Term and Final Examination
    CLO 2 Evaluate environmental sampling data statistically. Evaluate Term and Final Examination
    CLO 3 Explain the distribution of data. Examine Term and Final Examination
    CLO 4 Develop appropriate hypotheses about sample and population and test hypotheses. Analyze Term and Final Examination
    CLO 5 Define statistical tests used in experimental design. Analyze Term and Final Examination
    CLO 6 Solve statistical calculations and experimental design methods in statistical package programs. Apply Term and Final Examination
    CLO 7 Machine learning applications in environmental data analysis. Analyze Term and Final Examination

    Course Code: DSA5063

    Course Title: Bioinformatics 

    Credits: 3

    Course rational: The aim of this course is to provide an overview of the most common statistical methods for molecular genomics and transcriptomics data analysis, and to provide the necessary information for solving the complex biological problems and achieving the satisfactory score of sustainable development goal (SDG) index from the agriculture, health sectors and other sectors.

    Course Contents:

    Basic Cell Architecture, the Structure, Content and Scale of Deoxyribonucleic Acid (DNA), History of the Human Genome, Genes and Proteins. Knowledge of Various Databases and Bioinformatics Tools Available at these Resources, the Major Content of the Databases, Nucleic Acid Sequence Databases (Genbank, EMBL, DDBJ), Protein Sequence Databases(SWISS-PROT, Trembl, PIR, PDB), Genome Databases (NCBI, EBI, TIGR, SANGER); Genome Sequencing: DNA sequencing, RNA sequencing, Whole genome sequencing, Basic methods of Sequencing, High-throughput sequencing (HTS) methods, Genome assembly, Next Generation Sequencing (NGS) and application areas; Sequence Analysis: Various File Formats for Bio-Molecular Sequences: Genbank, Fasta, Gcg, Msf, Nbrf-Pir Etc., Basic Concepts of Sequence Similarity, Identity and Homology, BLAST and FASTA Algorithms, Various Versions of Basic BLAST and FASTA, Pairwise and Multiple Sequence Alignments-Concepts of Sequence Alignment, Needleman and Wuncsh, Smith and Waterman Algorithms for Pairwise Alignments, Progressive and Hierarchical Algorithms for MSA; Machine Learning in Bioinformatics: Introduction to Various Machine Learning Techniques and their Applications in Bioinformatics. Genetic Algorithms, Support Vector Machine, Neural Networks.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO1 Understand statistical modeling and learn most common statistical methods for genome data analysis.

    Understand Term and Final Examination
    CLO2 Select appropriate statistical algorithms for genome assembling and their analysis
     
    Apply Term and Final Examination
    CLO3 Analyze genome datasets to provide the necessary information to solve the complex biological problems that are associated with the genetic factors
     
    Analyze Term and Final Examination
    CLO4 Contribute to the development of agricultural, medicine and health sectors and other sectors. Evaluate Term and Final Examination

    Course Code: DSA5065

    Course Title: Biostatistics & Epidemiology

    Credits: 3

    Course rational: This course introduces epidemiology and biostatistics to the student with minimal training in public health, the biomedical sciences, and statistics. It is important for the student to be able to identify causal factors and modes of transmission, with the assistance of statistical tools and biomedical information, and reflect the primary aim of epidemiology. 

    Course Content:

    Basic Concepts: Definition, Scope of Epidemiology, uses of Epidemiology; Types of Epidemiologic Studies: Cross Sectional, Cohort, Case-Control, Retrospective and Prospective, Clinical Trials, Community Intervention and Cluster Randomized Trials; Measures of Disease Frequency: Incidence and Prevalence Rates, Relation between Incidence and Prevalence, Case Fatality Rate, Risk Ratio, Rate Ratio, Risk Difference, Rate Difference, Mortality Measures, Standardized Mortality Ratio; Measures of Association between Disease and Risk Factor: Relative Risk, Attributable Risk, Odds Ratio; Screening, Properties of Screening Test: Sensitivity, Specificity, Negative and Positive Predictive Values; Biostatistics Scope of Biostatistics: Survivor Function, Hazard Function, their Inter Relationships; Censoring and Truncation; Type I, Type II and Random Censoring; Non-Parametric Methods of Estimating Survivor Functions: Life Table Method, Product Limit Method, Variance Estimates, Cumulative Hazard Function, Plots Involving Estimated Survivor and Hazard Functions; Inference Procedures for Exponential Distributions: One Parameter Exponential Distribution with Type I and Type II Censored Data, Comparison of Exponential Distributions; Exponential Regression Model: Method of Estimation, Tests of Hypothesis. Assignment and/or a mini project to be completed on the basis of the above topics by Microsoft-Excel and or R.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs PLO1 PLO2 PLO3 PLO4 PLO5 PLO6 PLO7 PLO8 PLO9
    CLO 1








    CLO 2








    CLO 3








    CLO 4








    CLO 5








    CLO 6








    CLO 7








     

    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain
    /Level
    Assessment Tool
    CLO1
    Apply the theoretical foundations of probability theory and distribution theory
    Analyze
    Term and Final Examination
    CLO2
    Derive the theoretical mathematics of statistical inferences
    Analyze
    Term and Final Examination
    CLO3
    Perform linear regression model fitting and diagnosis
    Apply
    Term and Final Examination
    CLO4
    Perform generalized linear regression model fitting and diagnosis
    Apply
    Term and Final Examination
    CLO5
    Perform ANOVA analysis and longitudinal data analysis
    Apply
    Term and Final Examination
    CLO6
    Interpret the results of statistical analysis to public health audience
    Apply
    Term and Final Examination
    CLO7
    Write scientific reports based on statistical analysis for effective collaboration with public health related scientists in epidemiology, health management and policy, environmental health sciences, nutrition, and health behavior and health education
    Analyze
    Term and Final Examination

    Course Code: DSA 5071

    Course Title: Generalized linear Model

    Credits: 3

     

    Course Rationale:

    The course will help the students to build statistical models where the response data may follow any particular probability distribution. The emphasis will be given in finding the link which will help the students derive the models. This course will also guide the students on the procedure of comparing the models and finding out the best fitted one.

    Course Content:

    Introduction to the Concepts of Modeling; Model Fitting, Exponential Family and Generalized Linear Models (Bernoulli, Binomial, Poisson, Exponential, Gamma, Normal, etc.); Properties of distributions in the exponential family, Random, Systematic and Link Functions, Poisson Regression; Maximum Likelihood Identity Link, Logit Link, Log Link, Parameter Estimation; Score Function and Information Matrix, Estimation, Residuals analysis for GLM, Quasi Likelihood Estimating Equations, Generalized Estimating Equations for Repeated Measures Data, Comparison between Likelihood and Quasi Likelihood Methods, Mixed Effect Models.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain
    /Level
    Assessment Tool
    CLO1 Understanding the probability distribution of the population from where the samples are derived understand Term and Final Examination
    CLO2 Find the link to derive statistical model Apply Assignment, Term and Final Examination
    CLO3 Compare the models to find the best one and get the inference from the model Analyze Assignment, Term and Final Examination

    Course Code: DSA 5073

    Course Title: Categorical Data Analysis   

    Credits: 3

     

    Course Rationale

    This course will introduce students to the categorical data and inference regarding the data. There will be emphasis on the understanding the model building of binary response data.

    Course Content 

    Contingency table, Inference of contingency table, GLM for binary and count data, Inference of logistic regression; Nominal Responses: Baseline-Category Logit Models; Ordinal Responses: Cumulative Logit and link Models; Alternative Models for Ordinal Responses; Testing Conditional Independence in I×J×Tables; Discrete-Choice Multinomial Logit Models; Loglinear Models for Two-Way Tables; Loglinear Models for Independence and Interaction in Three-Way Tables; Inference for Loglinear Models; Loglinear Models for Higher Dimensions; The Loglinear/Logit Model Connection;Modeling Ordinal Associations; Association Models; Association Models, Correlation Models, and Correspondence Analysis; Poisson Regression for Rates; Empty Cells and Sparseness in Modeling Contingency Tables.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1









    CLO 2









    CLO 3









    CLO 4









    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO1 Compute summary statistics of categorical data analyze Assignment
    CLO2 Find the inter dependency of the binary bi-variate data Analyze Term and Final Examination
    CLO3 Derive the models for binary response data Apply Assignment, Term and Final Examination
    CLO4 Finding the estimates using the model Apply Assignment, Term and Final Examination

    Course Code: DSA5075

    Course Title: Design of Experiment

    Credits: 3

     

    Course Rationale

    Experimental designs play an important role in process development and process troubleshooting in industry, business, agriculture and many others areas. Thus the course objective is to learn how to plan, design and conduct experiments efficiently and effectively, and analyze the resulting data to obtain valid conclusions.

    Course Content

    Basic concept, Principles of experimental design, Analysis of Variance for Unbalanced Data, Estimates, Confidence Intervals, Inference about Difference between Treatment Means, Multiple Comparisons, Effects and Tests of Departures from Assumptions Underlying the Analysis of Variance Model, Two Way Crossed Classification without Interaction, Model, Assumptions, Mean Squares and Expected Mean Squares,  Fixed Effects, Random Effects, Mixed Effects, Tests, Two Way Crossed Classification with Interaction. Model, Assumption, Partition of SS, Mean Squares and Expectations, Fixed Effect, Random Effect and Mixed Effects, Tests, Models for Unbalanced Data. Two Way Nested (Hierarchical) Classification, Model, Assumptions, Fixed Effects, Random Effects and Mixed Effects, Estimation and Tests, Multivariate Analysis of Variance, Repeated Measures Data and ANOVA, Multilevel Models. 

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1








    CLO 2








    CLO 3








    CLO 4








    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs CLO Description Domain/Level Assessment Tool
    CLO 1 Understand the experimental methods and major experimental designs, and think critically about their proper applications. Understand Term and Final Examination
    CLO 2 Formulate hypotheses that can be tested using experiments.

    Understand Term and Final Examination
    CLO 3 Apply different types of experimental design.
     
    Apply Term and Final Examination
    CLO 4 Explain the output of analysis of variance (ANOVA) from different design.
     
    Analyze Term and Final Examination

    Course Code: DSA5077

    Course Title: Actuarial Data Analysis

    Credits: 3

     

    Course Rationale

    Actuarial statistics is very important because it is essential to know underlying theories behind actuarial computations. In this course some fundamental concepts of actuarial statistics and its applications in insurance and others areas will be discussed. Thus, this course will help how statistical and mathematical models are used in pricing and valuing actuarial products and their real-life applications.

    Course Content

    The meaning and scope of actuarial science; role of actuaries in insurance, business and Economy; The basic deterministic model, Cashflows, Discount functions, Interest and discount rates, Balances and reserves, Change of discount function, Internal rates of return, Forward prices and term structure, Spreadsheet calculations;  The life table: Basic definitions, Probabilities related to life table, Constructing the life table from the values of qx, Life expectancy, Choice of life tables; Life annuities: Calculating annuity premiums, The interest and survivorship discount function, Guaranteed payments, Deferred annuities with annual premiums, Gross premiums, Gender aspects, Spreadsheet calculations; Life insurance: Calculating life insurance premiums, Types of life insurance, Combined insurance–annuity benefits, Insurances viewed as annuities, Spreadsheet applications; Insurance and annuity reserves: Definition, The general pattern of reserves, Interest and mortality bases for reserves, Nonforfeiture values, Policies involving a ‘return of the reserve’, Premium difference and paid-up formulas, Universal life and variable annuities, Spreadsheet applications; Continuous payments: Introduction to continuous annuities, The force of discount, The constant interest case, Continuous life annuities, The force of mortality, Insurances payable at the moment of death, Premiums and reserves, The general insurance–annuity identity in the continuous case, Differential equations for reserves; Multiple-life contracts: Joint-life annuities and insurances, Moment of death insurances, The general two-life annuity contract, The general two-life insurance contract, Contingent insurances, Duration problem, Applications to annuity credit risk. Spreadsheet applications.

    Mapping Course Learning Outcomes (CLOs) with the PLOs

    CLOs
    PLO1
    PLO2
    PLO3
    PLO4
    PLO5
    PLO6
    PLO7
    PLO8
    PLO9
    CLO 1








    CLO 2








    CLO 3








    CLO 4








    Mapping Course Learning Outcomes (CLOs) with the Teaching-Learning & Assessment Strategy

    CLOs
    CLO Description
    Domain/Level
    Assessment Tool
    CLO 1
    Understand the basic concept of actuarial statistics and its applications in real field.
    Understand
    Term and Final Examination
    CLO 2
    Know the basic principles of actuarial science and actuarial mathematics.
    Understand
    Term and Final Examination
    CLO 3
    Acquire knowledge in actuarial mathematics/life and other contingencies to appear in actuarial professional examination.
    Apply
    Term and Final Examination
    CLO 4
    Analyze actuarial computations by using the actuarial statistics.
    Analyze
    Term and Final Examination