Data Science Syllabus 2022-23 S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 131U30C101: Probability Theory Course Objective: Students will learn how to: • Providing students with a formal treatment of probability theory. • Equipping students with essential tools for statistical analyses. • Fostering understanding through real-world statistical applications. Learning Outcomes: At the end of the course student will be able to: • • • Calculate probabilities by applying probability laws and theoretical results. Identify an appropriate probability distribution for a given discrete or continuous random variable and use its properties to calculate probabilities. Derive probability distributions of functions of random variables. Unit I Basic Probability Theory: Probability spaces, Conditional probability, Independence. Random Variables: Definition, Discrete random variables, Continuous random variables, Conditioning on an event, Functions of random variables, Generating random variables. Unit II Multivariate Random Variables: Discrete random variables, Continuous random variables, Joint distributions of discrete and continuous variables, Independence, Functions of several random variables, Generating multivariate random variables, Rejection sampling. Unit III Expectation: Definition, raw and central moments (definition and relationships), moment generation function and properties, characteristic function (definition and use only), Skewness and kurtosis using moments. Unit IV Hypothesis testing: The hypothesis-testing framework, Parametric testing, Nonparametric testing: The permutation test, Multiple testing. S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Linear Regression: Linear models, Least-squares estimation, Overfitting. Textbooks: 1. Carlos Fernandez-Granda, Probability and Statistics for Data Science 2. Robert B. Ash, Basic Probability Theory, Dover Publication. TUTORIAL COMPONENT (1 Credit) Sr. No. Module 1 Demonstrate the concept of Conditional probability. 2 Apply Rejection sampling on sample dataset. 3 Find the Mean and variance of sample dataset. 4 Demonstrate the Random Processes. 5 Apply the Convergence of Random Processes. 6 Solve the problem using Markov Chains. 7 Demonstrate the Frequentist Statistics on dataset. 8 Demonstrate the Bayesian Statistics. 9 Explain the Hypothesis testing. 10 Solve the problem using Linear Regression. S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 131U30C102: Combinatorics Course Objective: Students will learn how to: • To fill the gaps in modern problems of Discrete mathematics. • To learn practical problem-solving skills, which can be later applied in algorithmic theory. • To develop fundamental knowledge of combinatorics and complexity. • To develop practical skills needed in modern logic. • To give practical knowledge, which is needed in many courses’ theoretical informatics. Learning Outcomes: At the end of the course student will be able to: • Understand the combinatorial features in real world situations and Computer Science applications. Compute a generating function and apply them to combinatorial problems Unit I Combinatorics Introduction. Enumeration: Basic Counting Techniques-product rule, sum rule. Permutations, Combinations, and the Binomial Theorem -Permutations, Combinations, The Binomial Theorem. Bijections and Combinatorial Proofs-Counting via bijections, Combinatorial. Counting with Repetitions- Unlimited repetition, sorting a set that contains repetition. Unit II Induction and Recursion: Recursively defined sequences, Basic induction, more advanced induction. The Generalized Binomial Theorem, using generating functions to count things. Generating Functions and Recursion- Partial fractions, factoring polynomials, using generating functions to solve recursively defined sequences. Unit III Graph Theory: Basics of Graph Theory, Basic definitions, terminology, and notation, Deletion, complete graphs, and the Handshaking Lemma, Graph isomorphisms. S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Moving through graphs-Directed graphs, Walks and connectedness, Paths and cycles, Trees. Euler and Hamilton- Euler tours and trails, Hamilton paths and cycles. Graph Coloring- Edge coloring, Ramsey Theory, Vertex coloring. Planar graphs- Planar graphs, Euler's Formula, Map coloring. Unit IV Designs and Codes: Introduction, Error-correcting codes, Using the generator matrix for encoding, Using the parity-check matrix for decoding, Codes from designs. Textbooks: 1. William T Trotter, Applied Combinatorics. 2. Joy Morris, Combinatorics, University of Lethbridge. TUTORIAL COMPONENT (1 Credit) Sr. No. Module 1 Solve the problem using Permutations and Combinations. 2 Demonstrate Generating Functions and Recursion. 3 Explain Binomial Theorem. 4 Visualization using Directed graphs. 5 Demonstrate the Planer Graph. 6 Construct graph by Graph Coloring approach. 7 Demonstrate the Balanced Incomplete Block Designs. 8 Implement the Steiner and Kirkman triple systems. S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 9 Apply the generator matrix for encoding 10 Apply parity-check matrix for decoding. 131U30C103: Linear Algebra Course Objective: • • Linear Algebra plays a fundamental role in the theory of Data Science. This course aims at introducing the basic notions of vector spaces, Linear Algebra, and the use of Linear Algebra in applications to Data Science. Learning Outcomes: At the end of the course student will be able to: • • • • Understand the properties of Vector spaces. Use the properties of Linear Maps in solving problems on Linear Algebra. Demonstrate proficiency on the topics Eigenvalues, Eigenvectors, and Inner Product Spaces. Apply mathematics for some applications in Data Science. Unit I Introduction to Vector Spaces: Vector Spaces: Rn and Cn, lists, Fn and digression on Fields, Definition of Vector spaces, Subspaces, sums of Subspaces, Direct Sums, Span and Linear Independence, bases, dimension. Unit II Linear Maps: Definition of Linear Maps-Algebraic Operations on L (V, W), Null spaces and Injectivity, Range and Subjectivity, Fundamental Theorems of Linear Maps, Representing a Linear Map by a Matrix, Invertible Linear Maps, Isomorphic Vector spaces, Linear Map as Matrix Multiplication, Operators, Products of Vector Spaces, Product of Direct Sum, Quotients of Vector spaces. Unit III Eigenvalues, Eigenvectors, and inner product Spaces: S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Eigenvalues and Eigenvectors, Eigenvectors and Upper Triangular matrices, Eigenspaces and Diagonal Matrices, Inner Products and Norms, Linear functionals on Inner Product spaces. Unit IV Mathematics Applied to Data Science: Singular value decomposition, Handwritten digits and simple algorithm, Classification of handwritten digits using SVD bases, Tangent distance, Text Mining. Textbooks: 1. Sheldon Axler, Linear Algebra Done Right, Second Edition, Springer,2017. 2. EldeĢn Lars, Matrix methods in data mining and pattern recognition, Society for Industrial and Applied Mathematics, 2007. 3. E. Davis, Linear algebra and probability for computer science applications, CRC Press, 2012. TUTORIAL COMPONENT (1 Credit) Sr. No. Module 1 Apply sums of Subspaces, Direct Sums. 2 Demonstrate Span and Linear Independence. 3 Perform Algebraic Operations on L(V,W). 4 Demonstrate Invertible Linear Maps. 5 Perform Invertible Linear Maps operations. 6 Apply Quotients of Vector spaces. 7 Demonstrate the use of Eigenvalues. 8 Demonstrate the use of Eigenvectors. 9 Perform Singular value decomposition. 10 Classification of handwritten digits using SVD bases S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 131U30C104: Programming through Python Course Objective: • To learn and understand Python programming basics and paradigm. • To learn and understand python looping, control statements and string manipulations. Learning Outcomes: At the end of the course student will be able to: • To understand why Python is a useful scripting language for developers. • To learn how to design and program Python applications. • To learn how to use lists, tuples, and dictionaries in Python programs. • To learn how to identify Python object types. To learn how to use indexing and slicing to access data in Python programs. Unit I Python Fundamentals: History of Python, Need of Python Programming, Applications Basics of Python, Working with Interactive mode, Working with Script mode, Python Character Set, Python Tokens, Keywords, Identifiers, Literals, Operators, Variables and Assignments, Input and Output in Python. String and List Manipulation: Introduction to Python String, Accessing Individual Elements, String Operators, String Slices, String Functions and Methods, Introduction to Python List, Creating List, Accessing List, Joining List, Replicating List, List Slicing. Unit II Tuples: Introduction to Tuple, Creating Tuples, Accessing Tuples, Joining Tuples, Replicating Tuples, Tuple Slicing. S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Dictionaries: Introduction to Dictionary, accessing values in dictionaries, Working with dictionaries, Properties. Frozenset: Introduction to Frozenset, Creating Set and Frozenset, Accessing and Joining, Replicating and Slicing. Unit III Classes and Object-Oriented Programming: Abstract Data Types and Classes, Inheritance, Encapsulation and Information Hiding. Program Control Flow: Conditional Statements, the if Statement, the if-else Statement, the if-elif Statement, Nested if Statements, Python Indentation, Looping and Iteration, The For Loop, The While Loop, Loop else Statement, Nested Loops, Break and Continue, The Range Function. Functions: Introduction to Functions, Using a Functions, Python Function Types, Structure of Python Functions, User Defined Functions - Lambda function. Unit IV Numpy: NumPy Basics: Arrays and Vectorized Computation- The NumPy ndarray- Creating ndarrays- Data Types for ndarrays- Arithmetic with NumPy Arrays- Basic Indexing and Slicing - Boolean Indexing-Transposing Arrays and Swapping Axes Pandas: Introduction to pandas Data Structures: Series, DataFrame, Essential Functionality: Dropping EntriesIndexing, Selection, and Filtering- Function Application and Mapping- Sorting and Ranking. File Handling: Opening and Closing Files, Reading and Writing Files. Modules And Packages: Built-in Modules, Importing Modules in Python Programs, Working with Random Modules, User Defined Functions, Structure of Python Modules. Textbooks: 1) Mark Lutz, Learning Python ,5th Edition, O’Reilly publication 2) John Zelle, Python Programming: An introduction to Computer Science 3) Programming in python 3 by Developers Library S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 4) Tony Gaddis, Starting out with Python, Pearson publication 5) Kenneth A.Lambert ,Fundamentals of Python : First Program 6) Mark Lutz, Python pocket reference ,5th edition, O’Reilly publication PRACTICAL COMPONENT (1 Credit) Sr. No. Module 1 Write a program to demonstrate basic operators in python. 2 a) Write a program to compute distance between two points taking input from the user. b) Write a program add.py that takes 2 numbers as command line arguments and perform their sum. 3 Write a program to use split and join methods in the string and trace a birthday of a person with dictionary data structure. 4 a) Write a program to demonstrate list and tuple in python. b) Write a program using a for loop that loops over a sequence. 5 a) By considering the terms in the Fibonacci sequence whose values do not exceed four million, b) WAP to find the sum of the even numbers in sequence. 6 Write a program to count the numbers of characters in the string and store them in a dictionary data structure. 7 Perform File manipulations- open, close, read, write, append and copy from one file to another. 8 Write a program to demonstrate functions 9 Demonstrate manipulation of NumPy arrays- Indexing, Slicing, Reshaping, Joining and Splitting. 10 Write a program to create application for employee management using modules & packages S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Ability Enhancement Compulsory Course (AECC) 131U30N101: Business Communication Course Objective: The aim of this course is to help the students to attain the following industry identified competency through various teaching learning experiences Communicate effectively to fulfil business requirements. Learning Outcomes: The theory, practical experiences and relevant soft skills associated with this course are to be taught and implemented, so that the student demonstrates the following industry-oriented COs associated with the above-mentioned competency • • • • • Avoid communication barriers for effective business communication. Make the relevant use of body language to communicate. Use audio – visual aids to communicate effectively and efficiently. Develop notices, memoranda, and reports in relevant formats. Draft different types of business letters, E-mails using correct formats. Unit I: Introduction to basic Communication, Describe the importance of business communication, Use different types of verbal and nonverbal communications, Parts of speech, Prepositions, Conjunctions, Direct-Indirect speech, Types of sentences, Assertive, Imperative, Interrogative, Exclamatory, Tenses, Past, Future, Present, Active and Passive voice, Collocation, Idioms and phrases, Vocabulary building Unit II: Technical skill in communication, Improve the writing skills in various types of letters, email in correct format, use different types of body language to improve the stage performance, Paragraph writing, Passage writing, Mail formation / Letter writing, Report writing, Speech, Welcome, Valedictory, Vote of Thanks Unit III: Ethics and Etiquettes and preparation of Resume and job application, to improve Emotional Quotient (EQ) and improves ethics, professional behaviour and etiquettes, Emotional intelligence, Body Language, Phone etiquettes, Dining etiquettes, Professional etiquettes, Technology etiquettes, Mail etiquettes, Resume Application S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Unit IV: Personality Development and Various technical skills, improve creativity and workability, presentation skills and timely working and leadership qualities building, Creativity & Motivation Reading Skill, Presentation skills, Time management, Stress management, Leadership qualities, Team building, Decision Making, Preparatory steps for Job Interview, Interview techniques, Group Discussion techniques References Books – 1. Effective Communication Skill, M Ashraf Rizvi, Tata mc-Grill hill. 2. Communication skills, Sanjay kumar, Oxford University Press. 3. Personality Development and Soft Skills, Barun K. Mitra, Oxford University Press. Generic Elective (GE) 131U30G101: Web Programming using HTML & CSS Course Objective: The students will be able to design and develop static and dynamic web pages. Student will be familiarize with Client-Side Programming, Server-Side Programming Pages and learn Database Connectivity to web applications. Learning Outcomes: The students should be able to: • • • Design and develop dynamic web pages with good knowledge of designing and latest technical know-how's. Have a good understanding of Web Application Terminologies, Internet Tools other web services. Learn how to link and publish web sites. Unit I: Introduction, Why HTML5? Formatting text by using tags, using lists and backgrounds, Creating hyperlinks and anchors. Style sheets, CSS formatting text using style sheets, formatting paragraphs using style sheets. Creating navigational aids: planning site organization, creating text-based navigation bar, creating graphicsbased navigation bar, creating graphical navigation bar, creating image map Unit II: HTML5 Tables, Forms and Media: Creating tables: creating simple table, specifying the size of the table, specifying the width of the column, merging table cells, using tables for page layout, formatting tables: applying table borders, applying background and foreground fills, changing cell padding, spacing and alignment, creating user forms: creating basic form, using check boxes and option buttons, creating lists, additional input types in HTML5, Incorporating sound and video: audio and video in HTML5, HTML multimedia basics, embedding video clips, incorporating audio on web page. Unit III: Java Script: Introduction, Client-Side JavaScript, Server-Side JavaScript, JavaScript Objects, JavaScript Security, Operators: Assignment Operators, Comparison Operators, Arithmetic Operators, % (Modulus), ++(Increment), --(Decrement), -(Unary Negation), Logical Operators, String Operators, Special Operators, ?: S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 (Conditional operator), , (Comma operator), delete, new, this, void Statements: Break, comment, continue, delete, do...while, export, for, for...in, function, if...else, import, labelled, return, switch, var, while, with Core JavaScript (Properties and Methods of Each) : Array, Boolean, Date, Function, Math, Number, Object, String, regExp. Events and Event Handlers using JAVA Script: JavaScript Objects such as String, RegExp, Math, Date, Window, Navigator, History, Location, and Document. Storing and Retrieving Cookies Unit IV: Introduction to XML, benefits of XML, XML DTD, and Namespaces, declaring an elements and attributes. Displaying XML documents using CSS. Introduction to PHP: Essential PHP: Mixing HTML and PHP- comments – variables. Operators and flow control: Assignment operators- incrementing and decrementing values- string, bitwise, operator precedence-relational, logical operators- if, if-else, elseif statement, ternary operator, switch statement-for loops-while loops- do-while loops, for each loop. Reading data in web pages: Setting up web pages to communicate with PHP-handling text field, text areas, check boxes, radio buttons, list boxes, password controls. Working with database- basic SQL commands- creating MySQL database-table-insert- delete-update table. Sessions, Cookies: Setting, reading, deleting cookies, simple email creation and Sending, Storing data in sessions. JQuery: jQuery animation effects References Books – 1. 2. 3. HTML 5 Black Book, Covers CSS 3, JavaScript, XML, XHTML, AJAX, PHP and jQuery, 2ed, Dreamtech Press Web Programming and Interactive Technologies, scriptDemics, StarEdu Solutions India. PHP: A Beginners Guide, Vikram Vaswani, TMH PRACTICAL COMPONENT (1 credit) Sr. No. Module (Any 10) HTML & CSS 1 Design web pages for your college containing a description of the courses, departments, faculties, library etc, use href, list tag 2 Study of Table Tag 3 Study of form Tag with its elements Create user Student feedback form (use textbox, text area, checkbox, radio button, select box etc.) 4 Create your resume using HTML tags also experiment with colors, text, link , size and also other tags you studied 5 Design a web page of your hometown with an attractive background color, text color, an Image, font etc. (use internal CSS) S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 6 Study of External, Internal and Inline CSS Use External, Internal, and Inline CSS to format college web page that you created JavaScript 7 Write JavaScript code for a. Performing various mathematical operations such as calculating factorial / finding Fibonacci Series / Displaying Prime Numbers in each range / Evaluating Expressions / Calculating reverse of a number b. Validating the various Form Elements 8 Write JavaScript code for a. Demonstrating different JavaScript Objects such as String, RegExp, Math, Date b. Demonstrating different JavaScript Objects such as Window, Navigator, History, Location, Document, c. Storing and Retrieving Cookies XML 9 Design an XML document to store information about a student. The information must include USN, Name, and Name of the College, Branch, Year of Joining, and email id. Make up sample data for 3 students. Create a CSS style sheet and use it to display the document. 10 Create DTD for above XML File (Internal and External) PHP 11 Write a php program to display today’s date in dd-mm-yyyy format. 12 Write PHP scripts for a. Retrieving data from HTML forms b. Performing certain mathematical operations such as calculating factorial / finding Fibonacci Series / Displaying Prime Numbers in each range / Evaluating Expressions / Calculating reverse of a number c. Working with Arrays d. Working with Files (Reading / Writing) 13 Write PHP scripts for a. Working with Databases (Storing Records / Reprieving Records and Display them) b. Write a PHP Script for login authentication. Design an html form which takes username and password from user and validate against stored username and password in file. c. Write PHP Script for storing and retrieving user information from MySql table. Design a HTML page which takes Name, Address, Email and Mobile No. From user ( register.php ) Store this data in Mysql database / text file. Next page display all user in html table using PHP (display.php ) d. Storing and Retrieving Cookies e. Storing and Retrieving Sessions S K Somaiya College, Somaiya Vidyavihar University, Mumbai Data Science Syllabus 2022-23 Jquery 14 Design a webpage with some jQuery animation effects 15 Students must create a whole website which contains above topics in Website S K Somaiya College, Somaiya Vidyavihar University, Mumbai