CIS 528 Introduction to Big Data Science (Syllabus) Jongwook Woo, PhD jwoo5@calstatela.edu California State University, LA Computer and Information System Department Jongwook Woo Syllabus Jongwook Woo, Ph.D. Office: Simpson Tower, 604 Telephone: #604: (323) 343 - 2916; ST F517: (323) 343 - 5256 Email: jwoo5@calstatela.edu CIS528 Web Site: http://instructional1.calstatela.edu/jwoo5/classes/2015/fall/bigdata_sci/ Office Hours: Tuesday: 3:20 – 3:50 PM Thursday: 3:20 – 4:50 PM, Friday: 2:10 – 4:10 PM Jongwook Woo Homework 1 1.Should have NIS account 2.Should have email at CSULA 3. Need to leave a message at Moodle Due date Before the next lab starts at the third week – Section 4: Oct 2nd (Friday) Jongwook Woo NIS account Needed How To You need to apply NIS account to logon lab computer at CSULA 5% of HW1 Bring it on the second lab class. How to get MyCSULA account http://www.calstatela.edu/its/helpdesk/gettingstarted http://web.calstatela.edu/library/networkaccount.htm You need to leave a message “Hi Professor” at moodle page 5% of HW1 Jongwook Woo CSULA Email Account How To In order to communicate with the instructor interactively Email web site how to access https://mymail.calstatela.edu/ Login and password should be the same as NIS account How to forward CSULA email to your personal mail http://www.calstatela.edu/its/docs/pdf/forwarding_emails.pdf http://www.calstatela.edu/its/training/pdf/fwemail.pdf – You’d better right-click on the link to download the file instead of leftclick on it. Jongwook Woo Prerequisites Mastery over MS-Windows File Management (Windows Explorer) facilities. Fundamental Coding / Programming skill Unix (Linux) shell Jongwook Woo Course Objectives (Lecture) Identify Big Data that is unstructured data greater than tera-/petabytes Learn Hadoop, Machine Learning and Data Science Learn how to use Azure. Learn the fundamental theories and algorithms used to process, store, analyze, predict Big Data using Python, Hadoop, Machine Learning, and Data Analytics See the use cases and examples of Big Data Analysis and Science in business Jongwook Woo Course Objectives (Lab) With the hands on exercises Setup Hadoop, Pig, Hive, Spark MLib on Azure Practice how to write Hadoop, Pig, Hive, Spark MLib codes Practice Hadoop, Pig, Hive, Spark MLib codes Practice Hive, Data Analysis, Machine Learning codes Jongwook Woo Textbook Instructor’s lecture and lab materials will be posted at a web when the class starts. Related slides, pdf files, papers, web sites etc from the instructor Jongwook Woo Expectation for the Course Classroom SH C 344: Friday 10:00 -2:00 PM – will be changed to Lab classroom Students are expected to attend every class session For successful completion of Lecture/Lab example, assignments and tests – Know how to utilize the equipment or course web site If attendance is not possible, please contact the instructor beforehand to attend other sessions – – Check out the lab example in one week If you don’t, you wouldn’t catch up the class Not to be late – You will have penalties Memory Stick, email Students are expected to use the equipment of computer labs at CSULA for programming or project assignments No excuse not to complete HWs and Lab works for other classes and jobs Jongwook Woo Exams and Grading Policy Grading Policy Total: 100% – Class Activities (Lab, Attendance, Participation in Lab Class, Not late for Lab Class): 30% • 10%: Attendance • 20%: Lab Completeness – 2 or 3 Homeworks (Questions and Project Assignments, Team Building, Term Paper Abstract): 20% – Midterm Exam: 25% – Final Term Project Presentation: 25% Jongwook Woo • Term Project Team build: 2 % of 25% • Term Project Abstract: 5 % of 25% Exams and Grading Policy (Cont’d) Tentative Grade At the end of the quarter, you will have a score out of 100 percent. – This score will be used in a class curve to arrive at a letter grade. Normally but not guaranteed – >= 90 : A (A- or A) – >= 80 : B (B-,B,B+) – >= 70 : C (C-,C,C+) – >=60 : D (D-,D, D+) Jongwook Woo Others Use of email Email will be used only for short messages and sending attachments of less than one Mega Byte A Tentative Course Schedule See the Syllabus See the Course Website Jongwook Woo Others (Cont’d) Academic dishonesty Giving or Receiving solutions of Homework or Exams – The instructor can easily detect the copies – F on the assignment or Course – Cheating and Plagiarism, etc – Normally Individual not Team Assignment See the Course Website Jongwook Woo