60-574 machine learning - School of Computer Science

advertisement
COURSE INFORMATION Winter 2014
60-574 MACHINE LEARNING
School of Computer Science, University of Windsor
Instructor:
Dr. Alioune Ngom,
School of Computer Science.
Email: angom@cs.uwindsor.ca
Phone Extension: 3789
Room: 5107 LT
Office Hours: Thursday 08:30am to 12:00pm
PURPOSE OF THIS COURSE AND LEARNING RESOURCES:
Course Description and Objectives
Machine Learning (as well as Data Mining) has as its goal the automatic acquisition of knowledge from examples. Obtaining
concepts from examples is a typical learning task. The examples may be, e.g., news items selected by an internet user, and the
concept will be a description of user’s interests inferred from his selections. Or genetic expression knowledge can be ‘learned’
from time-series data of DNA chip results. A functional view of Machine Learning is that it performs data analysis and
extraction of knowledge from data. Many Machine Learning techniques are used in Data Mining.
The objective of this course is to provide an introduction to the basic principles, techniques, and applications of Machine
Learning. Programming assignments are used to help clarify basic learning concepts. The emphasis of the course is on teaching
the fundamentals, and not on providing a mastery of specific commercially available software tools or programming
environments.
This course is about the principles, design and implementation of learning agents --- programs that improve their performance
on some set of tasks with experience. A range of learning paradigms and methods will be covered, such as: Concept and
Decision Tree Learning, Neural and Evolutionary Learning, Probabilistic and Statistical Learning, Inductive and Analytical
Learning, and Instance-Based and Reinforcement Learning.
Upon successful completion of the course, you will have the capability to develop learning algorithms or systems, to analyze
their behavior and to empirically and theoretically estimate their performance. You will have an understanding of a variety of
practically useful machine learning algorithms and their use in data-driven knowledge discovery and program synthesis. You
will have designed and implemented several machine learning algorithms. You should also be able to adapt or combine some
of the key elements of existing machine learning algorithms to design new algorithms as needed. The emphasis of the course is
on machine learning algorithms that are useful in practice. However, we will cover some of the key theoretical results that have
significant practical implications.
Lectures
Lecture Hours and Location: Tuesday 08:30am to 11:20am --- To be Announced
Students will have the opportunity to meet with the instructor and with other students through the following:



Some formal lectures every week – see schedule for times.
One-to-one consultations with the instructor as required.
Discussion and interaction with other students.
Course notes, textbook and web page
Homepage: http://cs.uwindsor.ca/~angom
Recommended Textbook (no particular textbook is required):
 Tom M. Mitchell, Machine Learning, Mc-Graw-Hill, ISBN 0-07-042807-7, 1997
2



Ethem Alpaydin, Introduction to Machine Learning, MIT Press, ISBN 0-262-01211-1, 2004
Trevor Hastie, Robert Tibshirani and Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference,
and Prediction, Second Edition, Springer, ISBN 978-0-387-84857-0, 2009
Internet resources: Dr. Andrew Moore’s tutorials at http://www.autonlab.org/tutorials/, and, Dr. Ethem Alpaydin’s
slides at http://www.cmpe.boun.edu.tr/~ethem/i2ml/. And many others.
Software resources
WEKA can be downloaded from http://www.cs.waikato.ac.nz/~ml
Matlab, and other programming language compilers
Basic knowledge of Java, C or C++ is expected.
WORK TO BE UNDERTAKEN BY STUDENTS:
Preparation for lectures
Attendance at all lectures is highly recommended. Some of the concepts covered are difficult and
professors will attempt to present the concepts in such a way as to make them easier to understand.
Students should read the course notes and textbook ahead of lectures. A detailed schedule showing the topic
of each lecture is given later. Lectures are not substitutes for student reading. Students who do not read
ahead may find themselves lost in the lectures.
Evaluation Scheme
The final mark will consist of
1. Class Participation: 10%
2. Presentation: 15%
3. Assignments: 25%
4. Project: 50%
Students who wish to appeal a class-test, exam, or assignment mark should do so within a week of receiving the mark. If
disagreements between a student and the instructor persist, then the student should wait until s/he received his final
grade at the end of the semester and then follow the procedure outlined in the University Calendar for the appeal of
that grade. No remarking of class tests or the final exam will be undertaken unless a formal grade appeal is submitted
at the end of the semester after the student has received the final grade for the course. Numerical errors in adding
marks on class tests and the final exam will be corrected when identified. There are no make-up tests, so please do not
miss a test.
The University of Windsor uses a percentage marking and grading scale as of Fall 2013. Only raw % scores (e.g., 75%) are
assigned in course work and only raw scores are used in the computation of cumulative, major and sessional grade point
averages. The meaning of scores (or logical conversion of raw scores to grades) in transcripts are with a conversion scale
different from the one used previously. This new conversion scale can be found through www.uwindsor.ca/calendar, then
grading and marks/grades descriptors link. Only raw % scores are assigned in course work and the meaning of scores in
transcripts are:
 90
 85
 80
 77
 73
 70
 67
=100
< 90
< 85
< 80
< 77
< 73
< 70
A+
A
AB+
B
BC+
 63
 60
 57
 53
 50
0
<67
< 63
< 60
< 57
< 53
< 50
C
CD+
D
DF
In computing a student's average, grades from 0% to 22% are calculated as 22%. Grades from 23% to 40% calculated as 40%.
Grades from 40% to 49% are calculated as is into the student's average. All grades are recorded in the transcript as is. All
3
grades below 50% are considered failures. (see mark/grades descriptor page of calendar www.uwindsor.ca/calendar for
details).
Teaching Evaluation
Student Evaluation of Teaching (SET) forms will be administered during the last two weeks of the class schedule.
POLICY ON CHEATING:
The professor will put a great deal of effort into helping students to understand and to learn the material in the course.
However, he will not tolerate any form of cheating.
The professor will report any suspicion of cheating to the Director of the School of Computer Science. If sufficient evidence is
available, the Director will begin a formal process according to the University Senate Bylaws. The instructor will not negotiate
with students who are accused of cheating but will pass all information to the Director of the School of Computer Science.
The following behavior will be regarded as cheating (together with other acts that would normally be regarded as cheating in
the broad sense of the term):







Copying assignments
Allowing another student to copy an assignment
from you and present it as their own work
Copying from another student during a test or
exam
Referring to notes, textbooks, etc. during a test or
exam
Talking during a test or an exam
Not sitting at the pre-assigned seat during a test
or exam
Communicating with another student in any way
during a test or exam






Having access to the exam/test paper prior to the
exam/test
Asking a teaching assistant for the answer to a
question during an exam/test
Presenting another’s work as your own
Modifying answers after they have been marked
Any other behaviour which attempts unfairly to
give you an advantage over other students in the
grade-assessment process
Refusing to obey the instructions of the officer in
charge of an examination.
Students who are found guilty of any form of cheating will be given a grade of F- for the whole course.
Several University of Windsor students have been caught cheating during the last few years. In most cases the evidence was
sufficient to invoke a disciplinary process which resulted in various forms of punishment including letters of censure, loss of
marks, failing grades, and expulsions. As example, a student who copied a project from another student and presented it as his
own was expelled from the university. In course, 60-100, a student who copied answers during a final exam received a failing
grade.
Do not cheat, if you are caught and found guilty, you could be thrown out of the university and will have to explain why when
you go looking for a job.
Tentative course lecture schedule:
Week
1
2
3
4
5
6
7
8
9
10
11
12
Lecture Topic
Introduction to Machine Learning
Concept Learning and the General-to-Specific Ordering
Decision Tree Learning
Artificial Neural Networks
Evaluating Hypothesis
Bayesian Learning
Computational Learning Theory
Instance-Based Learning
Genetic Algorithms
Learning Set of Rules
Analytical Learning
Grammatical Learning and Hidden Markov Models
4
13
Reinforcement Learning
Download