End of Semester Report Spring 2005 Jeremy Morris May 2, 2005

advertisement
End of Semester Report
Spring 2005
Jeremy Morris
May 2, 2005
At the beginning of the semester I proposed to do two things. The
first was to alter the model I was using to determine relationships between
students at the university. The second was to create a web site where I was
going to document my results and progress on this project.
1
A More Realistic Model
The main aim of this project has been to see if we can establish whether or
not the University of Utah meets the criteria for a small world, as defined
by Duncan Watts. To do this, I have received some data from the university
where we can create a graph of students where we place an edge in the graph
between students that take at least on class together. We have shown that
this graph can be considered a small world. However, if we think about
what really happens in the classroom, it is not realistic to think that any
given student knows or interacts with, all other students in each course. So
we attempted to create a more realistic graph.
In this more realistic graph, we connect students with probability
p
kU
ni
(1)
Where p is a parameter chosen by us, ni is the number of students attending
the ith course and kU is the average class size in our data set (this number
is approximately 20). What this means is that in an average sized class,
a student will know a certain percentage of the people in that class. This
percentage is scaled with the size of the class such that in a larger class,
the student knows less people, and in a smaller class, the student will know
more. Experience tells us that this closely approximates reality.
1
We ran a few experiments to see what value of p would give us a realistic
number of average connections. It turns out that if we pick p = 0.40, we
get an average degree of 17, this is down from 208 if we connect students to
everyone in all their courses. For this graph we have the results
Graph Statistics
Number of Students in Graph
Avg Degree
Characteristic Path Length (L)
Clustering Coefficient (γ)
28517
17
4.52656
0.182163
This shows that, amazingly, the U can be considered a small world even
when we make drastic restrictions on how we connect the graph. It is also
amazing that the characteristic path length (which measures the typical
distance between nodes) only increased slightly from 2.88 to 4.52. This is a
very short distance for a disease to travel.
2
Documentation and the Web site
For a large part of the semester I have been working on a web site that will
document this project. The web site will eventually feature an explanation
of the project, results from the various experiments I have run, the software
I have written and an interactive section for students. Some of these things
are not ready for release and will be added sometime this summer.
The interactive section will allows students to enter their course schedule
so that they can tell a few things about themselves. After entering a course
schedule, students will be able to see their Google weight, their mean path
length and their clustering coefficient. There are links to definitions for all
of these statistics.
The web site can be found at
http://sundial.math.utah.edu/smallU/
I have had a very good experience with the REU program and thank
everyone involved for giving me the opportunity to participate. Thank you.
2
Download