Docs.com: Social File Sharing in Facebook

Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media
Docs.com: Social File Sharing in Facebook
Shelly D. Farnham, Andrzej Turski, and Sameer Halai
FUSE Labs, Microsoft Research
{Shellyfa, andrzejt, sameerh}@microsoft.com
Abstract
Social file sharing systems that enable public or broadcast
file sharing have been examined in the enterprise space.
For example, a study of the file sharing system Cattail deployed internally at IBM found that adding sociality to file
sharing improved document reach and discoverability
(Shami, Muller and Millen, 2011). Users employed the social aspects of the system to upload and publicize their files,
annotate and track other people’s files, and discover and
share new files (Muller, Millen, and Feinberg, 2010).
However, the use of personal SNSs for file sharing has
yet to be studied. We might expect that users would not
care to share Microsoft Office documents in personal SNSs,
because they tend to be used for more productivity-oriented
activities. Further, users may prefer to not share to their
whole network, but rather only to a few friends, for which
other tools such as email would suffice. To examine these
issues we analyze the content of documents users choose to
share in Docs.com, how they use SNS settings to define
sharing and editing levels, and what impact such social file
sharing has on engagement and document reach.
Social networking sites provide a unique opportunity for users to engage in broadcast online file sharing and collaboration. In this study we examine the usage of Docs.com,
which integrates Microsoft Office web documents with Facebook. In reviewing the content of Docs.com documents,
we found they were equally created and shared for work,
personal productivity, and play. Users actively employed relationship levels in their Facebook networks to define access
to their documents. However, we also found the most
viewed documents were those shared to the entire public,
with Facebook notifications driving most of the traffic to
these public documents. Finally, through a factor analysis of
user activity four primary types of usage emerged – sharing,
creating, collaborating, and consuming. Those who created
and collaborated around documents had higher levels of engagement than those who just shared or consumed documents, interacting with more documents and returning on
more unique days.
Introduction
Social networking sites (SNSs) are increasingly prominent
tools for maintaining both close and distant social relationships. A recent Pew Internet and American Life report
found that 59% of Internet users have at least one SNS, out
of which 92% have a Facebook account (Hampton et al.,
2011). Facebook users have friended 48% of their real
world extended networks, including friends, coworkers,
family, neighbors, and old classmates.
SNS features such as broadcast updates, news feeds, and
activity notifications enable a more lightweight, passive
form of interaction than found in other forms of computermediated communication such as email (Farnham, 2008;
Hampton et al, 2008; Wohn et al, 2011). In the following
note we explore the extent to which users will leverage this
more passive context for social file sharing and online collaboration. In particular, we examine a system called
Docs.com, which provides tools for sharing and collaborating with Microsoft Office documents such as Word, Excel,
and Powerpoint in the context of Facebook.
Docs.com
Docs.com is a web application that allows users to create or
upload Microsoft Office documents online and then share
them with friends using Facebook. Docs.com was released
in June of 2010, and through steady linear growth over the
course of 15 months has accrued 945,000 authenticated users and 298,000 documents. The primary document types it
supports are Microsoft Word, Excel, Powerpoint, and
PDFs. Users sign in to Docs.com using Facebook’s registration and login system (Facebook Registration API,
2011). Once signed in, users may see, read and comment
on either public documents in the document gallery or any
documents their Facebook friends have shared. More importantly, users may create or upload their own online documents. They may edit these documents directly online, or
open them in a local client to edit. When viewed online in
Docs.com, the document is framed in the context of social
sharing features. See Figure 1. The user may set sharing
and editing levels within the Facebook network, including
Copyright © 2012, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
451
a) “Individual friends”, where the user specifies which Facebook friends have access, b) “All my friends”, which includes all of the user’s Facebook friends, c) “Group”, or
any member of a Facebook group to which the user belongs, d) “Everyone”, the entire Internet, and e) “Only me”,
viewable only by the user. At the time of sharing a document, a notification with a link to the document is sent to
the news feeds of appropriate friends or groups. The user
may also add tags to the document, and viewers or editors
may add comments on the document’s wall.
documents were Word, 18% PDFs, 15% Powerpoint, and
15% Excel. Amongst the top 2000 most viewed documents, PDFs were more popular at 35%, with Word at 35%,
Powerpoint at 15%, and Excel at 15%.
User Action
N
Percent
Join the system
75803
9.6%
Create document
41856
5.3%
Upload document
35339
4.5%
Start editing document
79140
10.1%
Save while editing document
56506
7.2%
Finish editing document
60571
7.7%
Share document
14758
1.9%
View document
322656
41.0%
86787
11.0%
Download document
Delete document
Total
12784
1.6%
786200
100.0%
Table 1. User actions collected during data collection period.
Types of Content Shared
In order to assess the types of documents users were sharing
in the context of Facebook, we performed a content analysis
of 400 public documents. Note 32% of all the documents
used were public. We excluded documents that had extremely high numbers of views and thus were not representative of the population (with z-scores over 5.0). We included for analysis 200 of the most viewed documents, and
200 documents selected using a random number generator.
All documents were reviewed twice, first to code for specific, descriptive types of use (e.g., recipe, or political essay),
and then to categorize them into the more general types of
content sharing that emerged. As can be seen from Table
2, the most prominent category of content was fun and entertainment (44%), with work-related documents a close second (38.7%). A third category was personal productivity
(11.3), which contained content relevant to accomplishing
non-work life tasks such as organizing finances or finding a
health clinic. The top viewed documents were less fun and
had more advertisements than those of our random sample.
Figure 1. A document in Docs.com may be viewed and edited
online, and then shared with Facebook friends.
Method
To analyze Docs.com usage, instrumentation data was collected from May 28 to August 7, 2011. The system logged
discrete user actions around documents as listed in Table 1.
For each action, we collected who performed the action,
what document the action was performed upon, the owner
of the document, and attributes specific to each action. For
example, for sharing a document, the system logged whether it was shared only with friends or publicly. When users
viewed a document, the system only logged when the user
first opened the document, not any subsequent clicking.
Use of Sharing Levels
We examined the sharing levels of the 74,671 documents
that were initially created or uploaded during our data collection period. About 45% of these documents were shared
for viewing, and 18% were shared for editing, meaning
55% remained private. The most popular form of sharing is
to everyone, followed by sharing to groups and sharing to
all friends. This suggests that if users share their documents in Facebook, they are casting a fairly broad net. See
Results
Over the 72-day time span of our data collection period
there were 86,897 authenticated users interacting with
104,199 unique documents. 72% of the documents were
new to Docs.com during this time span, with 54% being
newly created online, and 46% being uploaded. 52% of all
452
Table 3. If documents are shared to all friends, they are
shared to an average of 236 friends in the user’s network.
If they are shared to specific people, they are shared to an
average of 1.6 people, out of a maximum of five.
Content
Categories
Fun and entertainment
Work
Personal
productivity
Civic participation
Content
Examples
recipes, sports statistics, writing,
hobby how-tos
presentations, resumes, homework,
schedules
computer safety
tips, health information, forms
call to actions, political opinion articles
Percent
in Random
Sample
Percent
in Top
200
Viewed
44.0%
21.4%
38.7%
41.7%
11.3%
16.5%
4.2%
4.9%
Advertisement
hotel flyers, product information
1.8%
12.6%
Religious
written essays
0.0%
2.9%
can be seen from Figure 3, the majority of these document
views are generated from Facebook. Thus even the public
documents are leveraging the Facebook network to drive
traffic through notifications in users’ news feeds.
Figure 3. More document views are generated if the document is
shared more broadly in the network.
Table 2. Types of content in public documents.
Sharing Level
To View
To Edit
Everyone
12.7%
Group
14.0%
6.2%
Only friends
12.4%
1.2%
Specific people
4.2%
2.2%
Page admin
1.1%
7.7%
55.6%
82.7%
Only Me
Figure 4. Most document views for public documents are generated from links within Facebook notifications.
Types of Usage
To enable comparisons with Muller, Millen and Feinberg
(2010) who examined social file sharing in the enterprise,
we performed the same factor analysis method to reduce
our various activity measures to the most important types of
use, first aggregating the data at the level of the user. As
can be seen from Table 4, four factors emerged. Prominent
types of usage tended to be either a) sharing, uploading
documents to share them, b) creating, creating and editing
their own documents online, c) collaborating, viewing and
editing others’ documents, or d) consuming, viewing and
downloading public documents. Distinct from the findings
of Muller, Millen, and Feinberg (2010), creating and collaborating were important components of Docs.com usage
patterns.
Table 3. Percentage of sharing levels across documents for both
sharing for others to view, and sharing for others to edit.
Sharing Level and Document Reach
Document reach was measured as number views by the self,
friends, and unauthenticated public persons. Sharing levels
had a meaningful impact on document reach, as can be seen
from Figure 3. Documents shared with the user’s entire
network (only friends) or a group had higher levels of views
from friends. Documents generated much higher page
views on average when shared to the entire Internet (Everyone). In order to examine whether the page views for these
public documents came from Facebook notifications, we
examined the referring site for each document view. As
Individual scores were then saved for each factor to assess
which types of usage had the most impact on two measures
453
of engagement: the number of unique days the user was in
the system, and how many documents they interacted with.
As can be seen from Table 5, users who were more engaged
with sharing content interacted with more documents,
whereas those who created content and collaborated returned for more days. In other words, the use of the more
collaborative features corresponds with more engagement.
gration with Facebook to engage in broadcast, social file
sharing and online collaboration. We found that Docs.com
is well-used and continues to grow, that users actively used
the network-based access levels to passively share their
documents to all their friends and groups within Facebook,
and that notifications within Facebook drove most of the
traffic to users’ documents, especially for documents shared
to the public. We found four primary types of usage – sharing, creating, collaborating, and consuming – and that those
who created and collaborated around documents had higher
levels of engagement.
Factor
Eigenvalue
2.4
2.3
1.3
1.1
23.0%
22.9%
12.8%
11.1%
Sharing
Creating
Collaborating
Consuming
.032
.783
.046
.037
.940
.121
.018
.015
.035
.922
.044
-.016
-.006
-.085
.887
-.130
.721
.023
.011
-.016
.942
.153
.021
.016
.061
.317
.694
.275
.240
.826
.054
.000
-.024
-.003
.039
.755
.024
.012
-.001
.668
% Variance
Measure
Created doc
Uploaded
doc
Edited own
doc
Edited others' doc
Shared doc
to edit
Shared doc
to view
Viewed
friends doc
Viewed own
doc
Viewed public doc
Downloaded
doc
Although we might expect Microsoft Office documents
to be more productivity-oriented, we did find that a substantial percentage of Docs.com documents had fun and entertaining content, as might be expected from sharing within a
friend-oriented network such as Facebook. Nonetheless,
we also found many documents were focused on work and
personal productivity. All documents, whether for fun or
work, tended to be focused on sharing information. Unfortunately for this study we were only able to examine public
documents, and in the future expect to pursue a comparison
with private documents through questionnaires or interviews.
Our results show that users who effectively use social
media sharing tools can drive increased engagement with
their online content, and that SNSs provide a unique context
for a form of lightweight micropublishing – to the user’s
extended network.
Table 4. Factor loadings for types of usage. Sharing, creating,
collaborating, and consuming emerged as four types using principal component analysis with Varimax rotation. We considered
any cell with a value > .40, as shaded.
Unique Days
Active
Number
of docs
r
r
Sharing
.20
.64
Creating
.50
.38
Collaborating
.44
.29
Consuming
.13
.23
Type of Usage
References
Facebook
registration
API.
http://developers.facebook.com/docs/plugins/
Farnham, S. D. 2008. The Facebook Application Ecosystem:
Why Some Thrive – and Most Don’t. An O’Reilly Radar Report,
March 2008.
Farnham, S. D. and Churchill, E. F. 2011. Faceted Identity, Faceted Lives: Social and Technical Issues in Being Yourself Online.
In Proc. CSCW 2011, ACM Press, 359-368.
Hampton, K. N., Goulet, L. S., Rainie, L., Purcell, K. 2011. Social Networking Sites and Our Lives. A report of the Pew Internet
& American Life Project.
Muller, M., Millen, D. R., and Feinberg, J. 2010. Patterns of Usage in an Enterprise File-sharing Service: Publicizing, Discovering, and Telling the News. In Proc. CHI 2010, ACM Press, 763766.
Shami, N. S., Muller, M., and Millen, D. 2011. Browse and Discover: Social File Sharing in the Enterprise. In Proc. CSCW
2011, 295-304.
Wohn, D., Y., Lampe, C., Vitak, J., Ellison, N. 2011. Coordinating the Ordinary: Social Information Uses of Facebook by Adults.
In Proc. iConference, ACM Press, 340-34.
Table 5. Correlations between primary types of usage and
measures of engagement. Due to the large N, all r values are significant at p < .001, and should be interpreted as effect sizes
where .2 = weak, .5 = moderate, and .7 = strong.
Conclusion and Discussion
In this analysis we examined the usage patterns of
Docs.com to assess whether people will leverage its inte-
454