Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas Weigend (www.weigend.com) The Social Data Revolution, INFO 290A-3 (http://weigend.com/teaching/ischool) UC Berkeley, School of Information, Fall 2012 (http://ischool2012.wikispaces.com) Class2 This transcript: http://weigend.com/files/teaching/ischool/2012/weigend_ischool2012_transcript_2.doc Corresponding audio files: http://weigend.com/files/teaching/ischool/2012/weigend_ischool2012_audio_2_TMP.to.b e.compressed/0910_155700.WAV http://weigend.com/files/teaching/ischool/2012/weigend_ischool2012_audio_2_TMP.to.b e.compressed/0910_172102.WAV Containing folder of the whole series: http://weigend.com/files/teaching/ischool/2012 http://weigend.com/files/teaching/ischool/2012 1 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: Sorry I was late. My car wasn't working and then trying to find Dr. Lifs (ph.) living in San Francisco took me too late to get here. What did you do so far today in those 25 minutes? Student: Went over video tasks, the Facebook page on Social Data Revolution, and Jerry and Carl went over – Andreas: Wonderful, there's no much left for me to do. Here's a plan for today. First I want to summarize what we did last class. I realize standing here for a couple of minutes how bad the acoustics are in this room. I had a couple people say it doesn't really carry well, so I'll try to do my best to speak as clearly as I can. After the summary I want to talk a bit about how many details you had about the different areas and get your input, your votes. Divya kindly prepared a Google form where we have twelve areas where each of you is asked then to vote tonight so I know how to prioritize who I will get to the class. In our usual tradition, the last half hour we have a guest come. Today it is a journalist, a woman how works for Marketplace. Not sure who of you knows Marketplace. It's (indiscernible) out of Los Angeles, and it was a great honor for me to be on it a couple of years ago, in the summary they had about that year. The last hour of the year, they took the favorite quotes of the entire year and they picked one of my which was a quote on privacy. "Privacy started with cities, and ended with Facebook." Anyway, we have a journalist come today at the end of the class, the last half-hour. The reason for today is that as you probably know the new iPhone will be released later this week, on Wednesday. We want to investigate together here how the DNA of Apple differs from the DNA of a Google differs from the DNA of Amazon when it comes to data. After we get your input she's going to take me to some quiet room and I'll produce a couple of minutes, and that will be our summary on the radio on Wednesday on (indiscernible) show called Marketplace. Divya, did you explain to them the colors here? You all should be able to do a bit of origami and fold this thing you have in front of you. Why? We talked about interaction last class. We talked about communication as one of the main pillars, remember? You came up with the wonderful dimensions of communication and I emphasized that dimensions are more important than the specific points in space. I care in teaching much more that people understand the space than knowing where things are. One of these communication dimensions was feedback. In most classes, people don't have a feedback loop. I can see you nod. I can see you doze off, and as you know about after half an hour half of the class has fallen asleep. Maybe the other half is having sexual fantasies. http://weigend.com/files/teaching/ischool/2012 2 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 So in order to improve that feedback loop, Divya kindly prepared these cubes. I asked her to fold them in front of you so that I can see the same color you see. If I'm seeing green in the room and you're seeing green in the room, then everything is going smoothly. If I'm not sure what's happening, then you put on yellow and when you have no idea what I'm talking about, then put it in red and I'll reboot. I basically would like to look at the room of green. If I see a bit of yellow somewhere I know I may need to ask the person what's not clear. If I see a room of red, then I know I lost everybody. Does that make sense? Low tech, but it works because you can see the same from the back as I see from the front. Any questions? That's the purpose of the origami exercise. Student: Do we have a pair of scissors that's handy? Divya: There's one, that one, and there's a piece of paper – Andreas: I know that I will walk you in an hour or so through the different things of the wiki but I wanted to get started with a summary of the last class because content is important. Let's get started. What is social data? Now all the cubes are turning into yellow and red. What's social data? Social data comes in two flavors. One is data individuals socialize, typically data that they actually create. I sometimes say data we create and share. To share is to care, as we learned in kindergarten. What it means is we express our humanity, who we are through sharing: what the best noodle shop is, or our favorite pictures, etc. Creating data, we contrast it against social media. My geolocation for instance, if anybody knew that on weigend.com/itinerary, you know exactly where I was and you could exactly triangulate when I would be arriving at class here today. That's an example of social data. I share my geolocation, weigend.com/itinerary. The second meaning of social data is the connectivity between people, sometimes called the social graph. An example for the social graph is who is my friend. I this case, mutually confirmed. Student: Just to clarify, you're talking about geolocation, does that mean – your geolocation wasn't actually your data (indiscernible)? Andreas: If only I know why I am, then I think using the term social is not the right term for that. We compare and contrast a few kinds of sharing. In the case of geolocation for instance, and that's something that I'll get back to at the very end of class today, in the case of geolocation why would I share that with a company? So far I've talked about peer-to-peer sharing or peer-to-word sharing. But why would I share my geolocation with a company? Any ideas? By the way, Divya made the wonderful point that if you put your name tags in front of you, then I don't have to say the dude over there with the beard, but I can pretend that I know your names. Over time I'll actually be learning your names. If http://weigend.com/files/teaching/ischool/2012 3 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 you have those things with the name, or if you don't but you have an extra sheet of paper? Divya: I just have the cubes. Andreas: Be creative, borrow a piece from somebody. So I'm putting names to faces and faces to names. Sorry about that. If you don't have a nametag just say your name. Omar: So companies based on location, companies in your area, for example Yelp. Andreas: Yes, one thing I really want to get across is that in many cases there is a tradeoff you need to make. It’s not that one thing is good and the other thing is bad; one thing is divine and the other thing is evil. The world is not black and white. If you want to get, for example in the Yelp example Yelp mentioned, if you wanted to find a restaurant nearby you have to tell the website, the app where you are. Otherwise, how would the poor website know what is nearby to you? So that's an example why we share geolocation with a company. Any other examples why sharing geolocation with individuals might be useful? Student: You might be able to hang out with others around you. Andreas: Absolutely, discovery. Student: If you go mountain biking in the hills of Napa and you don't come home, people will know where you are. Student: There was a movie last year (indiscernible), that movie was about a person who got trapped mountain biking. The person was stuck. (indiscernible) He had to amputate his one hand to get out of the place. (00:10:45.0 to 00:10:58.9). Andreas: Any other examples why sharing can be beneficial to you? Student: People might to it for intangible reasons, such as showing off or sort of social hierarchy. If they'd gone to a very exclusive club or something like that, they want all their peers and people in their social group to know where they've been so they can get a boost in the social hierarchy. Andreas: One other example, and I am waiting – after we went through the different topics, I'm waiting for your feedback to decide on what we want to do. One example here is dating. I've been on the board of a dating site, dating app, and I already put an assignment up which is called data and dating, or data and desire or something like that, at the bottom of the homeworks page. What that is, is an example of analysis, a site called OkCupid is doing on their data. There's a company in San Francisco called Skout and they had a marvelous media event five months ago where the CEO made it to the front page of the New York Times. Who doesn't want to be on the front page of the New York Times? It was a bit of an unfortunate situation. People hooked up on Skout but http://weigend.com/files/teaching/ischool/2012 4 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 one of them was below 16 and all kinds of bad things happened. So sometimes maybe there are pros and cons about sharing your geolocation. You might have known about the site, I think robme.com, where people from the ACLU, the Civil Liberties Union made the point that by sharing information about where you are and where you're not, you make it easier for people to actually break into your house. The site is pleaserobme.com. We'll get back to that at the end of class when we talk about the different data sources and what the pros and cons are, and boiling it down to two minutes for Marketplace. Those are our definitions of social data. Any questions? The second point was revolution. What's the story about that revolution thing? Why is it a revolution? Student: It's one of the only exponential technologies in the world right now. Andreas: By the way it's good to see you in person. Poor Ben. Every two hours he got an email from me that somebody dropped, or there was a typo in an email address you provided. It's good to see you in person again. Thank you trying to be on top of the class this year. The discrepancy by the way between the email list I downloaded from bare facts and the registrant list I downloaded from bare facts was about 40%, so it's pretty amazing we need a human being to manually check and I hope by the end of today we're pretty clear about who's in class and what people are doing. Back to the point you made. Student: I was just saying that's why it's revolutionary. It's exponential technology. Andreas: Yeah, it's one of the few things that does grow, doubling every one and a half years, which prompted Peter Hirschberg who used to work at Apple with me, last year when we talked to the United Nations to say data is the new oil. What we meant by that was like oil, data is a raw material. Actually, the word economic forum in (indiscernible) was called the most important asset class that we have in this day and age. It is a raw material. So if you find raw oil in your backyard, it's probably not doing you much good. It's probably making a lot of trouble. On the other hand, the world we know wouldn't run without oil. You have to refine oil, you have to refine data. One of the conversations I had with Carl here is what does a data scientist do. Think of him or her as a refinery dude, who takes this raw stuff coming in and then he puts actions out. He pipes actions out. That's basically what data scientists do. Dave: Does that complicate the issue of social data though? I took a data science course last semester. A lot of what you're doing is analyzing data which is not social with (indiscernible). It's like access logs to a server. It's not necessarily to do with my interactions with another individual or something that's even really being rendered access. It's different in some way. Two pieces of data you're putting together and analyzing it in the end. http://weigend.com/files/teaching/ischool/2012 5 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: I snuck in the UN General Assembly speech that social data is the new oil. If you really think about where the data comes from, where the exponential growth is, it is maybe access logs but primarily data that is produced in the context of people. Sensors I carry around, geolocation being an example, so it's not really machineto-machine data, some refinery, some traffic light talking to another traffic light. It could be my car telling me that it's out of battery ahead of time. That would be cool, wouldn't it? It's really data people create where the exponential growth comes. Not only things like geolocation, which is one example, but think about photos, think about videos. Every day – how many pictures get uploaded a day on Facebook? Think for a moment but don't call it out. I want you to get an order of magnitude of feeling. How many pictures get uploaded a day? Just write something down. Max: I guess it's ten million if you don't count Instagram. Andreas: Ten million. Ray: 900 million. Andreas: More on this side. Student: Are you talking just in the U.S.? Andreas: Ray how did you come up with your number? Ray: I was estimating there are one billion Facebook users in the world. And then not everyone would upload a picture, not everyone is apt to, so we should be slightly below that. Andreas: I like the way he thinks. JJ: Eighty million, because there are 800 million Facebook users, I would think 400 of them are active users, that they check their Facebook daily and maybe only 20% of them actually post a photo, an average of one photo per day, so that gives me eighty billion. Andreas: How about those people who paste pictures of every single meal they have? We all love those pictures. JJ: That's why I'm saying it's an average. Average one, so only for the people who like upload. Andreas: Do you know the number? Student: Yeah, (indiscernible), the reason is like you said; there are a bunch of people, 25% of the user rate. http://weigend.com/files/teaching/ischool/2012 6 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Student: (00:20:22.9 to 00:20:45.8). Andreas: I think it is not about the specific number, it's how you think about it. And the way I think about it is most like Roy who says it's roughly a billion people, and I think it's between one out of ten or a tenth uploading ten pictures a day, something like that, so any number between 100 million and a billion is good enough. That's how I want you to think about this. Don't worry JJ specifically whether it's 20% or 22% or maybe 23%. Just worry about it's a billion people, roughly a picture a person, so it's roughly a billion photos a day. Or you could argue it's roughly one picture every week, so it's 100 million. Pictures in terms of bits are a pretty important part. Video is even more important if you count the bits, than just simply geolocation, longitude, and latitude which is quite informative data for the number of bits you have. Any other questions or comments on social data? Any other questions on the revolution? Student: (00:22:21.6 to 00:22:32.9). Andreas: Why do I call it a revolution? Because it's a billion people who think about it differently now compared to ten years ago. A billion people make a revolution. Think how people make purchasing decisions. They look at social data. They look at reviews other people posted. They didn't do that ten years ago. Think about information. Could we live a week without Google or Bing or Yahoo? It would be hard. Think about who we are. Think about who our friends are. Think about how we get information. I think it really is that a billion people do things differently from the way they did ten years ago. Student: You say the number gives the power to the Social Data Revolution? Andreas: It's a phase transition, that basically the norm has shifted. The norm used to be you keep your pictures private. Whereas now, the norm is it's really weird if you don’t have a Facebook account. Who of you don't use Facebook at least once a week? Student: I used it two weeks ago. Andreas: Welcome to the revolution. It's the mindset of people which has shifted. Who would take argument against that this is a deep shift, a cultural shift, what I call a revolution? It doesn’t impact your grade, so I welcome dissent. Dave. Dave: I know this class is about social data and our purpose is that, but there are so many other revolutions that are happening at the same time that are causing huge shifts in our own everyday practices, toward how we use technology, how we apply information. I think it's difficult to focus in on that individually, even the devices themselves become carriers of information and radically shifted that relationship we have. For example, I can think of lots of people in my own life that haven't been affected by the Social Data Revolution, like my parents for example, and even many of my siblings. We live here in the Bay area and we're http://weigend.com/files/teaching/ischool/2012 7 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 really interested in tech. It has affected them in some way but I don't know if it's a revolution for them. I can think of other more significant things, like even email is very significant in the long term and how they're able to keep in touch with people that were previously inaccessible to them. Maybe email is social data, I'm not sure. Andreas: Actually not in my definition. You can extract a social graph out of email, by knowing who communicates with whom. And Xobni is an example of a plug-in into Outlook which tries to figure out, by your communication patterns, of whether you should pay attention to somebody's email or whether you should deprioritize it. There are a number of ways to try and get to a social graph through email. But by very nature, email is not necessarily social. However, the former chief scientist of British Telecom, who now works at Salesforce did an experiment. He open sourced all the information, all the emails he got, within British Telecom. Indian dude, gets lots of emails, who was I think VP. He said, "Well just to be clear, everything I get anybody in the company can see." He said the amount of bickering immediately disappeared, dropped dramatically from "He didn't do what he was supposed to do," to nobody wrote those things anymore. People just did their job. We very much have a philosophy in class that we basically do things in the open. I think it's just a philosophy that we share things that pretty much these are all Word readable documents. I'm a bit careful with the editing because I don't want to be chasing if there are mistakes. People make mistakes all the time. Last night – was it last night Jeremy when we figured out that if I give people access to a Google folder on Google Drive then people can move stuff around on my Google Drive and that's so confusing. If you knew you put it there and it's not longer there. The only reason I'm clamping down every now and then on permissions is I don't want chaos to happen. A revolution does mean everybody is involved. Arab Spring, the Twitter Revolution, some people say is a big deal. I'm actually not sure how much of it really would have happened without Twitter. I don't know. But the fact is that the world is a different place from what it was before Facebook started. It's only been eight years. Dave: Are there other revolutions they could point to, the internet revolution maybe, a bit broader, or something around devices? I'm not saying it's replacing this definition but I'm wondering if there are other things that are concurrent or at a certain point are these revolutions? Andreas: Good point, I think the main thing is that we live in the cloud. We access information whether it's ours or other peoples'. We access it somewhere in the cloud. The phone is only a smart phone because it connects to a pretty smart cloud, not because it's particularly smart itself. I think that for me, as Kevin Kelly said it, I think nobody has said it better than him; we live in the time where the world got connected, irreversibly so. For me, that is ultimately from which http://weigend.com/files/teaching/ischool/2012 8 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 everything else flows. That's why we had at the first pillar in the last class connection, communication, and interaction as the building block of why we have the whole Social Data Revolution. Michael: I guess my counter argument to that would be then why isn't SMS perhaps the revolution (indiscernible), it was the first thing that connected individuals, around the world globally, and still today is one of the most prevalent (indiscernible). Andreas: That is precisely why last class we created that matrix and let's look at it. It's always ischool2012wikispaces.com. Then we go to Class1. If anybody doesn't know what I'm doing you're in trouble now. It's pretty straightforward. Here are the dimensions people had. Is this big enough or should I make it bigger? Student: It's a bit blurry. Andreas: What we had here is these number of dimensions, so what makes SMS different? First, SMS is a one-to-one, very different from the broadcasting, from sharing with the world. It's sharing to one person. Second, and I think that shows beautifully the difference in different countries, how the pricing works. Germany SMS is quite expensive. China is quite cheap compared to voice calls, so lots of people use SMS in China. But now SMS in turn is getting pushed out by data services, like WeChat where you now don't have to pay anymore, and you can also have a broadcasting ability. SMS was a very simple person-to-person – like a letter substitute. Maybe a similar price in Germany from sending a letter, maybe half the price – to Twitter, which is a transportation channel, which the world can listen to or not listen to. Who asked about SMS? Does this make sense that we didn't have an SMS revolution but we do have a Social Data Revolution? Michael: I understand the differences in the actual protocol, and the actual underlying communication differences between broadcast and one-to-one or one-to-group as opposed to one-to-many that Twitter or Facebook offers but I don't necessarily want to cut the importance of SMS short. I think for many reasons it has implications on how people communicate and use their mobile devices. So I guess my question is why one and not the other; why was one so important and the other one not so important. Andreas: One important difference I think is that SMS also changed the way people communicate. For instance, if you want to ask a date out for dinner, I think you're probably more likely to text them than to call them up because by texting them you give them the space that they might want to check with somebody else they'd rather go out with and if that other person can't go out tonight they will say nothing more important for me than having dinner with you. Otherwise they could say sorry I have a prior appointment. That was not a possibility in synchronist phone calls. You can always say hello, and pretend the call got interrupted, but that only works every so often. http://weigend.com/files/teaching/ischool/2012 9 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Michael: The changes with it, the idea that because of physical limitations with keyboards, you had new language evolving around SMS usage, new lingo. It's the same with chat and similar synchronist communication online. I guess my question would be SMS may have been the first step in broadcasting and social data may be the second step, but perhaps there's a third step, and that's the bigger revolution. I guess why are we calling one a revolution and the other not? Student: I think this is pulling elements out of a set. The set is the revolution and SMS would be an element of the revolution. Michael: For that to work then we need to consider email and SMS and those modes of communication as elements of social data, but we're saying by his definition that email and SMS are not social data. Student: You can extract social graph from email. Michael: You can from snail mail too but we don't consider it social data. Andreas: Actually that's a good distinction. Snail mail is actually very different because the one thing we want to do today is to look at the various and sundry traces we leave. The traces we leave, the bread crumbs we leave, and I just started when we talked about geolocation and we got a bit sidetracked; that is not the case with physical mail. That fact that you have to tell Amazon what page you want to see, as opposed to sitting in the bathroom at home and flipping through the pages and looking at the stuff, without anybody knowing what you're looking at. That is qualitatively different that you actually must make your desires known. You can of course pretend to have many desires, so the machine gets fooled, but in principle, that is really significantly different, that snail mail doesn't leave traces, versus any electronic communication leaves traces. A letter in the mail, a check in the mail is a good example of not leaving traces. Michael: And if I send a letter it leaves a trace. If I send a letter to you, and there's also startups now like Outbox that are digitizing snail mail and moving towards that. So perhaps those will be the next (indiscernible). Andreas: So there are always edge cases, but the fact is that physical stuff tends to not leave trails, whereas electronic stuff does leave trails. I talked in Aspen at Renaissance Weekend last weekend. I talked to Nick Patterson, who is one of the most famous cryptographers the world ever had. Nobody really disputes that all the emails that are coming into the countries, and probably also all the emails that are sent within the country are stored somewhere. That wasn't possible, to make copies of every single letter, just in case you catch somebody who does something which you don't think is a good thing; that you can actually track down who they get letters from and who they send letters to. Whether it's desired or not, civil liberties and all that, are a totally different question. After all, we're at Berkeley here. There was another question. http://weigend.com/files/teaching/ischool/2012 10 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Brendon: I had a lot of things I just wanted to jump in at this point. To me the Social Data Revolution is taking – we've been generating social data forever. It's the interaction between people that in the past because of the lack of technology we had to capture this, so any form of capturing social interaction is what the internet and Facebook and SMS has allowed us to do, and it's taking advantage of this increased communication and ability for – if you look at groups trying to interact and do stuff, it's only worked in smaller scales and trying to scale up becomes exponentially more difficult in this technology. And taking advantage of the data has allowed larger groups to function in a distributed way. So that's why I feel it's a revolution. Andreas: Yes, and I think one could argue that people have not been creating data all their life. It is actually a fairly new thing that we create that amount of data. When we were living in caves or our ancestors, we were very sparse in what we created. Maybe communicated where the berries are and where the bears are. Brendon: I claim that even something simple as a facial gesture is a bit of data that is created and then lost in the past. Now with something like videos recording – (indiscernible) facial recognition, people could analyze the facial expressions and keep that and turn it into a usable, tangible piece of data. I guess I would be more general in what I concern is data. Student: (00:39:05.0 to 00:41:22.9). Student: I have two things to say about why we find it is a revolution because this is active content. What I mean by active content is this content is not only (indiscernible) something now to the specific sender or receiver, but it's also generating some footprint that can be analyzed or (indiscernible). So it's like an ID. Every interaction (indiscernible) active content that can be used or analyzed for different reasons which was not possible before now. Coming to the second part, what he was mentioning about the market (indiscernible), I see that as a huge thing because of the central thing of like beforehand, the way the public opinion and everything used to be controlled, through the power centers like big newspapers or the government or someone, now every newspaper has a (indiscernible) on that news, (indiscernible) how useful you find the news is. So that factor of the fundamental structure of peoples' power structure is getting more democratic in this way, and I feel that's a huge thing about the revolution because the democracy, that thing that we all want (indiscernible). Andreas: So you do exist as opposed to what the registrar is claiming. He sent in screen shots, sent in pictures, and he still didn't appear on the list. Are you sure it's you? Who are you? Student: I (indiscernible) the definition of and looking also for this revolution, I just want to go back to the previous question which I think (indiscernible) asked. Is a http://weigend.com/files/teaching/ischool/2012 11 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 revolution something which is affected by numbers? What people are seeing is about change and you mentioned something like billions of people thinking differently (indiscernible) the definition is, even with the billions of people you have a good amount of people who are thinking in a particular direction. So it's actually the numbers, right? Andreas: It's the shift in mindset for me, what Anno here calls institutional norms or what other people call social norms. It is that flipping which I think is really exemplified by do you trust the person who is not on Facebook? And it's not just an age thing. A couple of days ago as I was driving in San Francisco on Market Street, I saw an Indian restaurant, and I told people this story that about a year ago I was walking with the bureau chief of The Economist, and I said, "I know that place. I love that place. I always get delivery from there," and I go in and there was a guy with a turban. I told him, "I love your food." He looks at me and says, "Why are you talking to me? Talk to your Twitter. Talk to your Facebook." That's when I realized how the world had changed. I think it's a shift in mindset, including the guy who runs the Bombay Cuisine, or Bombay Kitchen, or whatever it's called on Market Street. Let's move on. What I want to get through in the next 40 minutes or so is I want to get through the space of data sources where we leave stuff. There are two (indiscernible) approaches to this. One is the list Jeremy and Carl showed you earlier, which is this Our Data thing, where there are people. And there's the list which Divya produced, the Google Form. Have you (indiscernible) them where the form is yet? Divya: No. Andreas: We'll do that at the end of the class. Each of you has ten points and you get to distribute those ten points across the twelve categories we see here. This spreadsheet is a people-centric spreadsheet, because ultimately we want to get a couple of people in like we had Quentin Hardy in the last class. We have Ms. Kim coming today's class. Now I want to do a category or a topic-centric view. Here is how we came up with that perspective. The people do the – who is doing today's wiki? Is it you Dave? Dave: (indiscernible). Andreas: The three of you, awesome, so you'll get this file at the end. So don’t panic about it. But do panic about writing stuff down which is not on the file which you think is important. Fields, I want to have a discussion with you about fifteen fields or categories I have here. The first one, in random order pretty much, is DNA. DNA, some people socialize their DNA. For instance, Esther Dyson socialized her DNA. She came to class two years ago and when people said, "Wow, why http://weigend.com/files/teaching/ischool/2012 12 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 did you do this?" She didn't really have an answer. Why not? Then she asked people, "Why would you not do it," and all kinds of fears about potentially health insurance would cause problems down the road if something is in your DNA which doesn't look good. DNA is one thing where we don't leave much in terms of traces, but it certainly is data about the person that can be socialized, but not so much in the sense of traces. The second field is sickness, and I compare sickness to fitness. So sickness or medical data are in some degree data which are socially constructed. Diabetes is socially constructed, or obesity. Somebody says he is a certain weight or certain ratio between height and weight, and you're obese if you're beyond that ratio. The problem is that sickness is not really the goal. I don't know whether it's true or not, but I've heard that in China in ancient China, doctors got paid very differently from the way doctors get paid now. How do doctors get paid now? They get paid if they do something for you, like chop you up or something, surgery. Apparently in the Xiang dynasty it was the other way around. You paid your doctor for not doing stuff to you. If you're healthy (indiscernible), then your doctor gets paid and you're healthy. Great deal. If you're getting sick, then your doctor doesn't get paid, something like that. Does anybody know more about this? Makes sense, pretty cool. What doctors do, if you actually pay them to chop you up, what do they do? They chop you up. The example we had last class on call centers, if you pay people for completing calls, they complete calls whether they're really done or not. There's one element in sickness which patients like me is an example where people form communities. People share data about themselves, share what they do in order to get healthy again. One of your earliest examples are people share who are waiting for organ transplants, like where do they find somebody in China. You know the story about the bathtub, waking up with no kidney. That is the sickness and medical data. More interesting I think is the fitness or health data, wellbeing data. Student: Back to DNA and no traces, what do you mean by that? Andreas: Best I think done in contrast. What I mean by this is if I wear a Fitbit or do geolocation, if I use my phone, I constantly create interaction data. However, my DNA I'm born with is not really creating anything on the road. Student: I would beg to differ, being that I'm a (indiscernible) in my previous life. Whenever you go anywhere you leave traces of your DNA. Just by entering this classroom, hair drops. If there was someone coming in and sequencing that all the time, you're leaving traces of your DNA around. It's a form of geolocation that's more. It's just not being captured much currently so it's not something to be worried about, but as the DNA sequencers get to the point where it costs 100 bucks to http://weigend.com/files/teaching/ischool/2012 13 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 sequence this, the cost of the human genome sequencing has been dropping dramatically in the past three years. Student: (00:51:43.1 to 00:52:00.9). Andreas: I think there is the perspective which actually I really wanted to push, the perspective of economics for pretty much everything. When we talked about communication last class, I talked about the economics of communication, the fact that it's now free basically to communicate with anybody in the world anything, versus it used to be expensive or the fact that the communication costs used to reside for the sender, and now it's mainly the consumption costs which decides. The same is true here for yes, somebody could be going around here and trying to find some hairs I dropped, or some skin flakes that I'm dropping, but it is still costly to do it. Besides, it’s not the social norm. You think these are cleaners who come in the evening? What they're actually doing is checking out whatever they're checking out. Of course now, that would be really weird. How would you feel about that? Student: I would feel I wouldn’t be able to go anywhere without wearing a heavy layer. Andreas: Look at him, he already wears a cap. Student: And I shaved this morning. Andreas: Yes, it is possible. We talked two weeks ago about discovery in a legal sense. Something might be discoverable so here is – they call it discovery in law. You could go through big databases and find some shit, that you're violating somebody's patents and that's where the patent trolls make their money. But normal people and the revolution is really about us and not about the tail of the distribution, normal people tend to not do DNA analysis of the hair which is left on their table. Brendon: Are you wanting to focus on the current data revolution where it is at now, or where it's going to be in a generation from now? Andreas: I'm fully with you Brendon. I love going forward, so let's play with that. Let's play with we're all carrying our DNA analysis with us. Here I'm grabbing some skin flakes from you and hair from Ben. What will that world be like? Let's go with Brendon. Brendon: One possibility is if I really care about who I'm going to choose as a mate, I could decide – I have a specific gene. If I (indiscernible) gene, it's going to cause birth defects. I ask them if they have a gene so I can know whether or not to even contact that woman in the first place. http://weigend.com/files/teaching/ischool/2012 14 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: You mean right now you're not requiring your potential mate to also do 23andMe and look for compatibility? Brendon: No, because of the ease of access, which is going to increase. It's not feasible or likely something that was done much now. Andreas: I think a good number of couples actually do that, that they find out what's the risk of – I'm not an expert in this but I've heard about it years ago. People do get checked on the probability of having children with problems based on DNA. Brendon: My mom did. She carried a gene – she wanted to make sure her daughter didn't. Student: There is actually a California state law that covers – (indiscernible) offered genetic counseling when you're pregnant, as a parent. Andreas: Good point, I like it. We potentially could not only sniff the digital exhaust but also sniff the DNA exhaust and based on that make decisions. Brendon: That's where the fear of insurance companies rejecting you based off of having a likelihood of having to (indiscernible). Andreas: That's why ultimately social norms and laws that follow the shifts in social norms, that tell insurance companies – for instance in the United States it is not legal to determine the fee for car insurance based on the zip code. You might think how random is that. A zip code has so much information on probability of your car getting stolen and stuff like this. The reason is that some people who made laws decided that the zip code is so highly correlated with race that essentially it is racial discrimination, or it could be interpreted as racial discrimination. Somebody who lives in a neighborhood with more break-ins is likely to be of a certain race and they don't want that. From a pure decision-making perspective it's the right thing, you could build better models, if you take that into account, if you take more information into account. However, from a regulation perspective, people decided this is not what they want. I'm not taking a side here but it's an interesting fact of showing there is data; if you objective function solely is to make predictions about your car getting broken into, then you would take other things into account which are highly correlated with race. But likely I think in this country we don't want that, so insurance companies are forced to make sub-optimal decisions from a risk prediction profile modeling perspective. Student: I was thinking potentially people may leave insurance companies and go to funds that are specifically set up, for example, your newborn child has a risk of getting cancer at the age of 40. You have 40 years, would you like to invest in a fund that would be invested for the next 40 years and we'll see how we can offset the risk and cost you might have to take out to deal with the cancer. That's one. http://weigend.com/files/teaching/ischool/2012 15 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 On the other hand I was thinking with the DNA example, DNA essentially at this point we can't modify it. On Facebook someone could say (indiscernible) is a creepy guy, but I could share cool posts or share cool pictures saying no I'm not. I'm a really cool guy. People may change their opinions based on that, but when they look at DNA, if someone sees I have some sort of disease they'll say you can't cover that up, you have a disease. Of course there will be people for example who are (indiscernible) or will not be able to mate at all. Then you would have services that would be able to modify your genes to say that when I was born I had all these diseases. Now I don't, so you can disregard my previous DNA. Look at my current DNA and see who I am. Student: DNA also adds implications, not just to you but to the people related to you. Andreas: Very good point. Student: My son has inherited half of my DNA. If I have brothers and sisters, by implication I may have DNA publicly available (indiscernible) be traced in way potentially that they have no control over. Student: All this (indiscernible) DNA of social data, the point Brendon mentioned of the fact that there (00:59:42.5 to 01:00:22.5). Andreas: I want to make you a deal. I want to spend one minute on each point and then we have one discussion. Otherwise I worry about our time running away. Is that okay? Fitness and health, I've been wearing devices which recorded and broadcast how much I'm walking and stuff. Fitbit I heard has a 9-digit revenue this year. That's a huge amount of money. I don't know, millions of devices which people wear that counts how many steps you're taking. Massive Health, I just met with Andrew who is their chief-something officer last week. Massive Health has an app called "The Eatery" which allows you to take pictures of your food and then you rate your food: fat versus fit. And other people rate your food. People tend to systematically over appreciate the healthiness of the food they eat and under appreciate the unhealthiness. 90% of all Americans think that they're better-than-average drivers. In France it's the other way around. Location, we talked about enough. Interactions, we clearly leave an amazing number of interactions data. We talked about email, phone, SMS, IM, etc. Purchases, we've talked surprisingly little about purchases. Just as you said, your child has the right – I'm not quite sure about genetics but your child has a 50% overlap with you. In the early days of Amazon, we thought we were really smart by aggregating purchase circles. Aggregating and showing the aggregate of what people buy with a certain domain name. Like the people at Microsoft all bought Java books and Microsoft didn't like Amazon socializing that aggregate information. They felt it was not something they wanted other companies to http://weigend.com/files/teaching/ischool/2012 16 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 know. Purchase data, there's much to be said. Any commerce and purchase – one-minute discussion of purchase data? As you can imagine I can talk about it for ten hours. Michael: Can you draw a distinction between payments and purchases? Andreas: We lump them together. Payments, you make the distinction because payments can also be for services. Michael: PayPal is more payments, not focused on purchases, whereas (indiscernible), Amazon, you make purchases and they have your purchases but that's it. There's definitely overlap between some of those. Mint is not about purchases. Andreas: Mint is not social, we just threw stuff in here and as I said I give it to you in the hope that it will grow by what other people add. Michael: You listed e-commerce payment platforms. That should be under payments because it's just a protocol for making purchases. One is a directed goal versus the means that you make the directed goal happen could be different. Payment systems and payment platforms are the way you actually make purchases. Andreas: Good distinction, thank you. Consumption, sometimes we use the term collaborative consumption and it means different things for different people. What I mean here is you share your consumption. For instance music, that it goes all the way: Turntable, .FM where somebody curates stuff for other people listening to it, or just passively sharing with the world on Skype or Facebook, what you're listing to now on Spotify. Links consumption and sharing, and of course video, where video is probably the least social of the three. These are what we call sharing platforms in the list of companies or what you're voting on at the end of the day. Comments on sharing or sharing platforms? Student: Maybe something about the newspapers or those things because that's also consumption of (indiscernible). Andreas: Let me jot this down here. It's really more about creation – blogs, if I write some stuff then it's not primarily about the consumption but it's more about the creation. Is that what you mean? Student: (indiscernible) so then it's a consumption from my perspective. Andreas: There used to be a site which was bought by Yahoo, which showed you who either currently or within a certain time window is on a given page or who has visited that page before. I'm not sure what happened to them. Student: MyBlogLog. Andreas: Who said that? Yeah, do you know what happened to them? http://weigend.com/files/teaching/ischool/2012 17 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Student: (indiscernible). Andreas: Too intimate. Student: (Indiscernible). Andreas: Photos, all aspects are there, from creation, sharing, the social sharing, consumption, tagging meaning metadata. Photos is a good example, I think it hits pretty much all of the things we're talking about. When you write the notes for class, let me star Photos because I think that's a good one to start with since everything is involved there. Michael: Consumption sharing platforms seems more like entertainment. The reason I bring it up is because you have Hotels and Airlines, which is really maybe a (indiscernible) category like travel. Where does something like (indiscernible) fit in with a sharing platform, but it's focused on the travel space. There's clearly overlaps. Andreas: Yes. TripAdvisor is reviews. Michael: It's reviews and they do other stuff now too. Andreas: The way we sliced it up is Hotels – in particular with the examples we have here, like Airbnb, which is a social business model applied to something which traditionally was not social. Hotels were not a social thing. Airbnb, you now import the reputation of somebody. Similarly Couchsurfing, Gogobot is Menlo Park based. I forgot exactly what they do. Michael: I guess I bring it up because I kind of heard someone say it's not just airlines, it's airlines, cars, hotels. Andreas: The margins on hotels is much higher than the margins on air travel for instance, however airlines is a different animal because it's more like you go somewhere else. This is not the perfect characterization but I think it's a good starting point to make you think about all the traces you leave. Again, what are we talking about right now? We're talking about the data people leave more or less knowingly and willingly, sometimes less knowingly and willingly. If you're staying in a hotel, if you're taking an airplane, if you take public transportation, Fastrack for instance. Fastrack, every month emails me by the minute, by the gate, when I crossed the Bay Bridge. Taxis, the Uber cab I took today I can probably pull up on Uber the exact route the dude took. That's typically what Uber provides you with. There are car markets, buying cars. Then there is not only the buying cars but also just buying space in a car, like Relay Rides. Relay Rides you rent a car from somebody else, or these car-sharing models where you offer somebody to share that personal space of yours, the car, day after day. http://weigend.com/files/teaching/ischool/2012 18 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Then of course parking, 30% of all the traffic in Los Angeles people are looking for parking spots. Those are all data people leave. How should you act on them? Parking, should you have demand price-based parking? If there's only one parking spot left in a one-kilometer square radius, that parking spot would be 200 bucks for the next hour. That sucks right, unless you're the one who does get one. These are all things where we leave traces of movement, of airlines, highlyverified information that it's really you on that passport and so on. Make sense, about creating social data? Dating, I want a weight of what your interests are, but I want to show you that I did put out in the homeworks tab what I think I would like you to do in one of the next weeks. The tab here on data and dating, that's the background and assignment, then a deliverable. Basically, having you read some stuff on the OkCupid blog. OkCupid was bought by Mash.com last year. Come up with some ideas about how you would analyze the data people create on dating sites and dating apps. Financial, it goes sort of with payments kind of. These are not mutually exclusive and collectively exhaustive categories. We leave transactions as we go to our ATM machine. Of course with our picture being taken, with the patterns of how we enter our code, with personalization – you wanted 500 bucks the last time. Is that what you want to do this time as well? Jobs, BranchOut being my favorite here, a half a billion people if you go for the friends of people who have the app installed. Knowing what they're interested in, knowing social behavior of them on Facebook. Are they ones who organize meetings, are they ones who show up, are they ones who don't show up? The last example here is real estate. Zillow in Seattle, Trulia in San Francisco, others taking public records, records everybody could have and socializing them. Now people make their decisions differently on what house they buy, how much they pay for it, than this previous nonsocial world where some real estate broker says I have a good deal for you. But you were basically blind. You didn't know whether you overpaid or not. Those were some examples here which we came up with. What are we missing? The next ten minutes if people we can take a quick bathroom break. At 5:20 we will start again. I would like you to have a conversation with your neighbor, or if you don't like your neighbor, with somebody else, on trying to figure out something you think is missing in this list. Where did we not come up or simply miss stuff about data people create, data people share? This is just a couple people thinking about it. What did we miss? Then at 5:20 we'll spend about ten minutes on complementing this list and then we'll get ready for our lady from Marketplace who will come here at 6. http://weigend.com/files/teaching/ischool/2012 19 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 (Break) Andreas: All right, as promised the rest of today looks like this. Ten minutes where we collect things from you which were missing, from 5:20 to 5:30. Then since a couple of people asked about the wiki, I will spend fifteen minutes – I've probably spent more than fifteen hours to set it up. I want to make sure we're on the same page and that you understand why I did things the way I did them. And by now I think you know that if you think some things can be done in a better way that what I had, then I'm more than happy to hear that. Then at 5:45 I will tell you the questions and we'll talk maybe fifteen minutes, and then our lady from Marketplace will join us for a half hour of her telling us what she wants to do and us discussing with her. Let's go back online here. 5:30 what's missing; 5:45 wiki; 6 is data etc. for Marketplace. What's missing? You were so eager to tell me in person and now nobody wants to talk in public. Acoustics in this room really sucks. Student: Government would be an interesting one. The White House has where you can sign a petition and if it gets past a certain mark the White House promises they'll respond to it. Andreas: Very good, the traditional – the oldest one there was a site where people could report their potholes, and put pressure – not what people at Berkeley think about pot. There's a hole in the street and people were saying it still isn't fixed, and they bubbled it up. There is in this Government 2.0 this Code for America, governmental code for Jen Pahlka (ph.). She gave a very good talk at TED, Jen Pahlka, Tim O'Reilly's girlfriend. She has a beautiful example about people in Boston taking ownership of fire hydrants. It makes sense that what's the big deal of you checking out the fire hydrant like once a week when you walk by, and how much does it save us from somebody being paid to walk the streets and checking out fire hydrants. You need to have the coordination and communication problem but whose fire hydrant is that? By the way, these are yellow right now. Are you okay? It's red now. Green, okay because after break I think everybody should be green. Government, Jan Pahlka Code for America. Student: Another one is Votizen. It's about socializing the democratic process. Andreas: I heard, is it Change.org I think is a very good example of providing a platform that makes it trivially easy for people to get a petition done. If you don’t like the opening hours of the local corner store, and you want to create a petition, change.org. If you don’t like the way the iSchool does their tea hour, change.org. It's very interesting how technology, how collective intelligence – we're missing the collective intelligence part here, that enabling of collective intelligence through data and collaboration. Student: Sunlight Foundation is good. http://weigend.com/files/teaching/ischool/2012 20 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: Do you want to say what it is? Best disinfectant. Student: Their ultimate goal is to make government more transparent, so some of the really cool projects they've been working on is they'll take news stories and analyze them, and basically point out bias, so it's not one particular group or candidate. They do lots of things around tracking spending patterns, so where do campaign funds come from and what organizations are actually contributing those funds, so you can start to realize bias and why certain candidates might be for or against certain issues. Andreas: What else are we missing about social data, about trails, traces, bread crumbs people leave? Student: (indiscernible) for example the instant feedback (indiscernible) now or even people who have become reporters like on YouTube (indiscernible). Andreas: Okay. Student: I don’t know what category this falls under but things like (indiscernible) which uses crowdsourcing or (indiscernible). Andreas: Here's a story I heard in China, that people take captchas, those letters that are blurry that you have to answer, and put in front of something which people want, like porn. Now you have people solve the captchas for you so you can give them not necessarily copyright paid material, and you get all the captchas of the world captured, so interesting business models there. Collective intelligence. Student: One would be reviews and (indiscernible) IMDB chart of things you see (indiscernible). Andreas: How come we don’t have reviews yet, or do we have it implicitly somewhere? Student: It's under shared (indiscernible) I think. Andreas: The way to look at this is this is explicit data people provide in addition to the implicit data they have by via Spotify, sharing what song they're listening to. Student: Another thing, like the big places like (00:10:30.1 to 00:10:55.4). Andreas: It goes like the potholes, restrooms, things that are broken. Student: Crowdsourcing in general should be probably (indiscernible). Andreas: Crowdsourcing is often used in an active way, that you have a question you want to know how heavy is that (indiscernible) versus here we mean you provide a pipe that will make it trivially easy for people to contribute stuff that is broken: dirty toilets, holes in the street. Student: (00:11:30.1 to 00:11:42.0). http://weigend.com/files/teaching/ischool/2012 21 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: Say this again, it's a hard mix between accent and acoustics here. Student: The active storytelling (indiscernible) that also has a danger of crowdsourcing because people are (indiscernible) in picture. Student: The Big Brother stories. Andreas: I have no television but tonight at 7 o'clock – I did an interview this morning with a public television station in D.C. and they posted something on the Facebook page. I never know what they eventually take. I forgot even what they asked. It was about price of privacy and about how you would frame that Kindle if you wanted an ad you pay more or equally well, if you're willing to see ads then you pay less. Should that depend on the situation you're in, and stuff like that. I have no TV but at 7 o'clock tonight on whatever this TV station is called here, KQD I think, you can watch something interesting about privacy and price of privacy and Kindle and whatever he's doing. I haven't heard from you yet. Student: You already mentioned about the (indiscernible) but I think maybe (indiscernible) because they not just provide (indiscernible) but it's more about product information. They don’t want to just share your kind of information of products. Andreas: Is it different from reviews? Student: Review is more about the activity after purchase, but the thing is the – the different thing is (indiscernible) is more about the activity before purchasing. Andreas: I'm down with that. It is really how social data informed different stages in the decision-making process in this funnel as traditional marketing has it, that eventually leads to purchase or not. It's interesting, it always shifts from one side to the other. Everybody on the right side now. JJ: We talked about the online marketplace. Andreas: Craigslist, absolutely. JJ: Maybe the AppStore as well, like Android and Apple (indiscernible) reviews. Andreas: Lots to be said about it. Jeremy: We touched a bit on this in the spreadsheet, but social analytics so based on the idea of social capital and (indiscernible). We use Clout (ph.) as the primary example for (indiscernible) online influence. Andreas: All I can say is the CEO of Clout this week is going to talk to Bill somewhere in the Seattle area. Interesting isn't it, some Bill. Don't know which Bill. Social capital. Student: I just want to follow up on Nikki (ph.). I thought it was interesting that (indiscernible) and all those guys, social data actually is increasing exposure http://weigend.com/files/teaching/ischool/2012 22 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 (00:15:54.8 to 00:16:22.6). I was thinking the other day that if you walk into a (indiscernible) you will either see (00:16:26.7 to 00:17:36.6). Andreas: Actually word of authority here, physical stores and online stores have almost nothing in common. The action space is entirely different. What you see as an experienced store manager walking into a physical store is quite a bit – I'm not an experienced store manager, but I hear that if some dude walks in there and sees how things are arranged in the shelves, they have a good feeling about what's going on. Secondly, comparing different stores with each other tells them again pretty much what's happening. They have no idea when just looking at the bits in the database at a place like Amazon. On the other hand, at Amazon and other ecommerce companies change prices, inventory, ranking within milliseconds, whereas in a physical store that's a lot of work to change the prices, so you don’t. So physical stores and online stores are more different than most people think. We have three more remarks and then we'll move to the wiki. Let's do Brendon, Michael, and then Ben. Brendon: Education, like (indiscernible). Andreas: Kickstarter, so education is interesting because one of the best examples I know about Kahn Academy (ph.) is that in a school in the Palo Alto, Los Altos, Los Gatos school district, by observing, by measuring, by instrumenting the students and their progress, they had the kids who were slightly ahead of the class teach those who already were in the red zone. So helping understand by instrumenting who is slightly ahead of the curve and can explain something they just got to somebody who is not quite getting it yet is where the whole Kahn Academy revolution really sits. Some other time I'm happy to talk about the vision I have for academia or the vision I don't have for academia. But here two more people. Michael: I was going to also bring up things from agriculture and things like that. That in turn (indiscernible) social data. For instance, they sell these little things now from Athomeplants. You put in your plant, it reminds you when to water your plant via Twitter, via social networks and things like that. I think maybe that's on instance, but I think a broader concept is the emerging of physical spaces (indiscernible). Andreas: Our bridging the physical and digital which we were dwelling on after the second bottle of wine. Final word from Ben. Ben: I was going to mention, it would probably fall under location; the website like Opentopia which has thousands of webcam feeds on it. These are some of them are security camera and some of the public webcams. Obviously you're unknowingly leaving that information. I know there's one in my hometown in the center square of downtown. http://weigend.com/files/teaching/ischool/2012 23 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: How many of you think that when you rent a room on Airbnb that they in what looks like a motion detector have a webcam installed where they can remotely see what's happening in their bedroom while somebody else is there? Roy is laughing. Roy: Hopefully not. Andreas: Why not, my bedroom. It's interesting, we negotiate this boundary between private and public space. In Singapore by the way, hotels are public space. You should not be doing anything in a hotel room you wouldn’t do out there in a public space. Isn't that interesting? Singapore hotel rooms count from a police perspective as public space. If everything was recorded in your house, it's still your house, even if you're renting out an Airbnb. What does that mean? Who do you share it with, etc. Now ten minutes where I'll show you the wiki and why I did it the way I did things. Home page here is everybody in class should have write access to the wiki. If somebody doesn’t have write access, talk to me after class, or send me an email tonight. You edit by clicking on "edit". It's not good for many people to edit the same page, unlike Google Docs. Don't use it while you're bringing up the page in the next couple of days, which I'll look at on Thursday evening at 10 p.m. Don't use it contemporaneously because it doesn't really know how to lock files well. It's amazingly good for everything else. At the bottom of it is the grading policy, the middle of it. I said I'd discuss this in class too, so I'll do this now. 30% is the class wiki. Everybody as you're aware is responsible for form a team of two to three people and take ownership, bring the stuff up, so the first couple days after class, by Thursday at 10 p.m. it is up. The first week was hard. Where is the person who did it the first week? She's not here? It was hard for her because one dude didn't respond and the other dropped the class. I think it's the responsibility if you sign up, do it because otherwise it's hard on the person who thinks you're there and then you're not there for them. Everybody gets the same grade, and that's 30%. 30% is a task like the ones Sean outlined in the beginning. Or you can do a project or do a blog entry. It's negotiation with me. I'll evaluate this November 15. 25% are the homework assignments. They're not big deals. I used to call them dog food. The homework assignments are for you to reflect on something which is going to happen in the class. For instance if we have a guest speaker then I'll assign, for instance if it's a dating site I will assign you to spend some time looking at that. If we have the guy from Clout come, if that's of interest to you, then you should look up a couple of his Clout scores. It’s not a lot or work, maybe an hour or something like that. The purpose is to make the class more productive for you. 50% is the fudge factor for class participation. I want people to participate. We're in good shape on that. That's the home page. http://weigend.com/files/teaching/ischool/2012 24 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 FAQ, given we have no TA if you figure out something you didn't know at the beginning of class, maybe asking me, maybe asking somebody else, put it on the FAQ so the next person doesn't have to start from scratch but can look up things on the FAQ. My responsibility is I will create with Ben's help the Google group, iSchool2012, Googlegroups.com. That is the mailing list which we should be able; it's a finite piece of work in front of us, to get it harmonized tonight. I will create this by tomorrow morning. If you want to email the class, then that's the email for the class. The task list is growing. I'm not really in charge of it. We see that you are interested in the survey responses, so we should be talking about that and I'll give you some pointers. I'm happy to see that people are signing up for this. Sean introduced the video tasks. Jeremy used to do videos in his high school and knows all that stuff. He got my spare video camera and is willing to show people or share stuff with people to help them out. That is Jeremy. Tasks are being added there. We have Ben – as the course list. Catherine is not in the class but she's willing to help us. She worked at NBC before with some interview (indiscernible) stuff. Shreyas, where is he? Shreyas slaved over getting the wiki structure done and Guarav who is here despite other evidence, is helping with the coordination with iTunesU. Announcements, when I send out an email, and I'll try to do it reasonably well; I put a copy of what I sent out here on the announcements page, reverse chronological order so if you miss something or your spam filter eats it, this should be the relevant stuff. So far I've sent two emails, one before the first class, and one before the second class. Classes, you know how it works now. Thursday evening the three people who do the wiki should get together on Tuesday or Wednesday and figure out what the main thing is. What I didn't get to review today which I meant to review was the identity part and the recommendation systems part. Good job to the people who did it. It was not so good at the beginning but now it's very good. That happens at each class. Dinner signups, so instead of office hours I think it's more fun to have dinner together. We have figured out that tonight we have JJ, Divya who helped with class2, and Shohay (ph.) coming out for dinner, and Cory (ph.) you didn't put your email down so we can't reach you. It's sort of the backdrop of not having the email down. Four people is more than enough. Just sign up for the next week. We have Rachel. Who's Rachel? She's not here. Rachel is the only one for next week so far. You get the idea. Any questions about the wiki? Student: When can we anticipate the schedule will be finalized? Several classes coming up have two different dates. http://weigend.com/files/teaching/ischool/2012 25 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: Very clear logic, tonight after you had time today to digest the topics, you vote having ten points in twelve categories. Here is what the form looks like. Divya, why don’t you, in the spirit of sharing, post the link on the doc. Fill it out tonight, distribute ten points. Let's not waste time on it now. You'll post it under Class2. Based on that, I will look at the people who I have in mind. I really want to do what you're interested in. And then we'll figure which dates. We have two Mondays where I have either this one or that one. I could do any of the days, but if you really want a person and they can only do that day, then that's the day we'll do. I think by next week, give me a week to figure this out. By Week 3 we should be done, knowing which dates. That's the price of flexibility we pay to get the people we want. Is everybody okay with that? For the moment, there's nothing besides the Monday 3:30 to 6:30 time slot, and there's no other class in that time slot, so it should be a finite burden for people to not know exactly for two classes when they will happen. Student: Follow-up question, if we're bringing speakers will the speakers be invited to dinner as well? Andreas: If they have time absolutely. Student: What's the probability of Reed? Andreas: I've had dinner a bunch of times with him. It's a question of whether you think that Netflix is an interesting company or not. I think it's a clear procedure. You post the link, you guys vote, you get everybody else's vote reflected after you voted, and then I'll look at it, go to the spreadsheet and make my calls and see whether I can find some matches. I put a bunch of readings up. These are ideas mainly from the students last year. I put a page up called "insights" which came again from last year's class. I had Tom Glocer, the CEO of Thomson-Reuters and the CTO of Thompson-Reuters come to class last year. They said why don’t you ask the students to put up what they learned, which they wished they'd know at the beginning of the quarter. So those are the insights. It's a wiki, so if you want to add stuff add stuff. If you want to add other stuff, add other stuff. If you want to throw something out because it's totally superseded, throw it out. Last part of today's class, switch gears. I have the following thesis. I would claim that in the future there will be different ways people think about the data they create and share, about their social data, about what we might call our data. People will vote with their clicks, with their wallets, of which country in a sense they want to live in, whether they want to live in a country that makes it very difficult for them to take their data out; whether they want to live in a country where all their data is public; whether they want to live in a country where they can look through other peoples' – poke through their living room, rifle through their drawers and figure out what they have and other people can do that with them; whether they want to live in a country where all information is sort of http://weigend.com/files/teaching/ischool/2012 26 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 heavily taxes, where if Carl wants to get my number that is possible but he has to pay somebody a dollar for that. I think people will make choices. These choices are largely determined between how companies look – when I say countries of course I mean companies – how these companies or countries deal with information. I want to give you four examples. Apple – Apple, clearly credited to have built the first app ecosystem, clearly has no idea what to do with the data they collect. They have our phone number, our email address, our apps, our access patterns, amazing amount of data but very little comes out of it. Facebook – brilliantly engineered to provide incentives again and again for people to create and share data. It's the only thing people do on Facebook, and of course consume, looking at the friends' pictures and stuff. Facebook's attitude is a brilliant communication platform, making it trivially easy for people to share. Identity provider – do you want to live in that country? Google – Google, it's brilliant how engineers – not social engineers but computer scientists – scrape every bit of information in the world. How when you want something you need to tell Google, but then you get amazing information back. Not so at Apple. The Android phone is amazing. I did bring my three phones here. The Android phone tells me how far it is home. It probably knows today that took Uber and probably will tell me that I'm cheaper by so-and-so dollars to take BART tonight, and will probably tell me when BART leaves after we're done with dinner, because it knows that I paid for dinner. That's the data universe that makes life amazingly simple. Of course, they know everything about us. We already talked about the iPhone. Then there's this messaging device called a Blackberry. Some of you might remember that from museums or something, but I use it every day. Those are different worlds here, and they different by how they treat data. It's fundamentally different. If you don't have an Android phone, I really invite you or am happy to give you some demos. It is amazing. I search for a flight number and next time on these Google Now, it will simply tell me when those flights are leaving, whether they're on time or not, and that stuff. It's amazing. Amazon – which I think is way underestimated by people – powers the ecosystem. I don’t have the exact numbers for Amazon Web Services, but the Kindle discussion I started about the primitivity now of giving people discounts if they're willing to look at ads, or charging people premium if they want an ad-free experience. What I want to invite you to do is now think with me about what those different worlds or countries or universes will look like. These different universes with the http://weigend.com/files/teaching/ischool/2012 27 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 different attitudes towards data, how do people interact, what data do they create? That's one discussion I want to have. Apple of course is one example of trying to build an ecosystem on one hand and on the other hand not having data DNA. What's difficult for others, for the Googles, the Facebooks, for the Amazons of the world? The context of this is I would like to introduce Queena Kim. We met at the iSchool at the Big Data event. Big Data girl Queena, she works for Marketplace where there's a radio show that airs on public radio, it's American Public Media. I love (indiscernible), is one of the most empowering people, when he focuses and asks you a question, you come up with answers which you never had though he would come up with. I think maybe you spend a couple of minutes telling us what you're thinking about in terms of the context of the new iPhone launch on Wednesday. Then we'll have a discussion and at the end I promised to go to a quiet room with her and talk into a microphone as I do all the time. And produce something which (indiscernible) will think is good enough to have on the air. Give us a bit of context. Queena: I think you did a breakdown. As you guys know, the iPhone 5 is supposedly coming out on Tuesday. I think we've been seeing a lot and hearing a lot about the ecosystem. Apple is the most obvious in the way they've done this with iTunes store and your phone being compatible with only their universe of content and sort of devices in a way, although it syncs to the PC. Anyway, the point is we're trying to think of an interesting way to cover the iPhone coming out that's beyond just it's got this new feature or that new feature. One of the things we're trying to wrap up and I'd love to get your examples here, is there's all these little fights that are happening to create the ecosystem and I think there are all these little streams that are happening and nobody knows how it's going to come together. For example it looks like the new iPhone will not have Google Maps on it because they want their own map. They want their own data. They want to be able to sell whatever it is that they want to sell that data back to you, and they don't want Google to be a part of this. Now Google has Android and they've got that universe, and now Google has Android Play, Google Play, and they're trying to create this iTunes-like atmosphere. Then you hear a little stream the other day that Kindle is going to start sending out advertisements on their Kindle because they're very heavily subsidized. Amazon is eating the cost of what it takes to make the Kindle, with the idea that if you buy it you'll start clicking on things to buy things from Amazon. That's how they'll make their money back. I'm having a hard time wrapping my head around when you buy these things, five or ten years from now, what is this world going to look like? Is it going to be – do you remember back enough with Microsoft and Apple. At one point Apple never really talked to Microsoft and Microsoft would never get away with it because it http://weigend.com/files/teaching/ischool/2012 28 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 would be considered monopoly but because Apple was small enough they wanted to stay very sealed off. It sort of backfired on them because corporations were like if you're not going to talk to anybody then how are we going to use you? Maybe these walls will disappear at some point. I called Andreas to sort of wrap my head around what will this world look like. We see it from Apple's point of view, that's the most obvious, but then Kindle introduced something interesting. Is it going to be more like networks where we all have these devices and Kindle is now ABC? And it'll give you content for free, or let you surf the web but now you've got to look at all their ads, like a TV model? I'm grappling with this too. Maybe a good way to start is where do you see these wars starting to happen? Student: I think a big difference, an unacknowledged difference between a lot of these tech companies is that essentially Apple is a hardware company. Google is an Advertising company, and Amazon is a media company. Andreas: A media company? Bezos would kill you. It's a tech company. Student: It's a tech company but I don't think that the products people are going to them for are technology. I use Amazon Web Services as technical underpinnings but if that disappeared tomorrow, I wouldn't say oh my God Amazon is no longer there. Student: What about the fact that advertising companies also (indiscernible). Google is an advertising company, but it's actually a media company. (indiscernible) subsidizing their advertising and it comes back the same way. Student: The difference (indiscernible), people say this all the time. On Facebook if you're not a customer you're the product. When we're thinking about the futures of these products and services, really who is generating revenue is really significant in terms of where (indiscernible) in the future. Queena: So how you make your money is going to determine how – what this universe will look like. Student: But you're talking purely from a business model perspective. Queena: I don't know what I'm talking about. I feel like this is something that's going around a lot, and it seems like at some point these companies want you to make a decision. Are you Apple and are you going to go for the iTunes and is that where you're going to get all your media and movies and is it going to replace Netflix and sooner or later because you buy an Apple device does that mean you're going to get all your media through Apple? I'm brainstorming too. I don't know. Student: I feel like each of these companies, and it's also Microsoft plays this game as well, where essentially it's about owning the user experience from one end of the spectrum to the other. Essentially vertical integration where you own every touch http://weigend.com/files/teaching/ischool/2012 29 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 point that a user interacts with your products and services. Essentially they view it as another opportunity to cash in. Queena: Will that be an exclusive relationship? Does that mean once you buy a Microsoft phone you can – they want to make it so all your touch points are Microsoft and you won't be able to – Student: Which is essentially what Apple has popularized, this closed system. Now everybody is rushing to build their own closed system because Apple proved it could work. Then I think there's always an ebb and flow with these things, where you have really desperate siloed things and then there's some standards, and people talk, and then things become less siloed and then more siloed. There's always ebb and flow back and forth between those. Student: Economically speaking, I think the most realistic picture of what this is all going to look like in the future from the dynamics of how the R&D expenditures have been greatly cut down; I believe more they're going to – these players try to polarize themselves and make subcategories within the industry, that they're going to get a lot of pushback from consumers. And also because of those economic dynamics our (indiscernible) is shrinking and business models all over the place across the globe right now have to adopt more open innovation, more open business models, that it is actually going to – instead of looking really segregated, as they inch towards this polarization will get a lot of kickback from the (indiscernible) and the consumers and eventually have to reconcile business models in some way to where it is very integrated. The Google Maps returns to the iPhone for example. Queena: The swing back and forth. It's funny, I remember a time when everything was like the promise of Web 2.0 was that everything was we're going to share, we'll all be friends, and we're all going to let each other see things and use our open source. To me, it's almost like the Wild West is becoming ranches. Maybe it's just a pendulum. Student: I think one of the fundamental changes of R&D is that it's going to be virtually impossible for them to return these vertically integrated structures. The business model is going to be obsolete. They won't be able to function. Student: Probably one of the biggest influences is going to be the legal aspect of software patents and copyright. With Kindle being a closed format, (indiscernible) I think the social revolution is going to give people a voice they really haven't had to pressure businesses to become (indiscernible) and I think companies are latching onto the social media without totally understanding or realizing (indiscernible). I think it's going to force change in the business model. We're going to see lots and lots of changes, experiments of different business models, like Kindle with advertising, it might stay or it (indiscernible). Like most predictions of what the future is going to be like, particularly the iPhone or http://weigend.com/files/teaching/ischool/2012 30 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 iTunesUniversity and these other things, I think we're going to see lots of experiments (indiscernible). Queena: I have a question. Do you feel if you use Apple iPhones or Androids, do you feel like you're too closed off with your music? Student: On that point, Mozilla is coming out with really open sourced mobile platform. (indiscernible) next year and then (indiscernible) so in terms of – people call Android open source but it’s not. Mozilla is actually pushing a really open source platform. Queena: Does it need to have a manufacturer to go behind it? Student: I don’t remember who they deal with but they are – they're pairing with carriers. I don't recall what manufacturers they said they're working with. Student: (indiscernible) going open source and so is the Eagle platform, which is owned by Intel now (indiscernible) Nokia's secondary platform. Student: I think the point is here that all these companies expect or calculate that they're going to make more money with content, not the device. The content and how the content is going about, that will make the companies sustainable in the market. That’s how (indiscernible) integrate all the companies together. The content market and the (indiscernible) assumption, the content market will take over the phone's usage for any other purpose like texting, calling, or using as a (indiscernible). It still has not been proven (00:52:02.9 to 00:52:15.5) Next thing is I think we need to understand that what are the revenue models, especially after the Facebook thing, like Facebook (indiscernible) people starting saying that no one (00:52:27.8 to 00:52:36.6). How does this Mozy Lab make revenue? (00:52:40.6 to 00:52:56.3). What is the sustainable revenue model for open platforms? That's what we need to understand. Queena: What the future is going to be since the revenue hasn't been sorted out; you're saying it's hard to figure out what the future is going to be. Student: Maybe it will be useful to start looking at the three companies that we have now and look at where the revenue comes from. Apple has this huge ecosystem built around apps and services. Hardware is where the revenue comes from. They make nothing, the developers, even though they (indiscernible) they don't make any product (00:53:32.2). Queena: I think it makes pretty good money, app market. Student: I think from Apple's perspective overall, it's about the integrated experience with their product and of course software and services come along with that. It's only in so far as selling more hardware. Student: That's what I'm saying (00:54:07.7 to 00:54:12.0). http://weigend.com/files/teaching/ischool/2012 31 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Student: I don't think Apple believes that. For instance Amazon and Google might say that content's king but I think Apple – Queena: Although now Amazon is making hardware. Google is going to be making hardware with Motorola, one assumes. Student: (00:54:30.9 to 00:56:32.8). Student: (00:56:34.7 to 00:56:45.0). Apple for instance has a really small R&D budget compared to Google, so if that's important or not. You could also measure a certain culture of the company is, maybe indicative of how the things are going forward. For instance, Apple's very focused on producing a few things done really well, as opposed to Google which releases lots of things. Microsoft will do lots of market surveys and understand where the market's going and what products (indiscernible) and then from that they'll build things and sort of (indiscernible). The culture and design process (indiscernible) and also obviously (indiscernible). Student: Just some further interesting things, I watched the Jeff Bezos keynote of the Kindle Fire last week. Part of (indiscernible) distinction between technology, devices, media and advertising was because of that keynote, where he was discussing we're cutting the price of this and how can I give this to you for so cheap. He said that if we were to sell you an expensive device that would just sit in the drawer it would be (indiscernible). We want to make our profits not from selling you the device but by selling you all the additional things that you'll do with that device, which I think is a very cool mindset. (indiscernible) to do that. But I think it relates to this kind of mission of (indiscernible), whether not vertical integration is something that matters for them. Queena: That's the question, if they're going to go out for everything, but it seems more and more they are. Everybody is bleeding into each other's territory. The most obvious example is Apple getting the maps. That was Google's space. Bing and Google now are like who's going to be on Kindle. It seems like there's all these little areas where they are starting to merge and yet trying to stay separate at the same time. Student: I think that speaks wonders about the (indiscernible) so all these guys that explode all these things and ideas, and now there's nothing more (indiscernible). Now there's only one thing left, and that's going to the market – the other guy's market space. One of the reasons I think Kindle makes (indiscernible) is the manufacturer (indiscernible) for example if you make – the first Kindles may cost you 400 dollars each, but the next thousand Kindles (00:59:58.0 to 01:01:09.9). Student: (01:01:14.5 to 01:01:26.5) because of the success of Mozilla, Apple, and Internet Explorer and Microsoft and there people, they were forced to act, to open their own standards and make it (indiscernible) so whenever I say this is a start of this technology, what we are seeing is that even (indiscernible) which eventually is http://weigend.com/files/teaching/ischool/2012 32 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 going to trickle down to (indiscernible) for the collective good (indiscernible) Mozilla, they would have to open up. If Apple were to (indiscernible). Queena: I'm also curious about – everybody (indiscernible) Apple, obviously they're more hardware. Other people have different strengths, but if everybody's going to be integrated, does that mean we all have to carry around five devices? If Kindle is putting everything on – if Amazon is putting everything on Kindle to sell and Google has Android, and Apple has iTunes, at some point does that mean we carry around five devices, or does that mean at some point maybe all these devices become more (indiscernible) like a TV? And it really becomes about content. Student: I think something we haven't discussed yet is cloud (indiscernible). Apple has the iCloud, Google as the Drive, and Amazon has Cloud Services. At the end of the day, when it comes to (indiscernible) we want what we want when we want it. So I'm going back to this openness. I think the winner in the space is going to be the company that is going to be able to (01:03:23.7 to 01:03:35.3) that's something we need to consider as we often have this content identity we want to access. Queena: (01:03:40.2 to Student: (... to 01:04:04.2). Queena: Does it feel like they're trying to do the opposite then? They're trying to push you into the products they're selling or that they want you to see. Does anybody feel that way at all? Student: I guess my first comment would be even cloud services, there's a huge distinction between how these companies manage those things so in Apple's ecosystem there's no such thing as a file essentially. Everything is derived in what you do in the application and it's only present within that application. It's very distributed as opposed to Google Drive is essentially taking what you have on the desktop, what you're familiar with, and putting it on the cloud. It's a really different architecture and the jury is still out on what is better and what customers prefer. The secondary thing I was going to bring up was I guess when you're looking at these companies, what the landscape is going to look like is really determine by the choices we're making today. Student: (01:05:12.6 to 01:05:42.8). People want to be able to take music they've purchased and put it on a different (indiscernible). You're going to have the same battle again over movies and books. If you look at where music is headed, they end up taking the (indiscernible) management and getting rid of it because customers don't like that. It's the same – so many has taken this approach of designing their own hardware to be different so they can control the accessories and make more money off of it. Customers in general, especially with the social http://weigend.com/files/teaching/ischool/2012 33 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 media are going to push back more and more and say I want to read my book on this new device, but I can't because it's locked into Amazon Nook or whatever Andreas: I have some questions too. First one is what is the price of attention? How do people price their attention like for ads? Equally well what's the price of convenience? You can always find somewhere where that .mp3 sits but maybe you would rather buy it from Amazon where you can get it. Who really makes the decisions? Consumers often say or we often say that consumer decide, but how much do they really decide? Very few people do carry five devices with them. The cloud question is interesting. There are a number of questions. How easy is it to get stuff back out? Queena: If often feel that way about iTunes. I don't really buy stuff off iTunes because it's too much of a headache to figure out how to put it in whatever device it is, and somehow I always lose it and I can't find it in the other computer I wanted it in. I don't mind buying music, but I sort of don't because it's more of a pain for me to figure out iTunes. It's gotten a lot easier and I may be a bit retarded in terms of figuring out how to really work it, but it's not as easy as downloading something online and putting it on whatever I want to put it on. Student: Did that cause you to switch to listening to Pandora? Queena: It caused me to not use iTunes very much. And then just download stuff. Student: I listen to Pandora now because I get so pissed off at the whole – Andreas: Let's get a feeling here. What do people do to organize their .mp3s? Queena: Who buys off iTunes as their primary way of getting music? Andreas: These are two questions. One is how do you buy the stuff and the other one is how do you manage to navigate your 50GB of music you acquired in some way or other? What do people use? Student: Pandora exclusively. Student: I use Pandora and if anything really stands out then I'll go into iTunes and buy it if I want to listen to it again and again. Student: Could you ask that question again; how many people actually buy off iTunes? Andreas: Who has spent more than 20 bucks this year from buying music? Student: On iTunes or in general? Andreas: First of all who has spent more than 20 bucks buying music? Who has spent more than 20 bucks buying music on iTunes? You bought on Amazon? Student: So no one is buying music now. http://weigend.com/files/teaching/ischool/2012 34 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Andreas: Next question, every month I pay 14 bucks for Rhapsody. Rhapsody is a service which runs well on my solar system at home. If I want to listen to the North Korean broadcasting station I can. Student: But you don't own – Andreas: On the Android phone Rhapsody has some hack where they cache stuff and you can download it. Student: You don't own that music. That's a distinction because Apple – when they invented iTunes that was a huge thing. People want to own their music. They don't want to rent their music. They don't want to stream it. They want to own the files. Andreas: Let's make a couple distinctions. One is I personally don’t care at all about owning anything if I can listen to something on the plane for instance, when I feel like it. I don't know if I were to ever lapse my Rhapsody subscription, would all my music disappear, evaporate like be nuked off my device? I don't know. A couple of comments, for Facebook, Facebook is deep in data and deep in social. Student: It seems like a whole different generation likes buying stuff. The younger generation doesn't. iTunes sales are actually high. In this classroom nobody buys it. Andreas: I think people want to listen to stuff and they want to listen to stuff in decent quality. Sometimes the quality you have on Rhapsody is just super sucky. It's the worst recordings. Student: Is the reason why the other population prefers to own – do they conceptually understand the difference between owning and streaming and sharing? Andreas: I think they have a more refined taste. No, I have no idea. Student: If you think it's free versus paying for it, why would you pay more? Is it a quality thing? Queena: I think it's convenience. I find myself doing that. I used to rip stuff all the time but now that I've got a job I don't want to spend the time looking for the song that I want. If it's on iTunes I'll pay for it. I can now. But there was a time where I couldn’t so I would do the searching and look and try to figure it out, but I don't know. Andreas: I think we have one minute left. What am I going to tell her? Irrespective of what you ask, one thing you learn when you deal with reporters is you just tell them irrespective of the question whatever you want to tell them. But I have no message I'm pushing here, so I'm asking you; what is it that you want the world via Marketplace to hear? What do we have that we want to share with the world? http://weigend.com/files/teaching/ischool/2012 35 Andreas Weigend (www.weigend.com) The Social Data Revolution (SDR), INFO 290A-03 UC Berkeley, School of Information, Fall 2012 Class2 – September 10, 2012 Student: The Social Data Revolution is just getting started and we want to (indiscernible). Andreas: We want open devices. Student: I think more importantly the concept of open because I don't think users actually care who made it or who's selling it to them essentially. It's more about the perceived interaction with it. They want to be able to buy a song and message it to (indiscernible). To them the concept of open or who made it doesn't matter. As long as there is the interoperability between things is what matters. Student: It's the open concept that you can migrate from one device to another. Andreas: If you're looking into this classroom, we only see Macs. JJ and I are the only two. Basically the perception is because of this Apple thing and the glowing of the back. It's interesting. So much for open devices. Student: I wanted to use the key term compatibility because I think a lot of people bought Windows as opposed to Macs in the '90s because Macs didn't have a lot of software you could use that people were using on Windows. I think that goes back to the concept of openness. It doesn't matter what book or what format you buy it in as long as you can use it on a Windows and Mac, or on Kindle and on other devices, so compatibility. Queena: I agree with you but it's interesting that it's going the other way. It feels like it is at least. Maybe not entirely now but the thing is you can't read your Kindle book on your iPad. I guess you can now because you've got the app. Andreas: As you know, even if I was super late today, and I want to apologize again for my car problems, we should end on time. We're having dinner. If you are free for dinner after we have our little chat, we would love to invite you. Any questions? Any stumbling blocks? Then Ben if anybody is not sure about his identity, Ben is our identity czar. Make sure you're on the list. I'll upload the audio. Thank you, see you next Monday at 3:30. http://weigend.com/files/teaching/ischool/2012 36