Getting Started With Condor 1

advertisement

Getting Started With Condor

1

Contents

 Getting Started

 Collecting Web Content

 OneDegreeCollector

 Building your own Startlists

 Collecting your E-Mail

 Collecting Facebook Data

 Collecting Wikipedia Data

 Collecting CoolPeople

 Coolhunting Blueprin

2 t

3

Getting Data into Condor

IMAP E-Mail (Mailcollector)

Eudora mailboxes

Communication View (social net)

Term view (semantic net)

Web/Blog/News/Scholar

(WebCollector)

Communication View (link net)

Term view (semantic net)

Communication View (semantic net)

Wikipedia (WikiFactFetcher)

Term view (semantic net)

S nippets (OneDegreeCollector)

Twitter (TwitterCollector)

Communication View (social net)

Term view (semantic net)

FlatFiles (FileLoader)

Communication View (social net)

Term view (semantic net)

PeopleNetworks (CoolPeople)

Communication View (social net)

Facebook

Communication View (social net)

4

Temporal Visualization by a Sliding Time Frame time

1 n

2 n+1

3 n+2

4 n+3

5 n+4

5

With and without history

6

Preparation

 Install MySQL

 Install Java (only Windows)

 Install Java 3D (only Windows)

 Start Java (if it does not run yet)

7

Contents

 Getting Started

 Collecting Web Content

 OneDegreeCollector

 Building your own Startlists

 Collecting your E-Mail

 Collecting Facebook Data

 Collecting Wikipedia Data

 Collecting CoolPeople

 Coolhunting Blueprin

8 t

Collect Web Content

9

Communication View

10

Term view index

11

Term view index - 2

12

Contents

 Getting Started

 Collecting Web Content

 OneDegreeCollector

 Building your own Startlists

 Collecting your E-Mail

 Collecting Facebook Data

 Collecting Wikipedia Data

 Collecting CoolPeople

 Coolhunting Blueprin

13 t

One-Degree-Collector

 Complementary to the Blog Collector

 Fetches only one degree

 Retrieved websites are not aggregate

14

One-Degree-Collector - UI

 GUI resembles

Blog

Collector

15

One-Degree-Collector - result

 typical result of one-degree search

16

Contents

 Getting Started

 Collecting Web Content

 OneDegreeCollector

 Building your own Startlists

 Collecting your E-Mail

 Collecting Facebook Data

 Collecting Wikipedia Data

 Collecting CoolPeople

 Coolhunting Blueprin

17 t

Creating Term View Without OneDegreeCollector Start

List: Create Stoplist First

18

… then use this stop list for the term view

19

Creating Term View With Start AND Stop List

20

Contents

 Getting Started

 Collecting Web Content

 OneDegreeCollector

 Building your own Startlists

 Collecting your E-Mail

 Collecting Facebook Data

 Collecting Wikipedia Data

 Collecting CoolPeople

 Coolhunting Blueprin

21 t

Collect E-Mail

 java -Xmx2048M -jar condor-2.1.jar

Condor Key

MySQL password

(default: no password)

22

Tools to collect data

23

Left side: enter here the specification of the mailbox

Right side: database related data, eg no pass

. username: root, word

For username, host, port, and ssl check with your email provider (for gmail, see next slide)

Anonymize will replace email addresses with random identifiers

Content: yes will download the whole emails, w/o content only the sender, recipients and the subject line are downloaded

Here you can choose specific folders to dow nload

Delete the present data in the database?

Settings for gmail yourname@gmail.com

Your gmail password

. imap.gmail.com

Don’t forget the access information for your mysql database on the right, then press start.

It might take a while (esp. with huge mailboxes) before you see a progress bar.

3

1

Visualize Mail-Data

2

4

26

Visualize E-Mail Data (3)

7

9

8

2 7

Dynamic View of Communication

2 8

1

Visualize E-Mail Contents

2

3

29

Visualize E-Mail Contents (2)

4

5

3 0

Dynamic View of Terms

3 1

Contents

 Getting Started

 Collecting Web Content

 OneDegreeCollector

 Building your own Startlists

 Collecting your E-Mail

 Collecting Facebook Data

 Collecting Wikipedia Data

 Collecting CoolPeople

 Coolhunting Blueprin

3 2 t

MIT OpenCourseWare http://ocw.mit.edu

15.599 Workshop in IT: Collaborative Innovation Networks

Fall 2011

For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms .

Download