kin260_lec10

advertisement
Email & Web Searches
Kin 260
Jackie Kiwata
Overview

Email
– Message Transfer Agents
– User Agents
– Envelope & Contents

Web Searches
– Search Engines
– Key Words
– Operators
– CSLA Library Research
Electronic Mail

Use of email by general public exploded in
the 1990s
 Prior to 90’s, used mostly in academia

Email systems composed of 2 subsystems
– User Agents
– Message Transfer Agents

Key idea behind email system: envelope
and contents
– Separate routing information from message
Message Transfer Agents
Move messages from source to destination
 Use daemons

– Computer Science definition: processes that
run in the background
– NOT:
Daemons

Purpose:
– Establish connection to destination
– Reliably transfer message
– Close connection
– Perform steps automatically without bothering
user
Daemon Example
from <mailer-daemon@googlemail.com>
to
jkiwata2@csla.edu,
date Nov 8, 2007 1:42 PM
Subject: Delivery Status Notification (Failure)
This is an automatically generated Delivery Status
Notification
Delivery to the following recipient failed permanently:
ladyleedoc@yahoo.com
Technical details of permanent failure:
PERM_FAILURE: SMTP Error (state 13): 550
<ladyleedoc@yahoo.com> recipient rejected
User Agents
Normally a program
 Accepts a variety of commands, including:

– Composing messages
– Receiving messages
– Replying to messages
– Manipulating mailboxes

e.g. Outlook, Gmail, Hotmail
Email - Envelope

Read by Message Transfer Agents
– Not seen by person reading email
Encapsulates the message
 Similar to paper envelopes transported by the
post office
 Contains all the information needed to transport
the message

– Destination Address
– Priority
– Security Level
Email - Content

Content consists of two parts
1.
Header
-
2.
contains information for User Agents
Body
- Contains the message read by the human
recipient
Anatomy of an Email message
Header
Return-Path: <example-return-9792ccsxxx=bath.ac@gmail.com>
Received: from roche.bath.ucla.edu ([138.38.32.21]
ident=yalrla9a1j69szla2ydr) by serena.bath.smtp.gmail
with esmtp; Sat, 07 Jun 2007 20:17:35 +0100
Date: Sat, 7 Jun 2001 12:35:58 -0700
From: Jane Martin<jmartin@gmail.com>
To: Bob Richards <brichards@ucla.edu>
Subject: Athletic Director Position
Message-ID:
<20030607123558.A2408@ring.CS.example.com>
User-Agent: Mutt/1.2.5.1i
Body
Dear Mr. Richards:
Please find my resume attached.
Sincerely,
Jane Martin
Transfer Protocols
How User Agents talk to Message Transfer
Agents
 Daemons will speak one of the following
protocols:

– SMTP: Simple Mail Transfer Protocol
– POP3: Post Office Protocol 3
– IMAP: Internet Message Access Protocol

e.g. Gmail uses POP3 to receive mail and
SMTP to send mail
Web Searches
Web is a terrific place to obtain info on
any topic
 But all too often, waste time on reviewing
useless web site results
 There has to be a better way!

Making Searches Better
1. Use a sophisticated search engine
i.e. Google
2. Choose key words wisely
i.e. “Exercise Science programs” instead
of “Exercise”
3. Refine search query with Operators
i.e. +, -, “”
Web Search Engines



Provide an interface to search for pages,
images or other types of web files
Use algorithms
Work by:
1. Web crawling
2. Indexing
3. Searching
Web Crawling

Accomplished using web crawlers, or
spiders
– Automated programs that browse pages on
the WWW

Crawlers create a copy of visited pages for
use in later analysis
Indexing

Contents of stored pages are analyzed
– Words extracted from titles, headings, and
special fields called meta tags
Data about web pages are stored in an
index database
 When a user enters a search query, search
engine checks the index database

Searching & Results
Search engine returns results based on
search query
 Search engine will rank the results and
show most relevant higher in the list

– Most relevant = how closely key words match
on web page
– Issue: In 2006, spammers created websites
containing random sequences of high-traffic
keywords. Results were polluted with pages
that contained little or no relevance.
Choosing Key Words
Key to finding information quickly
 Get better results if use multiple search
terms that are specific
 E.g. “Sports” vs. “Sports medicine
programs in California”
 Note: Google will ignore capitalization and
common words like “and”, “or”, “how”

Basic Operators
Quotes “”
1.
- Will search for pages with exact phrase
- Good if know title of article
- E.g. “Expression of the bilateral deficit”
Positive Terms +
2.
-
-
If put + before a word, search engine must
return results that include the word
Use if search is omitting common words
essential to results
E.g. Star Wars Episode +1
Basic Operators, con’t.
3. Negative Terms –
- If put – before a word, search engine will
omit results containing word
- Use if search term has more than one
meaning
-e.g. vo2max test -cycle
-e.g. rehab -winehouse
Advanced Operators
1.
2.
3.
-
OR search
Find results that include either of 2 search
terms
e.g. vo2max test cycle OR treadmill
Domain search
Find results on one specific website
E.g. biomechanics site:www.usc.edu
Fill in the blanks search *
Ask Google to fill in the blank for you
Add an * at the part you want filled in
e.g. glycolysis discovered by *
Helpful Specialized Search Sites

CSLA Library
– http://www.calstatela.edu/library/

Google Scholar
– http://scholar.google.com
– Searches research publications, books

Google Image search
– http://images.google.com
– Only returns results that contain images
Practical Ex. – CSLA Library Search
Usually, will use Article search
Then can choose
database by Subject
(i.e. Kinesiology)
Or choose database
directly by name (i.e.
Medline)
Helpful Hints

Can access CSLA library databases from home
–
–

Must have NIS account
If use ERIC, can only access abstracts, not full text
Most efficient to least efficient searching terms:
1.
2.
3.
4.
5.
Title
Keyword
Author
Abstract
All text
Ex – Medline Search
You want to find studies on caffeine
supplementation during high-intensity
exercise bouts of varying duration on a
cycle ergometer.
 Use the MEDLINE database through the
CSULA library
 Must find the full text version (html or
pdf).

Extra Credit
In preparation for next lecture, do the
following:
1. Sign up for gmail (free)
2. Email me from your gmail account
3. Do this by 5:00 pm on Friday and I will
add 1 extra point to your overall lab
score
 If you already have a gmail account, just
do steps 2-3.
References
Google Search Guide
http://www.google.com/intl/en/help/basics.html
 Wiki Search Engine page

http://en.wikipedia.org/wiki/Web_search_engine
Email
Tanenbaum, A. S. (2003). Computer Networks.
Upper Saddle River, NJ: Prentice Hall

Download