WWW6 Trip Report

advertisement
WWW 6 Trip Report
Report on the Sixth International
WWW Conference held in Santa
Clara on April 7-11, 1997
Brian Kelly
UK Web Focus
UKOLN
University of Bath
BA2 7AY
1
Contents
•
•
•
•
•
•
B.Kelly@ukoln.ac.uk
http://www.ukoln.ac.uk/ •
•
About WWW 6
W3C Session - HTML
Microsoft IE 4.0
W3C Session - HTTP
Web Site Mapping Workshop
XML Tutorial
Technical Papers
Plenary and Panel Sessions
UK Web Focus
UK Web Focus:
• National Web coordination post
• Responsibilities include:
– Technology watch
– Information dissemination
– Coordination
– Representing JISC on W3C
• Based at UKOLN, University of Bath
• Brian Kelly appointed on 1st Nov 1996
• Formerly worked at Netskills (Newcastle University)
and universities of Leeds, Liverpool and
Loughborough
2
About WWW 6
WWW 6:
• Held at the Santa Clara Convention Centre
• Workshops and tutorial sessions on
Monday 7th April
• Technical papers from 8-10th April
• Developer's Day and History Day on 11th
April
• Various other BOFs and meeting (including
Web Accessibility Initiative day)
• About 1,800 participants (down on
previous years)
3
Before the Conference
• Conference details
including online booking
available on Web -
http://ice.www6conf.org/
http://www6conf.org/
• Online conference (ICE)
available for delegates
before, during and after
conference
• Conference proceedings
available online
http://proceedings
.www6conf.org/
4
Accessibility
Accessibility was
conference theme
Web Accessibility Initiative:
• About 80 participants
• WAI approval
Issues
5
• Accessibility is important
• WAI will address:
protocols, software
developers, information
providers and end users
http://www.lbc.co.uk/
• Need for university /
departmental / project policy?
Accessibility
Comments made by Peter Bosher (RNIB):
6
• Poorly designed navigation (when you follow
several links, and then get back to where you
started) is a particular problem for the blind
who don't have the visual clues that the
sighted have
• When navigating the Hotwired site his
browser said "link link link link image image
image without caption"
• Tables are difficult to process by speech
browsers
• Painstakingly competing a form and then
getting an error message because the form
was browser specific is annoying
Further Information
Further information on the Web Accessibility
Initiative, and on general accessibility
resources is available at:
http://www.w3.org/pub/WWW/Disabilities/
http://weber.u.washington.edu/~doit/
Other/design.html
http://www.igs.net/~starling/acces.htm
http://trace.wisc.edu/
http://access.adobe.com/
http://cs.cornell.edu/home/raman/
7
W3C Session
HTML Developments
Cougar:
• Code name for next version of HTML
• Needed as "forces driving evolution of HTML are
threatening media independence"
• First draft available summer 97
• Some parts already released:
WD-htmllink
WD-script
WD-frames
WD-forms
WD-object
WD-entities
WD-fileupload
(WD-printing, WD-positioning)
• See http://www.w3.org/pub/WWW/
MarkUp/Cougar/
8
Day 1 - 2 pm
W3C Session
WD-htmllink
WD-htmllink
• Hypertext links in HTML.
• Defines link relationships (e.g. <LINK REL=Next>
could be used to facilitate printing in a single
document split into several files)
• Related to Style Sheet work
<LINK REL=Stylesheet MEDIA=print
HREF=ukoln-print.css>
• Define TITLE attribute for hints for accessibility e.g.
<A HREF="welcome.html" TITLE="The
Welcome page for ACME Ltd">text</A>
In this example a browser could display the text in,
say, a pop-up window.
9
Day 1 - 2 pm
W3C Session
WD-htmllink
WD-htmllink (continued)
• Defines uses of META tag (e.g. for use with
PICS)
• Define language(s) for documents
• Define pages which can be indexed by robots
<META NAME="ROBOTS"
CONTENT="NOINDEX, NOFOLLOW">
prohibits robots from indexing and
following links
10
Day 1 - 2 pm
W3C Session
WD-script
WD-script
• Client-side Scripting and HTML
• Defines mechanism for embedding scripting
languages in HTML:
<SCRIPT type="text/javascript">
document.write ("<EM>This will work</EM>")
</SCRIPT>
• <SCRIPT SRC="url"> used for external scripts
• Default script language can be defined:
<META HTTP-EQUIV="Content-ScriptType" CONTENT="text/tcl">
11
Day 1 - 2 pm
W3C Session
WD-frames
WD-frames
• Defines existing usage of FRAME tag
• Includes IFRAME proposal for inclusion of frames
inline in body of HTML documents (FRAME tag
replaces BODY tag):
<BODY>
<P>This document contains an inline frame
<IFRAME src="foo.html" width=400 height=500>
that unfortunately your user agent doesn't
support. Alternatively you can get the
related document <A href="foo.html">here
</A>.</IFRAME> That's all folks!
</BODY>
12
• Initial release of FRAMES caused problems,and
there are still accessibility concerns
Day 1 - 2 pm
W3C Session
CSS
The CSS1 (Cascading Style Sheet) spec:
•
•
•
•
Replaces all HTML extensions
Can replace most uses of tables
Provides floating text elements
Provides control over background
WD-style
• Defines relationships between HTML document
and stylesheet
• Cascading style sheets can be defined by:
<LINK REL=stylesheet HREF="corporate.css">
<LINK REL=stylesheet HREF="techreport.css">
• Can define alternate style sheets for media types:
13
<LINK REL=stylesheet MEDIA=screen …>
Day 1 - 2.45pm
W3C Session
CSS - The Next Wave
In next version of CSS:
• Multiple media (using @media)
• Improved printing support (headers, footers,
margins, etc.) See WD-printing
• Better control over positioning (e.g. out-of-flow
elements, navigational bars, layering) See WDpositioning
• Better font control (mixed fonts - e.g. helvetica and
cyrillic for, say, language dictionaries)
• Aural cascading style sheets. See WD-acss For
visually impaired an incar use, industrial and
medical systems, entertainment, illiterate:
14
– <STRONG> rendered as loud voice, pause-before
– Left column in table spoken in left speaker Day 1 - 2.45 pm
W3C Session
CSS Questions
Q How closely matched are the CSS and XML?
A They are a good match
Q Features such as side bars (e.g. <P CLASS=side>)
are likely to be widely used. Should such attributes
be registered?
A It's likely that a set of standard style sheet definitions
will be released, and they'll be widely used
Q Will CSS held to reduce network bandwidth?
A Yes, see
http://www.w3.org/pub/WWW/Protocols/
NL-PerfNote.html
15
Q What about headers and footers?
A In next release
Day 1 - 2.45pm
W3C Session
HTML Math
The HTML Math spec:
• Released in May 97 - see
http://www.w3.org/TR/WD-math
16
• Core standard covering presentation and markup
• Will contain 20 presentation tags with 40 attributes
• Will contain 50 content tags (roughly equivalent to
functions on scientific calculator)
• Initially implemented using embedded elements
(ActiveX, Java) enabling 3rd parties to develop
rendering tools (won't need to wait for Netscape /
Microsoft)
• Software will be available in June 97
• Full implementation requires better browser APIs (e.g.
DOM)
Day 1 - 2 pm
W3C Session
Dynamic HTML
• Based on work of the Document Object Model
(DOM) working group
• Provides an API (applications programming
interface) for HTML page structure and style
• Microsoft's Internet Explorer 4.0 implements
many features of Dynamic HTML
• See:
– http://www.microsoft.com/gallery/
files/html/default.htm
– http://www.microsoft.com/workshop/
prog/aplatfrm/dynhtml-f.htm
17
Day 1 - 2 pm
Microsoft IE 4.0
Presentation in the Industrial Presentations track on
Internet Explorer's implementation of Dynamic HTML
HTML is limited:
• Animation using animated GIFs is limited
• Can't position or layer elements
• The web is slow - interaction with server needed
Dynamic HTML:
• Based on emerging W3C proposals
• Covers:
– Dynamic styles and dynamic contents
– CSS positioning
– Data binding
18
Day 2 - 4 pm
Microsoft IE 4.0
Dynamic HTML
• Positioning control (in X,
Y and Z planes)
• Position can be changed
at run time (e.g. on
mouseOver event)
Dynamic Content
The Document Object Model (DOM) can be used to
change the content at runtime:
function change() "new header"
...
<H1 id=foo onclick=change()>Old header</H1>
For example a table of contents could be built
dynamically
19
Day 2 - 4 pm
Microsoft IE 4.0
Dynamic Style Sheets
CSS properties can be changed at run time:
<H1 onMouseOver {colour: red;}
onMouseOut {color=yellow}>
A more elegant way is to store the Javascript
code in the document HEAD (or externally)
CSS Positioning
Elements can be positioned absolutely or relative to
each other
<IMG SRC="smiley.gif" : top: 25% ; zindex=1>
<DIV zindex=2>Text on top of image</DIV>
20
Text on top of image
Day 2 - 4 pm
Microsoft IE 4.0
Data Binding
21
Currently data binding
(merged data from
database with HTML
code) is done on the
server
Data binding proposal
enables it to be done on
the client so that, for
example, the output from
a search engine can be
resorted on the client)
But will this be in
Cougar?
http://www.microsoft.
com/gallery/files/
html/repeat.htm
Day 2 - 4 pm
Microsoft IE 4.0
Dynamic HTML
Multimedia Effects
22
Uses IE4 control to provide
multimedia effects
Removes need for animated
GIF
Variety of effects available:
• Transitions, filters,
structured graphics,
sequencing
How will Powerpoint be
• sprite control (scripted
marketed if this is
animated GIFs)
possible in HTML?
• Example - e rotating in 3D
Day 2 - 4 pm
Microsoft IE 4.0
Document Object Model (DOM)
• DOM enables every HTML element to be
processed as an object (i.e. DOM is an API for
accessing HTML elements)
• DOM is not a
set of tags or a
new language
(it is language
independent)
23
Day 2 - 4 pm
Netscape
Netscape have also expressed support for Cougar
See
• http://www.netscape.
com/focus3/comprod/
columns/intranet/
open_standards.html
• http://www.netscape.
com/flash1/comprod/
products/
communicator/
24
Cougar and Browser Issues
Standards
• IE 4.0 conforms to HTML 3.2 and CSS1, and
implements new W3C drafts (e.g. DOM, forms)
• Netscape have also expressed commitment to
Cougar
• Javascript is being standardised by ECMA
Support for legacy browsers?
• Don't use new features
• New features degrade gracefully
• New features won't work (e.g. Tetris example 10K HTML file, which used no ActiveX controls)
25
Day 2 - 4 pm
Support for Multiple Browsers
How do we deploy new features when there are
many different versions and many browsers to
support?
• Offer choice at client:
Click here if you have IE 4 or Netscape 4
otherwise click here
Maintenance of resources can be done manually
or using site management tool
• Offer choice at server:
if useragent=IE4.0 then serve index.dom.html
else serve index.html
• Various toolkits can provide this (Microsoft Active
Server Pages, Netscape Suitespot, PHP/FI)
26
Developer's Day - HTTP
Transparent Content Negotiation (TCN)
• IETF draft
• Enables:
– Deployment of new data formats and tags
– Tailoring of content for new platforms
– Internationalisation
• See ftp://ftp.ietf.org/internet-drafts
/draft-ietf-http-negotiation-01.txt
• TCN spec server sends list of variants:
<IMG SRC="logo">
HTML source
{logo.gif
{logo.png
type image.png}
type image.png}
Variant list sent by server
27
Day 4 - 9 am
Developer's Day - HTTP
Feature Negotiation
• IETF draft
• Part of TCN, addressing extensibility
• No more "Click here from frames, here for
tables" buttons
• Universal agreement on new features not
needed
• See ftp://ftp.ietf.org/internetdrafts/draft-ietf-http-featurereg-00.txt
28
Day 4 - 9 am
Web Site Mapping Workshop
Half day workshops on web collections:
• Defining relationships between groups of related
documents
• Useful for:
– printing
– off-line browsing
– indexing
• Presentations on:
– Protocols (Web Collections, MCF, Dublin Core)
– Applications (HotSauce, WebCutter,
WebMapper)
29
Day 0 - 9 am
Web Visualisation
http://dynamicdiagrams.com/
Various proprietary ways
of visualising web sites
are available (e.g. MAPA,
WebCutter)
Need a standard
for defining
relationships
between web pages,
to provide application
independence
30
http://www6.nttlabs.com/HyperNews/
get/PAPER40.html
Day 0 - 9 am
Conclusions
The Web Site Mapping workshop agreed:
• Split protocol into three:
1 Metadata Model
2 File Format Syntax
3 Metadata Manipulation Language
• Further discussions needed on:
– Is a new query language is needed (e.g. SQL,
HyTime Query Language)
– Do we need a metadata query language or a
general document query language
– A list of common site mapping operations
31
• XML is likely to be the preferred syntax
• Web Collections spec to be updated http://www.w3.org/pub/WWW/TR/
NOTE-XMLsubmit.html
Day 0 - 9 am
Metadata Architecture
Much work in progress in developing a
metadata architecture for the web
Addressing
URL
Transport Data format
HTTP
HTML
32
XML Tutorial
XML:
•
•
•
•
•
•
Extensible markup language
An SGML-lite designed for the Internet
Developed by the SGML community
Tools being developed by SGML tool vendors
Microsoft involvement
Two XML draft specs:
English
– XML
French
– XML-link A richer form of linking
• Stricter than HTML (to reduce client processing):
– Attribute quoting <IMG SRC="logo.gif">
– End tags needed <P>A paragraph</P>
33
Day 0 - 2 pm
What's It Look Like?
What does an XML document look like?
<?XML VERSION="1.0" RMD="ALL" ENCODING="UTF-8"?>
<!doctype titlepage system "typo.dtd"
[<!entity % active.links "INCLUDE">]>
More complex XML
<titlepage>
document with DTD
<whitespace type="vertical" amount="36"/>
specified
<title font="Baskerville" size="24/30"
alignment="centered">Hello, world!</title>
<whitespace type="vertical" amount="12"/>
<!--* In some copies the following decoration is
hand-colored, presumably by the author *-->
<image location="http://www.foo.bar/fleuron.eps" Simple XML document
type="URL" alignment="centered"/>
<?XML VERSION="1.0" RMD="NONE"?>
<whitespace type="vertical" amount="24"/>
<conversation>
<author font="Baskerville" size="18/22"
style="italic">
Munde Salutem</author>
<greeting>Hello, world!</greeting>
</titlepage>
<response>Stop the planet,
34
I want to get off!</response>
</conversation>
The Jumbo XML Browser
Jumbo is an XML
browser written in
Java.
It was developed to
view CML
(Chemical Markup
Language)
resources.
It can view other http://www.venus.co.uk/
XML applications. omf/cml/download.html
35
CDF
XML can be used to define:
• Document structure
• Structure for metadata
• Structure for applications
CDF (Channel Definition Format) is:
• An XML application
• A proposed standard for push
technology
• Developed by Microsoft and
implemented in IE4
• See http://www.w3.org/
TR/NOTE-CDFsubmit.html
36
Further Information on XML
For further information see:
http://www.textuality.com/xml/
http://www.w3.org/MarkUp/SGML/
Activity
http://sunsite.unc.edu/pub/suninfo/standards/xml/why/xmlapps.htm
http://www.ucc.ie/xml
http://www.webreview.com/97/05/16/
feature/
37
Technical Paper
Extending HTML in a
Principled Way with Displets
This paper describes a Java solution to adding
new tags (charts, links, maths, etc.) to HTML.
<TAG NAME=CHART ..
SRC=chart.class>…</TAG>
<CHART TYPE=BAR>
<TABLE>
<TH> Smith<TR><TH>125…
</TABLE>
</CHART>
HTML source
38
Jan
Jan
Smith 125
Green 137
Apr
Apr
257
140
Jul
Oct
Jul
327
110
Oct
250
160
HTML output
Day 1 - 11am
Extending HTML in a
Principled Way with Displets
Other extensions:
Multi-way links
Graphs
Issues:
How does this fit in with XML?
How does this fit in with the HTML Math work, which
proposes using Java to display Maths output?
Further Information:
http://proceedings.www6conf.org/
HyperNews/get/PAPER155.html
http://pegasus.rutgers.edu/
~chchiu/displets.html
39
Towards a Multimedia World-Wide Web
Information Retrieval Engine
Paper that integrates
searching for text with
searching for images.
• Search for "baseball
player"
• Select one of the images
retrieved, and search for
others similar to this one
• Give schematic drawing
(as shown)
• Provide 3D visualisation of
search results using VRML
40
Find an image that
looks like this will find
Mickey Mouse from
the Disney Web site
Day 1 - 4 pm
Towards a Multimedia World-Wide
Web Information Retrieval Engine
The software is called
AMORE.
Prototype available at
http://www.ccrl.
neclab.com/amore/
See
http://www6.nttlabs.
com/HyperNews/
get/PAPER3.html
41
Day 1 - 4 pm
Dynamic Reference Sifting:
A Case Study
Described Ahoy! - a
Web application used
for finding personal
home pages based
on recognition of
likely hits from
directory naming
conventions.
42
See http://www6.nttlabs.com/
HyperNews/get/PAPER39.html and
http://www.cs.washington.edu/
research/ahoy
Day 1 - 4 pm
Parasite: Mining Structural
Information on the Web
Based on heuristic assumptions including:
Hypertext Linking
• A linked page is likely to be on the same topic as
the original page (esp. for Yahoo type resources)
Directory Structure
• A URL containing a directory below a personal
home page (PHP) is likely to be authored by the
person identified in the PHP
Page Structure
• Links "near" each other on a page are likely to
have similar topics
43
Day 1 - 4 pm
Parasite: Mining Structural
Information on the Web
Used these assumptions to propose
applications for finding:
• moved pages
• related pages
• people
Comments
• Brute strength approach
• This and preceding paper (on Ahoy!) show
importance of directory naming conventions
(directory names provide metadata - what can we
guess from the URL
www.cs.acme.edu/staff/jsmith)
44
Day 1 - 4 pm
WebCutter: A System for Dynamic
and Tailorable Site Mapping
Paper on IBM/Lotus
product for website
visualisation
• Implemented in Java
• Map generated on
the fly
• Map can be edited
by end user (to
define user's view of
web site)
http://www6.nttlabs.com/HyperNews/
get/PAPER40.html
45
Day 3 - 2 pm
WebQuery
Paper on various
visualisation
techniques for
searching.
Based on
structural
information to
find "hot spots"
http://www6.nttlabs.com/HyperNews/
get/PAPER96.html
46
Day 3 - 2 pm
Transforming Command-Line Driven
Systems to Web Applications
This paper described work
which used Java to
provide access to a legacy
command line application
(a monolithic Fortran 77
program) on the Web:
• Input to application
via Java form
• Output in Java
applet window
http://www6.nttlabs.com
• Commercial product /HyperNews/get/PAPER41.
available soon
html
47
Day 3 - 2 pm
Responsive Interaction for a Large Web
Application The Meteor Shower Architecture in
the WebWriter II Editor
An HTML
authoring tool
with server and
client side
processing.
Makes use of
frames.
Uses <BLINK> to
define flashing
cursor
48
http://www6.nttlabs.com/
HyperNews/get/PAPER86.html
Day 3 - 2 pm
Seamless Integration of
Interactive Forms into the Web
This paper described
limitations of existing
forms on the Web and
described how Dynamic
Forms would overcome
these limitations.
Dynamic Forms is based
on Java.
But what about W3C
work on HTML (new
DOM and Forms
http://www6.nttlabs.com/
specs)?
49
HyperNews/get/PAPER83.html
Plenary Talk
All Authored Works Online: A Global
Infrastructure for Universal Access to
Information
Raj Reddy, Dean of the School of Computer
Science at Carnegie Mellon spoke on the Universal
Library project - when "All Authored Works of the
Human Race will be available to anyone in the world
instantaneously".
His talk is available at
http://www.ul.cs.cmu.edu/rr/aaw/
aaw.html
50
Web Design Panel Session
Good Web Design Panel Session
• A very popular session on Web design
• Panelists included Jakob Neilson (Sun),
David Seigel (a leading web designer and
author of the 1 bit clear GIF used to provide
spacing), Cathy Gill (HTML Writer's Guild)
and others
• Strong disagreements between the design
community (David Seigel) and the
structuralist community (Jakon Neilson)
51
What is Good Web Design?
What is Good Design?
52
• No frames - Jakob Neilson's response (see his
paper on 10 top mistakes in web design)
• The design must be related to a user's tasks
• A well-designed site downloads quickly
• A good design doesn't break browsers
• No gratuitous backgrounds or animation
• Should be internally consistent
• Should be externally consistent in use of navigation
(Jakob Neilson's comments on need for standard
navigational aids such as not changing colours of
hypertext links)
• Variety in navigational aids (David Siegel's
response)
Bad Web Design?
There was much
disagreement over the
Seigel "hack" illustrated
53
• Image on the next page
stored in a 1 pixel by 1 pixel
• The image is downloaded
while main image is being
examined
• Moving to the next image it
appears to load quickly
• "Neat trick" - David Seigel
• "Hack", "future maintenance
problems", "no semantics"
image4.gif
<IMG SRC=image5.gif
height=1 width=1>
Other Papers
The Proceedings contains papers on:
54
• Beyond HTML
• Multidimensional Web Search
• Searching Heterogeneous
Sources
• Strategies for Resource
Location
• Learning User Preferences
• Usability Issues
• Counting on the Web
• Site Mapping and Syntactic
Analysis
• Information Access
• Automatic Interface
Generation
•
•
•
•
•
•
•
•
•
Database Query Techniques
Caching
Embedded Systems
Highly Configurable Web
Services
Distributed Servers
Accessibility Using Audio
Real Time Video
Real Time Audio
Security and Payment
Conclusions
• The hit of the conference was XML
• Java was the hit two years ago - now it's
accepted as the language of the Web
• W3C are facilitating developments of web
protocols
• Closed nature of W3C standardisation
creates uncertainty but seems needed
following problems with over standardisation
(e.g. HTML 3.0)
55
Challenges
The rapid development of the web poses
various challenges for the community:
•
•
•
•
Deploying new facilities in a timely manner
Choosing the winners
Spotting the winners
Minimising future web re-engineering
How can the community address these
challenges:
56
•
•
•
•
Institutionally
Regionally
Nationally
Internationally
Download